Aidan Erickson commited on
Commit
cbfdce9
·
verified ·
1 Parent(s): 2d0811d

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - google/gemma-3-27b-it
4
+ ---
5
+ # Grass Keyframe Annotation Model
6
+ #### A targeted distillation of gemini-2.5 pro.
7
+
8
+ The goal of this model is to provide a disentangled and precise description of video keyframes. The goal is to make descriptive output fields for downstream tasks, including classification, clustering, and by extension search.
9
+
10
+ The ouput fields, after being provided a keyframe image, will look something like the below.
11
+
12
+ ```
13
+ {
14
+ "description": "A detailed, factual account of what is visibly happening in the image (4 sentences max). Only mention concrete elements or actions that are clearly shown. Do not include anything about how the image is styled, shot, or composed.",
15
+ "objects": ["object1 with relevant visual details", "object2 with relevant visual details", ...],
16
+ "actions": ["action1 with participants and context", "action2 with participants and context", ...],
17
+ "environment": "Detailed factual description of the setting and atmosphere based on visible cues (e.g., interior of a classroom with fluorescent lighting, or outdoor forest path with snow-covered trees).",
18
+ "content_type": "The type of content it is, e.g. 'real-world footage', 'video game', 'animation', 'cartoon', 'CGI', 'VTuber', etc.",
19
+ "specific_style": "Specific genre, aesthetic, or platform style (e.g., anime, 3D animation, mobile gameplay, vlog, tutorial, news broadcast, etc.)",
20
+ "production_quality": "Visible production level: e.g., 'professional studio', 'amateur handheld', 'webcam recording', 'TV broadcast', etc.",
21
+ "summary": "One clear, comprehensive sentence summarizing the visual content of the frame.",
22
+ "logos": ["logo1 with visual description", "logo2 with visual description", ...]
23
+ }
24
+ ```