teysty commited on
Commit
bd135ee
·
verified ·
1 Parent(s): a405c82

Upload VJEPA2ForVideoClassification

Browse files
Files changed (3) hide show
  1. README.md +199 -0
  2. config.json +394 -0
  3. model.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,199 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags: []
4
+ ---
5
+
6
+ # Model Card for Model ID
7
+
8
+ <!-- Provide a quick summary of what the model is/does. -->
9
+
10
+
11
+
12
+ ## Model Details
13
+
14
+ ### Model Description
15
+
16
+ <!-- Provide a longer summary of what this model is. -->
17
+
18
+ This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
19
+
20
+ - **Developed by:** [More Information Needed]
21
+ - **Funded by [optional]:** [More Information Needed]
22
+ - **Shared by [optional]:** [More Information Needed]
23
+ - **Model type:** [More Information Needed]
24
+ - **Language(s) (NLP):** [More Information Needed]
25
+ - **License:** [More Information Needed]
26
+ - **Finetuned from model [optional]:** [More Information Needed]
27
+
28
+ ### Model Sources [optional]
29
+
30
+ <!-- Provide the basic links for the model. -->
31
+
32
+ - **Repository:** [More Information Needed]
33
+ - **Paper [optional]:** [More Information Needed]
34
+ - **Demo [optional]:** [More Information Needed]
35
+
36
+ ## Uses
37
+
38
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
39
+
40
+ ### Direct Use
41
+
42
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
43
+
44
+ [More Information Needed]
45
+
46
+ ### Downstream Use [optional]
47
+
48
+ <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
49
+
50
+ [More Information Needed]
51
+
52
+ ### Out-of-Scope Use
53
+
54
+ <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
+
56
+ [More Information Needed]
57
+
58
+ ## Bias, Risks, and Limitations
59
+
60
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
+
62
+ [More Information Needed]
63
+
64
+ ### Recommendations
65
+
66
+ <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
+
68
+ Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
+
70
+ ## How to Get Started with the Model
71
+
72
+ Use the code below to get started with the model.
73
+
74
+ [More Information Needed]
75
+
76
+ ## Training Details
77
+
78
+ ### Training Data
79
+
80
+ <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
81
+
82
+ [More Information Needed]
83
+
84
+ ### Training Procedure
85
+
86
+ <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
+
88
+ #### Preprocessing [optional]
89
+
90
+ [More Information Needed]
91
+
92
+
93
+ #### Training Hyperparameters
94
+
95
+ - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
96
+
97
+ #### Speeds, Sizes, Times [optional]
98
+
99
+ <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
100
+
101
+ [More Information Needed]
102
+
103
+ ## Evaluation
104
+
105
+ <!-- This section describes the evaluation protocols and provides the results. -->
106
+
107
+ ### Testing Data, Factors & Metrics
108
+
109
+ #### Testing Data
110
+
111
+ <!-- This should link to a Dataset Card if possible. -->
112
+
113
+ [More Information Needed]
114
+
115
+ #### Factors
116
+
117
+ <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
118
+
119
+ [More Information Needed]
120
+
121
+ #### Metrics
122
+
123
+ <!-- These are the evaluation metrics being used, ideally with a description of why. -->
124
+
125
+ [More Information Needed]
126
+
127
+ ### Results
128
+
129
+ [More Information Needed]
130
+
131
+ #### Summary
132
+
133
+
134
+
135
+ ## Model Examination [optional]
136
+
137
+ <!-- Relevant interpretability work for the model goes here -->
138
+
139
+ [More Information Needed]
140
+
141
+ ## Environmental Impact
142
+
143
+ <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
+
145
+ Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
+
147
+ - **Hardware Type:** [More Information Needed]
148
+ - **Hours used:** [More Information Needed]
149
+ - **Cloud Provider:** [More Information Needed]
150
+ - **Compute Region:** [More Information Needed]
151
+ - **Carbon Emitted:** [More Information Needed]
152
+
153
+ ## Technical Specifications [optional]
154
+
155
+ ### Model Architecture and Objective
156
+
157
+ [More Information Needed]
158
+
159
+ ### Compute Infrastructure
160
+
161
+ [More Information Needed]
162
+
163
+ #### Hardware
164
+
165
+ [More Information Needed]
166
+
167
+ #### Software
168
+
169
+ [More Information Needed]
170
+
171
+ ## Citation [optional]
172
+
173
+ <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
174
+
175
+ **BibTeX:**
176
+
177
+ [More Information Needed]
178
+
179
+ **APA:**
180
+
181
+ [More Information Needed]
182
+
183
+ ## Glossary [optional]
184
+
185
+ <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
+
187
+ [More Information Needed]
188
+
189
+ ## More Information [optional]
190
+
191
+ [More Information Needed]
192
+
193
+ ## Model Card Authors [optional]
194
+
195
+ [More Information Needed]
196
+
197
+ ## Model Card Contact
198
+
199
+ [More Information Needed]
config.json ADDED
@@ -0,0 +1,394 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "VJEPA2ForVideoClassification"
4
+ ],
5
+ "attention_dropout": 0.0,
6
+ "attention_probs_dropout_prob": 0.0,
7
+ "crop_size": 256,
8
+ "drop_path_rate": 0.0,
9
+ "frames_per_clip": 16,
10
+ "hidden_act": "gelu",
11
+ "hidden_dropout_prob": 0.0,
12
+ "hidden_size": 1024,
13
+ "id2label": {
14
+ "0": "Approaching [something] with your camera",
15
+ "1": "Attaching [something] to [something]",
16
+ "2": "Bending [something] so that it deforms",
17
+ "3": "Bending [something] until it breaks",
18
+ "4": "Burying [something] in [something]",
19
+ "5": "Closing [something]",
20
+ "6": "Covering [something] with [something]",
21
+ "7": "Digging [something] out of [something]",
22
+ "8": "Dropping [something] behind [something]",
23
+ "9": "Dropping [something] in front of [something]",
24
+ "10": "Dropping [something] into [something]",
25
+ "11": "Dropping [something] next to [something]",
26
+ "12": "Dropping [something] onto [something]",
27
+ "13": "Failing to put [something] into [something] because [something] does not fit",
28
+ "14": "Folding [something]",
29
+ "15": "Hitting [something] with [something]",
30
+ "16": "Holding [something]",
31
+ "17": "Holding [something] behind [something]",
32
+ "18": "Holding [something] in front of [something]",
33
+ "19": "Holding [something] next to [something]",
34
+ "20": "Holding [something] over [something]",
35
+ "21": "Laying [something] on the table on its side, not upright",
36
+ "22": "Letting [something] roll along a flat surface",
37
+ "23": "Letting [something] roll down a slanted surface",
38
+ "24": "Letting [something] roll up a slanted surface, so it rolls back down",
39
+ "25": "Lifting a surface with [something] on it but not enough for it to slide down",
40
+ "26": "Lifting a surface with [something] on it until it starts sliding down",
41
+ "27": "Lifting [something] up completely without letting it drop down",
42
+ "28": "Lifting [something] up completely, then letting it drop down",
43
+ "29": "Lifting [something] with [something] on it",
44
+ "30": "Lifting up one end of [something] without letting it drop down",
45
+ "31": "Lifting up one end of [something], then letting it drop down",
46
+ "32": "Moving away from [something] with your camera",
47
+ "33": "Moving [part] of [something]",
48
+ "34": "Moving [something] across a surface until it falls down",
49
+ "35": "Moving [something] across a surface without it falling down",
50
+ "36": "Moving [something] and [something] away from each other",
51
+ "37": "Moving [something] and [something] closer to each other",
52
+ "38": "Moving [something] and [something] so they collide with each other",
53
+ "39": "Moving [something] and [something] so they pass each other",
54
+ "40": "Moving [something] away from [something]",
55
+ "41": "Moving [something] away from the camera",
56
+ "42": "Moving [something] closer to [something]",
57
+ "43": "Moving [something] down",
58
+ "44": "Moving [something] towards the camera",
59
+ "45": "Moving [something] up",
60
+ "46": "Opening [something]",
61
+ "47": "Picking [something] up",
62
+ "48": "Piling [something] up",
63
+ "49": "Plugging [something] into [something]",
64
+ "50": "Plugging [something] into [something] but pulling it right out as you remove your hand",
65
+ "51": "Poking a hole into [some substance]",
66
+ "52": "Poking a hole into [something soft]",
67
+ "53": "Poking a stack of [something] so the stack collapses",
68
+ "54": "Poking a stack of [something] without the stack collapsing",
69
+ "55": "Poking [something] so it slightly moves",
70
+ "56": "Poking [something] so lightly that it doesn't or almost doesn't move",
71
+ "57": "Poking [something] so that it falls over",
72
+ "58": "Poking [something] so that it spins around",
73
+ "59": "Pouring [something] into [something]",
74
+ "60": "Pouring [something] into [something] until it overflows",
75
+ "61": "Pouring [something] onto [something]",
76
+ "62": "Pouring [something] out of [something]",
77
+ "63": "Pretending or failing to wipe [something] off of [something]",
78
+ "64": "Pretending or trying and failing to twist [something]",
79
+ "65": "Pretending to be tearing [something that is not tearable]",
80
+ "66": "Pretending to close [something] without actually closing it",
81
+ "67": "Pretending to open [something] without actually opening it",
82
+ "68": "Pretending to pick [something] up",
83
+ "69": "Pretending to poke [something]",
84
+ "70": "Pretending to pour [something] out of [something], but [something] is empty",
85
+ "71": "Pretending to put [something] behind [something]",
86
+ "72": "Pretending to put [something] into [something]",
87
+ "73": "Pretending to put [something] next to [something]",
88
+ "74": "Pretending to put [something] on a surface",
89
+ "75": "Pretending to put [something] onto [something]",
90
+ "76": "Pretending to put [something] underneath [something]",
91
+ "77": "Pretending to scoop [something] up with [something]",
92
+ "78": "Pretending to spread air onto [something]",
93
+ "79": "Pretending to sprinkle air onto [something]",
94
+ "80": "Pretending to squeeze [something]",
95
+ "81": "Pretending to take [something] from [somewhere]",
96
+ "82": "Pretending to take [something] out of [something]",
97
+ "83": "Pretending to throw [something]",
98
+ "84": "Pretending to turn [something] upside down",
99
+ "85": "Pulling [something] from behind of [something]",
100
+ "86": "Pulling [something] from left to right",
101
+ "87": "Pulling [something] from right to left",
102
+ "88": "Pulling [something] onto [something]",
103
+ "89": "Pulling [something] out of [something]",
104
+ "90": "Pulling two ends of [something] but nothing happens",
105
+ "91": "Pulling two ends of [something] so that it gets stretched",
106
+ "92": "Pulling two ends of [something] so that it separates into two pieces",
107
+ "93": "Pushing [something] from left to right",
108
+ "94": "Pushing [something] from right to left",
109
+ "95": "Pushing [something] off of [something]",
110
+ "96": "Pushing [something] onto [something]",
111
+ "97": "Pushing [something] so it spins",
112
+ "98": "Pushing [something] so that it almost falls off but doesn't",
113
+ "99": "Pushing [something] so that it falls off the table",
114
+ "100": "Pushing [something] so that it slightly moves",
115
+ "101": "Pushing [something] with [something]",
116
+ "102": "Putting [number of] [something] onto [something]",
117
+ "103": "Putting [something] and [something] on the table",
118
+ "104": "Putting [something] behind [something]",
119
+ "105": "Putting [something] in front of [something]",
120
+ "106": "Putting [something] into [something]",
121
+ "107": "Putting [something] next to [something]",
122
+ "108": "Putting [something] on a flat surface without letting it roll",
123
+ "109": "Putting [something] on a surface",
124
+ "110": "Putting [something] on the edge of [something] so it is not supported and falls down",
125
+ "111": "Putting [something] onto a slanted surface but it doesn't glide down",
126
+ "112": "Putting [something] onto [something]",
127
+ "113": "Putting [something] onto [something else that cannot support it] so it falls down",
128
+ "114": "Putting [something similar to other things that are already on the table]",
129
+ "115": "Putting [something] that can't roll onto a slanted surface, so it slides down",
130
+ "116": "Putting [something] that can't roll onto a slanted surface, so it stays where it is",
131
+ "117": "Putting [something that cannot actually stand upright] upright on the table, so it falls on its side",
132
+ "118": "Putting [something] underneath [something]",
133
+ "119": "Putting [something] upright on the table",
134
+ "120": "Putting [something], [something] and [something] on the table",
135
+ "121": "Removing [something], revealing [something] behind",
136
+ "122": "Rolling [something] on a flat surface",
137
+ "123": "Scooping [something] up with [something]",
138
+ "124": "Showing a photo of [something] to the camera",
139
+ "125": "Showing [something] behind [something]",
140
+ "126": "Showing [something] next to [something]",
141
+ "127": "Showing [something] on top of [something]",
142
+ "128": "Showing [something] to the camera",
143
+ "129": "Showing that [something] is empty",
144
+ "130": "Showing that [something] is inside [something]",
145
+ "131": "[Something] being deflected from [something]",
146
+ "132": "[Something] colliding with [something] and both are being deflected",
147
+ "133": "[Something] colliding with [something] and both come to a halt",
148
+ "134": "[Something] falling like a feather or paper",
149
+ "135": "[Something] falling like a rock",
150
+ "136": "Spilling [something] behind [something]",
151
+ "137": "Spilling [something] next to [something]",
152
+ "138": "Spilling [something] onto [something]",
153
+ "139": "Spinning [something] so it continues spinning",
154
+ "140": "Spinning [something] that quickly stops spinning",
155
+ "141": "Spreading [something] onto [something]",
156
+ "142": "Sprinkling [something] onto [something]",
157
+ "143": "Squeezing [something]",
158
+ "144": "Stacking [number of] [something]",
159
+ "145": "Stuffing [something] into [something]",
160
+ "146": "Taking [one of many similar things on the table]",
161
+ "147": "Taking [something] from [somewhere]",
162
+ "148": "Taking [something] out of [something]",
163
+ "149": "Tearing [something] into two pieces",
164
+ "150": "Tearing [something] just a little bit",
165
+ "151": "Throwing [something]",
166
+ "152": "Throwing [something] against [something]",
167
+ "153": "Throwing [something] in the air and catching it",
168
+ "154": "Throwing [something] in the air and letting it fall",
169
+ "155": "Throwing [something] onto a surface",
170
+ "156": "Tilting [something] with [something] on it slightly so it doesn't fall down",
171
+ "157": "Tilting [something] with [something] on it until it falls off",
172
+ "158": "Tipping [something] over",
173
+ "159": "Tipping [something] with [something in it] over, so [something in it] falls out",
174
+ "160": "Touching (without moving) [part] of [something]",
175
+ "161": "Trying but failing to attach [something] to [something] because it doesn't stick",
176
+ "162": "Trying to bend [something unbendable] so nothing happens",
177
+ "163": "Trying to pour [something] into [something], but missing so it spills next to it",
178
+ "164": "Turning [something] upside down",
179
+ "165": "Turning the camera downwards while filming [something]",
180
+ "166": "Turning the camera left while filming [something]",
181
+ "167": "Turning the camera right while filming [something]",
182
+ "168": "Turning the camera upwards while filming [something]",
183
+ "169": "Twisting (wringing) [something] wet until water comes out",
184
+ "170": "Twisting [something]",
185
+ "171": "Uncovering [something]",
186
+ "172": "Unfolding [something]",
187
+ "173": "Wiping [something] off of [something]",
188
+ "174": "fall",
189
+ "175": "before-fall",
190
+ "176": "after-fall"
191
+ },
192
+ "image_size": 256,
193
+ "in_chans": 3,
194
+ "initializer_range": 0.02,
195
+ "label2id": {
196
+ "Approaching [something] with your camera": 0,
197
+ "Attaching [something] to [something]": 1,
198
+ "Bending [something] so that it deforms": 2,
199
+ "Bending [something] until it breaks": 3,
200
+ "Burying [something] in [something]": 4,
201
+ "Closing [something]": 5,
202
+ "Covering [something] with [something]": 6,
203
+ "Digging [something] out of [something]": 7,
204
+ "Dropping [something] behind [something]": 8,
205
+ "Dropping [something] in front of [something]": 9,
206
+ "Dropping [something] into [something]": 10,
207
+ "Dropping [something] next to [something]": 11,
208
+ "Dropping [something] onto [something]": 12,
209
+ "Failing to put [something] into [something] because [something] does not fit": 13,
210
+ "Folding [something]": 14,
211
+ "Hitting [something] with [something]": 15,
212
+ "Holding [something]": 16,
213
+ "Holding [something] behind [something]": 17,
214
+ "Holding [something] in front of [something]": 18,
215
+ "Holding [something] next to [something]": 19,
216
+ "Holding [something] over [something]": 20,
217
+ "Laying [something] on the table on its side, not upright": 21,
218
+ "Letting [something] roll along a flat surface": 22,
219
+ "Letting [something] roll down a slanted surface": 23,
220
+ "Letting [something] roll up a slanted surface, so it rolls back down": 24,
221
+ "Lifting [something] up completely without letting it drop down": 27,
222
+ "Lifting [something] up completely, then letting it drop down": 28,
223
+ "Lifting [something] with [something] on it": 29,
224
+ "Lifting a surface with [something] on it but not enough for it to slide down": 25,
225
+ "Lifting a surface with [something] on it until it starts sliding down": 26,
226
+ "Lifting up one end of [something] without letting it drop down": 30,
227
+ "Lifting up one end of [something], then letting it drop down": 31,
228
+ "Moving [part] of [something]": 33,
229
+ "Moving [something] across a surface until it falls down": 34,
230
+ "Moving [something] across a surface without it falling down": 35,
231
+ "Moving [something] and [something] away from each other": 36,
232
+ "Moving [something] and [something] closer to each other": 37,
233
+ "Moving [something] and [something] so they collide with each other": 38,
234
+ "Moving [something] and [something] so they pass each other": 39,
235
+ "Moving [something] away from [something]": 40,
236
+ "Moving [something] away from the camera": 41,
237
+ "Moving [something] closer to [something]": 42,
238
+ "Moving [something] down": 43,
239
+ "Moving [something] towards the camera": 44,
240
+ "Moving [something] up": 45,
241
+ "Moving away from [something] with your camera": 32,
242
+ "Opening [something]": 46,
243
+ "Picking [something] up": 47,
244
+ "Piling [something] up": 48,
245
+ "Plugging [something] into [something]": 49,
246
+ "Plugging [something] into [something] but pulling it right out as you remove your hand": 50,
247
+ "Poking [something] so it slightly moves": 55,
248
+ "Poking [something] so lightly that it doesn't or almost doesn't move": 56,
249
+ "Poking [something] so that it falls over": 57,
250
+ "Poking [something] so that it spins around": 58,
251
+ "Poking a hole into [some substance]": 51,
252
+ "Poking a hole into [something soft]": 52,
253
+ "Poking a stack of [something] so the stack collapses": 53,
254
+ "Poking a stack of [something] without the stack collapsing": 54,
255
+ "Pouring [something] into [something]": 59,
256
+ "Pouring [something] into [something] until it overflows": 60,
257
+ "Pouring [something] onto [something]": 61,
258
+ "Pouring [something] out of [something]": 62,
259
+ "Pretending or failing to wipe [something] off of [something]": 63,
260
+ "Pretending or trying and failing to twist [something]": 64,
261
+ "Pretending to be tearing [something that is not tearable]": 65,
262
+ "Pretending to close [something] without actually closing it": 66,
263
+ "Pretending to open [something] without actually opening it": 67,
264
+ "Pretending to pick [something] up": 68,
265
+ "Pretending to poke [something]": 69,
266
+ "Pretending to pour [something] out of [something], but [something] is empty": 70,
267
+ "Pretending to put [something] behind [something]": 71,
268
+ "Pretending to put [something] into [something]": 72,
269
+ "Pretending to put [something] next to [something]": 73,
270
+ "Pretending to put [something] on a surface": 74,
271
+ "Pretending to put [something] onto [something]": 75,
272
+ "Pretending to put [something] underneath [something]": 76,
273
+ "Pretending to scoop [something] up with [something]": 77,
274
+ "Pretending to spread air onto [something]": 78,
275
+ "Pretending to sprinkle air onto [something]": 79,
276
+ "Pretending to squeeze [something]": 80,
277
+ "Pretending to take [something] from [somewhere]": 81,
278
+ "Pretending to take [something] out of [something]": 82,
279
+ "Pretending to throw [something]": 83,
280
+ "Pretending to turn [something] upside down": 84,
281
+ "Pulling [something] from behind of [something]": 85,
282
+ "Pulling [something] from left to right": 86,
283
+ "Pulling [something] from right to left": 87,
284
+ "Pulling [something] onto [something]": 88,
285
+ "Pulling [something] out of [something]": 89,
286
+ "Pulling two ends of [something] but nothing happens": 90,
287
+ "Pulling two ends of [something] so that it gets stretched": 91,
288
+ "Pulling two ends of [something] so that it separates into two pieces": 92,
289
+ "Pushing [something] from left to right": 93,
290
+ "Pushing [something] from right to left": 94,
291
+ "Pushing [something] off of [something]": 95,
292
+ "Pushing [something] onto [something]": 96,
293
+ "Pushing [something] so it spins": 97,
294
+ "Pushing [something] so that it almost falls off but doesn't": 98,
295
+ "Pushing [something] so that it falls off the table": 99,
296
+ "Pushing [something] so that it slightly moves": 100,
297
+ "Pushing [something] with [something]": 101,
298
+ "Putting [number of] [something] onto [something]": 102,
299
+ "Putting [something similar to other things that are already on the table]": 114,
300
+ "Putting [something that cannot actually stand upright] upright on the table, so it falls on its side": 117,
301
+ "Putting [something] and [something] on the table": 103,
302
+ "Putting [something] behind [something]": 104,
303
+ "Putting [something] in front of [something]": 105,
304
+ "Putting [something] into [something]": 106,
305
+ "Putting [something] next to [something]": 107,
306
+ "Putting [something] on a flat surface without letting it roll": 108,
307
+ "Putting [something] on a surface": 109,
308
+ "Putting [something] on the edge of [something] so it is not supported and falls down": 110,
309
+ "Putting [something] onto [something else that cannot support it] so it falls down": 113,
310
+ "Putting [something] onto [something]": 112,
311
+ "Putting [something] onto a slanted surface but it doesn't glide down": 111,
312
+ "Putting [something] that can't roll onto a slanted surface, so it slides down": 115,
313
+ "Putting [something] that can't roll onto a slanted surface, so it stays where it is": 116,
314
+ "Putting [something] underneath [something]": 118,
315
+ "Putting [something] upright on the table": 119,
316
+ "Putting [something], [something] and [something] on the table": 120,
317
+ "Removing [something], revealing [something] behind": 121,
318
+ "Rolling [something] on a flat surface": 122,
319
+ "Scooping [something] up with [something]": 123,
320
+ "Showing [something] behind [something]": 125,
321
+ "Showing [something] next to [something]": 126,
322
+ "Showing [something] on top of [something]": 127,
323
+ "Showing [something] to the camera": 128,
324
+ "Showing a photo of [something] to the camera": 124,
325
+ "Showing that [something] is empty": 129,
326
+ "Showing that [something] is inside [something]": 130,
327
+ "Spilling [something] behind [something]": 136,
328
+ "Spilling [something] next to [something]": 137,
329
+ "Spilling [something] onto [something]": 138,
330
+ "Spinning [something] so it continues spinning": 139,
331
+ "Spinning [something] that quickly stops spinning": 140,
332
+ "Spreading [something] onto [something]": 141,
333
+ "Sprinkling [something] onto [something]": 142,
334
+ "Squeezing [something]": 143,
335
+ "Stacking [number of] [something]": 144,
336
+ "Stuffing [something] into [something]": 145,
337
+ "Taking [one of many similar things on the table]": 146,
338
+ "Taking [something] from [somewhere]": 147,
339
+ "Taking [something] out of [something]": 148,
340
+ "Tearing [something] into two pieces": 149,
341
+ "Tearing [something] just a little bit": 150,
342
+ "Throwing [something]": 151,
343
+ "Throwing [something] against [something]": 152,
344
+ "Throwing [something] in the air and catching it": 153,
345
+ "Throwing [something] in the air and letting it fall": 154,
346
+ "Throwing [something] onto a surface": 155,
347
+ "Tilting [something] with [something] on it slightly so it doesn't fall down": 156,
348
+ "Tilting [something] with [something] on it until it falls off": 157,
349
+ "Tipping [something] over": 158,
350
+ "Tipping [something] with [something in it] over, so [something in it] falls out": 159,
351
+ "Touching (without moving) [part] of [something]": 160,
352
+ "Trying but failing to attach [something] to [something] because it doesn't stick": 161,
353
+ "Trying to bend [something unbendable] so nothing happens": 162,
354
+ "Trying to pour [something] into [something], but missing so it spills next to it": 163,
355
+ "Turning [something] upside down": 164,
356
+ "Turning the camera downwards while filming [something]": 165,
357
+ "Turning the camera left while filming [something]": 166,
358
+ "Turning the camera right while filming [something]": 167,
359
+ "Turning the camera upwards while filming [something]": 168,
360
+ "Twisting (wringing) [something] wet until water comes out": 169,
361
+ "Twisting [something]": 170,
362
+ "Uncovering [something]": 171,
363
+ "Unfolding [something]": 172,
364
+ "Wiping [something] off of [something]": 173,
365
+ "[Something] being deflected from [something]": 131,
366
+ "[Something] colliding with [something] and both are being deflected": 132,
367
+ "[Something] colliding with [something] and both come to a halt": 133,
368
+ "[Something] falling like a feather or paper": 134,
369
+ "[Something] falling like a rock": 135,
370
+ "after-fall": 176,
371
+ "before-fall": 175,
372
+ "fall": 174
373
+ },
374
+ "layer_norm_eps": 1e-06,
375
+ "mlp_ratio": 4,
376
+ "model_type": "vjepa2",
377
+ "num_attention_heads": 16,
378
+ "num_hidden_layers": 24,
379
+ "num_pooler_layers": 3,
380
+ "patch_size": 16,
381
+ "pred_hidden_size": 384,
382
+ "pred_mlp_ratio": 4.0,
383
+ "pred_num_attention_heads": 12,
384
+ "pred_num_hidden_layers": 12,
385
+ "pred_num_mask_tokens": 10,
386
+ "pred_zero_init_mask_tokens": true,
387
+ "problem_type": "single_label_classification",
388
+ "qkv_bias": true,
389
+ "torch_dtype": "float32",
390
+ "transformers_version": "4.56.0.dev0",
391
+ "tubelet_size": 2,
392
+ "use_SiLU": false,
393
+ "wide_SiLU": true
394
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:720160d7a8f7a8d66439e40e5c0640040159769b3668dadea4343d77b15a62f1
3
+ size 1502030892