camenduru commited on
Commit
4ff4b2a
·
1 Parent(s): 7e7082a

thanks to muelea ❤

Browse files
essentials/.DS_Store ADDED
Binary file (8.2 kB). View file
 
essentials/body_model_utils/lowres_smplx.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7916f2de2839f3c9c7414368f779bd1db4df20882f6fbf95b7137bec7e30599d
3
+ size 175766
essentials/body_model_utils/smpl_to_smplx.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e8917053cc980c1218453a563378c9e7d277e71c49def3a26ab72afb19c7b994
3
+ size 578366660
essentials/body_model_utils/smplx_faces.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c4a6a7ad34078a6b091ef40445655edefe55ff5a654c2fb315e7f5f48cb01c56
3
+ size 251627
essentials/body_model_utils/smplx_inner_mouth_bounds.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b59408d8e6882b62a0604498380be93fdcd2fedd83af3efbd50e4a5cfce8344b
3
+ size 115
essentials/body_models/smil/smil_packed_info.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9fbeebd143f53bf7ec7c3873420ea220ed75927af0ca382e13aafaf1b33ff992
3
+ size 20232886
essentials/body_models/smil/smil_web.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f2d2eb4c6b545e37ac8f6228f823d470b11786a50c95781dbd1e9aaa9f233e3
3
+ size 90318333
essentials/body_models/smil/smplx_kid_template.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bdce4f5886b9ddcb6da3ee0f70ae636b1aa1292f2b379c4c3149fce8abc0a604
3
+ size 251528
essentials/body_models/smpl/SMPL_NEUTRAL.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:814b40834cd955f67fece3129d04a24699fd94bf9fcc0f74f84288f078728217
3
+ size 20232766
essentials/body_models/smpla/SMPLA_NEUTRAL.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87853127bf09585c4d60be65471add87c8248cb1bdf13ea568b0f13031f5750d
3
+ size 21142479
essentials/body_models/smplx/SMPLX_NEUTRAL.npz ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:376021446ddc86e99acacd795182bbef903e61d33b76b9d8b359c2b0865bd992
3
+ size 108752058
essentials/body_models/smplx/SMPLX_NEUTRAL.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:381c808965deb4f5e845f8c3eddb0cd69930cc72e5774ce4f34c4ce3cf058361
3
+ size 544173380
essentials/buddi/.DS_Store ADDED
Binary file (6.15 kB). View file
 
essentials/buddi/buddi_cond_bev.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5fbac46d4ad4624f652cbc4ff1b6722ef69dcde5acf89c3bd57fd36984951dfb
3
+ size 30586515
essentials/buddi/buddi_cond_bev.yaml ADDED
@@ -0,0 +1,1024 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ batch_size: 512
2
+ device: cuda
3
+ body_model:
4
+ type: smplx
5
+ smpl_family_folder: essentials/body_models
6
+ smpl:
7
+ num_vertices: 6890
8
+ init:
9
+ ext: pkl
10
+ batch_size: 1
11
+ gender: neutral
12
+ age: adult
13
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
14
+ betas:
15
+ dim: 10
16
+ create: true
17
+ body_pose:
18
+ create: true
19
+ global_orient:
20
+ create: true
21
+ transl:
22
+ create: true
23
+ joint_mapper:
24
+ use: false
25
+ type: smpl_to_openpose
26
+ smpl_to_openpose:
27
+ use_hands: true
28
+ use_face: true
29
+ use_face_contour: false
30
+ openpose_format: coco25
31
+ smplh:
32
+ num_vertices: 6890
33
+ init:
34
+ ext: pkl
35
+ batch_size: 1
36
+ gender: neutral
37
+ age: adult
38
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
39
+ betas:
40
+ dim: 10
41
+ create: true
42
+ body_pose:
43
+ create: true
44
+ global_orient:
45
+ create: true
46
+ transl:
47
+ create: true
48
+ joint_mapper:
49
+ use: false
50
+ type: smpl_to_openpose
51
+ smpl_to_openpose:
52
+ use_hands: true
53
+ use_face: true
54
+ use_face_contour: false
55
+ openpose_format: coco25
56
+ left_hand_pose:
57
+ use_pca: true
58
+ num_pca_comps: 6
59
+ flat_hand_mean: false
60
+ create: true
61
+ right_hand_pose:
62
+ use_pca: true
63
+ num_pca_comps: 6
64
+ flat_hand_mean: false
65
+ create: true
66
+ use_compressed: true
67
+ smplx:
68
+ num_vertices: 10475
69
+ init:
70
+ ext: npz
71
+ batch_size: 1
72
+ gender: neutral
73
+ age: kid
74
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
75
+ betas:
76
+ dim: 10
77
+ create: true
78
+ body_pose:
79
+ create: true
80
+ global_orient:
81
+ create: true
82
+ transl:
83
+ create: true
84
+ joint_mapper:
85
+ use: true
86
+ type: smpl_to_openpose
87
+ smpl_to_openpose:
88
+ use_hands: false
89
+ use_face: false
90
+ use_face_contour: false
91
+ openpose_format: coco25
92
+ left_hand_pose:
93
+ use_pca: true
94
+ num_pca_comps: 6
95
+ flat_hand_mean: false
96
+ create: true
97
+ right_hand_pose:
98
+ use_pca: true
99
+ num_pca_comps: 6
100
+ flat_hand_mean: false
101
+ create: true
102
+ use_compressed: true
103
+ hand_vertex_ids_path: ''
104
+ expression:
105
+ dim: 10
106
+ create: true
107
+ jaw_pose:
108
+ create: true
109
+ leye_pose:
110
+ create: true
111
+ reye_pose:
112
+ create: true
113
+ datasets:
114
+ processing:
115
+ normalization_mean:
116
+ - 0.485
117
+ - 0.456
118
+ - 0.406
119
+ normalization_std:
120
+ - 0.229
121
+ - 0.224
122
+ - 0.225
123
+ resolution: 224
124
+ use: false
125
+ load_image: false
126
+ train_names:
127
+ - chi3d
128
+ - hi4d
129
+ - flickrci3ds
130
+ train_composition:
131
+ - 0.2
132
+ - 0.2
133
+ - 0.6
134
+ augmentation:
135
+ use: true
136
+ mirror: 0.5
137
+ noise: 0.4
138
+ rotation: 30.0
139
+ scale: 0.25
140
+ swap: 0.5
141
+ val_names:
142
+ - hi4d
143
+ - chi3d
144
+ - flickrci3ds
145
+ test_names: []
146
+ flickrci3dsd:
147
+ body_model_path: essentials/body_models
148
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
149
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
150
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
151
+ max_count_regions_in_contact: 25
152
+ number_of_regions: 75
153
+ image_folder: images
154
+ bev_folder: bev
155
+ openpose_folder: keypoints/keypoints
156
+ vitpose_folder: vitpose
157
+ vitdet_folder: vitdet
158
+ image_format: png
159
+ pseudogt_folder: pseudogt/summaries
160
+ overfit: false
161
+ overfit_num_samples: 12
162
+ init_pose_from_bev: false
163
+ features:
164
+ is_itw: true
165
+ has_dhhc_class: false
166
+ has_dhhc_sig: true
167
+ has_dsc_sig: false
168
+ has_dsc_class: false
169
+ has_gt_kpts: false
170
+ has_op_kpts: true
171
+ has_gt_joints: false
172
+ has_gender: false
173
+ has_gt_smpl_shape: false
174
+ has_gt_smpl_pose: false
175
+ has_pgt_smpl_shape: false
176
+ has_pgt_smpl_pose: false
177
+ flickrci3ds:
178
+ body_model_path: essentials/body_models
179
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
180
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
181
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
182
+ max_count_regions_in_contact: 25
183
+ number_of_regions: 75
184
+ image_folder: images
185
+ bev_folder: bev
186
+ openpose_folder: openpose
187
+ vitpose_folder: vitpose
188
+ vitposeplus_folder: vitposeplus
189
+ vitdet_folder: vitdet
190
+ image_format: png
191
+ pseudogt_folder: hhcs_opti/run4_20231015/results
192
+ overfit: false
193
+ overfit_num_samples: 12
194
+ adult_only: false
195
+ child_only: false
196
+ features:
197
+ is_itw: true
198
+ has_dhhc_class: false
199
+ has_dhhc_sig: true
200
+ has_dsc_sig: false
201
+ has_dsc_class: false
202
+ has_gt_kpts: false
203
+ has_op_kpts: true
204
+ has_gt_joints: false
205
+ has_gender: false
206
+ has_gt_smpl_shape: false
207
+ has_gt_smpl_pose: false
208
+ has_pgt_smpl_shape: false
209
+ has_pgt_smpl_pose: false
210
+ flickrci3ds_adult:
211
+ body_model_path: essentials/body_models
212
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
213
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
214
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
215
+ max_count_regions_in_contact: 25
216
+ number_of_regions: 75
217
+ image_folder: images
218
+ bev_folder: bev
219
+ openpose_folder: openpose
220
+ vitpose_folder: vitpose
221
+ vitposeplus_folder: vitposeplus
222
+ vitdet_folder: vitdet
223
+ image_format: png
224
+ pseudogt_folder: pseudogt/summaries
225
+ overfit: false
226
+ overfit_num_samples: 12
227
+ adult_only: true
228
+ child_only: false
229
+ features:
230
+ is_itw: true
231
+ has_dhhc_class: false
232
+ has_dhhc_sig: true
233
+ has_dsc_sig: false
234
+ has_dsc_class: false
235
+ has_gt_kpts: false
236
+ has_op_kpts: true
237
+ has_gt_joints: false
238
+ has_gender: false
239
+ has_gt_smpl_shape: false
240
+ has_gt_smpl_pose: false
241
+ has_pgt_smpl_shape: false
242
+ has_pgt_smpl_pose: false
243
+ flickrci3ds_child:
244
+ body_model_path: essentials/body_models
245
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
246
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
247
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
248
+ max_count_regions_in_contact: 25
249
+ number_of_regions: 75
250
+ image_folder: images
251
+ bev_folder: bev
252
+ openpose_folder: openpose
253
+ vitpose_folder: vitpose
254
+ vitposeplus_folder: vitposeplus
255
+ vitdet_folder: vitdet
256
+ image_format: png
257
+ pseudogt_folder: pseudogt/summaries
258
+ overfit: false
259
+ overfit_num_samples: 12
260
+ adult_only: false
261
+ child_only: true
262
+ features:
263
+ is_itw: true
264
+ has_dhhc_class: false
265
+ has_dhhc_sig: true
266
+ has_dsc_sig: false
267
+ has_dsc_class: false
268
+ has_gt_kpts: false
269
+ has_op_kpts: true
270
+ has_gt_joints: false
271
+ has_gender: false
272
+ has_gt_smpl_shape: false
273
+ has_gt_smpl_pose: false
274
+ has_pgt_smpl_shape: false
275
+ has_pgt_smpl_pose: false
276
+ flickrci3dc:
277
+ original_data_folder: datasets/original/FlickrCI3D_Classification
278
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
279
+ processed_data_folder: datasets/processed/FlickrCI3D_Classification
280
+ image_folder: images
281
+ image_format: png
282
+ bev_folder: bev
283
+ features:
284
+ is_itw: true
285
+ has_dhhc_class: true
286
+ has_dhhc_sig: false
287
+ has_dsc_sig: false
288
+ has_dsc_class: false
289
+ has_gt_kpts: false
290
+ has_op_kpts: true
291
+ has_gt_joints: false
292
+ has_gender: false
293
+ has_gt_smpl_shape: false
294
+ has_gt_smpl_pose: false
295
+ has_pgt_smpl_shape: false
296
+ has_pgt_smpl_pose: false
297
+ overfit: false
298
+ overfit_num_samples: 64
299
+ chi3d:
300
+ original_data_folder: datasets/original/CHI3D
301
+ processed_data_folder: datasets/processed/CHI3D
302
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
303
+ max_count_regions_in_contact: 25
304
+ number_of_regions: 75
305
+ image_folder: images
306
+ bev_folder: bev
307
+ openpose_folder: keypoints/keypoints
308
+ vitpose_folder: vitpose
309
+ vitdet_folder: vitdet
310
+ image_format: png
311
+ pseudogt_folder: pseudogt/summaries
312
+ overfit: false
313
+ overfit_num_samples: 12
314
+ load_single_camera: false
315
+ load_from_scratch_single_camera: false
316
+ load_contact_frame_only: true
317
+ load_unit_glob_and_transl: false
318
+ features:
319
+ is_itw: false
320
+ has_dhhc_class: false
321
+ has_dhhc_sig: true
322
+ has_dsc_sig: false
323
+ has_dsc_class: false
324
+ has_gt_kpts: false
325
+ has_op_kpts: true
326
+ has_gt_joints: false
327
+ has_gender: false
328
+ has_gt_smpl_shape: false
329
+ has_gt_smpl_pose: false
330
+ has_pgt_smpl_shape: false
331
+ has_pgt_smpl_pose: false
332
+ hi4d:
333
+ original_data_folder: datasets/original/Hi4D
334
+ processed_data_folder: datasets/processed/Hi4D
335
+ image_folder: images
336
+ bev_folder: bev
337
+ openpose_folder: keypoints/keypoints
338
+ vitpose_folder: keypoints/vitposeplus
339
+ image_format: jpg
340
+ overfit: false
341
+ overfit_num_samples: 12
342
+ load_single_camera: false
343
+ load_from_scratch_single_camera: false
344
+ load_unit_glob_and_transl: false
345
+ features:
346
+ is_itw: false
347
+ has_dhhc_class: false
348
+ has_dhhc_sig: true
349
+ has_dsc_sig: false
350
+ has_dsc_class: false
351
+ has_gt_kpts: false
352
+ has_op_kpts: true
353
+ has_gt_joints: false
354
+ has_gender: false
355
+ has_gt_smpl_shape: false
356
+ has_gt_smpl_pose: false
357
+ has_pgt_smpl_shape: false
358
+ has_pgt_smpl_pose: false
359
+ demo:
360
+ original_data_folder: ''
361
+ number_of_regions: 75
362
+ image_folder: images
363
+ bev_folder: bev
364
+ openpose_folder: keypoints/keypoints
365
+ vitpose_folder: vitpose
366
+ image_format: png
367
+ image_name_select: ''
368
+ has_gt_contact_annotation: false
369
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
370
+ unique_keypoint_match: true
371
+ camera:
372
+ type: perspective
373
+ perspective:
374
+ afov_horizontal: 60.0
375
+ pitch: 0.0
376
+ yaw: 0.0
377
+ roll: 0.0
378
+ tx: 0.0
379
+ ty: 0.0
380
+ tz: 0.0
381
+ iw: 224
382
+ ih: 224
383
+ model:
384
+ regressor:
385
+ type: diffusion_transformer
386
+ optimizer:
387
+ type: adam
388
+ adam:
389
+ lr: 0.0001
390
+ weight_decay: 0.0
391
+ losses:
392
+ debug: false
393
+ keypoint2d:
394
+ type: l2
395
+ weight:
396
+ - 0.0
397
+ squared: true
398
+ translated: false
399
+ weighted: true
400
+ d1_aggregation: sum
401
+ init_pose:
402
+ type: l2
403
+ weight:
404
+ - 0.0
405
+ squared: true
406
+ translated: false
407
+ weighted: false
408
+ d1_aggregation: sum
409
+ init_shape:
410
+ type: l2
411
+ weight:
412
+ - 0.0
413
+ squared: true
414
+ translated: false
415
+ weighted: false
416
+ d1_aggregation: sum
417
+ kl:
418
+ type: ''
419
+ weight:
420
+ - 0.0
421
+ anneal_start: 0
422
+ anneal_end: -1
423
+ pseudogt_pose:
424
+ type: l2
425
+ weight:
426
+ - 10.0
427
+ squared: true
428
+ translated: false
429
+ weighted: false
430
+ d1_aggregation: sum
431
+ pseudogt_shape:
432
+ type: l2
433
+ weight:
434
+ - 0.1
435
+ squared: true
436
+ translated: false
437
+ weighted: false
438
+ d1_aggregation: sum
439
+ pseudogt_transl:
440
+ type: l2
441
+ weight:
442
+ - 1.0
443
+ squared: true
444
+ translated: false
445
+ weighted: false
446
+ d1_aggregation: sum
447
+ pseudogt_v2v:
448
+ type: l2
449
+ weight:
450
+ - 1000.0
451
+ squared: true
452
+ translated: false
453
+ weighted: false
454
+ d1_aggregation: sum
455
+ pseudogt_j2j:
456
+ type: l2
457
+ weight:
458
+ - 0.0
459
+ squared: true
460
+ translated: false
461
+ weighted: false
462
+ d1_aggregation: sum
463
+ hhc_contact:
464
+ type: hhcmap
465
+ weight:
466
+ - 0.0
467
+ region_aggregation_type: sum
468
+ r2r_dist_type: min
469
+ squared_dist: true
470
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
471
+ hhc_contact_general:
472
+ type: hhcgen
473
+ weight:
474
+ - 0.0
475
+ region_aggregation_type: sum
476
+ r2r_dist_type: min
477
+ squared_dist: true
478
+ body_model_utils_folder: essentials/body_model_utils
479
+ model_type: smplx
480
+ pose_prior:
481
+ type: gmm
482
+ weight:
483
+ - 0.0
484
+ prior_folder: essentials/priors
485
+ num_gaussians: 8
486
+ epsilon: 1.0e-16
487
+ use_merged: true
488
+ shape_prior:
489
+ type: l2
490
+ weight:
491
+ - 0.0
492
+ squared: true
493
+ translated: false
494
+ weighted: false
495
+ d1_aggregation: sum
496
+ ground_plane:
497
+ type: l2
498
+ weight:
499
+ - 0.0
500
+ squared: true
501
+ translated: false
502
+ weighted: false
503
+ d1_aggregation: sum
504
+ cmap:
505
+ type: cmap
506
+ weight:
507
+ - 0.0
508
+ r2r_dist_type: test
509
+ cmap_heat_smpl:
510
+ type: ''
511
+ weight:
512
+ - 0.0
513
+ cmap_heat_token:
514
+ type: ''
515
+ weight:
516
+ - 0.0
517
+ cmap_binary_smpl:
518
+ type: ''
519
+ weight:
520
+ - 0.0
521
+ cmap_binary_token:
522
+ type: ''
523
+ weight:
524
+ - 0.0
525
+ diffusion_prior_orient:
526
+ type: l2
527
+ weight:
528
+ - 0.0
529
+ squared: true
530
+ translated: false
531
+ weighted: false
532
+ d1_aggregation: sum
533
+ diffusion_prior_pose:
534
+ type: l2
535
+ weight:
536
+ - 0.0
537
+ squared: true
538
+ translated: false
539
+ weighted: false
540
+ d1_aggregation: sum
541
+ diffusion_prior_shape:
542
+ type: l2
543
+ weight:
544
+ - 0.0
545
+ squared: true
546
+ translated: false
547
+ weighted: false
548
+ d1_aggregation: sum
549
+ diffusion_prior_scale:
550
+ type: l2
551
+ weight:
552
+ - 0.0
553
+ squared: true
554
+ translated: false
555
+ weighted: false
556
+ d1_aggregation: sum
557
+ diffusion_prior_transl:
558
+ type: l2
559
+ weight:
560
+ - 0.0
561
+ squared: true
562
+ translated: false
563
+ weighted: false
564
+ d1_aggregation: sum
565
+ diffusion_prior_v2v:
566
+ type: l2
567
+ weight:
568
+ - 0.0
569
+ squared: true
570
+ translated: false
571
+ weighted: false
572
+ d1_aggregation: sum
573
+ vae_prior_latent:
574
+ type: l2
575
+ weight:
576
+ - 0.0
577
+ squared: true
578
+ translated: false
579
+ weighted: false
580
+ d1_aggregation: sum
581
+ experiment:
582
+ contact_rep: bev_contact_heat
583
+ rotrep: sixd
584
+ in_data: pgt
585
+ token_setup: H0PH1P
586
+ num_contact_tokens: 0
587
+ smpl_mask: []
588
+ relative_transl: true
589
+ relative_orient: false
590
+ guidance_params:
591
+ - bev
592
+ guidance_param_nc: 0.5
593
+ guidance_all_nc: 0.2
594
+ guidance_no_nc: 0.7
595
+ hmr:
596
+ load_pretrained: true
597
+ pretrained_type: resnet50
598
+ smpl_mean_params: essentials/spin/smpl_mean_params.npz
599
+ resnet:
600
+ depth: 50
601
+ pretrained: true
602
+ mlp:
603
+ in_channels: 10
604
+ hidden_channels:
605
+ - 8
606
+ - 6
607
+ - 4
608
+ - 2
609
+ bias: true
610
+ dropout: 0.0
611
+ vit:
612
+ image_size: 224
613
+ patch_size: 16
614
+ num_classes: 1000
615
+ dim: 1024
616
+ depth: 6
617
+ heads: 8
618
+ mlp_dim: 2048
619
+ dropout: 0.1
620
+ emb_dropout: 0.1
621
+ dim_head: 64
622
+ bev:
623
+ bv_with_fv_condition: true
624
+ add_offsetmap: true
625
+ add_depth_encoding: true
626
+ backbone_type: hrnet_32
627
+ hrnet_pretrain_path: hrbet_32.pth
628
+ params_num: 146
629
+ outmap_size: 128
630
+ cam_dim: 3
631
+ num_center_maps: 1
632
+ coord_maps_size: 128
633
+ centermap_size: 64
634
+ head_block_num: 2
635
+ head_num_channels: 128
636
+ bv_center_num_block: 2
637
+ bv_center_momentum: 0.1
638
+ transformer_dropout_ratio: 0.2
639
+ transformer_num_channels: 512
640
+ max_person: 12
641
+ conf_thresh: 0.1
642
+ ae:
643
+ in_channels: 5625
644
+ num_neurons: 4000
645
+ hidden_channels: 1000
646
+ dropout: 0.1
647
+ transformer:
648
+ dim: 146
649
+ depth: 6
650
+ heads: 1
651
+ mlp_dim: 2048
652
+ dropout: 0.1
653
+ diffusion_transformer:
654
+ dim: 152
655
+ depth: 6
656
+ heads: 8
657
+ mlp_dim: 500
658
+ dropout: 0.1
659
+ use_positional_encoding: false
660
+ use_positional_embedding: false
661
+ use_human_embedding: true
662
+ use_param_embedding: true
663
+ max_tokens: 100
664
+ use_cross_attention: false
665
+ share_linear_layers: false
666
+ encode_target: false
667
+ hhcc:
668
+ pretrain_path: essentials/bev/BEV.pth
669
+ vae_mlp:
670
+ d_model: null
671
+ d_latent: 64
672
+ d_features: 256
673
+ n_enc_layers: 2
674
+ n_dec_layers: 2
675
+ dropout: 0.1
676
+ embed_features: false
677
+ embed_id: false
678
+ enc_pos: false
679
+ autoencoder:
680
+ in_channels: 5625
681
+ hidden_channels: 1000
682
+ depth: 4
683
+ ae_mlp:
684
+ d_model: null
685
+ d_latent: 64
686
+ d_features: 256
687
+ n_enc_layers: 2
688
+ n_dec_layers: 2
689
+ dropout: 0.1
690
+ embed_features: false
691
+ embed_id: false
692
+ enc_pos: false
693
+ optimization:
694
+ type: smplifydc
695
+ print_loss: true
696
+ render_iters: false
697
+ use_gt_contact_map: true
698
+ use_vae: false
699
+ pretrained_vae_cfg: ''
700
+ pretrained_vae_ckpt: ''
701
+ use_diffusion: false
702
+ pretrained_diffusion_model_cfg: ''
703
+ pretrained_diffusion_model_ckpt: ''
704
+ sds_type: fixed
705
+ sds_t_fixed: 20
706
+ sds_t_range:
707
+ - 25
708
+ - 75
709
+ sds_t_adaptive_i:
710
+ - 1.0
711
+ - 0.8
712
+ - 0.6
713
+ - 0.4
714
+ - 0.2
715
+ sds_t_adaptive_t:
716
+ - 100
717
+ - 80
718
+ - 60
719
+ - 40
720
+ - 20
721
+ optimizer:
722
+ type: adam
723
+ adam:
724
+ lr: 1.0
725
+ weight_decay: 0.0
726
+ losses:
727
+ debug: false
728
+ keypoint2d:
729
+ type: l2
730
+ weight:
731
+ - 0.0
732
+ squared: true
733
+ translated: false
734
+ weighted: true
735
+ d1_aggregation: sum
736
+ init_pose:
737
+ type: l2
738
+ weight:
739
+ - 0.0
740
+ squared: true
741
+ translated: false
742
+ weighted: false
743
+ d1_aggregation: sum
744
+ init_shape:
745
+ type: l2
746
+ weight:
747
+ - 0.0
748
+ squared: true
749
+ translated: false
750
+ weighted: false
751
+ d1_aggregation: sum
752
+ kl:
753
+ type: ''
754
+ weight:
755
+ - 0.0
756
+ anneal_start: 0
757
+ anneal_end: -1
758
+ pseudogt_pose:
759
+ type: l2
760
+ weight:
761
+ - 0.0
762
+ squared: true
763
+ translated: false
764
+ weighted: false
765
+ d1_aggregation: sum
766
+ pseudogt_shape:
767
+ type: l2
768
+ weight:
769
+ - 0.0
770
+ squared: true
771
+ translated: false
772
+ weighted: false
773
+ d1_aggregation: sum
774
+ pseudogt_transl:
775
+ type: l2
776
+ weight:
777
+ - 0.0
778
+ squared: true
779
+ translated: false
780
+ weighted: false
781
+ d1_aggregation: sum
782
+ pseudogt_v2v:
783
+ type: l2
784
+ weight:
785
+ - 0.0
786
+ squared: true
787
+ translated: false
788
+ weighted: false
789
+ d1_aggregation: sum
790
+ pseudogt_j2j:
791
+ type: l2
792
+ weight:
793
+ - 0.0
794
+ squared: true
795
+ translated: false
796
+ weighted: false
797
+ d1_aggregation: sum
798
+ hhc_contact:
799
+ type: hhcmap
800
+ weight:
801
+ - 0.0
802
+ region_aggregation_type: sum
803
+ r2r_dist_type: min
804
+ squared_dist: true
805
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
806
+ hhc_contact_general:
807
+ type: hhcgen
808
+ weight:
809
+ - 0.0
810
+ region_aggregation_type: sum
811
+ r2r_dist_type: min
812
+ squared_dist: true
813
+ body_model_utils_folder: essentials/body_model_utils
814
+ model_type: smplx
815
+ pose_prior:
816
+ type: gmm
817
+ weight:
818
+ - 0.0
819
+ prior_folder: essentials/priors
820
+ num_gaussians: 8
821
+ epsilon: 1.0e-16
822
+ use_merged: true
823
+ shape_prior:
824
+ type: l2
825
+ weight:
826
+ - 0.0
827
+ squared: true
828
+ translated: false
829
+ weighted: false
830
+ d1_aggregation: sum
831
+ ground_plane:
832
+ type: l2
833
+ weight:
834
+ - 0.0
835
+ squared: true
836
+ translated: false
837
+ weighted: false
838
+ d1_aggregation: sum
839
+ cmap:
840
+ type: cmap
841
+ weight:
842
+ - 0.0
843
+ r2r_dist_type: test
844
+ cmap_heat_smpl:
845
+ type: ''
846
+ weight:
847
+ - 0.0
848
+ cmap_heat_token:
849
+ type: ''
850
+ weight:
851
+ - 0.0
852
+ cmap_binary_smpl:
853
+ type: ''
854
+ weight:
855
+ - 0.0
856
+ cmap_binary_token:
857
+ type: ''
858
+ weight:
859
+ - 0.0
860
+ diffusion_prior_orient:
861
+ type: l2
862
+ weight:
863
+ - 0.0
864
+ squared: true
865
+ translated: false
866
+ weighted: false
867
+ d1_aggregation: sum
868
+ diffusion_prior_pose:
869
+ type: l2
870
+ weight:
871
+ - 0.0
872
+ squared: true
873
+ translated: false
874
+ weighted: false
875
+ d1_aggregation: sum
876
+ diffusion_prior_shape:
877
+ type: l2
878
+ weight:
879
+ - 0.0
880
+ squared: true
881
+ translated: false
882
+ weighted: false
883
+ d1_aggregation: sum
884
+ diffusion_prior_scale:
885
+ type: l2
886
+ weight:
887
+ - 0.0
888
+ squared: true
889
+ translated: false
890
+ weighted: false
891
+ d1_aggregation: sum
892
+ diffusion_prior_transl:
893
+ type: l2
894
+ weight:
895
+ - 0.0
896
+ squared: true
897
+ translated: false
898
+ weighted: false
899
+ d1_aggregation: sum
900
+ diffusion_prior_v2v:
901
+ type: l2
902
+ weight:
903
+ - 0.0
904
+ squared: true
905
+ translated: false
906
+ weighted: false
907
+ d1_aggregation: sum
908
+ vae_prior_latent:
909
+ type: l2
910
+ weight:
911
+ - 0.0
912
+ squared: true
913
+ translated: false
914
+ weighted: false
915
+ d1_aggregation: sum
916
+ smplifydc:
917
+ use_contact: true
918
+ hhcs:
919
+ use_contact: true
920
+ max_iters:
921
+ - 100
922
+ - 100
923
+ num_prev_steps: 100
924
+ slope_tol: -1.0e-05
925
+ diffusion:
926
+ steps: 1000
927
+ noise_schedule: cosine
928
+ rescale_timesteps: false
929
+ timestep_respacing: ''
930
+ loss_type: custom
931
+ model_mean_type: start_x
932
+ model_var_type: fixed_large
933
+ training:
934
+ max_epochs: 5000
935
+ max_duration: .inf
936
+ num_workers: 8
937
+ pin_memory: false
938
+ shuffle_train: true
939
+ pretrained: ''
940
+ clip_grad_norm: 1.0
941
+ train: true
942
+ eval_val: false
943
+ eval_test: false
944
+ evaluation:
945
+ checkpoint_metric: total_loss
946
+ metrics:
947
+ - v2v
948
+ - mpjpe
949
+ - pa_mpjpe
950
+ - pairwise_pa_mpjpe
951
+ per_person_metrics:
952
+ - v2v
953
+ - mpjpe
954
+ - scale_mpjpe
955
+ - pa_mpjpe
956
+ generative_metrics:
957
+ - gen_diversity
958
+ - gen_fid
959
+ - gen_contact_and_isect
960
+ - gen_tsne
961
+ num_samples: 512
962
+ v2v:
963
+ name: PointError
964
+ alignment: root
965
+ mpjpe:
966
+ name: PointError
967
+ alignment: root
968
+ scale_mpjpe:
969
+ name: PointError
970
+ alignment: scale
971
+ pa_mpjpe:
972
+ name: PointError
973
+ alignment: procrustes
974
+ pairwise_pa_mpjpe:
975
+ name: PointError
976
+ alignment: procrustes
977
+ cmap_dist:
978
+ name: ContactMapDistError
979
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
980
+ cmap_iou:
981
+ name: ContactIOU
982
+ gen_diversity:
983
+ name: GenDiversity
984
+ gen_fid:
985
+ fid_model_path: ''
986
+ fid_model_cfg: ''
987
+ name: GenFID
988
+ gen_contact_and_isect:
989
+ name: GenContactIsect
990
+ gen_tsne:
991
+ name: GentSNE
992
+ visualization:
993
+ renderer:
994
+ type: pytorch3d
995
+ image_height: 224
996
+ image_width: 224
997
+ mesh_color: light_blue
998
+ pytorch3d:
999
+ blur_radius: 0.0
1000
+ faces_per_pixel: 5
1001
+ light_location:
1002
+ - - 0.0
1003
+ - 0.0
1004
+ - -0.5
1005
+ plotter:
1006
+ iw: 224
1007
+ ih: 224
1008
+ style:
1009
+ type: latex
1010
+ logging:
1011
+ base_folder: /is/cluster/work/lmueller2/results/HHC/diffusion/cvpr2024
1012
+ run: 2023_11_11_run_buddi_cfg_config_buddi_v02_cond_bev_017
1013
+ images_folder: images
1014
+ summaries_folder: summaries
1015
+ checkpoint_folder: checkpoints
1016
+ validation_folder: validation
1017
+ result_folder: results
1018
+ summaries_freq: 100.0
1019
+ checkpoint_freq: 100.0
1020
+ logger_level: INFO
1021
+ logger: wandb
1022
+ project_name: HumanHumanContactGenerativeModel
1023
+ run_id: lscw0945
1024
+ wandb_api_key_path: .wandb/api.txt
essentials/buddi/buddi_unconditional.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:038025897b5a3d1ff3ae4e10e5ab84dba070661bd5000681eb8cf5cde6dbeb28
3
+ size 28715795
essentials/buddi/buddi_unconditional.yaml ADDED
@@ -0,0 +1,1023 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ batch_size: 512
2
+ device: cuda
3
+ body_model:
4
+ type: smplx
5
+ smpl_family_folder: essentials/body_models
6
+ smpl:
7
+ num_vertices: 6890
8
+ init:
9
+ ext: pkl
10
+ batch_size: 1
11
+ gender: neutral
12
+ age: adult
13
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
14
+ betas:
15
+ dim: 10
16
+ create: true
17
+ body_pose:
18
+ create: true
19
+ global_orient:
20
+ create: true
21
+ transl:
22
+ create: true
23
+ joint_mapper:
24
+ use: false
25
+ type: smpl_to_openpose
26
+ smpl_to_openpose:
27
+ use_hands: true
28
+ use_face: true
29
+ use_face_contour: false
30
+ openpose_format: coco25
31
+ smplh:
32
+ num_vertices: 6890
33
+ init:
34
+ ext: pkl
35
+ batch_size: 1
36
+ gender: neutral
37
+ age: adult
38
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
39
+ betas:
40
+ dim: 10
41
+ create: true
42
+ body_pose:
43
+ create: true
44
+ global_orient:
45
+ create: true
46
+ transl:
47
+ create: true
48
+ joint_mapper:
49
+ use: false
50
+ type: smpl_to_openpose
51
+ smpl_to_openpose:
52
+ use_hands: true
53
+ use_face: true
54
+ use_face_contour: false
55
+ openpose_format: coco25
56
+ left_hand_pose:
57
+ use_pca: true
58
+ num_pca_comps: 6
59
+ flat_hand_mean: false
60
+ create: true
61
+ right_hand_pose:
62
+ use_pca: true
63
+ num_pca_comps: 6
64
+ flat_hand_mean: false
65
+ create: true
66
+ use_compressed: true
67
+ smplx:
68
+ num_vertices: 10475
69
+ init:
70
+ ext: npz
71
+ batch_size: 1
72
+ gender: neutral
73
+ age: kid
74
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
75
+ betas:
76
+ dim: 10
77
+ create: true
78
+ body_pose:
79
+ create: true
80
+ global_orient:
81
+ create: true
82
+ transl:
83
+ create: true
84
+ joint_mapper:
85
+ use: true
86
+ type: smpl_to_openpose
87
+ smpl_to_openpose:
88
+ use_hands: false
89
+ use_face: false
90
+ use_face_contour: false
91
+ openpose_format: coco25
92
+ left_hand_pose:
93
+ use_pca: true
94
+ num_pca_comps: 6
95
+ flat_hand_mean: false
96
+ create: true
97
+ right_hand_pose:
98
+ use_pca: true
99
+ num_pca_comps: 6
100
+ flat_hand_mean: false
101
+ create: true
102
+ use_compressed: true
103
+ hand_vertex_ids_path: ''
104
+ expression:
105
+ dim: 10
106
+ create: true
107
+ jaw_pose:
108
+ create: true
109
+ leye_pose:
110
+ create: true
111
+ reye_pose:
112
+ create: true
113
+ datasets:
114
+ processing:
115
+ normalization_mean:
116
+ - 0.485
117
+ - 0.456
118
+ - 0.406
119
+ normalization_std:
120
+ - 0.229
121
+ - 0.224
122
+ - 0.225
123
+ resolution: 224
124
+ use: false
125
+ load_image: false
126
+ train_names:
127
+ - chi3d
128
+ - hi4d
129
+ - flickrci3ds
130
+ train_composition:
131
+ - 0.2
132
+ - 0.2
133
+ - 0.6
134
+ augmentation:
135
+ use: true
136
+ mirror: 0.5
137
+ noise: 0.4
138
+ rotation: 30.0
139
+ scale: 0.25
140
+ swap: 0.5
141
+ val_names:
142
+ - hi4d
143
+ - chi3d
144
+ - flickrci3ds
145
+ test_names: []
146
+ flickrci3dsd:
147
+ body_model_path: essentials/body_models
148
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
149
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
150
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
151
+ max_count_regions_in_contact: 25
152
+ number_of_regions: 75
153
+ image_folder: images
154
+ bev_folder: bev
155
+ openpose_folder: keypoints/keypoints
156
+ vitpose_folder: vitpose
157
+ vitdet_folder: vitdet
158
+ image_format: png
159
+ pseudogt_folder: pseudogt/summaries
160
+ overfit: false
161
+ overfit_num_samples: 12
162
+ init_pose_from_bev: false
163
+ features:
164
+ is_itw: true
165
+ has_dhhc_class: false
166
+ has_dhhc_sig: true
167
+ has_dsc_sig: false
168
+ has_dsc_class: false
169
+ has_gt_kpts: false
170
+ has_op_kpts: true
171
+ has_gt_joints: false
172
+ has_gender: false
173
+ has_gt_smpl_shape: false
174
+ has_gt_smpl_pose: false
175
+ has_pgt_smpl_shape: false
176
+ has_pgt_smpl_pose: false
177
+ flickrci3ds:
178
+ body_model_path: essentials/body_models
179
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
180
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
181
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
182
+ max_count_regions_in_contact: 25
183
+ number_of_regions: 75
184
+ image_folder: images
185
+ bev_folder: bev
186
+ openpose_folder: openpose
187
+ vitpose_folder: vitpose
188
+ vitposeplus_folder: vitposeplus
189
+ vitdet_folder: vitdet
190
+ image_format: png
191
+ pseudogt_folder: hhcs_opti/run4_20231015/results
192
+ overfit: false
193
+ overfit_num_samples: 12
194
+ adult_only: false
195
+ child_only: false
196
+ features:
197
+ is_itw: true
198
+ has_dhhc_class: false
199
+ has_dhhc_sig: true
200
+ has_dsc_sig: false
201
+ has_dsc_class: false
202
+ has_gt_kpts: false
203
+ has_op_kpts: true
204
+ has_gt_joints: false
205
+ has_gender: false
206
+ has_gt_smpl_shape: false
207
+ has_gt_smpl_pose: false
208
+ has_pgt_smpl_shape: false
209
+ has_pgt_smpl_pose: false
210
+ flickrci3ds_adult:
211
+ body_model_path: essentials/body_models
212
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
213
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
214
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
215
+ max_count_regions_in_contact: 25
216
+ number_of_regions: 75
217
+ image_folder: images
218
+ bev_folder: bev
219
+ openpose_folder: openpose
220
+ vitpose_folder: vitpose
221
+ vitposeplus_folder: vitposeplus
222
+ vitdet_folder: vitdet
223
+ image_format: png
224
+ pseudogt_folder: pseudogt/summaries
225
+ overfit: false
226
+ overfit_num_samples: 12
227
+ adult_only: true
228
+ child_only: false
229
+ features:
230
+ is_itw: true
231
+ has_dhhc_class: false
232
+ has_dhhc_sig: true
233
+ has_dsc_sig: false
234
+ has_dsc_class: false
235
+ has_gt_kpts: false
236
+ has_op_kpts: true
237
+ has_gt_joints: false
238
+ has_gender: false
239
+ has_gt_smpl_shape: false
240
+ has_gt_smpl_pose: false
241
+ has_pgt_smpl_shape: false
242
+ has_pgt_smpl_pose: false
243
+ flickrci3ds_child:
244
+ body_model_path: essentials/body_models
245
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
246
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
247
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
248
+ max_count_regions_in_contact: 25
249
+ number_of_regions: 75
250
+ image_folder: images
251
+ bev_folder: bev
252
+ openpose_folder: openpose
253
+ vitpose_folder: vitpose
254
+ vitposeplus_folder: vitposeplus
255
+ vitdet_folder: vitdet
256
+ image_format: png
257
+ pseudogt_folder: pseudogt/summaries
258
+ overfit: false
259
+ overfit_num_samples: 12
260
+ adult_only: false
261
+ child_only: true
262
+ features:
263
+ is_itw: true
264
+ has_dhhc_class: false
265
+ has_dhhc_sig: true
266
+ has_dsc_sig: false
267
+ has_dsc_class: false
268
+ has_gt_kpts: false
269
+ has_op_kpts: true
270
+ has_gt_joints: false
271
+ has_gender: false
272
+ has_gt_smpl_shape: false
273
+ has_gt_smpl_pose: false
274
+ has_pgt_smpl_shape: false
275
+ has_pgt_smpl_pose: false
276
+ flickrci3dc:
277
+ original_data_folder: datasets/original/FlickrCI3D_Classification
278
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
279
+ processed_data_folder: datasets/processed/FlickrCI3D_Classification
280
+ image_folder: images
281
+ image_format: png
282
+ bev_folder: bev
283
+ features:
284
+ is_itw: true
285
+ has_dhhc_class: true
286
+ has_dhhc_sig: false
287
+ has_dsc_sig: false
288
+ has_dsc_class: false
289
+ has_gt_kpts: false
290
+ has_op_kpts: true
291
+ has_gt_joints: false
292
+ has_gender: false
293
+ has_gt_smpl_shape: false
294
+ has_gt_smpl_pose: false
295
+ has_pgt_smpl_shape: false
296
+ has_pgt_smpl_pose: false
297
+ overfit: false
298
+ overfit_num_samples: 64
299
+ chi3d:
300
+ original_data_folder: datasets/original/CHI3D
301
+ processed_data_folder: datasets/processed/CHI3D
302
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
303
+ max_count_regions_in_contact: 25
304
+ number_of_regions: 75
305
+ image_folder: images
306
+ bev_folder: bev
307
+ openpose_folder: keypoints/keypoints
308
+ vitpose_folder: vitpose
309
+ vitdet_folder: vitdet
310
+ image_format: png
311
+ pseudogt_folder: pseudogt/summaries
312
+ overfit: false
313
+ overfit_num_samples: 12
314
+ load_single_camera: false
315
+ load_from_scratch_single_camera: false
316
+ load_contact_frame_only: true
317
+ load_unit_glob_and_transl: true
318
+ features:
319
+ is_itw: false
320
+ has_dhhc_class: false
321
+ has_dhhc_sig: true
322
+ has_dsc_sig: false
323
+ has_dsc_class: false
324
+ has_gt_kpts: false
325
+ has_op_kpts: true
326
+ has_gt_joints: false
327
+ has_gender: false
328
+ has_gt_smpl_shape: false
329
+ has_gt_smpl_pose: false
330
+ has_pgt_smpl_shape: false
331
+ has_pgt_smpl_pose: false
332
+ hi4d:
333
+ original_data_folder: datasets/original/Hi4D
334
+ processed_data_folder: datasets/processed/Hi4D
335
+ image_folder: images
336
+ bev_folder: bev
337
+ openpose_folder: keypoints/keypoints
338
+ vitpose_folder: keypoints/vitposeplus
339
+ image_format: jpg
340
+ overfit: false
341
+ overfit_num_samples: 12
342
+ load_single_camera: false
343
+ load_from_scratch_single_camera: false
344
+ load_unit_glob_and_transl: true
345
+ features:
346
+ is_itw: false
347
+ has_dhhc_class: false
348
+ has_dhhc_sig: true
349
+ has_dsc_sig: false
350
+ has_dsc_class: false
351
+ has_gt_kpts: false
352
+ has_op_kpts: true
353
+ has_gt_joints: false
354
+ has_gender: false
355
+ has_gt_smpl_shape: false
356
+ has_gt_smpl_pose: false
357
+ has_pgt_smpl_shape: false
358
+ has_pgt_smpl_pose: false
359
+ demo:
360
+ original_data_folder: ''
361
+ number_of_regions: 75
362
+ image_folder: images
363
+ bev_folder: bev
364
+ openpose_folder: keypoints/keypoints
365
+ vitpose_folder: vitpose
366
+ image_format: png
367
+ image_name_select: ''
368
+ has_gt_contact_annotation: false
369
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
370
+ unique_keypoint_match: true
371
+ camera:
372
+ type: perspective
373
+ perspective:
374
+ afov_horizontal: 60.0
375
+ pitch: 0.0
376
+ yaw: 0.0
377
+ roll: 0.0
378
+ tx: 0.0
379
+ ty: 0.0
380
+ tz: 0.0
381
+ iw: 224
382
+ ih: 224
383
+ model:
384
+ regressor:
385
+ type: diffusion_transformer
386
+ optimizer:
387
+ type: adam
388
+ adam:
389
+ lr: 0.0001
390
+ weight_decay: 0.0
391
+ losses:
392
+ debug: false
393
+ keypoint2d:
394
+ type: l2
395
+ weight:
396
+ - 0.0
397
+ squared: true
398
+ translated: false
399
+ weighted: true
400
+ d1_aggregation: sum
401
+ init_pose:
402
+ type: l2
403
+ weight:
404
+ - 0.0
405
+ squared: true
406
+ translated: false
407
+ weighted: false
408
+ d1_aggregation: sum
409
+ init_shape:
410
+ type: l2
411
+ weight:
412
+ - 0.0
413
+ squared: true
414
+ translated: false
415
+ weighted: false
416
+ d1_aggregation: sum
417
+ kl:
418
+ type: ''
419
+ weight:
420
+ - 0.0
421
+ anneal_start: 0
422
+ anneal_end: -1
423
+ pseudogt_pose:
424
+ type: l2
425
+ weight:
426
+ - 10.0
427
+ squared: true
428
+ translated: false
429
+ weighted: false
430
+ d1_aggregation: sum
431
+ pseudogt_shape:
432
+ type: l2
433
+ weight:
434
+ - 0.1
435
+ squared: true
436
+ translated: false
437
+ weighted: false
438
+ d1_aggregation: sum
439
+ pseudogt_transl:
440
+ type: l2
441
+ weight:
442
+ - 1.0
443
+ squared: true
444
+ translated: false
445
+ weighted: false
446
+ d1_aggregation: sum
447
+ pseudogt_v2v:
448
+ type: l2
449
+ weight:
450
+ - 100.0
451
+ squared: true
452
+ translated: false
453
+ weighted: false
454
+ d1_aggregation: sum
455
+ pseudogt_j2j:
456
+ type: l2
457
+ weight:
458
+ - 0.0
459
+ squared: true
460
+ translated: false
461
+ weighted: false
462
+ d1_aggregation: sum
463
+ hhc_contact:
464
+ type: hhcmap
465
+ weight:
466
+ - 0.0
467
+ region_aggregation_type: sum
468
+ r2r_dist_type: min
469
+ squared_dist: true
470
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
471
+ hhc_contact_general:
472
+ type: hhcgen
473
+ weight:
474
+ - 0.0
475
+ region_aggregation_type: sum
476
+ r2r_dist_type: min
477
+ squared_dist: true
478
+ body_model_utils_folder: essentials/body_model_utils
479
+ model_type: smplx
480
+ pose_prior:
481
+ type: gmm
482
+ weight:
483
+ - 0.0
484
+ prior_folder: essentials/priors
485
+ num_gaussians: 8
486
+ epsilon: 1.0e-16
487
+ use_merged: true
488
+ shape_prior:
489
+ type: l2
490
+ weight:
491
+ - 0.0
492
+ squared: true
493
+ translated: false
494
+ weighted: false
495
+ d1_aggregation: sum
496
+ ground_plane:
497
+ type: l2
498
+ weight:
499
+ - 0.0
500
+ squared: true
501
+ translated: false
502
+ weighted: false
503
+ d1_aggregation: sum
504
+ cmap:
505
+ type: cmap
506
+ weight:
507
+ - 0.0
508
+ r2r_dist_type: test
509
+ cmap_heat_smpl:
510
+ type: ''
511
+ weight:
512
+ - 0.0
513
+ cmap_heat_token:
514
+ type: ''
515
+ weight:
516
+ - 0.0
517
+ cmap_binary_smpl:
518
+ type: ''
519
+ weight:
520
+ - 0.0
521
+ cmap_binary_token:
522
+ type: ''
523
+ weight:
524
+ - 0.0
525
+ diffusion_prior_orient:
526
+ type: l2
527
+ weight:
528
+ - 0.0
529
+ squared: true
530
+ translated: false
531
+ weighted: false
532
+ d1_aggregation: sum
533
+ diffusion_prior_pose:
534
+ type: l2
535
+ weight:
536
+ - 0.0
537
+ squared: true
538
+ translated: false
539
+ weighted: false
540
+ d1_aggregation: sum
541
+ diffusion_prior_shape:
542
+ type: l2
543
+ weight:
544
+ - 0.0
545
+ squared: true
546
+ translated: false
547
+ weighted: false
548
+ d1_aggregation: sum
549
+ diffusion_prior_scale:
550
+ type: l2
551
+ weight:
552
+ - 0.0
553
+ squared: true
554
+ translated: false
555
+ weighted: false
556
+ d1_aggregation: sum
557
+ diffusion_prior_transl:
558
+ type: l2
559
+ weight:
560
+ - 0.0
561
+ squared: true
562
+ translated: false
563
+ weighted: false
564
+ d1_aggregation: sum
565
+ diffusion_prior_v2v:
566
+ type: l2
567
+ weight:
568
+ - 0.0
569
+ squared: true
570
+ translated: false
571
+ weighted: false
572
+ d1_aggregation: sum
573
+ vae_prior_latent:
574
+ type: l2
575
+ weight:
576
+ - 0.0
577
+ squared: true
578
+ translated: false
579
+ weighted: false
580
+ d1_aggregation: sum
581
+ experiment:
582
+ contact_rep: bev_contact_heat
583
+ rotrep: sixd
584
+ in_data: pgt
585
+ token_setup: H0PH1P
586
+ num_contact_tokens: 0
587
+ smpl_mask: []
588
+ relative_transl: true
589
+ relative_orient: false
590
+ guidance_params: []
591
+ guidance_param_nc: 0.5
592
+ guidance_all_nc: 0.1
593
+ guidance_no_nc: 0.8
594
+ hmr:
595
+ load_pretrained: true
596
+ pretrained_type: resnet50
597
+ smpl_mean_params: essentials/spin/smpl_mean_params.npz
598
+ resnet:
599
+ depth: 50
600
+ pretrained: true
601
+ mlp:
602
+ in_channels: 10
603
+ hidden_channels:
604
+ - 8
605
+ - 6
606
+ - 4
607
+ - 2
608
+ bias: true
609
+ dropout: 0.0
610
+ vit:
611
+ image_size: 224
612
+ patch_size: 16
613
+ num_classes: 1000
614
+ dim: 1024
615
+ depth: 6
616
+ heads: 8
617
+ mlp_dim: 2048
618
+ dropout: 0.1
619
+ emb_dropout: 0.1
620
+ dim_head: 64
621
+ bev:
622
+ bv_with_fv_condition: true
623
+ add_offsetmap: true
624
+ add_depth_encoding: true
625
+ backbone_type: hrnet_32
626
+ hrnet_pretrain_path: hrbet_32.pth
627
+ params_num: 146
628
+ outmap_size: 128
629
+ cam_dim: 3
630
+ num_center_maps: 1
631
+ coord_maps_size: 128
632
+ centermap_size: 64
633
+ head_block_num: 2
634
+ head_num_channels: 128
635
+ bv_center_num_block: 2
636
+ bv_center_momentum: 0.1
637
+ transformer_dropout_ratio: 0.2
638
+ transformer_num_channels: 512
639
+ max_person: 12
640
+ conf_thresh: 0.1
641
+ ae:
642
+ in_channels: 5625
643
+ num_neurons: 4000
644
+ hidden_channels: 1000
645
+ dropout: 0.1
646
+ transformer:
647
+ dim: 146
648
+ depth: 6
649
+ heads: 1
650
+ mlp_dim: 2048
651
+ dropout: 0.1
652
+ diffusion_transformer:
653
+ dim: 152
654
+ depth: 6
655
+ heads: 8
656
+ mlp_dim: 500
657
+ dropout: 0.1
658
+ use_positional_encoding: false
659
+ use_positional_embedding: false
660
+ use_human_embedding: true
661
+ use_param_embedding: true
662
+ max_tokens: 100
663
+ use_cross_attention: false
664
+ share_linear_layers: false
665
+ encode_target: false
666
+ hhcc:
667
+ pretrain_path: essentials/bev/BEV.pth
668
+ vae_mlp:
669
+ d_model: null
670
+ d_latent: 64
671
+ d_features: 256
672
+ n_enc_layers: 2
673
+ n_dec_layers: 2
674
+ dropout: 0.1
675
+ embed_features: false
676
+ embed_id: false
677
+ enc_pos: false
678
+ autoencoder:
679
+ in_channels: 5625
680
+ hidden_channels: 1000
681
+ depth: 4
682
+ ae_mlp:
683
+ d_model: null
684
+ d_latent: 64
685
+ d_features: 256
686
+ n_enc_layers: 2
687
+ n_dec_layers: 2
688
+ dropout: 0.1
689
+ embed_features: false
690
+ embed_id: false
691
+ enc_pos: false
692
+ optimization:
693
+ type: smplifydc
694
+ print_loss: true
695
+ render_iters: false
696
+ use_gt_contact_map: true
697
+ use_vae: false
698
+ pretrained_vae_cfg: ''
699
+ pretrained_vae_ckpt: ''
700
+ use_diffusion: false
701
+ pretrained_diffusion_model_cfg: ''
702
+ pretrained_diffusion_model_ckpt: ''
703
+ sds_type: fixed
704
+ sds_t_fixed: 20
705
+ sds_t_range:
706
+ - 25
707
+ - 75
708
+ sds_t_adaptive_i:
709
+ - 1.0
710
+ - 0.8
711
+ - 0.6
712
+ - 0.4
713
+ - 0.2
714
+ sds_t_adaptive_t:
715
+ - 100
716
+ - 80
717
+ - 60
718
+ - 40
719
+ - 20
720
+ optimizer:
721
+ type: adam
722
+ adam:
723
+ lr: 1.0
724
+ weight_decay: 0.0
725
+ losses:
726
+ debug: false
727
+ keypoint2d:
728
+ type: l2
729
+ weight:
730
+ - 0.0
731
+ squared: true
732
+ translated: false
733
+ weighted: true
734
+ d1_aggregation: sum
735
+ init_pose:
736
+ type: l2
737
+ weight:
738
+ - 0.0
739
+ squared: true
740
+ translated: false
741
+ weighted: false
742
+ d1_aggregation: sum
743
+ init_shape:
744
+ type: l2
745
+ weight:
746
+ - 0.0
747
+ squared: true
748
+ translated: false
749
+ weighted: false
750
+ d1_aggregation: sum
751
+ kl:
752
+ type: ''
753
+ weight:
754
+ - 0.0
755
+ anneal_start: 0
756
+ anneal_end: -1
757
+ pseudogt_pose:
758
+ type: l2
759
+ weight:
760
+ - 0.0
761
+ squared: true
762
+ translated: false
763
+ weighted: false
764
+ d1_aggregation: sum
765
+ pseudogt_shape:
766
+ type: l2
767
+ weight:
768
+ - 0.0
769
+ squared: true
770
+ translated: false
771
+ weighted: false
772
+ d1_aggregation: sum
773
+ pseudogt_transl:
774
+ type: l2
775
+ weight:
776
+ - 0.0
777
+ squared: true
778
+ translated: false
779
+ weighted: false
780
+ d1_aggregation: sum
781
+ pseudogt_v2v:
782
+ type: l2
783
+ weight:
784
+ - 0.0
785
+ squared: true
786
+ translated: false
787
+ weighted: false
788
+ d1_aggregation: sum
789
+ pseudogt_j2j:
790
+ type: l2
791
+ weight:
792
+ - 0.0
793
+ squared: true
794
+ translated: false
795
+ weighted: false
796
+ d1_aggregation: sum
797
+ hhc_contact:
798
+ type: hhcmap
799
+ weight:
800
+ - 0.0
801
+ region_aggregation_type: sum
802
+ r2r_dist_type: min
803
+ squared_dist: true
804
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
805
+ hhc_contact_general:
806
+ type: hhcgen
807
+ weight:
808
+ - 0.0
809
+ region_aggregation_type: sum
810
+ r2r_dist_type: min
811
+ squared_dist: true
812
+ body_model_utils_folder: essentials/body_model_utils
813
+ model_type: smplx
814
+ pose_prior:
815
+ type: gmm
816
+ weight:
817
+ - 0.0
818
+ prior_folder: essentials/priors
819
+ num_gaussians: 8
820
+ epsilon: 1.0e-16
821
+ use_merged: true
822
+ shape_prior:
823
+ type: l2
824
+ weight:
825
+ - 0.0
826
+ squared: true
827
+ translated: false
828
+ weighted: false
829
+ d1_aggregation: sum
830
+ ground_plane:
831
+ type: l2
832
+ weight:
833
+ - 0.0
834
+ squared: true
835
+ translated: false
836
+ weighted: false
837
+ d1_aggregation: sum
838
+ cmap:
839
+ type: cmap
840
+ weight:
841
+ - 0.0
842
+ r2r_dist_type: test
843
+ cmap_heat_smpl:
844
+ type: ''
845
+ weight:
846
+ - 0.0
847
+ cmap_heat_token:
848
+ type: ''
849
+ weight:
850
+ - 0.0
851
+ cmap_binary_smpl:
852
+ type: ''
853
+ weight:
854
+ - 0.0
855
+ cmap_binary_token:
856
+ type: ''
857
+ weight:
858
+ - 0.0
859
+ diffusion_prior_orient:
860
+ type: l2
861
+ weight:
862
+ - 0.0
863
+ squared: true
864
+ translated: false
865
+ weighted: false
866
+ d1_aggregation: sum
867
+ diffusion_prior_pose:
868
+ type: l2
869
+ weight:
870
+ - 0.0
871
+ squared: true
872
+ translated: false
873
+ weighted: false
874
+ d1_aggregation: sum
875
+ diffusion_prior_shape:
876
+ type: l2
877
+ weight:
878
+ - 0.0
879
+ squared: true
880
+ translated: false
881
+ weighted: false
882
+ d1_aggregation: sum
883
+ diffusion_prior_scale:
884
+ type: l2
885
+ weight:
886
+ - 0.0
887
+ squared: true
888
+ translated: false
889
+ weighted: false
890
+ d1_aggregation: sum
891
+ diffusion_prior_transl:
892
+ type: l2
893
+ weight:
894
+ - 0.0
895
+ squared: true
896
+ translated: false
897
+ weighted: false
898
+ d1_aggregation: sum
899
+ diffusion_prior_v2v:
900
+ type: l2
901
+ weight:
902
+ - 0.0
903
+ squared: true
904
+ translated: false
905
+ weighted: false
906
+ d1_aggregation: sum
907
+ vae_prior_latent:
908
+ type: l2
909
+ weight:
910
+ - 0.0
911
+ squared: true
912
+ translated: false
913
+ weighted: false
914
+ d1_aggregation: sum
915
+ smplifydc:
916
+ use_contact: true
917
+ hhcs:
918
+ use_contact: true
919
+ max_iters:
920
+ - 100
921
+ - 100
922
+ num_prev_steps: 100
923
+ slope_tol: -1.0e-05
924
+ diffusion:
925
+ steps: 1000
926
+ noise_schedule: cosine
927
+ rescale_timesteps: false
928
+ timestep_respacing: ''
929
+ loss_type: custom
930
+ model_mean_type: start_x
931
+ model_var_type: fixed_large
932
+ training:
933
+ max_epochs: 5000
934
+ max_duration: .inf
935
+ num_workers: 8
936
+ pin_memory: false
937
+ shuffle_train: true
938
+ pretrained: ''
939
+ clip_grad_norm: 1.0
940
+ train: true
941
+ eval_val: false
942
+ eval_test: false
943
+ evaluation:
944
+ checkpoint_metric: total_loss
945
+ metrics:
946
+ - v2v
947
+ - mpjpe
948
+ - pa_mpjpe
949
+ - pairwise_pa_mpjpe
950
+ per_person_metrics:
951
+ - v2v
952
+ - mpjpe
953
+ - scale_mpjpe
954
+ - pa_mpjpe
955
+ generative_metrics:
956
+ - gen_diversity
957
+ - gen_fid
958
+ - gen_contact_and_isect
959
+ - gen_tsne
960
+ num_samples: 512
961
+ v2v:
962
+ name: PointError
963
+ alignment: root
964
+ mpjpe:
965
+ name: PointError
966
+ alignment: root
967
+ scale_mpjpe:
968
+ name: PointError
969
+ alignment: scale
970
+ pa_mpjpe:
971
+ name: PointError
972
+ alignment: procrustes
973
+ pairwise_pa_mpjpe:
974
+ name: PointError
975
+ alignment: procrustes
976
+ cmap_dist:
977
+ name: ContactMapDistError
978
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
979
+ cmap_iou:
980
+ name: ContactIOU
981
+ gen_diversity:
982
+ name: GenDiversity
983
+ gen_fid:
984
+ fid_model_path: ''
985
+ fid_model_cfg: ''
986
+ name: GenFID
987
+ gen_contact_and_isect:
988
+ name: GenContactIsect
989
+ gen_tsne:
990
+ name: GentSNE
991
+ visualization:
992
+ renderer:
993
+ type: pytorch3d
994
+ image_height: 224
995
+ image_width: 224
996
+ mesh_color: light_blue
997
+ pytorch3d:
998
+ blur_radius: 0.0
999
+ faces_per_pixel: 5
1000
+ light_location:
1001
+ - - 0.0
1002
+ - 0.0
1003
+ - -0.5
1004
+ plotter:
1005
+ iw: 224
1006
+ ih: 224
1007
+ style:
1008
+ type: latex
1009
+ logging:
1010
+ base_folder: /is/cluster/work/lmueller2/results/HHC/diffusion/cvpr2024
1011
+ run: 2023_11_13_run_buddi_cfg_config_buddi_v02_014
1012
+ images_folder: images
1013
+ summaries_folder: summaries
1014
+ checkpoint_folder: checkpoints
1015
+ validation_folder: validation
1016
+ result_folder: results
1017
+ summaries_freq: 100.0
1018
+ checkpoint_freq: 100.0
1019
+ logger_level: INFO
1020
+ logger: wandb
1021
+ project_name: HumanHumanContactGenerativeModel
1022
+ run_id: rcbq29q7
1023
+ wandb_api_key_path: .wandb/api.txt
essentials/buddi/config.yaml ADDED
@@ -0,0 +1,1038 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ batch_size: 16
2
+ device: cuda
3
+ body_model:
4
+ type: smplx
5
+ smpl_family_folder: essentials/body_models
6
+ smpl:
7
+ num_vertices: 6890
8
+ init:
9
+ ext: pkl
10
+ batch_size: 1
11
+ gender: neutral
12
+ age: adult
13
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
14
+ betas:
15
+ dim: 10
16
+ create: true
17
+ body_pose:
18
+ create: true
19
+ global_orient:
20
+ create: true
21
+ transl:
22
+ create: true
23
+ joint_mapper:
24
+ use: false
25
+ type: smpl_to_openpose
26
+ smpl_to_openpose:
27
+ use_hands: true
28
+ use_face: true
29
+ use_face_contour: false
30
+ openpose_format: coco25
31
+ smplh:
32
+ num_vertices: 6890
33
+ init:
34
+ ext: pkl
35
+ batch_size: 1
36
+ gender: neutral
37
+ age: adult
38
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
39
+ betas:
40
+ dim: 10
41
+ create: true
42
+ body_pose:
43
+ create: true
44
+ global_orient:
45
+ create: true
46
+ transl:
47
+ create: true
48
+ joint_mapper:
49
+ use: false
50
+ type: smpl_to_openpose
51
+ smpl_to_openpose:
52
+ use_hands: true
53
+ use_face: true
54
+ use_face_contour: false
55
+ openpose_format: coco25
56
+ left_hand_pose:
57
+ use_pca: true
58
+ num_pca_comps: 6
59
+ flat_hand_mean: false
60
+ create: true
61
+ right_hand_pose:
62
+ use_pca: true
63
+ num_pca_comps: 6
64
+ flat_hand_mean: false
65
+ create: true
66
+ use_compressed: true
67
+ smplx:
68
+ num_vertices: 10475
69
+ init:
70
+ ext: npz
71
+ batch_size: 1
72
+ gender: neutral
73
+ age: kid
74
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
75
+ betas:
76
+ dim: 10
77
+ create: true
78
+ body_pose:
79
+ create: true
80
+ global_orient:
81
+ create: true
82
+ transl:
83
+ create: true
84
+ joint_mapper:
85
+ use: true
86
+ type: smpl_to_openpose
87
+ smpl_to_openpose:
88
+ use_hands: false
89
+ use_face: false
90
+ use_face_contour: false
91
+ openpose_format: coco25
92
+ left_hand_pose:
93
+ use_pca: true
94
+ num_pca_comps: 6
95
+ flat_hand_mean: false
96
+ create: true
97
+ right_hand_pose:
98
+ use_pca: true
99
+ num_pca_comps: 6
100
+ flat_hand_mean: false
101
+ create: true
102
+ use_compressed: true
103
+ hand_vertex_ids_path: ''
104
+ expression:
105
+ dim: 10
106
+ create: true
107
+ jaw_pose:
108
+ create: true
109
+ leye_pose:
110
+ create: true
111
+ reye_pose:
112
+ create: true
113
+ datasets:
114
+ processing:
115
+ normalization_mean:
116
+ - 0.485
117
+ - 0.456
118
+ - 0.406
119
+ normalization_std:
120
+ - 0.229
121
+ - 0.224
122
+ - 0.225
123
+ resolution: 224
124
+ use: false
125
+ load_image: false
126
+ train_names: []
127
+ train_composition:
128
+ - 0.2
129
+ - 0.2
130
+ - 0.6
131
+ augmentation:
132
+ use: true
133
+ mirror: 0.5
134
+ noise: 0.4
135
+ rotation: 30.0
136
+ scale: 0.25
137
+ swap: 0.5
138
+ val_names:
139
+ - flickrci3ds
140
+ test_names: []
141
+ flickrci3dsd:
142
+ body_model_path: essentials/body_models
143
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
144
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
145
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
146
+ max_count_regions_in_contact: 25
147
+ number_of_regions: 75
148
+ image_folder: images
149
+ bev_folder: bev
150
+ openpose_folder: keypoints/keypoints
151
+ vitpose_folder: vitpose
152
+ vitdet_folder: vitdet
153
+ image_format: png
154
+ pseudogt_folder: pseudogt/summaries
155
+ overfit: false
156
+ overfit_num_samples: 12
157
+ init_pose_from_bev: false
158
+ features:
159
+ is_itw: true
160
+ has_dhhc_class: false
161
+ has_dhhc_sig: true
162
+ has_dsc_sig: false
163
+ has_dsc_class: false
164
+ has_gt_kpts: false
165
+ has_op_kpts: true
166
+ has_gt_joints: false
167
+ has_gender: false
168
+ has_gt_smpl_shape: false
169
+ has_gt_smpl_pose: false
170
+ has_pgt_smpl_shape: false
171
+ has_pgt_smpl_pose: false
172
+ flickrci3ds:
173
+ body_model_path: essentials/body_models
174
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
175
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
176
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
177
+ max_count_regions_in_contact: 25
178
+ number_of_regions: 75
179
+ image_folder: images
180
+ bev_folder: bev
181
+ openpose_folder: openpose
182
+ vitpose_folder: vitpose
183
+ vitposeplus_folder: vitposeplus
184
+ vitdet_folder: vitdet
185
+ image_format: png
186
+ pseudogt_folder: hhcs_opti/run4_20231015/results
187
+ overfit: false
188
+ overfit_num_samples: 12
189
+ adult_only: false
190
+ child_only: false
191
+ features:
192
+ is_itw: true
193
+ has_dhhc_class: false
194
+ has_dhhc_sig: true
195
+ has_dsc_sig: false
196
+ has_dsc_class: false
197
+ has_gt_kpts: false
198
+ has_op_kpts: true
199
+ has_gt_joints: false
200
+ has_gender: false
201
+ has_gt_smpl_shape: false
202
+ has_gt_smpl_pose: false
203
+ has_pgt_smpl_shape: false
204
+ has_pgt_smpl_pose: false
205
+ flickrci3ds_adult:
206
+ body_model_path: essentials/body_models
207
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
208
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
209
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
210
+ max_count_regions_in_contact: 25
211
+ number_of_regions: 75
212
+ image_folder: images
213
+ bev_folder: bev
214
+ openpose_folder: openpose
215
+ vitpose_folder: vitpose
216
+ vitposeplus_folder: vitposeplus
217
+ vitdet_folder: vitdet
218
+ image_format: png
219
+ pseudogt_folder: pseudogt/summaries
220
+ overfit: false
221
+ overfit_num_samples: 12
222
+ adult_only: true
223
+ child_only: false
224
+ features:
225
+ is_itw: true
226
+ has_dhhc_class: false
227
+ has_dhhc_sig: true
228
+ has_dsc_sig: false
229
+ has_dsc_class: false
230
+ has_gt_kpts: false
231
+ has_op_kpts: true
232
+ has_gt_joints: false
233
+ has_gender: false
234
+ has_gt_smpl_shape: false
235
+ has_gt_smpl_pose: false
236
+ has_pgt_smpl_shape: false
237
+ has_pgt_smpl_pose: false
238
+ flickrci3ds_child:
239
+ body_model_path: essentials/body_models
240
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
241
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
242
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
243
+ max_count_regions_in_contact: 25
244
+ number_of_regions: 75
245
+ image_folder: images
246
+ bev_folder: bev
247
+ openpose_folder: openpose
248
+ vitpose_folder: vitpose
249
+ vitposeplus_folder: vitposeplus
250
+ vitdet_folder: vitdet
251
+ image_format: png
252
+ pseudogt_folder: pseudogt/summaries
253
+ overfit: false
254
+ overfit_num_samples: 12
255
+ adult_only: false
256
+ child_only: true
257
+ features:
258
+ is_itw: true
259
+ has_dhhc_class: false
260
+ has_dhhc_sig: true
261
+ has_dsc_sig: false
262
+ has_dsc_class: false
263
+ has_gt_kpts: false
264
+ has_op_kpts: true
265
+ has_gt_joints: false
266
+ has_gender: false
267
+ has_gt_smpl_shape: false
268
+ has_gt_smpl_pose: false
269
+ has_pgt_smpl_shape: false
270
+ has_pgt_smpl_pose: false
271
+ flickrci3dc:
272
+ original_data_folder: datasets/original/FlickrCI3D_Classification
273
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
274
+ processed_data_folder: datasets/processed/FlickrCI3D_Classification
275
+ image_folder: images
276
+ image_format: png
277
+ bev_folder: bev
278
+ features:
279
+ is_itw: true
280
+ has_dhhc_class: true
281
+ has_dhhc_sig: false
282
+ has_dsc_sig: false
283
+ has_dsc_class: false
284
+ has_gt_kpts: false
285
+ has_op_kpts: true
286
+ has_gt_joints: false
287
+ has_gender: false
288
+ has_gt_smpl_shape: false
289
+ has_gt_smpl_pose: false
290
+ has_pgt_smpl_shape: false
291
+ has_pgt_smpl_pose: false
292
+ overfit: false
293
+ overfit_num_samples: 64
294
+ chi3d:
295
+ original_data_folder: datasets/original/CHI3D
296
+ processed_data_folder: datasets/processed/CHI3D
297
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
298
+ max_count_regions_in_contact: 25
299
+ number_of_regions: 75
300
+ image_folder: images
301
+ bev_folder: bev
302
+ openpose_folder: keypoints/keypoints
303
+ vitpose_folder: vitpose
304
+ vitdet_folder: vitdet
305
+ image_format: png
306
+ pseudogt_folder: pseudogt/summaries
307
+ overfit: false
308
+ overfit_num_samples: 12
309
+ load_single_camera: false
310
+ load_from_scratch_single_camera: false
311
+ load_contact_frame_only: true
312
+ load_unit_glob_and_transl: true
313
+ features:
314
+ is_itw: false
315
+ has_dhhc_class: false
316
+ has_dhhc_sig: true
317
+ has_dsc_sig: false
318
+ has_dsc_class: false
319
+ has_gt_kpts: false
320
+ has_op_kpts: true
321
+ has_gt_joints: false
322
+ has_gender: false
323
+ has_gt_smpl_shape: false
324
+ has_gt_smpl_pose: false
325
+ has_pgt_smpl_shape: false
326
+ has_pgt_smpl_pose: false
327
+ hi4d:
328
+ original_data_folder: datasets/original/Hi4D
329
+ processed_data_folder: datasets/processed/Hi4D
330
+ image_folder: images
331
+ bev_folder: bev
332
+ openpose_folder: keypoints/keypoints
333
+ vitpose_folder: keypoints/vitposeplus
334
+ image_format: jpg
335
+ overfit: false
336
+ overfit_num_samples: 12
337
+ load_single_camera: false
338
+ load_from_scratch_single_camera: false
339
+ load_unit_glob_and_transl: true
340
+ features:
341
+ is_itw: false
342
+ has_dhhc_class: false
343
+ has_dhhc_sig: true
344
+ has_dsc_sig: false
345
+ has_dsc_class: false
346
+ has_gt_kpts: false
347
+ has_op_kpts: true
348
+ has_gt_joints: false
349
+ has_gender: false
350
+ has_gt_smpl_shape: false
351
+ has_gt_smpl_pose: false
352
+ has_pgt_smpl_shape: false
353
+ has_pgt_smpl_pose: false
354
+ demo:
355
+ original_data_folder: ''
356
+ number_of_regions: 75
357
+ image_folder: images
358
+ bev_folder: bev
359
+ openpose_folder: keypoints/keypoints
360
+ vitpose_folder: vitpose
361
+ image_format: png
362
+ image_name_select: ''
363
+ has_gt_contact_annotation: false
364
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
365
+ unique_keypoint_match: true
366
+ camera:
367
+ type: perspective
368
+ perspective:
369
+ afov_horizontal: 60.0
370
+ pitch: 0.0
371
+ yaw: 0.0
372
+ roll: 0.0
373
+ tx: 0.0
374
+ ty: 0.0
375
+ tz: 0.0
376
+ iw: 224
377
+ ih: 224
378
+ model:
379
+ regressor:
380
+ type: diffusion_transformer
381
+ optimizer:
382
+ type: adam
383
+ adam:
384
+ lr: 0.0001
385
+ weight_decay: 0.0
386
+ lbfgs:
387
+ lr: 1.0
388
+ losses:
389
+ debug: false
390
+ keypoint2d:
391
+ type: l2
392
+ weight:
393
+ - 0.0
394
+ squared: true
395
+ translated: false
396
+ weighted: true
397
+ d1_aggregation: sum
398
+ init_pose:
399
+ type: l2
400
+ weight:
401
+ - 0.0
402
+ squared: true
403
+ translated: false
404
+ weighted: false
405
+ d1_aggregation: sum
406
+ init_shape:
407
+ type: l2
408
+ weight:
409
+ - 0.0
410
+ squared: true
411
+ translated: false
412
+ weighted: false
413
+ d1_aggregation: sum
414
+ init_transl:
415
+ type: l2
416
+ weight:
417
+ - 0.0
418
+ squared: true
419
+ translated: false
420
+ weighted: false
421
+ d1_aggregation: sum
422
+ kl:
423
+ type: ''
424
+ weight:
425
+ - 0.0
426
+ anneal_start: 0
427
+ anneal_end: -1
428
+ pseudogt_pose:
429
+ type: l2
430
+ weight:
431
+ - 10.0
432
+ squared: true
433
+ translated: false
434
+ weighted: false
435
+ d1_aggregation: sum
436
+ pseudogt_shape:
437
+ type: l2
438
+ weight:
439
+ - 0.1
440
+ squared: true
441
+ translated: false
442
+ weighted: false
443
+ d1_aggregation: sum
444
+ pseudogt_transl:
445
+ type: l2
446
+ weight:
447
+ - 1.0
448
+ squared: true
449
+ translated: false
450
+ weighted: false
451
+ d1_aggregation: sum
452
+ pseudogt_v2v:
453
+ type: l2
454
+ weight:
455
+ - 100.0
456
+ squared: true
457
+ translated: false
458
+ weighted: false
459
+ d1_aggregation: sum
460
+ pseudogt_j2j:
461
+ type: l2
462
+ weight:
463
+ - 0.0
464
+ squared: true
465
+ translated: false
466
+ weighted: false
467
+ d1_aggregation: sum
468
+ hhc_contact:
469
+ type: hhcmap
470
+ weight:
471
+ - 0.0
472
+ region_aggregation_type: sum
473
+ r2r_dist_type: min
474
+ squared_dist: true
475
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
476
+ hhc_contact_general:
477
+ type: hhcgen
478
+ weight:
479
+ - 0.0
480
+ region_aggregation_type: sum
481
+ r2r_dist_type: min
482
+ squared_dist: true
483
+ body_model_utils_folder: essentials/body_model_utils
484
+ model_type: smplx
485
+ pose_prior:
486
+ type: gmm
487
+ weight:
488
+ - 0.0
489
+ prior_folder: essentials/priors
490
+ num_gaussians: 8
491
+ epsilon: 1.0e-16
492
+ use_merged: true
493
+ shape_prior:
494
+ type: l2
495
+ weight:
496
+ - 0.0
497
+ squared: true
498
+ translated: false
499
+ weighted: false
500
+ d1_aggregation: sum
501
+ ground_plane:
502
+ type: l2
503
+ weight:
504
+ - 0.0
505
+ squared: true
506
+ translated: false
507
+ weighted: false
508
+ d1_aggregation: sum
509
+ cmap:
510
+ type: cmap
511
+ weight:
512
+ - 0.0
513
+ r2r_dist_type: test
514
+ cmap_heat_smpl:
515
+ type: ''
516
+ weight:
517
+ - 0.0
518
+ cmap_heat_token:
519
+ type: ''
520
+ weight:
521
+ - 0.0
522
+ cmap_binary_smpl:
523
+ type: ''
524
+ weight:
525
+ - 0.0
526
+ cmap_binary_token:
527
+ type: ''
528
+ weight:
529
+ - 0.0
530
+ diffusion_prior_orient:
531
+ type: l2
532
+ weight:
533
+ - 0.0
534
+ squared: true
535
+ translated: false
536
+ weighted: false
537
+ d1_aggregation: sum
538
+ diffusion_prior_pose:
539
+ type: l2
540
+ weight:
541
+ - 0.0
542
+ squared: true
543
+ translated: false
544
+ weighted: false
545
+ d1_aggregation: sum
546
+ diffusion_prior_shape:
547
+ type: l2
548
+ weight:
549
+ - 0.0
550
+ squared: true
551
+ translated: false
552
+ weighted: false
553
+ d1_aggregation: sum
554
+ diffusion_prior_scale:
555
+ type: l2
556
+ weight:
557
+ - 0.0
558
+ squared: true
559
+ translated: false
560
+ weighted: false
561
+ d1_aggregation: sum
562
+ diffusion_prior_transl:
563
+ type: l2
564
+ weight:
565
+ - 0.0
566
+ squared: true
567
+ translated: false
568
+ weighted: false
569
+ d1_aggregation: sum
570
+ diffusion_prior_v2v:
571
+ type: l2
572
+ weight:
573
+ - 0.0
574
+ squared: true
575
+ translated: false
576
+ weighted: false
577
+ d1_aggregation: sum
578
+ vae_prior_latent:
579
+ type: l2
580
+ weight:
581
+ - 0.0
582
+ squared: true
583
+ translated: false
584
+ weighted: false
585
+ d1_aggregation: sum
586
+ experiment:
587
+ contact_rep: bev_contact_heat
588
+ rotrep: sixd
589
+ in_data: pgt
590
+ token_setup: H0PH1P
591
+ num_contact_tokens: 0
592
+ smpl_mask: []
593
+ relative_transl: true
594
+ relative_orient: false
595
+ guidance_params: []
596
+ guidance_param_nc: 0.5
597
+ guidance_all_nc: 0.1
598
+ guidance_no_nc: 0.8
599
+ hmr:
600
+ load_pretrained: true
601
+ pretrained_type: resnet50
602
+ smpl_mean_params: essentials/spin/smpl_mean_params.npz
603
+ resnet:
604
+ depth: 50
605
+ pretrained: true
606
+ mlp:
607
+ in_channels: 10
608
+ hidden_channels:
609
+ - 8
610
+ - 6
611
+ - 4
612
+ - 2
613
+ bias: true
614
+ dropout: 0.0
615
+ vit:
616
+ image_size: 224
617
+ patch_size: 16
618
+ num_classes: 1000
619
+ dim: 1024
620
+ depth: 6
621
+ heads: 8
622
+ mlp_dim: 2048
623
+ dropout: 0.1
624
+ emb_dropout: 0.1
625
+ dim_head: 64
626
+ bev:
627
+ bv_with_fv_condition: true
628
+ add_offsetmap: true
629
+ add_depth_encoding: true
630
+ backbone_type: hrnet_32
631
+ hrnet_pretrain_path: hrbet_32.pth
632
+ params_num: 146
633
+ outmap_size: 128
634
+ cam_dim: 3
635
+ num_center_maps: 1
636
+ coord_maps_size: 128
637
+ centermap_size: 64
638
+ head_block_num: 2
639
+ head_num_channels: 128
640
+ bv_center_num_block: 2
641
+ bv_center_momentum: 0.1
642
+ transformer_dropout_ratio: 0.2
643
+ transformer_num_channels: 512
644
+ max_person: 12
645
+ conf_thresh: 0.1
646
+ ae:
647
+ in_channels: 5625
648
+ num_neurons: 4000
649
+ hidden_channels: 1000
650
+ dropout: 0.1
651
+ transformer:
652
+ dim: 146
653
+ depth: 6
654
+ heads: 1
655
+ mlp_dim: 2048
656
+ dropout: 0.1
657
+ diffusion_transformer:
658
+ dim: 152
659
+ depth: 6
660
+ heads: 8
661
+ mlp_dim: 500
662
+ dropout: 0.1
663
+ use_positional_encoding: false
664
+ use_positional_embedding: false
665
+ use_human_embedding: true
666
+ use_param_embedding: true
667
+ max_tokens: 100
668
+ use_cross_attention: false
669
+ share_linear_layers: false
670
+ encode_target: false
671
+ hhcc:
672
+ pretrain_path: essentials/bev/BEV.pth
673
+ vae_mlp:
674
+ d_model: null
675
+ d_latent: 64
676
+ d_features: 256
677
+ n_enc_layers: 2
678
+ n_dec_layers: 2
679
+ dropout: 0.1
680
+ embed_features: false
681
+ embed_id: false
682
+ enc_pos: false
683
+ autoencoder:
684
+ in_channels: 5625
685
+ hidden_channels: 1000
686
+ depth: 4
687
+ ae_mlp:
688
+ d_model: null
689
+ d_latent: 64
690
+ d_features: 256
691
+ n_enc_layers: 2
692
+ n_dec_layers: 2
693
+ dropout: 0.1
694
+ embed_features: false
695
+ embed_id: false
696
+ enc_pos: false
697
+ optimization:
698
+ type: smplifydc
699
+ print_loss: true
700
+ render_iters: false
701
+ use_gt_contact_map: true
702
+ use_vae: false
703
+ pretrained_vae_cfg: ''
704
+ pretrained_vae_ckpt: ''
705
+ use_diffusion: false
706
+ pretrained_diffusion_model_cfg: ''
707
+ pretrained_diffusion_model_ckpt: ''
708
+ sds_type: fixed
709
+ sds_t_fixed: 20
710
+ sds_t_range:
711
+ - 25
712
+ - 75
713
+ sds_t_adaptive_i:
714
+ - 1.0
715
+ - 0.8
716
+ - 0.6
717
+ - 0.4
718
+ - 0.2
719
+ sds_t_adaptive_t:
720
+ - 100
721
+ - 80
722
+ - 60
723
+ - 40
724
+ - 20
725
+ optimizer:
726
+ type: adam
727
+ adam:
728
+ lr: 1.0
729
+ weight_decay: 0.0
730
+ lbfgs:
731
+ lr: 1.0
732
+ losses:
733
+ debug: false
734
+ keypoint2d:
735
+ type: l2
736
+ weight:
737
+ - 0.0
738
+ squared: true
739
+ translated: false
740
+ weighted: true
741
+ d1_aggregation: sum
742
+ init_pose:
743
+ type: l2
744
+ weight:
745
+ - 0.0
746
+ squared: true
747
+ translated: false
748
+ weighted: false
749
+ d1_aggregation: sum
750
+ init_shape:
751
+ type: l2
752
+ weight:
753
+ - 0.0
754
+ squared: true
755
+ translated: false
756
+ weighted: false
757
+ d1_aggregation: sum
758
+ init_transl:
759
+ type: l2
760
+ weight:
761
+ - 0.0
762
+ squared: true
763
+ translated: false
764
+ weighted: false
765
+ d1_aggregation: sum
766
+ kl:
767
+ type: ''
768
+ weight:
769
+ - 0.0
770
+ anneal_start: 0
771
+ anneal_end: -1
772
+ pseudogt_pose:
773
+ type: l2
774
+ weight:
775
+ - 0.0
776
+ squared: true
777
+ translated: false
778
+ weighted: false
779
+ d1_aggregation: sum
780
+ pseudogt_shape:
781
+ type: l2
782
+ weight:
783
+ - 0.0
784
+ squared: true
785
+ translated: false
786
+ weighted: false
787
+ d1_aggregation: sum
788
+ pseudogt_transl:
789
+ type: l2
790
+ weight:
791
+ - 0.0
792
+ squared: true
793
+ translated: false
794
+ weighted: false
795
+ d1_aggregation: sum
796
+ pseudogt_v2v:
797
+ type: l2
798
+ weight:
799
+ - 0.0
800
+ squared: true
801
+ translated: false
802
+ weighted: false
803
+ d1_aggregation: sum
804
+ pseudogt_j2j:
805
+ type: l2
806
+ weight:
807
+ - 0.0
808
+ squared: true
809
+ translated: false
810
+ weighted: false
811
+ d1_aggregation: sum
812
+ hhc_contact:
813
+ type: hhcmap
814
+ weight:
815
+ - 0.0
816
+ region_aggregation_type: sum
817
+ r2r_dist_type: min
818
+ squared_dist: true
819
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
820
+ hhc_contact_general:
821
+ type: hhcgen
822
+ weight:
823
+ - 0.0
824
+ region_aggregation_type: sum
825
+ r2r_dist_type: min
826
+ squared_dist: true
827
+ body_model_utils_folder: essentials/body_model_utils
828
+ model_type: smplx
829
+ pose_prior:
830
+ type: gmm
831
+ weight:
832
+ - 0.0
833
+ prior_folder: essentials/priors
834
+ num_gaussians: 8
835
+ epsilon: 1.0e-16
836
+ use_merged: true
837
+ shape_prior:
838
+ type: l2
839
+ weight:
840
+ - 0.0
841
+ squared: true
842
+ translated: false
843
+ weighted: false
844
+ d1_aggregation: sum
845
+ ground_plane:
846
+ type: l2
847
+ weight:
848
+ - 0.0
849
+ squared: true
850
+ translated: false
851
+ weighted: false
852
+ d1_aggregation: sum
853
+ cmap:
854
+ type: cmap
855
+ weight:
856
+ - 0.0
857
+ r2r_dist_type: test
858
+ cmap_heat_smpl:
859
+ type: ''
860
+ weight:
861
+ - 0.0
862
+ cmap_heat_token:
863
+ type: ''
864
+ weight:
865
+ - 0.0
866
+ cmap_binary_smpl:
867
+ type: ''
868
+ weight:
869
+ - 0.0
870
+ cmap_binary_token:
871
+ type: ''
872
+ weight:
873
+ - 0.0
874
+ diffusion_prior_orient:
875
+ type: l2
876
+ weight:
877
+ - 0.0
878
+ squared: true
879
+ translated: false
880
+ weighted: false
881
+ d1_aggregation: sum
882
+ diffusion_prior_pose:
883
+ type: l2
884
+ weight:
885
+ - 0.0
886
+ squared: true
887
+ translated: false
888
+ weighted: false
889
+ d1_aggregation: sum
890
+ diffusion_prior_shape:
891
+ type: l2
892
+ weight:
893
+ - 0.0
894
+ squared: true
895
+ translated: false
896
+ weighted: false
897
+ d1_aggregation: sum
898
+ diffusion_prior_scale:
899
+ type: l2
900
+ weight:
901
+ - 0.0
902
+ squared: true
903
+ translated: false
904
+ weighted: false
905
+ d1_aggregation: sum
906
+ diffusion_prior_transl:
907
+ type: l2
908
+ weight:
909
+ - 0.0
910
+ squared: true
911
+ translated: false
912
+ weighted: false
913
+ d1_aggregation: sum
914
+ diffusion_prior_v2v:
915
+ type: l2
916
+ weight:
917
+ - 0.0
918
+ squared: true
919
+ translated: false
920
+ weighted: false
921
+ d1_aggregation: sum
922
+ vae_prior_latent:
923
+ type: l2
924
+ weight:
925
+ - 0.0
926
+ squared: true
927
+ translated: false
928
+ weighted: false
929
+ d1_aggregation: sum
930
+ smplifydc:
931
+ use_contact: true
932
+ hhcs:
933
+ use_contact: true
934
+ max_iters:
935
+ - 100
936
+ - 100
937
+ num_prev_steps: 100
938
+ slope_tol: -1.0e-05
939
+ diffusion:
940
+ steps: 1000
941
+ noise_schedule: cosine
942
+ rescale_timesteps: false
943
+ timestep_respacing: ''
944
+ loss_type: custom
945
+ model_mean_type: start_x
946
+ model_var_type: fixed_large
947
+ training:
948
+ max_epochs: 5000
949
+ max_duration: .inf
950
+ num_workers: 8
951
+ pin_memory: false
952
+ shuffle_train: true
953
+ pretrained: ''
954
+ clip_grad_norm: 1.0
955
+ train: true
956
+ eval_val: false
957
+ eval_test: false
958
+ evaluation:
959
+ checkpoint_metric: total_loss
960
+ metrics:
961
+ - v2v
962
+ - mpjpe
963
+ - pa_mpjpe
964
+ - pairwise_pa_mpjpe
965
+ per_person_metrics:
966
+ - v2v
967
+ - mpjpe
968
+ - scale_mpjpe
969
+ - pa_mpjpe
970
+ generative_metrics:
971
+ - gen_diversity
972
+ - gen_fid
973
+ - gen_contact_and_isect
974
+ - gen_tsne
975
+ num_samples: 512
976
+ v2v:
977
+ name: PointError
978
+ alignment: root
979
+ mpjpe:
980
+ name: PointError
981
+ alignment: root
982
+ scale_mpjpe:
983
+ name: PointError
984
+ alignment: scale
985
+ pa_mpjpe:
986
+ name: PointError
987
+ alignment: procrustes
988
+ pairwise_pa_mpjpe:
989
+ name: PointError
990
+ alignment: procrustes
991
+ cmap_dist:
992
+ name: ContactMapDistError
993
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
994
+ cmap_iou:
995
+ name: ContactIOU
996
+ gen_diversity:
997
+ name: GenDiversity
998
+ gen_fid:
999
+ fid_model_path: ''
1000
+ fid_model_cfg: ''
1001
+ name: GenFID
1002
+ gen_contact_and_isect:
1003
+ name: GenContactIsect
1004
+ gen_tsne:
1005
+ name: GentSNE
1006
+ visualization:
1007
+ renderer:
1008
+ type: pytorch3d
1009
+ image_height: 224
1010
+ image_width: 224
1011
+ mesh_color: light_blue
1012
+ pytorch3d:
1013
+ blur_radius: 0.0
1014
+ faces_per_pixel: 5
1015
+ light_location:
1016
+ - - 0.0
1017
+ - 0.0
1018
+ - -0.5
1019
+ plotter:
1020
+ iw: 224
1021
+ ih: 224
1022
+ style:
1023
+ type: latex
1024
+ logging:
1025
+ base_folder: essentials
1026
+ run: buddi
1027
+ images_folder: images
1028
+ summaries_folder: summaries
1029
+ checkpoint_folder: checkpoints
1030
+ validation_folder: validation
1031
+ result_folder: results
1032
+ summaries_freq: 100.0
1033
+ checkpoint_freq: 100.0
1034
+ logger_level: INFO
1035
+ logger: tensorbaord
1036
+ project_name: HumanHumanContactGenerativeModel
1037
+ run_id: rcbq29q7
1038
+ wandb_api_key_path: .wandb/api.txt
essentials/buddi/config_latest_run.yaml ADDED
@@ -0,0 +1,1038 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ batch_size: 16
2
+ device: cuda
3
+ body_model:
4
+ type: smplx
5
+ smpl_family_folder: essentials/body_models
6
+ smpl:
7
+ num_vertices: 6890
8
+ init:
9
+ ext: pkl
10
+ batch_size: 1
11
+ gender: neutral
12
+ age: adult
13
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
14
+ betas:
15
+ dim: 10
16
+ create: true
17
+ body_pose:
18
+ create: true
19
+ global_orient:
20
+ create: true
21
+ transl:
22
+ create: true
23
+ joint_mapper:
24
+ use: false
25
+ type: smpl_to_openpose
26
+ smpl_to_openpose:
27
+ use_hands: true
28
+ use_face: true
29
+ use_face_contour: false
30
+ openpose_format: coco25
31
+ smplh:
32
+ num_vertices: 6890
33
+ init:
34
+ ext: pkl
35
+ batch_size: 1
36
+ gender: neutral
37
+ age: adult
38
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
39
+ betas:
40
+ dim: 10
41
+ create: true
42
+ body_pose:
43
+ create: true
44
+ global_orient:
45
+ create: true
46
+ transl:
47
+ create: true
48
+ joint_mapper:
49
+ use: false
50
+ type: smpl_to_openpose
51
+ smpl_to_openpose:
52
+ use_hands: true
53
+ use_face: true
54
+ use_face_contour: false
55
+ openpose_format: coco25
56
+ left_hand_pose:
57
+ use_pca: true
58
+ num_pca_comps: 6
59
+ flat_hand_mean: false
60
+ create: true
61
+ right_hand_pose:
62
+ use_pca: true
63
+ num_pca_comps: 6
64
+ flat_hand_mean: false
65
+ create: true
66
+ use_compressed: true
67
+ smplx:
68
+ num_vertices: 10475
69
+ init:
70
+ ext: npz
71
+ batch_size: 1
72
+ gender: neutral
73
+ age: kid
74
+ kid_template_path: essentials/body_models/smil/smplx_kid_template.npy
75
+ betas:
76
+ dim: 10
77
+ create: true
78
+ body_pose:
79
+ create: true
80
+ global_orient:
81
+ create: true
82
+ transl:
83
+ create: true
84
+ joint_mapper:
85
+ use: true
86
+ type: smpl_to_openpose
87
+ smpl_to_openpose:
88
+ use_hands: false
89
+ use_face: false
90
+ use_face_contour: false
91
+ openpose_format: coco25
92
+ left_hand_pose:
93
+ use_pca: true
94
+ num_pca_comps: 6
95
+ flat_hand_mean: false
96
+ create: true
97
+ right_hand_pose:
98
+ use_pca: true
99
+ num_pca_comps: 6
100
+ flat_hand_mean: false
101
+ create: true
102
+ use_compressed: true
103
+ hand_vertex_ids_path: ''
104
+ expression:
105
+ dim: 10
106
+ create: true
107
+ jaw_pose:
108
+ create: true
109
+ leye_pose:
110
+ create: true
111
+ reye_pose:
112
+ create: true
113
+ datasets:
114
+ processing:
115
+ normalization_mean:
116
+ - 0.485
117
+ - 0.456
118
+ - 0.406
119
+ normalization_std:
120
+ - 0.229
121
+ - 0.224
122
+ - 0.225
123
+ resolution: 224
124
+ use: false
125
+ load_image: false
126
+ train_names: []
127
+ train_composition:
128
+ - 0.2
129
+ - 0.2
130
+ - 0.6
131
+ augmentation:
132
+ use: true
133
+ mirror: 0.5
134
+ noise: 0.4
135
+ rotation: 30.0
136
+ scale: 0.25
137
+ swap: 0.5
138
+ val_names:
139
+ - flickrci3ds
140
+ test_names: []
141
+ flickrci3dsd:
142
+ body_model_path: essentials/body_models
143
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
144
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
145
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
146
+ max_count_regions_in_contact: 25
147
+ number_of_regions: 75
148
+ image_folder: images
149
+ bev_folder: bev
150
+ openpose_folder: keypoints/keypoints
151
+ vitpose_folder: vitpose
152
+ vitdet_folder: vitdet
153
+ image_format: png
154
+ pseudogt_folder: pseudogt/summaries
155
+ overfit: false
156
+ overfit_num_samples: 12
157
+ init_pose_from_bev: false
158
+ features:
159
+ is_itw: true
160
+ has_dhhc_class: false
161
+ has_dhhc_sig: true
162
+ has_dsc_sig: false
163
+ has_dsc_class: false
164
+ has_gt_kpts: false
165
+ has_op_kpts: true
166
+ has_gt_joints: false
167
+ has_gender: false
168
+ has_gt_smpl_shape: false
169
+ has_gt_smpl_pose: false
170
+ has_pgt_smpl_shape: false
171
+ has_pgt_smpl_pose: false
172
+ flickrci3ds:
173
+ body_model_path: essentials/body_models
174
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
175
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
176
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
177
+ max_count_regions_in_contact: 25
178
+ number_of_regions: 75
179
+ image_folder: images
180
+ bev_folder: bev
181
+ openpose_folder: openpose
182
+ vitpose_folder: vitpose
183
+ vitposeplus_folder: vitposeplus
184
+ vitdet_folder: vitdet
185
+ image_format: png
186
+ pseudogt_folder: hhcs_opti/run4_20231015/results
187
+ overfit: false
188
+ overfit_num_samples: 12
189
+ adult_only: false
190
+ child_only: false
191
+ features:
192
+ is_itw: true
193
+ has_dhhc_class: false
194
+ has_dhhc_sig: true
195
+ has_dsc_sig: false
196
+ has_dsc_class: false
197
+ has_gt_kpts: false
198
+ has_op_kpts: true
199
+ has_gt_joints: false
200
+ has_gender: false
201
+ has_gt_smpl_shape: false
202
+ has_gt_smpl_pose: false
203
+ has_pgt_smpl_shape: false
204
+ has_pgt_smpl_pose: false
205
+ flickrci3ds_adult:
206
+ body_model_path: essentials/body_models
207
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
208
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
209
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
210
+ max_count_regions_in_contact: 25
211
+ number_of_regions: 75
212
+ image_folder: images
213
+ bev_folder: bev
214
+ openpose_folder: openpose
215
+ vitpose_folder: vitpose
216
+ vitposeplus_folder: vitposeplus
217
+ vitdet_folder: vitdet
218
+ image_format: png
219
+ pseudogt_folder: pseudogt/summaries
220
+ overfit: false
221
+ overfit_num_samples: 12
222
+ adult_only: true
223
+ child_only: false
224
+ features:
225
+ is_itw: true
226
+ has_dhhc_class: false
227
+ has_dhhc_sig: true
228
+ has_dsc_sig: false
229
+ has_dsc_class: false
230
+ has_gt_kpts: false
231
+ has_op_kpts: true
232
+ has_gt_joints: false
233
+ has_gender: false
234
+ has_gt_smpl_shape: false
235
+ has_gt_smpl_pose: false
236
+ has_pgt_smpl_shape: false
237
+ has_pgt_smpl_pose: false
238
+ flickrci3ds_child:
239
+ body_model_path: essentials/body_models
240
+ original_data_folder: datasets/original/FlickrCI3D_Signatures
241
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
242
+ processed_data_folder: datasets/processed/FlickrCI3D_Signatures
243
+ max_count_regions_in_contact: 25
244
+ number_of_regions: 75
245
+ image_folder: images
246
+ bev_folder: bev
247
+ openpose_folder: openpose
248
+ vitpose_folder: vitpose
249
+ vitposeplus_folder: vitposeplus
250
+ vitdet_folder: vitdet
251
+ image_format: png
252
+ pseudogt_folder: pseudogt/summaries
253
+ overfit: false
254
+ overfit_num_samples: 12
255
+ adult_only: false
256
+ child_only: true
257
+ features:
258
+ is_itw: true
259
+ has_dhhc_class: false
260
+ has_dhhc_sig: true
261
+ has_dsc_sig: false
262
+ has_dsc_class: false
263
+ has_gt_kpts: false
264
+ has_op_kpts: true
265
+ has_gt_joints: false
266
+ has_gender: false
267
+ has_gt_smpl_shape: false
268
+ has_gt_smpl_pose: false
269
+ has_pgt_smpl_shape: false
270
+ has_pgt_smpl_pose: false
271
+ flickrci3dc:
272
+ original_data_folder: datasets/original/FlickrCI3D_Classification
273
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
274
+ processed_data_folder: datasets/processed/FlickrCI3D_Classification
275
+ image_folder: images
276
+ image_format: png
277
+ bev_folder: bev
278
+ features:
279
+ is_itw: true
280
+ has_dhhc_class: true
281
+ has_dhhc_sig: false
282
+ has_dsc_sig: false
283
+ has_dsc_class: false
284
+ has_gt_kpts: false
285
+ has_op_kpts: true
286
+ has_gt_joints: false
287
+ has_gender: false
288
+ has_gt_smpl_shape: false
289
+ has_gt_smpl_pose: false
290
+ has_pgt_smpl_shape: false
291
+ has_pgt_smpl_pose: false
292
+ overfit: false
293
+ overfit_num_samples: 64
294
+ chi3d:
295
+ original_data_folder: datasets/original/CHI3D
296
+ processed_data_folder: datasets/processed/CHI3D
297
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
298
+ max_count_regions_in_contact: 25
299
+ number_of_regions: 75
300
+ image_folder: images
301
+ bev_folder: bev
302
+ openpose_folder: keypoints/keypoints
303
+ vitpose_folder: vitpose
304
+ vitdet_folder: vitdet
305
+ image_format: png
306
+ pseudogt_folder: pseudogt/summaries
307
+ overfit: false
308
+ overfit_num_samples: 12
309
+ load_single_camera: false
310
+ load_from_scratch_single_camera: false
311
+ load_contact_frame_only: true
312
+ load_unit_glob_and_transl: true
313
+ features:
314
+ is_itw: false
315
+ has_dhhc_class: false
316
+ has_dhhc_sig: true
317
+ has_dsc_sig: false
318
+ has_dsc_class: false
319
+ has_gt_kpts: false
320
+ has_op_kpts: true
321
+ has_gt_joints: false
322
+ has_gender: false
323
+ has_gt_smpl_shape: false
324
+ has_gt_smpl_pose: false
325
+ has_pgt_smpl_shape: false
326
+ has_pgt_smpl_pose: false
327
+ hi4d:
328
+ original_data_folder: datasets/original/Hi4D
329
+ processed_data_folder: datasets/processed/Hi4D
330
+ image_folder: images
331
+ bev_folder: bev
332
+ openpose_folder: keypoints/keypoints
333
+ vitpose_folder: keypoints/vitposeplus
334
+ image_format: jpg
335
+ overfit: false
336
+ overfit_num_samples: 12
337
+ load_single_camera: false
338
+ load_from_scratch_single_camera: false
339
+ load_unit_glob_and_transl: true
340
+ features:
341
+ is_itw: false
342
+ has_dhhc_class: false
343
+ has_dhhc_sig: true
344
+ has_dsc_sig: false
345
+ has_dsc_class: false
346
+ has_gt_kpts: false
347
+ has_op_kpts: true
348
+ has_gt_joints: false
349
+ has_gender: false
350
+ has_gt_smpl_shape: false
351
+ has_gt_smpl_pose: false
352
+ has_pgt_smpl_shape: false
353
+ has_pgt_smpl_pose: false
354
+ demo:
355
+ original_data_folder: ''
356
+ number_of_regions: 75
357
+ image_folder: images
358
+ bev_folder: bev
359
+ openpose_folder: keypoints/keypoints
360
+ vitpose_folder: vitpose
361
+ image_format: png
362
+ image_name_select: ''
363
+ has_gt_contact_annotation: false
364
+ imar_vision_datasets_tools_folder: essentials/imar_vision_datasets_tools
365
+ unique_keypoint_match: true
366
+ camera:
367
+ type: perspective
368
+ perspective:
369
+ afov_horizontal: 60.0
370
+ pitch: 0.0
371
+ yaw: 0.0
372
+ roll: 0.0
373
+ tx: 0.0
374
+ ty: 0.0
375
+ tz: 0.0
376
+ iw: 224
377
+ ih: 224
378
+ model:
379
+ regressor:
380
+ type: diffusion_transformer
381
+ optimizer:
382
+ type: adam
383
+ adam:
384
+ lr: 0.0001
385
+ weight_decay: 0.0
386
+ lbfgs:
387
+ lr: 1.0
388
+ losses:
389
+ debug: false
390
+ keypoint2d:
391
+ type: l2
392
+ weight:
393
+ - 0.0
394
+ squared: true
395
+ translated: false
396
+ weighted: true
397
+ d1_aggregation: sum
398
+ init_pose:
399
+ type: l2
400
+ weight:
401
+ - 0.0
402
+ squared: true
403
+ translated: false
404
+ weighted: false
405
+ d1_aggregation: sum
406
+ init_shape:
407
+ type: l2
408
+ weight:
409
+ - 0.0
410
+ squared: true
411
+ translated: false
412
+ weighted: false
413
+ d1_aggregation: sum
414
+ init_transl:
415
+ type: l2
416
+ weight:
417
+ - 0.0
418
+ squared: true
419
+ translated: false
420
+ weighted: false
421
+ d1_aggregation: sum
422
+ kl:
423
+ type: ''
424
+ weight:
425
+ - 0.0
426
+ anneal_start: 0
427
+ anneal_end: -1
428
+ pseudogt_pose:
429
+ type: l2
430
+ weight:
431
+ - 10.0
432
+ squared: true
433
+ translated: false
434
+ weighted: false
435
+ d1_aggregation: sum
436
+ pseudogt_shape:
437
+ type: l2
438
+ weight:
439
+ - 0.1
440
+ squared: true
441
+ translated: false
442
+ weighted: false
443
+ d1_aggregation: sum
444
+ pseudogt_transl:
445
+ type: l2
446
+ weight:
447
+ - 1.0
448
+ squared: true
449
+ translated: false
450
+ weighted: false
451
+ d1_aggregation: sum
452
+ pseudogt_v2v:
453
+ type: l2
454
+ weight:
455
+ - 100.0
456
+ squared: true
457
+ translated: false
458
+ weighted: false
459
+ d1_aggregation: sum
460
+ pseudogt_j2j:
461
+ type: l2
462
+ weight:
463
+ - 0.0
464
+ squared: true
465
+ translated: false
466
+ weighted: false
467
+ d1_aggregation: sum
468
+ hhc_contact:
469
+ type: hhcmap
470
+ weight:
471
+ - 0.0
472
+ region_aggregation_type: sum
473
+ r2r_dist_type: min
474
+ squared_dist: true
475
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
476
+ hhc_contact_general:
477
+ type: hhcgen
478
+ weight:
479
+ - 0.0
480
+ region_aggregation_type: sum
481
+ r2r_dist_type: min
482
+ squared_dist: true
483
+ body_model_utils_folder: essentials/body_model_utils
484
+ model_type: smplx
485
+ pose_prior:
486
+ type: gmm
487
+ weight:
488
+ - 0.0
489
+ prior_folder: essentials/priors
490
+ num_gaussians: 8
491
+ epsilon: 1.0e-16
492
+ use_merged: true
493
+ shape_prior:
494
+ type: l2
495
+ weight:
496
+ - 0.0
497
+ squared: true
498
+ translated: false
499
+ weighted: false
500
+ d1_aggregation: sum
501
+ ground_plane:
502
+ type: l2
503
+ weight:
504
+ - 0.0
505
+ squared: true
506
+ translated: false
507
+ weighted: false
508
+ d1_aggregation: sum
509
+ cmap:
510
+ type: cmap
511
+ weight:
512
+ - 0.0
513
+ r2r_dist_type: test
514
+ cmap_heat_smpl:
515
+ type: ''
516
+ weight:
517
+ - 0.0
518
+ cmap_heat_token:
519
+ type: ''
520
+ weight:
521
+ - 0.0
522
+ cmap_binary_smpl:
523
+ type: ''
524
+ weight:
525
+ - 0.0
526
+ cmap_binary_token:
527
+ type: ''
528
+ weight:
529
+ - 0.0
530
+ diffusion_prior_orient:
531
+ type: l2
532
+ weight:
533
+ - 0.0
534
+ squared: true
535
+ translated: false
536
+ weighted: false
537
+ d1_aggregation: sum
538
+ diffusion_prior_pose:
539
+ type: l2
540
+ weight:
541
+ - 0.0
542
+ squared: true
543
+ translated: false
544
+ weighted: false
545
+ d1_aggregation: sum
546
+ diffusion_prior_shape:
547
+ type: l2
548
+ weight:
549
+ - 0.0
550
+ squared: true
551
+ translated: false
552
+ weighted: false
553
+ d1_aggregation: sum
554
+ diffusion_prior_scale:
555
+ type: l2
556
+ weight:
557
+ - 0.0
558
+ squared: true
559
+ translated: false
560
+ weighted: false
561
+ d1_aggregation: sum
562
+ diffusion_prior_transl:
563
+ type: l2
564
+ weight:
565
+ - 0.0
566
+ squared: true
567
+ translated: false
568
+ weighted: false
569
+ d1_aggregation: sum
570
+ diffusion_prior_v2v:
571
+ type: l2
572
+ weight:
573
+ - 0.0
574
+ squared: true
575
+ translated: false
576
+ weighted: false
577
+ d1_aggregation: sum
578
+ vae_prior_latent:
579
+ type: l2
580
+ weight:
581
+ - 0.0
582
+ squared: true
583
+ translated: false
584
+ weighted: false
585
+ d1_aggregation: sum
586
+ experiment:
587
+ contact_rep: bev_contact_heat
588
+ rotrep: sixd
589
+ in_data: pgt
590
+ token_setup: H0PH1P
591
+ num_contact_tokens: 0
592
+ smpl_mask: []
593
+ relative_transl: true
594
+ relative_orient: false
595
+ guidance_params: []
596
+ guidance_param_nc: 0.5
597
+ guidance_all_nc: 0.1
598
+ guidance_no_nc: 0.8
599
+ hmr:
600
+ load_pretrained: true
601
+ pretrained_type: resnet50
602
+ smpl_mean_params: essentials/spin/smpl_mean_params.npz
603
+ resnet:
604
+ depth: 50
605
+ pretrained: true
606
+ mlp:
607
+ in_channels: 10
608
+ hidden_channels:
609
+ - 8
610
+ - 6
611
+ - 4
612
+ - 2
613
+ bias: true
614
+ dropout: 0.0
615
+ vit:
616
+ image_size: 224
617
+ patch_size: 16
618
+ num_classes: 1000
619
+ dim: 1024
620
+ depth: 6
621
+ heads: 8
622
+ mlp_dim: 2048
623
+ dropout: 0.1
624
+ emb_dropout: 0.1
625
+ dim_head: 64
626
+ bev:
627
+ bv_with_fv_condition: true
628
+ add_offsetmap: true
629
+ add_depth_encoding: true
630
+ backbone_type: hrnet_32
631
+ hrnet_pretrain_path: hrbet_32.pth
632
+ params_num: 146
633
+ outmap_size: 128
634
+ cam_dim: 3
635
+ num_center_maps: 1
636
+ coord_maps_size: 128
637
+ centermap_size: 64
638
+ head_block_num: 2
639
+ head_num_channels: 128
640
+ bv_center_num_block: 2
641
+ bv_center_momentum: 0.1
642
+ transformer_dropout_ratio: 0.2
643
+ transformer_num_channels: 512
644
+ max_person: 12
645
+ conf_thresh: 0.1
646
+ ae:
647
+ in_channels: 5625
648
+ num_neurons: 4000
649
+ hidden_channels: 1000
650
+ dropout: 0.1
651
+ transformer:
652
+ dim: 146
653
+ depth: 6
654
+ heads: 1
655
+ mlp_dim: 2048
656
+ dropout: 0.1
657
+ diffusion_transformer:
658
+ dim: 152
659
+ depth: 6
660
+ heads: 8
661
+ mlp_dim: 500
662
+ dropout: 0.1
663
+ use_positional_encoding: false
664
+ use_positional_embedding: false
665
+ use_human_embedding: true
666
+ use_param_embedding: true
667
+ max_tokens: 100
668
+ use_cross_attention: false
669
+ share_linear_layers: false
670
+ encode_target: false
671
+ hhcc:
672
+ pretrain_path: essentials/bev/BEV.pth
673
+ vae_mlp:
674
+ d_model: null
675
+ d_latent: 64
676
+ d_features: 256
677
+ n_enc_layers: 2
678
+ n_dec_layers: 2
679
+ dropout: 0.1
680
+ embed_features: false
681
+ embed_id: false
682
+ enc_pos: false
683
+ autoencoder:
684
+ in_channels: 5625
685
+ hidden_channels: 1000
686
+ depth: 4
687
+ ae_mlp:
688
+ d_model: null
689
+ d_latent: 64
690
+ d_features: 256
691
+ n_enc_layers: 2
692
+ n_dec_layers: 2
693
+ dropout: 0.1
694
+ embed_features: false
695
+ embed_id: false
696
+ enc_pos: false
697
+ optimization:
698
+ type: smplifydc
699
+ print_loss: true
700
+ render_iters: false
701
+ use_gt_contact_map: true
702
+ use_vae: false
703
+ pretrained_vae_cfg: ''
704
+ pretrained_vae_ckpt: ''
705
+ use_diffusion: false
706
+ pretrained_diffusion_model_cfg: ''
707
+ pretrained_diffusion_model_ckpt: ''
708
+ sds_type: fixed
709
+ sds_t_fixed: 20
710
+ sds_t_range:
711
+ - 25
712
+ - 75
713
+ sds_t_adaptive_i:
714
+ - 1.0
715
+ - 0.8
716
+ - 0.6
717
+ - 0.4
718
+ - 0.2
719
+ sds_t_adaptive_t:
720
+ - 100
721
+ - 80
722
+ - 60
723
+ - 40
724
+ - 20
725
+ optimizer:
726
+ type: adam
727
+ adam:
728
+ lr: 1.0
729
+ weight_decay: 0.0
730
+ lbfgs:
731
+ lr: 1.0
732
+ losses:
733
+ debug: false
734
+ keypoint2d:
735
+ type: l2
736
+ weight:
737
+ - 0.0
738
+ squared: true
739
+ translated: false
740
+ weighted: true
741
+ d1_aggregation: sum
742
+ init_pose:
743
+ type: l2
744
+ weight:
745
+ - 0.0
746
+ squared: true
747
+ translated: false
748
+ weighted: false
749
+ d1_aggregation: sum
750
+ init_shape:
751
+ type: l2
752
+ weight:
753
+ - 0.0
754
+ squared: true
755
+ translated: false
756
+ weighted: false
757
+ d1_aggregation: sum
758
+ init_transl:
759
+ type: l2
760
+ weight:
761
+ - 0.0
762
+ squared: true
763
+ translated: false
764
+ weighted: false
765
+ d1_aggregation: sum
766
+ kl:
767
+ type: ''
768
+ weight:
769
+ - 0.0
770
+ anneal_start: 0
771
+ anneal_end: -1
772
+ pseudogt_pose:
773
+ type: l2
774
+ weight:
775
+ - 0.0
776
+ squared: true
777
+ translated: false
778
+ weighted: false
779
+ d1_aggregation: sum
780
+ pseudogt_shape:
781
+ type: l2
782
+ weight:
783
+ - 0.0
784
+ squared: true
785
+ translated: false
786
+ weighted: false
787
+ d1_aggregation: sum
788
+ pseudogt_transl:
789
+ type: l2
790
+ weight:
791
+ - 0.0
792
+ squared: true
793
+ translated: false
794
+ weighted: false
795
+ d1_aggregation: sum
796
+ pseudogt_v2v:
797
+ type: l2
798
+ weight:
799
+ - 0.0
800
+ squared: true
801
+ translated: false
802
+ weighted: false
803
+ d1_aggregation: sum
804
+ pseudogt_j2j:
805
+ type: l2
806
+ weight:
807
+ - 0.0
808
+ squared: true
809
+ translated: false
810
+ weighted: false
811
+ d1_aggregation: sum
812
+ hhc_contact:
813
+ type: hhcmap
814
+ weight:
815
+ - 0.0
816
+ region_aggregation_type: sum
817
+ r2r_dist_type: min
818
+ squared_dist: true
819
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
820
+ hhc_contact_general:
821
+ type: hhcgen
822
+ weight:
823
+ - 0.0
824
+ region_aggregation_type: sum
825
+ r2r_dist_type: min
826
+ squared_dist: true
827
+ body_model_utils_folder: essentials/body_model_utils
828
+ model_type: smplx
829
+ pose_prior:
830
+ type: gmm
831
+ weight:
832
+ - 0.0
833
+ prior_folder: essentials/priors
834
+ num_gaussians: 8
835
+ epsilon: 1.0e-16
836
+ use_merged: true
837
+ shape_prior:
838
+ type: l2
839
+ weight:
840
+ - 0.0
841
+ squared: true
842
+ translated: false
843
+ weighted: false
844
+ d1_aggregation: sum
845
+ ground_plane:
846
+ type: l2
847
+ weight:
848
+ - 0.0
849
+ squared: true
850
+ translated: false
851
+ weighted: false
852
+ d1_aggregation: sum
853
+ cmap:
854
+ type: cmap
855
+ weight:
856
+ - 0.0
857
+ r2r_dist_type: test
858
+ cmap_heat_smpl:
859
+ type: ''
860
+ weight:
861
+ - 0.0
862
+ cmap_heat_token:
863
+ type: ''
864
+ weight:
865
+ - 0.0
866
+ cmap_binary_smpl:
867
+ type: ''
868
+ weight:
869
+ - 0.0
870
+ cmap_binary_token:
871
+ type: ''
872
+ weight:
873
+ - 0.0
874
+ diffusion_prior_orient:
875
+ type: l2
876
+ weight:
877
+ - 0.0
878
+ squared: true
879
+ translated: false
880
+ weighted: false
881
+ d1_aggregation: sum
882
+ diffusion_prior_pose:
883
+ type: l2
884
+ weight:
885
+ - 0.0
886
+ squared: true
887
+ translated: false
888
+ weighted: false
889
+ d1_aggregation: sum
890
+ diffusion_prior_shape:
891
+ type: l2
892
+ weight:
893
+ - 0.0
894
+ squared: true
895
+ translated: false
896
+ weighted: false
897
+ d1_aggregation: sum
898
+ diffusion_prior_scale:
899
+ type: l2
900
+ weight:
901
+ - 0.0
902
+ squared: true
903
+ translated: false
904
+ weighted: false
905
+ d1_aggregation: sum
906
+ diffusion_prior_transl:
907
+ type: l2
908
+ weight:
909
+ - 0.0
910
+ squared: true
911
+ translated: false
912
+ weighted: false
913
+ d1_aggregation: sum
914
+ diffusion_prior_v2v:
915
+ type: l2
916
+ weight:
917
+ - 0.0
918
+ squared: true
919
+ translated: false
920
+ weighted: false
921
+ d1_aggregation: sum
922
+ vae_prior_latent:
923
+ type: l2
924
+ weight:
925
+ - 0.0
926
+ squared: true
927
+ translated: false
928
+ weighted: false
929
+ d1_aggregation: sum
930
+ smplifydc:
931
+ use_contact: true
932
+ hhcs:
933
+ use_contact: true
934
+ max_iters:
935
+ - 100
936
+ - 100
937
+ num_prev_steps: 100
938
+ slope_tol: -1.0e-05
939
+ diffusion:
940
+ steps: 1000
941
+ noise_schedule: cosine
942
+ rescale_timesteps: false
943
+ timestep_respacing: ''
944
+ loss_type: custom
945
+ model_mean_type: start_x
946
+ model_var_type: fixed_large
947
+ training:
948
+ max_epochs: 5000
949
+ max_duration: .inf
950
+ num_workers: 8
951
+ pin_memory: false
952
+ shuffle_train: true
953
+ pretrained: ''
954
+ clip_grad_norm: 1.0
955
+ train: true
956
+ eval_val: false
957
+ eval_test: false
958
+ evaluation:
959
+ checkpoint_metric: total_loss
960
+ metrics:
961
+ - v2v
962
+ - mpjpe
963
+ - pa_mpjpe
964
+ - pairwise_pa_mpjpe
965
+ per_person_metrics:
966
+ - v2v
967
+ - mpjpe
968
+ - scale_mpjpe
969
+ - pa_mpjpe
970
+ generative_metrics:
971
+ - gen_diversity
972
+ - gen_fid
973
+ - gen_contact_and_isect
974
+ - gen_tsne
975
+ num_samples: 512
976
+ v2v:
977
+ name: PointError
978
+ alignment: root
979
+ mpjpe:
980
+ name: PointError
981
+ alignment: root
982
+ scale_mpjpe:
983
+ name: PointError
984
+ alignment: scale
985
+ pa_mpjpe:
986
+ name: PointError
987
+ alignment: procrustes
988
+ pairwise_pa_mpjpe:
989
+ name: PointError
990
+ alignment: procrustes
991
+ cmap_dist:
992
+ name: ContactMapDistError
993
+ region_to_vertex: essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl
994
+ cmap_iou:
995
+ name: ContactIOU
996
+ gen_diversity:
997
+ name: GenDiversity
998
+ gen_fid:
999
+ fid_model_path: ''
1000
+ fid_model_cfg: ''
1001
+ name: GenFID
1002
+ gen_contact_and_isect:
1003
+ name: GenContactIsect
1004
+ gen_tsne:
1005
+ name: GentSNE
1006
+ visualization:
1007
+ renderer:
1008
+ type: pytorch3d
1009
+ image_height: 224
1010
+ image_width: 224
1011
+ mesh_color: light_blue
1012
+ pytorch3d:
1013
+ blur_radius: 0.0
1014
+ faces_per_pixel: 5
1015
+ light_location:
1016
+ - - 0.0
1017
+ - 0.0
1018
+ - -0.5
1019
+ plotter:
1020
+ iw: 224
1021
+ ih: 224
1022
+ style:
1023
+ type: latex
1024
+ logging:
1025
+ base_folder: essentials
1026
+ run: buddi
1027
+ images_folder: images
1028
+ summaries_folder: summaries
1029
+ checkpoint_folder: checkpoints
1030
+ validation_folder: validation
1031
+ result_folder: results
1032
+ summaries_freq: 100.0
1033
+ checkpoint_freq: 100.0
1034
+ logger_level: INFO
1035
+ logger: tensorbaord
1036
+ project_name: HumanHumanContactGenerativeModel
1037
+ run_id: rcbq29q7
1038
+ wandb_api_key_path: .wandb/api.txt
essentials/contact/flickrci3ds_r75_rid_to_smplx_vid.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07ae2f8f1bad92d26d66f820ab1b0675949b513d0711e11fe87a554f7a61e7ba
3
+ size 51427
essentials/priors/gmm_08.pkl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1374908aae055a2afa01a2cd9a169bc6cfec1ceb7aa590e201a47b383060491
3
+ size 839127