2023-11-02 21:53:57.206 | INFO | mmgpt.model.builder:build_model_tokenizer:85 - LlamaTokenizer(name_or_path='/data/hypertext/yuangpeng/huggingface_cache/models--lmsys--vicuna-7b-v15', vocab_size=32000, model_max_length=2048, is_fast=False, padding_side='right', truncation_side='right', special_tokens={'bos_token': AddedToken("", rstrip=False, lstrip=False, single_word=False, normalized=False), 'eos_token': AddedToken("", rstrip=False, lstrip=False, single_word=False, normalized=False), 'unk_token': AddedToken("", rstrip=False, lstrip=False, single_word=False, normalized=False), 'pad_token': ''}, clean_up_tokenization_spaces=False) 2023-11-02 21:54:00.632 | INFO | mmgpt.model.mmgpt.base_mmgpt:build_vision_tokenizer:52 - CLIPImageProcessor { "crop_size": { "height": 448, "width": 448 }, "do_center_crop": true, "do_convert_rgb": true, "do_normalize": true, "do_rescale": true, "do_resize": true, "feature_extractor_type": "CLIPFeatureExtractor", "image_mean": [ 0.48145466, 0.4578275, 0.40821073 ], "image_processor_type": "CLIPImageProcessor", "image_std": [ 0.26862954, 0.26130258, 0.27577711 ], "resample": 3, "rescale_factor": 0.00392156862745098, "size": { "shortest_edge": 448 } } 2023-11-02 21:54:07.019 | INFO | mmgpt.model.mmgpt.base_mmgpt:build_vision_tokenizer:64 - 2 new tokens are added to be trained. 2023-11-02 21:54:07.145 | INFO | mmgpt.model.builder:build_model_tokenizer:148 - MMGPTLlamaForCausalLM( (model): MMGPTLlamaModel( (embed_tokens): Embedding(32003, 4096) (layers): ModuleList( (0-31): 32 x LlamaDecoderLayer( (self_attn): LlamaAttention( (q_proj): Linear(in_features=4096, out_features=4096, bias=False) (k_proj): Linear(in_features=4096, out_features=4096, bias=False) (v_proj): Linear(in_features=4096, out_features=4096, bias=False) (o_proj): Linear(in_features=4096, out_features=4096, bias=False) (rotary_emb): LlamaRotaryEmbedding() ) (mlp): LlamaMLP( (gate_proj): Linear(in_features=4096, out_features=11008, bias=False) (up_proj): Linear(in_features=4096, out_features=11008, bias=False) (down_proj): Linear(in_features=11008, out_features=4096, bias=False) (act_fn): SiLUActivation() ) (input_layernorm): LlamaRMSNorm() (post_attention_layernorm): LlamaRMSNorm() ) ) (norm): LlamaRMSNorm() (vision_tower): CLIPVisionTower( (vision_tower): CLIPVisionModel( (vision_model): CLIPVisionTransformer( (embeddings): CLIPVisionEmbeddings( (patch_embedding): Conv2d(3, 1024, kernel_size=(14, 14), stride=(14, 14), bias=False) (position_embedding): Embedding(1025, 1024) ) (pre_layrnorm): LayerNorm((1024,), eps=1e-05, elementwise_affine=True) (encoder): CLIPEncoder( (layers): ModuleList( (0-23): 24 x CLIPEncoderLayer( (self_attn): CLIPAttention( (k_proj): Linear(in_features=1024, out_features=1024, bias=True) (v_proj): Linear(in_features=1024, out_features=1024, bias=True) (q_proj): Linear(in_features=1024, out_features=1024, bias=True) (out_proj): Linear(in_features=1024, out_features=1024, bias=True) ) (layer_norm1): LayerNorm((1024,), eps=1e-05, elementwise_affine=True) (mlp): CLIPMLP( (activation_fn): QuickGELUActivation() (fc1): Linear(in_features=1024, out_features=4096, bias=True) (fc2): Linear(in_features=4096, out_features=1024, bias=True) ) (layer_norm2): LayerNorm((1024,), eps=1e-05, elementwise_affine=True) ) ) ) (post_layernorm): LayerNorm((1024,), eps=1e-05, elementwise_affine=True) ) ) ) (projector): ConvProjector( (projector): Conv2d(1024, 4096, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1)) ) ) (lm_head): Linear(in_features=4096, out_features=32003, bias=False) ) 2023-11-02 21:54:20.723 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.embed_tokens.weight 2023-11-02 21:54:20.724 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.self_attn.q_proj.weight 2023-11-02 21:54:20.724 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.self_attn.k_proj.weight 2023-11-02 21:54:20.724 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.self_attn.v_proj.weight 2023-11-02 21:54:20.724 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.self_attn.o_proj.weight 2023-11-02 21:54:20.724 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.mlp.gate_proj.weight 2023-11-02 21:54:20.725 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.mlp.up_proj.weight 2023-11-02 21:54:20.725 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.mlp.down_proj.weight 2023-11-02 21:54:20.725 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.input_layernorm.weight 2023-11-02 21:54:20.725 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.0.post_attention_layernorm.weight 2023-11-02 21:54:20.726 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.self_attn.q_proj.weight 2023-11-02 21:54:20.726 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.self_attn.k_proj.weight 2023-11-02 21:54:20.726 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.self_attn.v_proj.weight 2023-11-02 21:54:20.726 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.self_attn.o_proj.weight 2023-11-02 21:54:20.726 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.mlp.gate_proj.weight 2023-11-02 21:54:20.726 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.mlp.up_proj.weight 2023-11-02 21:54:20.727 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.mlp.down_proj.weight 2023-11-02 21:54:20.727 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.input_layernorm.weight 2023-11-02 21:54:20.727 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.1.post_attention_layernorm.weight 2023-11-02 21:54:20.727 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.self_attn.q_proj.weight 2023-11-02 21:54:20.727 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.self_attn.k_proj.weight 2023-11-02 21:54:20.728 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.self_attn.v_proj.weight 2023-11-02 21:54:20.728 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.self_attn.o_proj.weight 2023-11-02 21:54:20.728 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.mlp.gate_proj.weight 2023-11-02 21:54:20.728 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.mlp.up_proj.weight 2023-11-02 21:54:20.728 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.mlp.down_proj.weight 2023-11-02 21:54:20.728 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.input_layernorm.weight 2023-11-02 21:54:20.729 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.2.post_attention_layernorm.weight 2023-11-02 21:54:20.729 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.self_attn.q_proj.weight 2023-11-02 21:54:20.729 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.self_attn.k_proj.weight 2023-11-02 21:54:20.729 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.self_attn.v_proj.weight 2023-11-02 21:54:20.729 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.self_attn.o_proj.weight 2023-11-02 21:54:20.730 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.mlp.gate_proj.weight 2023-11-02 21:54:20.730 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.mlp.up_proj.weight 2023-11-02 21:54:20.730 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.mlp.down_proj.weight 2023-11-02 21:54:20.730 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.input_layernorm.weight 2023-11-02 21:54:20.730 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.3.post_attention_layernorm.weight 2023-11-02 21:54:20.730 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.self_attn.q_proj.weight 2023-11-02 21:54:20.731 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.self_attn.k_proj.weight 2023-11-02 21:54:20.731 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.self_attn.v_proj.weight 2023-11-02 21:54:20.731 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.self_attn.o_proj.weight 2023-11-02 21:54:20.731 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.mlp.gate_proj.weight 2023-11-02 21:54:20.731 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.mlp.up_proj.weight 2023-11-02 21:54:20.732 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.mlp.down_proj.weight 2023-11-02 21:54:20.732 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.input_layernorm.weight 2023-11-02 21:54:20.732 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.4.post_attention_layernorm.weight 2023-11-02 21:54:20.732 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.self_attn.q_proj.weight 2023-11-02 21:54:20.732 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.self_attn.k_proj.weight 2023-11-02 21:54:20.732 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.self_attn.v_proj.weight 2023-11-02 21:54:20.733 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.self_attn.o_proj.weight 2023-11-02 21:54:20.733 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.mlp.gate_proj.weight 2023-11-02 21:54:20.733 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.mlp.up_proj.weight 2023-11-02 21:54:20.733 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.mlp.down_proj.weight 2023-11-02 21:54:20.733 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.input_layernorm.weight 2023-11-02 21:54:20.734 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.5.post_attention_layernorm.weight 2023-11-02 21:54:20.734 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.self_attn.q_proj.weight 2023-11-02 21:54:20.734 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.self_attn.k_proj.weight 2023-11-02 21:54:20.734 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.self_attn.v_proj.weight 2023-11-02 21:54:20.734 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.self_attn.o_proj.weight 2023-11-02 21:54:20.734 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.mlp.gate_proj.weight 2023-11-02 21:54:20.735 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.mlp.up_proj.weight 2023-11-02 21:54:20.735 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.mlp.down_proj.weight 2023-11-02 21:54:20.735 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.input_layernorm.weight 2023-11-02 21:54:20.735 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.6.post_attention_layernorm.weight 2023-11-02 21:54:20.735 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.self_attn.q_proj.weight 2023-11-02 21:54:20.736 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.self_attn.k_proj.weight 2023-11-02 21:54:20.736 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.self_attn.v_proj.weight 2023-11-02 21:54:20.736 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.self_attn.o_proj.weight 2023-11-02 21:54:20.736 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.mlp.gate_proj.weight 2023-11-02 21:54:20.736 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.mlp.up_proj.weight 2023-11-02 21:54:20.736 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.mlp.down_proj.weight 2023-11-02 21:54:20.737 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.input_layernorm.weight 2023-11-02 21:54:20.737 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.7.post_attention_layernorm.weight 2023-11-02 21:54:20.737 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.self_attn.q_proj.weight 2023-11-02 21:54:20.737 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.self_attn.k_proj.weight 2023-11-02 21:54:20.737 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.self_attn.v_proj.weight 2023-11-02 21:54:20.738 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.self_attn.o_proj.weight 2023-11-02 21:54:20.738 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.mlp.gate_proj.weight 2023-11-02 21:54:20.738 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.mlp.up_proj.weight 2023-11-02 21:54:20.738 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.mlp.down_proj.weight 2023-11-02 21:54:20.738 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.input_layernorm.weight 2023-11-02 21:54:20.739 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.8.post_attention_layernorm.weight 2023-11-02 21:54:20.739 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.self_attn.q_proj.weight 2023-11-02 21:54:20.739 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.self_attn.k_proj.weight 2023-11-02 21:54:20.739 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.self_attn.v_proj.weight 2023-11-02 21:54:20.739 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.self_attn.o_proj.weight 2023-11-02 21:54:20.739 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.mlp.gate_proj.weight 2023-11-02 21:54:20.740 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.mlp.up_proj.weight 2023-11-02 21:54:20.740 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.mlp.down_proj.weight 2023-11-02 21:54:20.740 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.input_layernorm.weight 2023-11-02 21:54:20.740 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.9.post_attention_layernorm.weight 2023-11-02 21:54:20.740 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.self_attn.q_proj.weight 2023-11-02 21:54:20.741 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.self_attn.k_proj.weight 2023-11-02 21:54:20.741 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.self_attn.v_proj.weight 2023-11-02 21:54:20.741 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.self_attn.o_proj.weight 2023-11-02 21:54:20.741 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.mlp.gate_proj.weight 2023-11-02 21:54:20.741 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.mlp.up_proj.weight 2023-11-02 21:54:20.741 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.mlp.down_proj.weight 2023-11-02 21:54:20.742 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.input_layernorm.weight 2023-11-02 21:54:20.742 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.10.post_attention_layernorm.weight 2023-11-02 21:54:20.742 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.self_attn.q_proj.weight 2023-11-02 21:54:20.742 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.self_attn.k_proj.weight 2023-11-02 21:54:20.742 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.self_attn.v_proj.weight 2023-11-02 21:54:20.742 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.self_attn.o_proj.weight 2023-11-02 21:54:20.743 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.mlp.gate_proj.weight 2023-11-02 21:54:20.743 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.mlp.up_proj.weight 2023-11-02 21:54:20.743 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.mlp.down_proj.weight 2023-11-02 21:54:20.743 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.input_layernorm.weight 2023-11-02 21:54:20.743 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.11.post_attention_layernorm.weight 2023-11-02 21:54:20.744 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.self_attn.q_proj.weight 2023-11-02 21:54:20.744 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.self_attn.k_proj.weight 2023-11-02 21:54:20.744 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.self_attn.v_proj.weight 2023-11-02 21:54:20.744 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.self_attn.o_proj.weight 2023-11-02 21:54:20.744 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.mlp.gate_proj.weight 2023-11-02 21:54:20.744 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.mlp.up_proj.weight 2023-11-02 21:54:20.745 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.mlp.down_proj.weight 2023-11-02 21:54:20.745 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.input_layernorm.weight 2023-11-02 21:54:20.745 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.12.post_attention_layernorm.weight 2023-11-02 21:54:20.745 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.self_attn.q_proj.weight 2023-11-02 21:54:20.745 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.self_attn.k_proj.weight 2023-11-02 21:54:20.746 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.self_attn.v_proj.weight 2023-11-02 21:54:20.746 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.self_attn.o_proj.weight 2023-11-02 21:54:20.746 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.mlp.gate_proj.weight 2023-11-02 21:54:20.746 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.mlp.up_proj.weight 2023-11-02 21:54:20.746 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.mlp.down_proj.weight 2023-11-02 21:54:20.746 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.input_layernorm.weight 2023-11-02 21:54:20.747 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.13.post_attention_layernorm.weight 2023-11-02 21:54:20.747 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.self_attn.q_proj.weight 2023-11-02 21:54:20.747 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.self_attn.k_proj.weight 2023-11-02 21:54:20.747 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.self_attn.v_proj.weight 2023-11-02 21:54:20.747 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.self_attn.o_proj.weight 2023-11-02 21:54:20.748 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.mlp.gate_proj.weight 2023-11-02 21:54:20.748 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.mlp.up_proj.weight 2023-11-02 21:54:20.748 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.mlp.down_proj.weight 2023-11-02 21:54:20.748 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.input_layernorm.weight 2023-11-02 21:54:20.748 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.14.post_attention_layernorm.weight 2023-11-02 21:54:20.748 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.self_attn.q_proj.weight 2023-11-02 21:54:20.749 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.self_attn.k_proj.weight 2023-11-02 21:54:20.749 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.self_attn.v_proj.weight 2023-11-02 21:54:20.749 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.self_attn.o_proj.weight 2023-11-02 21:54:20.749 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.mlp.gate_proj.weight 2023-11-02 21:54:20.749 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.mlp.up_proj.weight 2023-11-02 21:54:20.749 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.mlp.down_proj.weight 2023-11-02 21:54:20.750 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.input_layernorm.weight 2023-11-02 21:54:20.750 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.15.post_attention_layernorm.weight 2023-11-02 21:54:20.750 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.self_attn.q_proj.weight 2023-11-02 21:54:20.750 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.self_attn.k_proj.weight 2023-11-02 21:54:20.750 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.self_attn.v_proj.weight 2023-11-02 21:54:20.750 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.self_attn.o_proj.weight 2023-11-02 21:54:20.751 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.mlp.gate_proj.weight 2023-11-02 21:54:20.751 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.mlp.up_proj.weight 2023-11-02 21:54:20.751 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.mlp.down_proj.weight 2023-11-02 21:54:20.751 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.input_layernorm.weight 2023-11-02 21:54:20.751 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.16.post_attention_layernorm.weight 2023-11-02 21:54:20.752 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.self_attn.q_proj.weight 2023-11-02 21:54:20.752 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.self_attn.k_proj.weight 2023-11-02 21:54:20.752 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.self_attn.v_proj.weight 2023-11-02 21:54:20.752 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.self_attn.o_proj.weight 2023-11-02 21:54:20.752 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.mlp.gate_proj.weight 2023-11-02 21:54:20.752 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.mlp.up_proj.weight 2023-11-02 21:54:20.753 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.mlp.down_proj.weight 2023-11-02 21:54:20.753 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.input_layernorm.weight 2023-11-02 21:54:20.753 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.17.post_attention_layernorm.weight 2023-11-02 21:54:20.753 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.self_attn.q_proj.weight 2023-11-02 21:54:20.753 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.self_attn.k_proj.weight 2023-11-02 21:54:20.753 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.self_attn.v_proj.weight 2023-11-02 21:54:20.754 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.self_attn.o_proj.weight 2023-11-02 21:54:20.754 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.mlp.gate_proj.weight 2023-11-02 21:54:20.754 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.mlp.up_proj.weight 2023-11-02 21:54:20.754 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.mlp.down_proj.weight 2023-11-02 21:54:20.754 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.input_layernorm.weight 2023-11-02 21:54:20.755 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.18.post_attention_layernorm.weight 2023-11-02 21:54:20.755 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.self_attn.q_proj.weight 2023-11-02 21:54:20.755 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.self_attn.k_proj.weight 2023-11-02 21:54:20.755 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.self_attn.v_proj.weight 2023-11-02 21:54:20.755 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.self_attn.o_proj.weight 2023-11-02 21:54:20.755 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.mlp.gate_proj.weight 2023-11-02 21:54:20.756 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.mlp.up_proj.weight 2023-11-02 21:54:20.756 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.mlp.down_proj.weight 2023-11-02 21:54:20.756 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.input_layernorm.weight 2023-11-02 21:54:20.756 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.19.post_attention_layernorm.weight 2023-11-02 21:54:20.756 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.self_attn.q_proj.weight 2023-11-02 21:54:20.756 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.self_attn.k_proj.weight 2023-11-02 21:54:20.757 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.self_attn.v_proj.weight 2023-11-02 21:54:20.757 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.self_attn.o_proj.weight 2023-11-02 21:54:20.757 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.mlp.gate_proj.weight 2023-11-02 21:54:20.757 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.mlp.up_proj.weight 2023-11-02 21:54:20.757 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.mlp.down_proj.weight 2023-11-02 21:54:20.758 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.input_layernorm.weight 2023-11-02 21:54:20.758 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.20.post_attention_layernorm.weight 2023-11-02 21:54:20.758 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.self_attn.q_proj.weight 2023-11-02 21:54:20.758 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.self_attn.k_proj.weight 2023-11-02 21:54:20.758 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.self_attn.v_proj.weight 2023-11-02 21:54:20.758 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.self_attn.o_proj.weight 2023-11-02 21:54:20.759 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.mlp.gate_proj.weight 2023-11-02 21:54:20.759 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.mlp.up_proj.weight 2023-11-02 21:54:20.759 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.mlp.down_proj.weight 2023-11-02 21:54:20.759 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.input_layernorm.weight 2023-11-02 21:54:20.759 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.21.post_attention_layernorm.weight 2023-11-02 21:54:20.759 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.self_attn.q_proj.weight 2023-11-02 21:54:20.760 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.self_attn.k_proj.weight 2023-11-02 21:54:20.760 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.self_attn.v_proj.weight 2023-11-02 21:54:20.760 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.self_attn.o_proj.weight 2023-11-02 21:54:20.760 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.mlp.gate_proj.weight 2023-11-02 21:54:20.760 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.mlp.up_proj.weight 2023-11-02 21:54:20.761 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.mlp.down_proj.weight 2023-11-02 21:54:20.761 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.input_layernorm.weight 2023-11-02 21:54:20.761 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.22.post_attention_layernorm.weight 2023-11-02 21:54:20.761 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.self_attn.q_proj.weight 2023-11-02 21:54:20.761 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.self_attn.k_proj.weight 2023-11-02 21:54:20.761 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.self_attn.v_proj.weight 2023-11-02 21:54:20.762 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.self_attn.o_proj.weight 2023-11-02 21:54:20.762 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.mlp.gate_proj.weight 2023-11-02 21:54:20.762 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.mlp.up_proj.weight 2023-11-02 21:54:20.762 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.mlp.down_proj.weight 2023-11-02 21:54:20.762 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.input_layernorm.weight 2023-11-02 21:54:20.762 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.23.post_attention_layernorm.weight 2023-11-02 21:54:20.763 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.self_attn.q_proj.weight 2023-11-02 21:54:20.763 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.self_attn.k_proj.weight 2023-11-02 21:54:20.763 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.self_attn.v_proj.weight 2023-11-02 21:54:20.763 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.self_attn.o_proj.weight 2023-11-02 21:54:20.763 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.mlp.gate_proj.weight 2023-11-02 21:54:20.764 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.mlp.up_proj.weight 2023-11-02 21:54:20.764 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.mlp.down_proj.weight 2023-11-02 21:54:20.764 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.input_layernorm.weight 2023-11-02 21:54:20.764 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.24.post_attention_layernorm.weight 2023-11-02 21:54:20.764 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.self_attn.q_proj.weight 2023-11-02 21:54:20.764 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.self_attn.k_proj.weight 2023-11-02 21:54:20.765 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.self_attn.v_proj.weight 2023-11-02 21:54:20.765 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.self_attn.o_proj.weight 2023-11-02 21:54:20.765 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.mlp.gate_proj.weight 2023-11-02 21:54:20.765 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.mlp.up_proj.weight 2023-11-02 21:54:20.765 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.mlp.down_proj.weight 2023-11-02 21:54:20.765 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.input_layernorm.weight 2023-11-02 21:54:20.766 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.25.post_attention_layernorm.weight 2023-11-02 21:54:20.766 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.self_attn.q_proj.weight 2023-11-02 21:54:20.766 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.self_attn.k_proj.weight 2023-11-02 21:54:20.766 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.self_attn.v_proj.weight 2023-11-02 21:54:20.766 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.self_attn.o_proj.weight 2023-11-02 21:54:20.767 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.mlp.gate_proj.weight 2023-11-02 21:54:20.767 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.mlp.up_proj.weight 2023-11-02 21:54:20.767 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.mlp.down_proj.weight 2023-11-02 21:54:20.767 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.input_layernorm.weight 2023-11-02 21:54:20.767 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.26.post_attention_layernorm.weight 2023-11-02 21:54:20.767 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.self_attn.q_proj.weight 2023-11-02 21:54:20.768 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.self_attn.k_proj.weight 2023-11-02 21:54:20.768 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.self_attn.v_proj.weight 2023-11-02 21:54:20.768 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.self_attn.o_proj.weight 2023-11-02 21:54:20.768 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.mlp.gate_proj.weight 2023-11-02 21:54:20.768 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.mlp.up_proj.weight 2023-11-02 21:54:20.768 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.mlp.down_proj.weight 2023-11-02 21:54:20.769 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.input_layernorm.weight 2023-11-02 21:54:20.769 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.27.post_attention_layernorm.weight 2023-11-02 21:54:20.769 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.self_attn.q_proj.weight 2023-11-02 21:54:20.769 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.self_attn.k_proj.weight 2023-11-02 21:54:20.769 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.self_attn.v_proj.weight 2023-11-02 21:54:20.770 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.self_attn.o_proj.weight 2023-11-02 21:54:20.770 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.mlp.gate_proj.weight 2023-11-02 21:54:20.770 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.mlp.up_proj.weight 2023-11-02 21:54:20.770 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.mlp.down_proj.weight 2023-11-02 21:54:20.770 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.input_layernorm.weight 2023-11-02 21:54:20.770 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.28.post_attention_layernorm.weight 2023-11-02 21:54:20.771 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.self_attn.q_proj.weight 2023-11-02 21:54:20.771 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.self_attn.k_proj.weight 2023-11-02 21:54:20.771 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.self_attn.v_proj.weight 2023-11-02 21:54:20.771 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.self_attn.o_proj.weight 2023-11-02 21:54:20.771 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.mlp.gate_proj.weight 2023-11-02 21:54:20.771 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.mlp.up_proj.weight 2023-11-02 21:54:20.772 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.mlp.down_proj.weight 2023-11-02 21:54:20.772 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.input_layernorm.weight 2023-11-02 21:54:20.772 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.29.post_attention_layernorm.weight 2023-11-02 21:54:20.772 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.self_attn.q_proj.weight 2023-11-02 21:54:20.772 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.self_attn.k_proj.weight 2023-11-02 21:54:20.773 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.self_attn.v_proj.weight 2023-11-02 21:54:20.773 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.self_attn.o_proj.weight 2023-11-02 21:54:20.773 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.mlp.gate_proj.weight 2023-11-02 21:54:20.773 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.mlp.up_proj.weight 2023-11-02 21:54:20.773 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.mlp.down_proj.weight 2023-11-02 21:54:20.773 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.input_layernorm.weight 2023-11-02 21:54:20.774 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.30.post_attention_layernorm.weight 2023-11-02 21:54:20.774 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.self_attn.q_proj.weight 2023-11-02 21:54:20.774 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.self_attn.k_proj.weight 2023-11-02 21:54:20.774 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.self_attn.v_proj.weight 2023-11-02 21:54:20.774 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.self_attn.o_proj.weight 2023-11-02 21:54:20.774 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.mlp.gate_proj.weight 2023-11-02 21:54:20.775 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.mlp.up_proj.weight 2023-11-02 21:54:20.775 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.mlp.down_proj.weight 2023-11-02 21:54:20.775 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.input_layernorm.weight 2023-11-02 21:54:20.775 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.layers.31.post_attention_layernorm.weight 2023-11-02 21:54:20.775 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.norm.weight 2023-11-02 21:54:20.775 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.embeddings.class_embedding 2023-11-02 21:54:20.776 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.embeddings.patch_embedding.weight 2023-11-02 21:54:20.776 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.embeddings.position_embedding.weight 2023-11-02 21:54:20.776 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.pre_layrnorm.weight 2023-11-02 21:54:20.776 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.pre_layrnorm.bias 2023-11-02 21:54:20.776 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.k_proj.weight 2023-11-02 21:54:20.777 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.k_proj.bias 2023-11-02 21:54:20.777 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.v_proj.weight 2023-11-02 21:54:20.777 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.v_proj.bias 2023-11-02 21:54:20.777 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.q_proj.weight 2023-11-02 21:54:20.777 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.q_proj.bias 2023-11-02 21:54:20.777 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.out_proj.weight 2023-11-02 21:54:20.778 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.self_attn.out_proj.bias 2023-11-02 21:54:20.778 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.layer_norm1.weight 2023-11-02 21:54:20.778 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.layer_norm1.bias 2023-11-02 21:54:20.778 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.mlp.fc1.weight 2023-11-02 21:54:20.778 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.mlp.fc1.bias 2023-11-02 21:54:20.779 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.mlp.fc2.weight 2023-11-02 21:54:20.779 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.mlp.fc2.bias 2023-11-02 21:54:20.779 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.layer_norm2.weight 2023-11-02 21:54:20.779 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.0.layer_norm2.bias 2023-11-02 21:54:20.779 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.k_proj.weight 2023-11-02 21:54:20.779 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.k_proj.bias 2023-11-02 21:54:20.780 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.v_proj.weight 2023-11-02 21:54:20.780 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.v_proj.bias 2023-11-02 21:54:20.780 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.q_proj.weight 2023-11-02 21:54:20.780 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.q_proj.bias 2023-11-02 21:54:20.780 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.out_proj.weight 2023-11-02 21:54:20.780 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.self_attn.out_proj.bias 2023-11-02 21:54:20.781 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.layer_norm1.weight 2023-11-02 21:54:20.781 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.layer_norm1.bias 2023-11-02 21:54:20.781 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.mlp.fc1.weight 2023-11-02 21:54:20.781 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.mlp.fc1.bias 2023-11-02 21:54:20.781 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.mlp.fc2.weight 2023-11-02 21:54:20.781 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.mlp.fc2.bias 2023-11-02 21:54:20.782 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.layer_norm2.weight 2023-11-02 21:54:20.782 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.1.layer_norm2.bias 2023-11-02 21:54:20.782 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.k_proj.weight 2023-11-02 21:54:20.782 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.k_proj.bias 2023-11-02 21:54:20.782 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.v_proj.weight 2023-11-02 21:54:20.783 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.v_proj.bias 2023-11-02 21:54:20.783 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.q_proj.weight 2023-11-02 21:54:20.783 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.q_proj.bias 2023-11-02 21:54:20.783 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.out_proj.weight 2023-11-02 21:54:20.783 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.self_attn.out_proj.bias 2023-11-02 21:54:20.783 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.layer_norm1.weight 2023-11-02 21:54:20.784 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.layer_norm1.bias 2023-11-02 21:54:20.784 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.mlp.fc1.weight 2023-11-02 21:54:20.784 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.mlp.fc1.bias 2023-11-02 21:54:20.784 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.mlp.fc2.weight 2023-11-02 21:54:20.784 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.mlp.fc2.bias 2023-11-02 21:54:20.784 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.layer_norm2.weight 2023-11-02 21:54:20.785 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.2.layer_norm2.bias 2023-11-02 21:54:20.785 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.k_proj.weight 2023-11-02 21:54:20.785 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.k_proj.bias 2023-11-02 21:54:20.785 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.v_proj.weight 2023-11-02 21:54:20.785 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.v_proj.bias 2023-11-02 21:54:20.785 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.q_proj.weight 2023-11-02 21:54:20.786 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.q_proj.bias 2023-11-02 21:54:20.786 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.out_proj.weight 2023-11-02 21:54:20.786 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.self_attn.out_proj.bias 2023-11-02 21:54:20.786 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.layer_norm1.weight 2023-11-02 21:54:20.786 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.layer_norm1.bias 2023-11-02 21:54:20.787 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.mlp.fc1.weight 2023-11-02 21:54:20.787 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.mlp.fc1.bias 2023-11-02 21:54:20.787 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.mlp.fc2.weight 2023-11-02 21:54:20.787 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.mlp.fc2.bias 2023-11-02 21:54:20.787 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.layer_norm2.weight 2023-11-02 21:54:20.787 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.3.layer_norm2.bias 2023-11-02 21:54:20.788 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.k_proj.weight 2023-11-02 21:54:20.788 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.k_proj.bias 2023-11-02 21:54:20.788 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.v_proj.weight 2023-11-02 21:54:20.788 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.v_proj.bias 2023-11-02 21:54:20.788 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.q_proj.weight 2023-11-02 21:54:20.788 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.q_proj.bias 2023-11-02 21:54:20.789 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.out_proj.weight 2023-11-02 21:54:20.789 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.self_attn.out_proj.bias 2023-11-02 21:54:20.789 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.layer_norm1.weight 2023-11-02 21:54:20.789 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.layer_norm1.bias 2023-11-02 21:54:20.789 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.mlp.fc1.weight 2023-11-02 21:54:20.789 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.mlp.fc1.bias 2023-11-02 21:54:20.790 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.mlp.fc2.weight 2023-11-02 21:54:20.790 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.mlp.fc2.bias 2023-11-02 21:54:20.790 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.layer_norm2.weight 2023-11-02 21:54:20.790 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.4.layer_norm2.bias 2023-11-02 21:54:20.790 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.k_proj.weight 2023-11-02 21:54:20.790 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.k_proj.bias 2023-11-02 21:54:20.791 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.v_proj.weight 2023-11-02 21:54:20.791 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.v_proj.bias 2023-11-02 21:54:20.791 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.q_proj.weight 2023-11-02 21:54:20.791 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.q_proj.bias 2023-11-02 21:54:20.791 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.out_proj.weight 2023-11-02 21:54:20.792 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.self_attn.out_proj.bias 2023-11-02 21:54:20.792 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.layer_norm1.weight 2023-11-02 21:54:20.792 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.layer_norm1.bias 2023-11-02 21:54:20.792 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.mlp.fc1.weight 2023-11-02 21:54:20.792 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.mlp.fc1.bias 2023-11-02 21:54:20.792 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.mlp.fc2.weight 2023-11-02 21:54:20.793 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.mlp.fc2.bias 2023-11-02 21:54:20.793 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.layer_norm2.weight 2023-11-02 21:54:20.793 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.5.layer_norm2.bias 2023-11-02 21:54:20.793 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.k_proj.weight 2023-11-02 21:54:20.793 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.k_proj.bias 2023-11-02 21:54:20.793 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.v_proj.weight 2023-11-02 21:54:20.794 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.v_proj.bias 2023-11-02 21:54:20.794 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.q_proj.weight 2023-11-02 21:54:20.794 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.q_proj.bias 2023-11-02 21:54:20.794 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.out_proj.weight 2023-11-02 21:54:20.794 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.self_attn.out_proj.bias 2023-11-02 21:54:20.794 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.layer_norm1.weight 2023-11-02 21:54:20.795 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.layer_norm1.bias 2023-11-02 21:54:20.795 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.mlp.fc1.weight 2023-11-02 21:54:20.795 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.mlp.fc1.bias 2023-11-02 21:54:20.795 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.mlp.fc2.weight 2023-11-02 21:54:20.795 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.mlp.fc2.bias 2023-11-02 21:54:20.796 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.layer_norm2.weight 2023-11-02 21:54:20.796 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.6.layer_norm2.bias 2023-11-02 21:54:20.796 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.k_proj.weight 2023-11-02 21:54:20.796 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.k_proj.bias 2023-11-02 21:54:20.796 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.v_proj.weight 2023-11-02 21:54:20.796 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.v_proj.bias 2023-11-02 21:54:20.797 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.q_proj.weight 2023-11-02 21:54:20.797 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.q_proj.bias 2023-11-02 21:54:20.797 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.out_proj.weight 2023-11-02 21:54:20.797 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.self_attn.out_proj.bias 2023-11-02 21:54:20.797 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.layer_norm1.weight 2023-11-02 21:54:20.797 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.layer_norm1.bias 2023-11-02 21:54:20.798 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.mlp.fc1.weight 2023-11-02 21:54:20.798 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.mlp.fc1.bias 2023-11-02 21:54:20.798 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.mlp.fc2.weight 2023-11-02 21:54:20.798 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.mlp.fc2.bias 2023-11-02 21:54:20.798 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.layer_norm2.weight 2023-11-02 21:54:20.798 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.7.layer_norm2.bias 2023-11-02 21:54:20.799 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.k_proj.weight 2023-11-02 21:54:20.799 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.k_proj.bias 2023-11-02 21:54:20.799 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.v_proj.weight 2023-11-02 21:54:20.799 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.v_proj.bias 2023-11-02 21:54:20.799 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.q_proj.weight 2023-11-02 21:54:20.799 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.q_proj.bias 2023-11-02 21:54:20.800 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.out_proj.weight 2023-11-02 21:54:20.800 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.self_attn.out_proj.bias 2023-11-02 21:54:20.800 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.layer_norm1.weight 2023-11-02 21:54:20.800 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.layer_norm1.bias 2023-11-02 21:54:20.800 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.mlp.fc1.weight 2023-11-02 21:54:20.801 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.mlp.fc1.bias 2023-11-02 21:54:20.801 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.mlp.fc2.weight 2023-11-02 21:54:20.801 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.mlp.fc2.bias 2023-11-02 21:54:20.801 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.layer_norm2.weight 2023-11-02 21:54:20.801 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.8.layer_norm2.bias 2023-11-02 21:54:20.801 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.k_proj.weight 2023-11-02 21:54:20.802 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.k_proj.bias 2023-11-02 21:54:20.802 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.v_proj.weight 2023-11-02 21:54:20.802 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.v_proj.bias 2023-11-02 21:54:20.802 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.q_proj.weight 2023-11-02 21:54:20.802 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.q_proj.bias 2023-11-02 21:54:20.802 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.out_proj.weight 2023-11-02 21:54:20.803 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.self_attn.out_proj.bias 2023-11-02 21:54:20.803 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.layer_norm1.weight 2023-11-02 21:54:20.803 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.layer_norm1.bias 2023-11-02 21:54:20.803 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.mlp.fc1.weight 2023-11-02 21:54:20.803 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.mlp.fc1.bias 2023-11-02 21:54:20.803 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.mlp.fc2.weight 2023-11-02 21:54:20.804 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.mlp.fc2.bias 2023-11-02 21:54:20.804 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.layer_norm2.weight 2023-11-02 21:54:20.804 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.9.layer_norm2.bias 2023-11-02 21:54:20.804 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.k_proj.weight 2023-11-02 21:54:20.804 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.k_proj.bias 2023-11-02 21:54:20.804 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.v_proj.weight 2023-11-02 21:54:20.805 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.v_proj.bias 2023-11-02 21:54:20.805 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.q_proj.weight 2023-11-02 21:54:20.805 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.q_proj.bias 2023-11-02 21:54:20.805 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.out_proj.weight 2023-11-02 21:54:20.805 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.self_attn.out_proj.bias 2023-11-02 21:54:20.806 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.layer_norm1.weight 2023-11-02 21:54:20.806 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.layer_norm1.bias 2023-11-02 21:54:20.806 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.mlp.fc1.weight 2023-11-02 21:54:20.806 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.mlp.fc1.bias 2023-11-02 21:54:20.806 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.mlp.fc2.weight 2023-11-02 21:54:20.806 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.mlp.fc2.bias 2023-11-02 21:54:20.807 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.layer_norm2.weight 2023-11-02 21:54:20.807 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.10.layer_norm2.bias 2023-11-02 21:54:20.807 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.k_proj.weight 2023-11-02 21:54:20.807 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.k_proj.bias 2023-11-02 21:54:20.807 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.v_proj.weight 2023-11-02 21:54:20.807 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.v_proj.bias 2023-11-02 21:54:20.808 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.q_proj.weight 2023-11-02 21:54:20.808 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.q_proj.bias 2023-11-02 21:54:20.808 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.out_proj.weight 2023-11-02 21:54:20.808 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.self_attn.out_proj.bias 2023-11-02 21:54:20.808 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.layer_norm1.weight 2023-11-02 21:54:20.808 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.layer_norm1.bias 2023-11-02 21:54:20.809 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.mlp.fc1.weight 2023-11-02 21:54:20.809 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.mlp.fc1.bias 2023-11-02 21:54:20.809 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.mlp.fc2.weight 2023-11-02 21:54:20.809 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.mlp.fc2.bias 2023-11-02 21:54:20.809 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.layer_norm2.weight 2023-11-02 21:54:20.809 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.11.layer_norm2.bias 2023-11-02 21:54:20.810 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.k_proj.weight 2023-11-02 21:54:20.810 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.k_proj.bias 2023-11-02 21:54:20.810 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.v_proj.weight 2023-11-02 21:54:20.810 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.v_proj.bias 2023-11-02 21:54:20.810 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.q_proj.weight 2023-11-02 21:54:20.811 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.q_proj.bias 2023-11-02 21:54:20.811 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.out_proj.weight 2023-11-02 21:54:20.811 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.self_attn.out_proj.bias 2023-11-02 21:54:20.811 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.layer_norm1.weight 2023-11-02 21:54:20.811 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.layer_norm1.bias 2023-11-02 21:54:20.811 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.mlp.fc1.weight 2023-11-02 21:54:20.812 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.mlp.fc1.bias 2023-11-02 21:54:20.812 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.mlp.fc2.weight 2023-11-02 21:54:20.812 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.mlp.fc2.bias 2023-11-02 21:54:20.812 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.layer_norm2.weight 2023-11-02 21:54:20.812 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.12.layer_norm2.bias 2023-11-02 21:54:20.812 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.k_proj.weight 2023-11-02 21:54:20.813 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.k_proj.bias 2023-11-02 21:54:20.813 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.v_proj.weight 2023-11-02 21:54:20.813 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.v_proj.bias 2023-11-02 21:54:20.813 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.q_proj.weight 2023-11-02 21:54:20.813 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.q_proj.bias 2023-11-02 21:54:20.814 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.out_proj.weight 2023-11-02 21:54:20.814 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.self_attn.out_proj.bias 2023-11-02 21:54:20.814 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.layer_norm1.weight 2023-11-02 21:54:20.814 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.layer_norm1.bias 2023-11-02 21:54:20.814 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.mlp.fc1.weight 2023-11-02 21:54:20.814 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.mlp.fc1.bias 2023-11-02 21:54:20.815 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.mlp.fc2.weight 2023-11-02 21:54:20.815 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.mlp.fc2.bias 2023-11-02 21:54:20.815 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.layer_norm2.weight 2023-11-02 21:54:20.815 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.13.layer_norm2.bias 2023-11-02 21:54:20.815 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.k_proj.weight 2023-11-02 21:54:20.815 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.k_proj.bias 2023-11-02 21:54:20.816 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.v_proj.weight 2023-11-02 21:54:20.816 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.v_proj.bias 2023-11-02 21:54:20.816 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.q_proj.weight 2023-11-02 21:54:20.816 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.q_proj.bias 2023-11-02 21:54:20.816 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.out_proj.weight 2023-11-02 21:54:20.816 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.self_attn.out_proj.bias 2023-11-02 21:54:20.817 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.layer_norm1.weight 2023-11-02 21:54:20.817 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.layer_norm1.bias 2023-11-02 21:54:20.817 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.mlp.fc1.weight 2023-11-02 21:54:20.817 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.mlp.fc1.bias 2023-11-02 21:54:20.817 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.mlp.fc2.weight 2023-11-02 21:54:20.817 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.mlp.fc2.bias 2023-11-02 21:54:20.818 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.layer_norm2.weight 2023-11-02 21:54:20.818 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.14.layer_norm2.bias 2023-11-02 21:54:20.818 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.k_proj.weight 2023-11-02 21:54:20.818 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.k_proj.bias 2023-11-02 21:54:20.818 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.v_proj.weight 2023-11-02 21:54:20.819 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.v_proj.bias 2023-11-02 21:54:20.819 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.q_proj.weight 2023-11-02 21:54:20.819 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.q_proj.bias 2023-11-02 21:54:20.819 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.out_proj.weight 2023-11-02 21:54:20.819 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.self_attn.out_proj.bias 2023-11-02 21:54:20.819 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.layer_norm1.weight 2023-11-02 21:54:20.820 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.layer_norm1.bias 2023-11-02 21:54:20.820 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.mlp.fc1.weight 2023-11-02 21:54:20.820 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.mlp.fc1.bias 2023-11-02 21:54:20.820 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.mlp.fc2.weight 2023-11-02 21:54:20.820 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.mlp.fc2.bias 2023-11-02 21:54:20.820 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.layer_norm2.weight 2023-11-02 21:54:20.821 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.15.layer_norm2.bias 2023-11-02 21:54:20.821 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.k_proj.weight 2023-11-02 21:54:20.821 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.k_proj.bias 2023-11-02 21:54:20.821 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.v_proj.weight 2023-11-02 21:54:20.821 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.v_proj.bias 2023-11-02 21:54:20.821 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.q_proj.weight 2023-11-02 21:54:20.822 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.q_proj.bias 2023-11-02 21:54:20.822 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.out_proj.weight 2023-11-02 21:54:20.822 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.self_attn.out_proj.bias 2023-11-02 21:54:20.822 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.layer_norm1.weight 2023-11-02 21:54:20.822 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.layer_norm1.bias 2023-11-02 21:54:20.823 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.mlp.fc1.weight 2023-11-02 21:54:20.823 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.mlp.fc1.bias 2023-11-02 21:54:20.823 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.mlp.fc2.weight 2023-11-02 21:54:20.823 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.mlp.fc2.bias 2023-11-02 21:54:20.823 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.layer_norm2.weight 2023-11-02 21:54:20.823 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.16.layer_norm2.bias 2023-11-02 21:54:20.824 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.k_proj.weight 2023-11-02 21:54:20.824 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.k_proj.bias 2023-11-02 21:54:20.824 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.v_proj.weight 2023-11-02 21:54:20.824 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.v_proj.bias 2023-11-02 21:54:20.824 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.q_proj.weight 2023-11-02 21:54:20.824 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.q_proj.bias 2023-11-02 21:54:20.825 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.out_proj.weight 2023-11-02 21:54:20.825 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.self_attn.out_proj.bias 2023-11-02 21:54:20.825 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.layer_norm1.weight 2023-11-02 21:54:20.825 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.layer_norm1.bias 2023-11-02 21:54:20.825 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.mlp.fc1.weight 2023-11-02 21:54:20.825 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.mlp.fc1.bias 2023-11-02 21:54:20.826 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.mlp.fc2.weight 2023-11-02 21:54:20.826 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.mlp.fc2.bias 2023-11-02 21:54:20.826 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.layer_norm2.weight 2023-11-02 21:54:20.826 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.17.layer_norm2.bias 2023-11-02 21:54:20.826 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.k_proj.weight 2023-11-02 21:54:20.826 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.k_proj.bias 2023-11-02 21:54:20.827 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.v_proj.weight 2023-11-02 21:54:20.827 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.v_proj.bias 2023-11-02 21:54:20.827 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.q_proj.weight 2023-11-02 21:54:20.827 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.q_proj.bias 2023-11-02 21:54:20.827 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.out_proj.weight 2023-11-02 21:54:20.827 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.self_attn.out_proj.bias 2023-11-02 21:54:20.828 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.layer_norm1.weight 2023-11-02 21:54:20.828 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.layer_norm1.bias 2023-11-02 21:54:20.828 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.mlp.fc1.weight 2023-11-02 21:54:20.828 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.mlp.fc1.bias 2023-11-02 21:54:20.828 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.mlp.fc2.weight 2023-11-02 21:54:20.828 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.mlp.fc2.bias 2023-11-02 21:54:20.829 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.layer_norm2.weight 2023-11-02 21:54:20.829 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.18.layer_norm2.bias 2023-11-02 21:54:20.829 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.k_proj.weight 2023-11-02 21:54:20.829 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.k_proj.bias 2023-11-02 21:54:20.829 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.v_proj.weight 2023-11-02 21:54:20.830 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.v_proj.bias 2023-11-02 21:54:20.830 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.q_proj.weight 2023-11-02 21:54:20.830 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.q_proj.bias 2023-11-02 21:54:20.830 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.out_proj.weight 2023-11-02 21:54:20.830 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.self_attn.out_proj.bias 2023-11-02 21:54:20.830 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.layer_norm1.weight 2023-11-02 21:54:20.831 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.layer_norm1.bias 2023-11-02 21:54:20.831 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.mlp.fc1.weight 2023-11-02 21:54:20.831 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.mlp.fc1.bias 2023-11-02 21:54:20.831 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.mlp.fc2.weight 2023-11-02 21:54:20.831 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.mlp.fc2.bias 2023-11-02 21:54:20.831 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.layer_norm2.weight 2023-11-02 21:54:20.832 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.19.layer_norm2.bias 2023-11-02 21:54:20.832 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.k_proj.weight 2023-11-02 21:54:20.832 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.k_proj.bias 2023-11-02 21:54:20.832 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.v_proj.weight 2023-11-02 21:54:20.832 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.v_proj.bias 2023-11-02 21:54:20.832 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.q_proj.weight 2023-11-02 21:54:20.833 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.q_proj.bias 2023-11-02 21:54:20.833 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.out_proj.weight 2023-11-02 21:54:20.833 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.self_attn.out_proj.bias 2023-11-02 21:54:20.833 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.layer_norm1.weight 2023-11-02 21:54:20.833 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.layer_norm1.bias 2023-11-02 21:54:20.833 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.mlp.fc1.weight 2023-11-02 21:54:20.834 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.mlp.fc1.bias 2023-11-02 21:54:20.834 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.mlp.fc2.weight 2023-11-02 21:54:20.834 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.mlp.fc2.bias 2023-11-02 21:54:20.834 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.layer_norm2.weight 2023-11-02 21:54:20.834 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.20.layer_norm2.bias 2023-11-02 21:54:20.835 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.k_proj.weight 2023-11-02 21:54:20.835 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.k_proj.bias 2023-11-02 21:54:20.835 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.v_proj.weight 2023-11-02 21:54:20.835 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.v_proj.bias 2023-11-02 21:54:20.835 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.q_proj.weight 2023-11-02 21:54:20.835 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.q_proj.bias 2023-11-02 21:54:20.836 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.out_proj.weight 2023-11-02 21:54:20.836 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.self_attn.out_proj.bias 2023-11-02 21:54:20.836 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.layer_norm1.weight 2023-11-02 21:54:20.836 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.layer_norm1.bias 2023-11-02 21:54:20.836 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.mlp.fc1.weight 2023-11-02 21:54:20.836 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.mlp.fc1.bias 2023-11-02 21:54:20.837 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.mlp.fc2.weight 2023-11-02 21:54:20.837 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.mlp.fc2.bias 2023-11-02 21:54:20.837 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.layer_norm2.weight 2023-11-02 21:54:20.837 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.21.layer_norm2.bias 2023-11-02 21:54:20.837 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.k_proj.weight 2023-11-02 21:54:20.837 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.k_proj.bias 2023-11-02 21:54:20.838 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.v_proj.weight 2023-11-02 21:54:20.838 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.v_proj.bias 2023-11-02 21:54:20.838 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.q_proj.weight 2023-11-02 21:54:20.838 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.q_proj.bias 2023-11-02 21:54:20.838 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.out_proj.weight 2023-11-02 21:54:20.838 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.self_attn.out_proj.bias 2023-11-02 21:54:20.839 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.layer_norm1.weight 2023-11-02 21:54:20.839 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.layer_norm1.bias 2023-11-02 21:54:20.839 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.mlp.fc1.weight 2023-11-02 21:54:20.839 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.mlp.fc1.bias 2023-11-02 21:54:20.839 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.mlp.fc2.weight 2023-11-02 21:54:20.839 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.mlp.fc2.bias 2023-11-02 21:54:20.840 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.layer_norm2.weight 2023-11-02 21:54:20.840 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.22.layer_norm2.bias 2023-11-02 21:54:20.840 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.k_proj.weight 2023-11-02 21:54:20.840 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.k_proj.bias 2023-11-02 21:54:20.840 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.v_proj.weight 2023-11-02 21:54:20.840 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.v_proj.bias 2023-11-02 21:54:20.841 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.q_proj.weight 2023-11-02 21:54:20.841 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.q_proj.bias 2023-11-02 21:54:20.841 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.out_proj.weight 2023-11-02 21:54:20.841 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.self_attn.out_proj.bias 2023-11-02 21:54:20.841 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.layer_norm1.weight 2023-11-02 21:54:20.842 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.layer_norm1.bias 2023-11-02 21:54:20.842 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.mlp.fc1.weight 2023-11-02 21:54:20.842 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.mlp.fc1.bias 2023-11-02 21:54:20.842 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.mlp.fc2.weight 2023-11-02 21:54:20.842 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.mlp.fc2.bias 2023-11-02 21:54:20.842 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.layer_norm2.weight 2023-11-02 21:54:20.843 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.encoder.layers.23.layer_norm2.bias 2023-11-02 21:54:20.843 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.post_layernorm.weight 2023-11-02 21:54:20.843 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.vision_tower.vision_tower.vision_model.post_layernorm.bias 2023-11-02 21:54:20.843 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.projector.projector.weight 2023-11-02 21:54:20.843 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: model.projector.projector.bias 2023-11-02 21:54:20.843 | INFO | mmgpt.utils.logger:log_model_parameters:194 - -> Trainable Parameters: lm_head.weight 2023-11-02 21:54:20.847 | INFO | mmgpt.utils.logger:log_model_parameters:199 - >> Total params: 6752.17M 2023-11-02 21:54:20.847 | INFO | mmgpt.utils.logger:log_model_parameters:200 - >> Train params: 6752.17M, Ratio 100.00% 2023-11-02 21:54:20.864 | INFO | mmgpt.data.dataset.pair_webdataset:__init__:53 - 1666666 interleaved (6-merged) image-text pairs (splitted to 48 workers) are sampled from dataset: laion2b_10m_6merge. 2023-11-02 21:54:21.089 | INFO | mmgpt.data.dataset.pair_webdataset:__init__:53 - 833333 interleaved (6-merged) image-text pairs (splitted to 48 workers) are sampled from dataset: grit_5m_6merge. 2023-11-02 21:54:21.099 | INFO | mmgpt.data.dataset.interpair_webdataset:__init__:51 - 500000 interleaved (2-merged) image-text pairs (splitted to 48 workers) are sampled from dataset: track_1m_v1_2merge. 2023-11-02 21:54:21.109 | INFO | mmgpt.data.dataset.interpair_webdataset:__init__:51 - 1250000 interleaved (4-merged) image-text pairs (splitted to 48 workers) are sampled from dataset: det_5m_v1_en_4merge. 2023-11-02 21:54:21.110 | INFO | mmgpt.data.builder:build_dataloader:65 - After processing, totally 4249999 samples are involved. 2023-11-02 21:54:21.264 | INFO | mmgpt.engine.train.trainer:create_optimizer:62 - ->> Number of Optimizer Groups: 50 2023-11-02 21:54:21.265 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 0: 233 groups of parameters maintains a learning rate of 5e-05 2023-11-02 21:54:21.265 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 1: 2 groups of parameters maintains a learning rate of 5e-06 2023-11-02 21:54:21.265 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 2: 6 groups of parameters maintains a learning rate of 4.923854510918059e-06 2023-11-02 21:54:21.265 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 3: 6 groups of parameters maintains a learning rate of 5.470949456575621e-06 2023-11-02 21:54:21.265 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 4: 6 groups of parameters maintains a learning rate of 6.078832729528468e-06 2023-11-02 21:54:21.266 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 5: 6 groups of parameters maintains a learning rate of 6.7542585883649645e-06 2023-11-02 21:54:21.266 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 6: 6 groups of parameters maintains a learning rate of 7.504731764849959e-06 2023-11-02 21:54:21.266 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 7: 6 groups of parameters maintains a learning rate of 8.338590849833288e-06 2023-11-02 21:54:21.266 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 8: 6 groups of parameters maintains a learning rate of 9.265100944259208e-06 2023-11-02 21:54:21.266 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 9: 6 groups of parameters maintains a learning rate of 1.0294556604732453e-05 2023-11-02 21:54:21.266 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 10: 6 groups of parameters maintains a learning rate of 1.1438396227480504e-05 2023-11-02 21:54:21.267 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 11: 6 groups of parameters maintains a learning rate of 1.2709329141645005e-05 2023-11-02 21:54:21.267 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 12: 6 groups of parameters maintains a learning rate of 1.4121476824050005e-05 2023-11-02 21:54:21.267 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 13: 6 groups of parameters maintains a learning rate of 1.5690529804500005e-05 2023-11-02 21:54:21.267 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 14: 6 groups of parameters maintains a learning rate of 1.7433922005000004e-05 2023-11-02 21:54:21.267 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 15: 6 groups of parameters maintains a learning rate of 1.9371024450000006e-05 2023-11-02 21:54:21.267 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 16: 6 groups of parameters maintains a learning rate of 2.1523360500000007e-05 2023-11-02 21:54:21.268 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 17: 6 groups of parameters maintains a learning rate of 2.3914845000000007e-05 2023-11-02 21:54:21.268 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 18: 6 groups of parameters maintains a learning rate of 2.6572050000000003e-05 2023-11-02 21:54:21.268 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 19: 6 groups of parameters maintains a learning rate of 2.9524500000000005e-05 2023-11-02 21:54:21.268 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 20: 6 groups of parameters maintains a learning rate of 3.2805e-05 2023-11-02 21:54:21.268 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 21: 6 groups of parameters maintains a learning rate of 3.6450000000000005e-05 2023-11-02 21:54:21.268 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 22: 6 groups of parameters maintains a learning rate of 4.05e-05 2023-11-02 21:54:21.269 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 23: 6 groups of parameters maintains a learning rate of 4.5e-05 2023-11-02 21:54:21.269 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 24: 6 groups of parameters maintains a learning rate of 5.555555555555556e-05 2023-11-02 21:54:21.269 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 25: 76 groups of parameters maintains a learning rate of 5e-05 2023-11-02 21:54:21.269 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 26: 5 groups of parameters maintains a learning rate of 5e-06 2023-11-02 21:54:21.269 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 27: 10 groups of parameters maintains a learning rate of 4.923854510918059e-06 2023-11-02 21:54:21.269 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 28: 10 groups of parameters maintains a learning rate of 5.470949456575621e-06 2023-11-02 21:54:21.270 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 29: 10 groups of parameters maintains a learning rate of 6.078832729528468e-06 2023-11-02 21:54:21.270 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 30: 10 groups of parameters maintains a learning rate of 6.7542585883649645e-06 2023-11-02 21:54:21.270 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 31: 10 groups of parameters maintains a learning rate of 7.504731764849959e-06 2023-11-02 21:54:21.270 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 32: 10 groups of parameters maintains a learning rate of 8.338590849833288e-06 2023-11-02 21:54:21.270 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 33: 10 groups of parameters maintains a learning rate of 9.265100944259208e-06 2023-11-02 21:54:21.270 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 34: 10 groups of parameters maintains a learning rate of 1.0294556604732453e-05 2023-11-02 21:54:21.271 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 35: 10 groups of parameters maintains a learning rate of 1.1438396227480504e-05 2023-11-02 21:54:21.271 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 36: 10 groups of parameters maintains a learning rate of 1.2709329141645005e-05 2023-11-02 21:54:21.271 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 37: 10 groups of parameters maintains a learning rate of 1.4121476824050005e-05 2023-11-02 21:54:21.271 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 38: 10 groups of parameters maintains a learning rate of 1.5690529804500005e-05 2023-11-02 21:54:21.271 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 39: 10 groups of parameters maintains a learning rate of 1.7433922005000004e-05 2023-11-02 21:54:21.271 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 40: 10 groups of parameters maintains a learning rate of 1.9371024450000006e-05 2023-11-02 21:54:21.272 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 41: 10 groups of parameters maintains a learning rate of 2.1523360500000007e-05 2023-11-02 21:54:21.272 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 42: 10 groups of parameters maintains a learning rate of 2.3914845000000007e-05 2023-11-02 21:54:21.272 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 43: 10 groups of parameters maintains a learning rate of 2.6572050000000003e-05 2023-11-02 21:54:21.272 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 44: 10 groups of parameters maintains a learning rate of 2.9524500000000005e-05 2023-11-02 21:54:21.272 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 45: 10 groups of parameters maintains a learning rate of 3.2805e-05 2023-11-02 21:54:21.273 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 46: 10 groups of parameters maintains a learning rate of 3.6450000000000005e-05 2023-11-02 21:54:21.273 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 47: 10 groups of parameters maintains a learning rate of 4.05e-05 2023-11-02 21:54:21.273 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 48: 10 groups of parameters maintains a learning rate of 4.5e-05 2023-11-02 21:54:21.273 | INFO | mmgpt.engine.train.trainer:create_optimizer:64 - *********>> 49: 10 groups of parameters maintains a learning rate of 5.555555555555556e-05 2023-11-02 21:54:32.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:54:32.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'shelf:[001, 306, 261, 999].') 2023-11-02 21:54:36.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:54:36.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Backpack:[760, 116, 999, 764];Cup:[002, 507, 047, 739];Person:[150, 002, 829, 1000];Necklace:[466, 455, 593, 728];Potted Plant:[808, 002, 916, 091];Chair:[764, 117, 999, 999];Cabinet:[002, 002, 276, 734];Desk:[002, 226, 264, 946];Laptop:[002, 382, 134, 622].') 2023-11-02 21:55:11.374 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:55:11.375 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'caption: Prem Chandra [171, 012, 494, 931] demonstrates how she sprays down furniture [436, 347, 997, 996] with an atomized disinfectant [390, 274, 531, 442] using the Clorox 360 machine [015, 507, 396, 997].') 2023-11-02 21:55:49.860 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:55:49.861 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[559, 566, 706, 805] A father and [606, 621, 710, 792] daughter sit on [510, 777, 852, 997] rocks looking out over [000, 481, 998, 882] a loch') 2023-11-02 21:56:17.643 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:56:17.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Street Lights,Ambulance and Car in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[027, 048, 116, 358],[192, 127, 252, 347],[138, 089, 211, 273],[238, 164, 294, 286],[282, 193, 335, 264];Ambulance:[265, 201, 658, 744];Car:[830, 397, 1000, 661],[608, 344, 731, 487],[145, 284, 188, 347],[000, 283, 037, 359].') 2023-11-02 21:56:26.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:56:26.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[646, 314, 731, 596] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[646, 314, 731, 596];Frame2:[180, 269, 266, 502];Frame3:[075, 339, 159, 552];Frame4:[204, 347, 272, 591].') 2023-11-02 21:57:02.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:02.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[129, 482, 177, 585],carFrame1:[000, 394, 114, 707],carFrame1:[363, 481, 426, 581],carFrame1:[061, 430, 163, 632],carFrame1:[224, 489, 256, 545] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[129, 482, 177, 585];Frame2:[081, 497, 144, 637],carFrame1[000, 394, 114, 707];Frame2:No object,carFrame1:[363, 481, 426, 581];Frame2:[364, 484, 436, 593],carFrame1:[061, 430, 163, 632];Frame2:[000, 410, 097, 711],carFrame1:[224, 489, 256, 545];Frame2:[214, 494, 251, 556].') 2023-11-02 21:57:09.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:09.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the step backward and stumble and fall?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one step backward and stumble and fall.Track1frame:1:[696, 250, 908, 533];frame:2:[790, 254, 961, 548];frame:3:[803, 261, 957, 559].') 2023-11-02 21:57:18.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:18.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Car:[028, 129, 963, 961],[753, 080, 999, 267];SUV:[054, 232, 093, 315];Desk:[138, 271, 163, 304],[163, 266, 176, 304],[213, 268, 243, 308],[253, 262, 271, 300];Chair:[388, 326, 429, 364];Person:[000, 264, 043, 477],[031, 239, 089, 451],[115, 241, 132, 308],[151, 228, 169, 267],[668, 122, 685, 204],[686, 110, 697, 134],[687, 103, 714, 223],[716, 061, 764, 216],[753, 057, 803, 275],[787, 054, 826, 280],[809, 035, 852, 285],[964, 016, 1000, 309],[935, 039, 977, 096],[338, 319, 351, 366],[371, 324, 390, 365],[404, 301, 415, 336],[621, 135, 637, 170];Bracelet:[817, 157, 826, 165];Tent:[000, 000, 998, 331];Sneakers:[967, 282, 990, 309],[028, 445, 043, 461],[000, 455, 012, 477],[054, 423, 066, 451],[076, 423, 089, 438];Glasses:[789, 071, 799, 079];Hat:[716, 087, 730, 104],[037, 239, 056, 256].') 2023-11-02 21:57:34.477 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:34.477 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[235, 129, 715, 827] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a iceboatFrame1:[235, 129, 715, 827];Frame2:[470, 216, 835, 739];Frame3:[395, 339, 667, 741];Frame4:[370, 360, 626, 739];Frame5:[350, 362, 595, 731].') 2023-11-02 21:57:40.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:40.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[473, 623, 1000, 1000] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a bicycle by a man on the road with other bicyclesFrame1:[473, 623, 1000, 1000];Frame2:[432, 637, 995, 1000];Frame3:[371, 544, 1000, 1000];Frame4:[332, 698, 1000, 1000].') 2023-11-02 21:57:45.530 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:45.531 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[080, 437, 148, 546],[469, 301, 501, 447],[645, 334, 794, 847],[776, 372, 903, 730],[859, 599, 999, 999].') 2023-11-02 21:57:52.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:57:52.007 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[421, 171, 540, 387],[239, 183, 349, 382];wheel:[120, 451, 182, 898],[147, 477, 211, 957],[557, 703, 623, 926],[591, 369, 651, 584],[616, 461, 695, 965];tire:[118, 805, 146, 917],[143, 565, 207, 960],[570, 788, 622, 926],[575, 368, 645, 614],[626, 557, 693, 965];car:[120, 193, 705, 967].') 2023-11-02 21:58:53.933 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:58:53.934 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Helmet:[556, 054, 625, 108],[087, 002, 128, 071],[356, 094, 480, 249];Backpack:[509, 002, 653, 075];Motorcycle:[007, 002, 610, 1000],[418, 002, 968, 715],[019, 002, 275, 238].') 2023-11-02 21:59:02.027 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:59:02.028 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'One hispanic and one african american males, on [005, 909, 991, 996] a roof wearing [657, 002, 758, 113] [278, 342, 392, 482] hardhats. [121, 169, 461, 998] One man is waving and [501, 003, 779, 997] the other man is holding [469, 478, 589, 649] a drill. Both are wearing [110, 496, 373, 773] [550, 111, 730, 460] saftey harnesses and smiling.') 2023-11-02 21:59:08.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:59:08.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'table:[510, 711, 713, 965];chair:[007, 595, 375, 999];woman:[742, 368, 926, 999];footwear:[523, 857, 595, 905];window:[388, 255, 551, 765],[830, 177, 999, 745].') 2023-11-02 21:59:51.246 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:59:51.247 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'flag:[029, 000, 155, 779];man:[000, 591, 168, 998],[162, 597, 349, 998],[334, 632, 480, 994],[400, 163, 620, 998],[606, 884, 647, 998],[636, 819, 731, 998],[724, 878, 780, 998],[752, 813, 869, 998],[839, 206, 998, 998];clothing:[000, 675, 151, 998],[157, 681, 338, 998],[325, 697, 457, 998],[400, 263, 608, 998],[632, 884, 745, 998],[764, 880, 863, 998],[862, 302, 998, 998];human face:[224, 632, 259, 703],[391, 638, 417, 701],[863, 218, 925, 326].') 2023-11-02 21:59:59.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 21:59:59.523 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[534, 410, 986, 975],[614, 288, 987, 782],[590, 124, 707, 464],[543, 114, 636, 429],[330, 117, 577, 654],[015, 145, 396, 964],[011, 036, 185, 422],[101, 022, 217, 262];Glasses:[791, 542, 903, 587],[252, 235, 346, 278],[869, 357, 942, 393];Hat:[196, 147, 348, 305];Satchel:[539, 255, 597, 347];Plate:[366, 720, 495, 818],[406, 631, 571, 735],[567, 619, 726, 707],[627, 736, 758, 827],[493, 798, 642, 903];Desk:[183, 477, 780, 980],[477, 385, 606, 510].') 2023-11-02 22:00:06.269 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:00:06.269 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[775, 548, 925, 722],[613, 505, 718, 615],[540, 462, 613, 557],[478, 431, 548, 516],[437, 435, 480, 487];Vase:[717, 682, 999, 861],[578, 594, 780, 715],[494, 534, 633, 625],[443, 503, 549, 579],[403, 476, 494, 537];Bench:[275, 472, 388, 545],[460, 776, 973, 1000];Truck:[522, 271, 815, 498];Van:[872, 298, 999, 731];Person:[923, 477, 999, 761],[733, 365, 801, 636],[703, 370, 751, 601],[108, 326, 140, 478],[060, 335, 094, 506];Backpack:[694, 410, 744, 504].') 2023-11-02 22:00:53.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:00:53.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[280, 489, 326, 552] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a border terrierFrame1:[280, 489, 326, 552];Frame2:[365, 452, 410, 500];Frame3:[333, 481, 366, 527];Frame4:[411, 435, 450, 489].') 2023-11-02 22:01:57.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:01:57.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the people sitting on a sailboat shifting in circles?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one people sitting on a sailboat shifting in circles.Track1frame:1:[406, 688, 558, 925];frame:2:[515, 696, 636, 871];frame:3:[545, 690, 656, 855].') 2023-11-02 22:02:10.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:02:10.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect ball,woman,office building and clothing in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'ball:[000, 641, 053, 826],[000, 532, 055, 597];woman:[456, 342, 578, 843];office building:[011, 002, 950, 918];clothing:[060, 403, 190, 889],[470, 411, 584, 799].') 2023-11-02 22:02:14.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:02:14.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[633, 487, 670, 514] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a border terrierFrame1:[633, 487, 670, 514];Frame2:[508, 510, 531, 541];Frame3:[495, 535, 528, 564];Frame4:[518, 518, 545, 547].') 2023-11-02 22:02:23.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:02:23.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[448, 760, 781, 996] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a slothFrame1:[448, 760, 781, 996];Frame2:[492, 769, 744, 999];Frame3:[467, 781, 783, 999];Frame4:[489, 760, 808, 965].') 2023-11-02 22:02:35.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:02:35.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the flock of avian creatures in flight.Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There are 11 flock of avian creatures in flight.Track1frame:1:[059, 425, 205, 529];frame:2:[235, 410, 377, 566];frame:3:[182, 279, 344, 491],Track2frame:1:[978, 486, 999, 495],Track3frame:1:[769, 364, 866, 508],Track4frame:1:[754, 458, 935, 634],Track5frame:1:[578, 238, 668, 379];frame:2:[801, 225, 900, 373];frame:3:[804, 284, 876, 353],Track6frame:1:[487, 253, 661, 420];frame:2:[700, 172, 868, 384];frame:3:[666, 268, 847, 376],Track7frame:1:[338, 353, 481, 564];frame:2:[516, 385, 648, 594];frame:3:[506, 432, 643, 531],Track8frame:1:[460, 318, 582, 440];frame:2:[678, 224, 795, 398];frame:3:[659, 306, 751, 370],Track9frame:1:[336, 288, 448, 392];frame:2:[544, 203, 654, 399];frame:3:[492, 372, 641, 457],Track10frame:1:[283, 314, 434, 455];frame:2:[467, 335, 655, 467];frame:3:[452, 250, 629, 385],Track11frame:1:[324, 447, 393, 479];frame:2:[607, 348, 666, 431];frame:3:[433, 348, 610, 393].') 2023-11-02 22:02:57.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:02:57.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[330, 323, 625, 687] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a mountain bikeFrame1:[330, 323, 625, 687];Frame2:[307, 345, 645, 716];Frame3:[309, 512, 716, 991];Frame4:[217, 150, 725, 866].') 2023-11-02 22:03:07.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:03:07.545 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'table:[020, 741, 361, 980],[000, 573, 150, 796],[173, 502, 466, 745],[500, 530, 999, 998],[724, 415, 899, 525];man:[006, 317, 177, 761],[280, 287, 410, 784],[328, 612, 574, 998],[442, 271, 553, 761],[515, 216, 626, 637],[587, 226, 633, 497],[606, 180, 758, 521];tree:[327, 000, 650, 316];clothing:[000, 356, 182, 760],[240, 363, 306, 463],[295, 306, 416, 780],[305, 782, 581, 998],[456, 304, 558, 796],[520, 254, 613, 534],[560, 755, 705, 998],[605, 250, 767, 504],[784, 287, 856, 386],[936, 283, 999, 417];building:[000, 000, 999, 410].') 2023-11-02 22:03:21.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:03:21.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[281, 351, 319, 518],[003, 346, 038, 494];Soccer:[460, 687, 532, 792];Sneakers:[594, 690, 636, 750];Person:[430, 083, 656, 750],[139, 217, 282, 679],[376, 414, 413, 582].') 2023-11-02 22:03:53.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:03:53.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A large group of athletes [001, 447, 181, 577] [624, 422, 739, 514] [328, 634, 491, 806] [332, 232, 483, 332] [829, 372, 966, 484] [390, 734, 576, 944] [541, 567, 700, 766] swimming in open water.') 2023-11-02 22:03:58.669 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:03:58.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the motorized-bicycle shift forward?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is no motorized-bicycle shift forward.') 2023-11-02 22:04:48.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:04:48.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Scooter:[804, 495, 1000, 944],[825, 447, 1000, 737],[404, 567, 774, 998],[121, 564, 498, 999];Car:[000, 378, 350, 806];Truck:[570, 267, 684, 447];Helmet:[342, 362, 421, 462],[540, 335, 626, 442],[621, 351, 668, 413],[803, 357, 861, 403];Gloves:[885, 502, 924, 548],[685, 664, 734, 737],[449, 685, 501, 756],[315, 673, 377, 734];Hat:[545, 401, 614, 453];Boots:[803, 674, 851, 754];Other Shoes:[689, 721, 738, 788];Person:[804, 359, 946, 840],[802, 361, 917, 752],[621, 352, 701, 545],[449, 336, 743, 1000],[167, 363, 449, 1000];Flag:[701, 188, 828, 459].') 2023-11-02 22:06:41.603 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:06:41.604 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,man and woman in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'girl:[130, 131, 998, 999],[164, 265, 596, 999],[607, 250, 999, 999];man:[046, 241, 320, 945],[730, 162, 999, 620];woman:[125, 151, 999, 999],[579, 258, 999, 999].') 2023-11-02 22:06:42.894 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:06:42.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[960, 489, 998, 623],[901, 491, 939, 624];Person:[162, 664, 216, 842],[289, 671, 340, 788],[750, 647, 839, 966],[831, 625, 900, 966],[967, 635, 1000, 765],[746, 687, 782, 839],[486, 685, 552, 908].') 2023-11-02 22:06:43.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:06:43.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Glasses:[688, 485, 883, 610],[888, 454, 1000, 598];Person:[865, 141, 999, 817],[695, 290, 996, 997],[690, 641, 863, 911],[002, 174, 133, 572];Desk:[002, 692, 939, 1000];Cup:[123, 632, 235, 871],[638, 784, 778, 1000],[002, 840, 089, 1000];Plate:[432, 650, 543, 888],[002, 782, 141, 884].') 2023-11-02 22:07:08.130 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:07:08.130 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Car:[667, 223, 1000, 436],[202, 213, 707, 470],[065, 256, 315, 452],[716, 209, 1000, 306];Person:[851, 089, 960, 520],[609, 117, 728, 532],[229, 137, 645, 949],[141, 116, 528, 948],[002, 297, 262, 913];Other Balls:[549, 195, 620, 305].') 2023-11-02 22:07:30.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:07:30.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'person:[480, 799, 533, 979],[530, 799, 583, 983];tree:[000, 000, 081, 085],[462, 781, 488, 845],[480, 746, 520, 831],[498, 682, 563, 809],[738, 000, 999, 379],[880, 607, 908, 878],[908, 606, 976, 881];umbrella:[320, 762, 415, 825].') 2023-11-02 22:08:11.989 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:08:11.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Bottle:[380, 011, 834, 996];Person:[002, 314, 911, 998].') 2023-11-02 22:08:25.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:08:25.824 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[107, 110, 999, 889];Tie:[281, 339, 486, 605];Leather Shoes:[676, 790, 917, 893],[889, 762, 997, 865];Chair:[076, 295, 629, 918],[029, 081, 304, 510],[573, 111, 999, 573];shelf:[019, 043, 792, 406];Trash bin Can:[545, 252, 700, 410].') 2023-11-02 22:08:34.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:08:34.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Leather Shoes:[013, 084, 097, 159];High Heels:[145, 074, 224, 159],[254, 076, 334, 159],[367, 078, 452, 160],[472, 078, 557, 160],[030, 242, 116, 308],[137, 233, 214, 308],[237, 233, 322, 311],[349, 226, 432, 308],[457, 227, 545, 310],[582, 236, 660, 311],[691, 253, 766, 311],[792, 218, 876, 307],[900, 743, 986, 846],[777, 769, 865, 846],[668, 770, 751, 842],[557, 760, 635, 841],[448, 759, 529, 840],[456, 611, 532, 692],[559, 611, 644, 692],[667, 626, 750, 694],[778, 616, 862, 694],[892, 588, 979, 689],[901, 397, 988, 502],[791, 425, 868, 498],[695, 424, 782, 498],[562, 421, 643, 502],[453, 422, 538, 502],[690, 327, 769, 392],[799, 313, 878, 392],[912, 312, 992, 383].') 2023-11-02 22:09:21.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:09:21.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[214, 162, 597, 833] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a pheasantFrame1:[214, 162, 597, 833];Frame2:[295, 161, 413, 763];Frame3:[216, 163, 382, 690];Frame4:[185, 212, 385, 697].') 2023-11-02 22:09:43.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:09:43.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'overhead bride [386, 275, 681, 934] and groom [358, 264, 541, 795] first dance at the cordelle rustic wedding') 2023-11-02 22:10:15.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:10:15.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Picture:[273, 373, 307, 430],[270, 429, 306, 523],[383, 442, 420, 539];Person:[391, 229, 457, 380],[491, 254, 604, 635],[599, 209, 706, 554],[766, 229, 814, 480];Boat:[248, 217, 519, 739],[555, 268, 893, 761];Dog:[326, 507, 395, 608],[369, 527, 433, 621].') 2023-11-02 22:10:19.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:10:19.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the bicycles moving to left.Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There are 3 bicycles moving to left.Track1frame:1:[320, 390, 449, 640];frame:2:[218, 402, 363, 679];frame:3:[052, 427, 161, 748],Track2frame:1:[417, 422, 572, 769];frame:2:[298, 445, 524, 836];frame:3:[071, 480, 410, 939],Track3frame:1:[502, 525, 669, 848];frame:2:[329, 564, 551, 927];frame:3:[034, 655, 338, 999].') 2023-11-02 22:11:01.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:01.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[171, 366, 256, 692],[000, 345, 999, 903];woman:[000, 385, 097, 740],[047, 414, 169, 750],[095, 416, 114, 496],[205, 380, 331, 747],[290, 343, 790, 829],[738, 376, 950, 892],[825, 348, 889, 481],[924, 339, 999, 548],[928, 346, 950, 425];girl:[204, 382, 338, 752],[288, 389, 432, 765],[343, 372, 509, 760],[576, 341, 765, 799],[921, 334, 999, 553];footwear:[205, 643, 253, 753],[346, 652, 396, 730],[399, 682, 438, 760],[477, 609, 525, 684],[585, 739, 650, 796],[663, 771, 733, 835],[730, 824, 799, 899].') 2023-11-02 22:11:05.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:05.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Machinery Vehicle:[315, 267, 937, 751],[078, 444, 434, 772];Street Lights:[243, 314, 268, 449];Truck:[062, 437, 154, 571];Trash bin Can:[067, 575, 084, 619],[044, 572, 068, 620],[020, 573, 043, 623].') 2023-11-02 22:11:08.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:08.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the green bird moving to a plate?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one green bird moving to a plate.Track1frame:1:[603, 270, 742, 462];frame:2:[504, 288, 601, 609];frame:3:[451, 381, 565, 651].') 2023-11-02 22:11:39.262 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:39.262 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Mighty fjords [007, 054, 991, 357] rise from the sea in the Westfjords Peninsula, northwestern Iceland. The landscape [007, 348, 991, 925] under the fjords is full of brooks and flowers. poster [001, 013, 993, 960]') 2023-11-02 22:11:47.647 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:47.647 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A rectangle [036, 145, 958, 863] of dough is spead with finely chopped chicken in buffalo sauce, and sprinkled with shredded cheese.') 2023-11-02 22:11:49.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:49.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'tree:[000, 255, 998, 546].') 2023-11-02 22:11:52.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:11:52.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Surfboard,Glasses,Desk and Chair in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Glasses:[259, 469, 359, 543];Desk:[379, 773, 1000, 1000];Chair:[089, 676, 465, 999].') 2023-11-02 22:11:55.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:11:55.009 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'woman:[633, 553, 891, 992],[134, 377, 221, 657],[223, 405, 268, 651],[267, 382, 365, 682],[326, 421, 365, 509],[340, 655, 632, 999],[355, 411, 463, 655],[876, 544, 999, 919];man:[643, 545, 908, 980],[418, 395, 495, 549],[485, 392, 511, 496],[501, 351, 563, 490],[526, 342, 653, 743],[639, 359, 741, 688],[645, 388, 665, 452],[746, 349, 856, 630],[812, 322, 899, 593],[889, 320, 953, 565],[925, 344, 983, 542];clothing:[021, 430, 233, 1000],[140, 383, 999, 735],[329, 828, 632, 999],[604, 705, 886, 999],[871, 645, 999, 925];girl:[141, 378, 178, 510],[204, 398, 305, 701],[253, 411, 281, 481],[266, 380, 375, 683],[320, 415, 358, 492],[335, 649, 637, 999],[355, 415, 471, 666],[390, 400, 430, 484],[643, 561, 905, 999],[870, 522, 999, 919];human face:[095, 385, 145, 506],[436, 706, 525, 877],[749, 598, 813, 732],[958, 565, 999, 671].') 2023-11-02 22:12:15.278 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:12:15.278 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 242, 231, 496] Horse lovers will love learning about [141, 681, 353, 998] [404, 567, 612, 709] their favorite animals with this amazing collection of [001, 001, 994, 997] nonfiction horse books! Perfect for ages 5-12!') 2023-11-02 22:12:52.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:12:52.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[457, 304, 596, 877];frame:2:[443, 302, 581, 879];frame:3:[429, 300, 557, 877] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an people are walking') 2023-11-02 22:13:45.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:13:45.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A portion of [511, 578, 702, 919] the border wall is under construction in Guadalupe Canyon, Arizona, which is a wildlife corridor for Mexican gray wolves and endangered jaguars.') 2023-11-02 22:14:28.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:14:28.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Boat:[830, 721, 999, 957],[875, 754, 940, 858],[683, 732, 791, 911],[476, 681, 551, 777],[346, 782, 497, 891],[194, 819, 323, 879],[339, 717, 387, 762],[123, 866, 268, 1000],[263, 905, 375, 980],[052, 719, 093, 759].') 2023-11-02 22:14:30.511 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:14:30.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[944, 310, 987, 504] [003, 077, 723, 811] The Metricup volunteer fire brigade were one of many to attend the Carbunup escaped burn.') 2023-11-02 22:14:50.096 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:14:50.097 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An illustration of people video conferencing on a computer screen [328, 463, 753, 696] with speech bubbles above it and a house [072, 287, 427, 514] behind it.') 2023-11-02 22:15:37.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:15:37.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track catFrame1:[440, 377, 714, 704],catFrame1:[359, 310, 460, 537],catFrame1:[818, 022, 1000, 185] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'catFrame1:[440, 377, 714, 704];Frame2:[584, 300, 860, 647],catFrame1:[359, 310, 460, 537];Frame2:[453, 225, 592, 483],catFrame1:[818, 022, 1000, 185];Frame2:[912, 002, 1000, 108].') 2023-11-02 22:16:07.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:16:07.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'tent:[581, 200, 849, 395];tree:[000, 000, 540, 506],[490, 021, 999, 325].') 2023-11-02 22:16:26.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:16:26.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[485, 159, 624, 833],blanketFrame1:[543, 243, 609, 534],towelFrame1:[487, 287, 591, 497] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[485, 159, 624, 833];Frame2:[564, 165, 682, 905],blanketFrame1:[543, 243, 609, 534];Frame2:[627, 255, 689, 573],towelFrame1:[487, 287, 591, 497];Frame2:[611, 316, 675, 575].') 2023-11-02 22:16:28.709 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:16:28.710 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Open-book spread [023, 015, 976, 981] of Imaginary Explosions, page 85. Above black-and-white graphic of mountain topography, text reads, “The event is a narrative (or is it)”') 2023-11-02 22:16:47.740 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:16:47.740 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face,woman,girl and apple in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[398, 680, 999, 999];human face:[018, 380, 092, 498],[137, 298, 195, 411],[206, 411, 288, 546],[363, 437, 428, 537],[420, 539, 478, 642],[475, 687, 585, 840],[510, 577, 575, 666],[578, 396, 645, 495],[713, 451, 778, 570],[840, 427, 926, 555];woman:[555, 379, 716, 733],[000, 354, 157, 940],[025, 393, 368, 999],[073, 270, 338, 605],[328, 410, 488, 856],[830, 389, 999, 830];girl:[075, 265, 341, 912],[328, 402, 489, 860],[341, 510, 511, 995],[490, 560, 702, 718],[557, 339, 728, 733],[713, 411, 855, 830].') 2023-11-02 22:16:59.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:16:59.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Bowl:[008, 494, 163, 571],[152, 506, 222, 562],[220, 495, 255, 518],[910, 532, 999, 650],[802, 505, 941, 585],[751, 521, 904, 611];bucket:[251, 718, 311, 896];Tong:[836, 823, 951, 986];Lamp:[669, 001, 763, 137],[454, 232, 494, 288],[499, 164, 543, 223],[168, 133, 226, 298],[201, 232, 243, 346],[219, 332, 256, 373],[431, 308, 509, 405];shelf:[512, 278, 683, 492],[460, 411, 507, 494];Storage box:[566, 337, 604, 398],[595, 332, 684, 387],[596, 428, 661, 462],[639, 443, 686, 474],[507, 477, 540, 505],[486, 503, 522, 539],[452, 653, 485, 757];Fan:[026, 112, 148, 232],[307, 318, 341, 376],[094, 223, 171, 314];Stool:[377, 620, 423, 730];Clock:[351, 332, 389, 388];Power outlet:[791, 389, 820, 416];Carpet:[223, 866, 541, 1000];Trash bin Can:[336, 561, 355, 601],[531, 301, 575, 378];Blackboard:[205, 379, 268, 434];Scale:[660, 520, 762, 622];Pliers:[837, 823, 956, 991];Hat:[397, 439, 439, 478],[425, 482, 455, 523];Boots:[408, 654, 434, 707],[411, 692, 431, 725];Person:[382, 436, 465, 726],[422, 479, 516, 565].') 2023-11-02 22:17:54.970 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:17:54.970 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Decorative element. Colorful flyer [049, 039, 946, 963] with multiple images of aquaculture describes content of Great Lakes Aquaculture Day. All information on flyer is included in text of article.') 2023-11-02 22:18:02.823 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:18:02.824 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Mechanicville to Malta Brewery Ride') 2023-11-02 22:18:07.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:18:07.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'couch:[473, 088, 918, 960];coffee table:[106, 381, 401, 1000].') 2023-11-02 22:18:29.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:18:29.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Chair:[071, 525, 396, 998],[431, 534, 769, 998];Frame:[413, 126, 500, 315];Lamp:[001, 013, 062, 099];shelf:[001, 067, 212, 300];Desk:[759, 554, 999, 1000];Person:[476, 330, 748, 987],[257, 151, 534, 908],[093, 363, 398, 1000],[620, 277, 699, 475];Satchel:[557, 792, 742, 956];Coffee Machine:[226, 242, 306, 345].') 2023-11-02 22:18:45.950 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:18:45.950 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Desk:[001, 720, 863, 999],[123, 444, 550, 793],[173, 380, 468, 449];Cabinet:[907, 044, 1000, 526],[459, 217, 507, 377];Person:[332, 343, 415, 471],[193, 283, 285, 483],[245, 285, 273, 336],[130, 403, 187, 468],[001, 261, 059, 438],[001, 337, 164, 619],[001, 563, 039, 818],[178, 426, 526, 913],[391, 001, 896, 1000];Book:[231, 561, 727, 981],[167, 491, 259, 562],[126, 507, 179, 578];Hat:[001, 264, 042, 346];Pen:[221, 760, 260, 905];Moniter:[147, 316, 202, 397],[357, 306, 407, 370].') 2023-11-02 22:19:03.955 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:19:03.955 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'One of [608, 313, 965, 698] Momina’s daughters is preparing naan. Conditions are simple; this family is officially categorized as one in a difficult life situation.') 2023-11-02 22:19:19.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:19:19.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[208, 071, 665, 797] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a black-necked cobraFrame1:[208, 071, 665, 797];Frame2:[236, 037, 715, 745];Frame3:[310, 067, 766, 617];Frame4:[292, 000, 778, 479].') 2023-11-02 22:19:45.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:19:45.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Chair:[739, 325, 856, 381];Couch:[669, 303, 880, 364];Lamp:[913, 220, 955, 327],[617, 228, 662, 322];Desk:[829, 393, 1000, 569];Tea pot:[394, 351, 443, 430];Refrigerator:[001, 152, 231, 846];Oven:[302, 486, 476, 797];Induction Cooker:[306, 414, 516, 514].') 2023-11-02 22:19:46.837 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:19:46.838 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Romantic couple [363, 390, 494, 905] [466, 323, 656, 903] walks in the apple orchard [004, 007, 993, 998] in summer and holding hands [450, 480, 498, 538].') 2023-11-02 22:19:47.436 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:19:47.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[836, 351, 999, 623],carFrame1:[354, 346, 449, 480],carFrame1:[497, 284, 623, 491],carFrame1:[000, 376, 100, 597],carFrame1:[191, 378, 388, 586] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1[836, 351, 999, 623];Frame2:No object,carFrame1:[354, 346, 449, 480];Frame2:[286, 332, 423, 511],carFrame1:[497, 284, 623, 491];Frame2:[497, 273, 628, 486],carFrame1[000, 376, 100, 597];Frame2:No object,carFrame1:[191, 378, 388, 586];Frame2:[000, 377, 314, 711].') 2023-11-02 22:19:57.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:19:57.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[310, 292, 673, 447] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a droneFrame1:[310, 292, 673, 447];Frame2:[317, 392, 690, 549];Frame3:[319, 394, 695, 555];Frame4:[322, 387, 697, 554].') 2023-11-02 22:20:27.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:20:27.816 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'SUV:[001, 690, 058, 806],[018, 687, 200, 864],[156, 586, 225, 681],[229, 672, 423, 818],[442, 660, 612, 784],[624, 650, 758, 756],[764, 645, 876, 740],[879, 643, 979, 730],[969, 633, 1000, 716];Car:[041, 671, 151, 728],[151, 658, 262, 755],[265, 642, 412, 716],[351, 626, 469, 702],[949, 563, 998, 621],[669, 567, 763, 640],[616, 583, 725, 654],[509, 571, 600, 624],[540, 593, 648, 669],[485, 603, 608, 685],[423, 617, 553, 677],[763, 565, 833, 634],[830, 562, 870, 626].') 2023-11-02 22:20:46.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:20:46.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'person:[114, 478, 778, 754],[518, 580, 553, 754];musical instrument:[043, 571, 071, 644],[085, 569, 114, 642],[140, 523, 679, 690],[446, 511, 481, 596],[606, 507, 661, 586],[693, 536, 723, 604],[738, 549, 770, 619].') 2023-11-02 22:20:55.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:20:55.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'building:[000, 128, 999, 998];window:[000, 958, 143, 996],[000, 572, 156, 923],[000, 281, 171, 510],[854, 268, 999, 513],[869, 569, 999, 923],[888, 947, 999, 998].') 2023-11-02 22:21:45.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:21:45.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Ring and Glasses in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Ring:[405, 305, 414, 320],[099, 777, 108, 797];Glasses:[766, 180, 839, 215],[551, 227, 626, 260],[439, 246, 489, 277].') 2023-11-02 22:22:04.257 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:22:04.257 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Watchtower [655, 218, 916, 769] near the Iranian border, eastern Turkey in this undated file photo. (Shutter') 2023-11-02 22:22:13.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:22:13.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Guitar,Backpack and Blackboard/Whiteboard in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[392, 692, 977, 952];Backpack:[764, 516, 887, 689];Blackboard:[326, 113, 677, 416].') 2023-11-02 22:23:00.705 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:23:00.705 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Timber decking use on a pedestrian bridge [085, 008, 994, 994] over the river [460, 401, 997, 997]') 2023-11-02 22:23:01.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:23:01.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[572, 366, 998, 998],[000, 226, 080, 522],[036, 303, 291, 998],[286, 436, 614, 998];man:[503, 049, 999, 998];human face:[650, 292, 843, 578].') 2023-11-02 22:23:24.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:23:24.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Picture:[288, 002, 511, 127];Dinning Table:[081, 778, 787, 999];Chair:[305, 648, 636, 778];Other Fish:[374, 808, 449, 951];Person:[659, 275, 1000, 942],[261, 222, 677, 813];Glasses:[444, 337, 575, 418];Tie:[460, 501, 528, 720];Pie:[131, 788, 278, 857];Basin:[366, 708, 536, 835],[002, 832, 146, 1000],[681, 900, 916, 1000];Chopsticks:[272, 770, 361, 825];Plate:[484, 868, 729, 998],[261, 823, 521, 956],[105, 779, 291, 866],[126, 939, 426, 1000].') 2023-11-02 22:23:39.542 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:23:39.542 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bear in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Bear:[365, 307, 812, 731].') 2023-11-02 22:23:39.731 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:23:39.732 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A cardboard cut out of a percentage symbol [013, 199, 284, 886] and a house [260, 218, 580, 892] with the Sanctuary Homes logo on it on a wooden table [002, 650, 995, 995]') 2023-11-02 22:23:44.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:23:44.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect houseplant,dress,woman and suit in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'dress:[106, 454, 252, 988],[243, 409, 376, 915],[611, 406, 735, 999];woman:[081, 237, 265, 1000],[241, 256, 380, 999],[361, 288, 488, 999],[475, 254, 618, 999],[611, 230, 773, 999];suit:[757, 300, 913, 999].') 2023-11-02 22:23:50.855 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:23:50.855 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[200, 001, 657, 358],[751, 328, 841, 414];Mirror:[614, 290, 873, 625];Flower:[685, 489, 805, 609];shelf:[639, 625, 932, 906],[310, 406, 412, 525],[270, 418, 311, 500],[240, 422, 298, 519],[205, 424, 240, 518],[122, 421, 203, 517],[001, 588, 127, 723],[722, 389, 823, 569];Chair:[332, 578, 404, 699],[270, 585, 331, 679],[189, 600, 313, 734],[135, 699, 459, 1000],[605, 708, 894, 999],[522, 603, 649, 770];Towel:[292, 678, 341, 725],[363, 742, 421, 802],[473, 702, 532, 753];Carpet:[053, 851, 752, 995];Wine Glass:[600, 720, 635, 821],[476, 714, 506, 809],[411, 687, 441, 769];Plate:[615, 826, 706, 879];Fork:[593, 850, 668, 901].') 2023-11-02 22:25:11.527 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:25:11.527 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Grabbing late-night tacos and [257, 573, 507, 732] burritos at a place in San Jose. It's always fun to take [422, 149, 796, 995] [006, 238, 479, 996] the girls to new places.") 2023-11-02 22:25:53.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:25:53.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Surrealist sculpture and [082, 136, 914, 902] functional occasional table is shown from above at a high angle; the work features [289, 337, 722, 903] realistic cast bronze crane legs holding [080, 139, 916, 354] a round wooden, gold-plated tabletop.') 2023-11-02 22:26:19.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:26:19.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'boat:[017, 282, 1000, 796];person:[000, 000, 181, 753],[295, 000, 429, 333],[486, 000, 535, 098];clothing:[000, 000, 199, 759],[298, 000, 426, 338],[485, 000, 533, 101],[713, 000, 748, 070],[836, 166, 999, 450].') 2023-11-02 22:26:50.133 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:26:50.134 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'thin strips of carrot, cucumber, avocado, and tofu are being placed on [152, 547, 779, 947] a nori sheet with rice') 2023-11-02 22:26:53.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:26:53.620 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the left one of the two parrots standing on the hand in this video clip?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one the left one of the two parrots standing on the hand.Track1frame:1:[022, 408, 709, 999];frame:2:[000, 437, 718, 999];frame:3:[000, 462, 716, 999].') 2023-11-02 22:27:05.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:27:05.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect beer,suit,bottle and person in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'beer:[255, 059, 634, 349];suit:[574, 000, 977, 286];bottle:[008, 009, 126, 190],[095, 000, 224, 176],[180, 057, 262, 181],[285, 059, 362, 281],[333, 059, 395, 284],[375, 067, 454, 298],[465, 076, 529, 310],[503, 086, 621, 330];person:[000, 000, 493, 334],[590, 000, 999, 298].') 2023-11-02 22:27:42.043 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:27:42.044 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Compact cars [163, 292, 861, 716] such as the Nissan Sentra finished March with an industry-leading value-retention rate of 2.5%, according to the latest numbers from Black Book. Photo courtesy [004, 007, 989, 988] Nissan USA') 2023-11-02 22:28:24.385 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:28:24.385 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[376, 341, 642, 760] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a grizzlyFrame1:[376, 341, 642, 760];Frame2:[390, 334, 675, 778];Frame3:[447, 170, 640, 767];Frame4:[451, 257, 683, 746].') 2023-11-02 22:28:34.979 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:28:34.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Toiletry:[105, 647, 150, 738],[065, 661, 098, 739],[019, 644, 072, 736],[073, 631, 107, 736];Bathtub:[001, 644, 999, 999];Towel:[261, 797, 501, 888].') 2023-11-02 22:28:56.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:28:56.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[689, 318, 965, 1000],[519, 871, 602, 1000],[557, 226, 701, 999],[529, 291, 616, 486],[065, 191, 270, 997],[892, 053, 981, 346];Sneakers:[196, 904, 257, 996],[132, 867, 192, 947];Hat:[527, 869, 603, 987];Microphone:[612, 457, 688, 541],[104, 305, 159, 415];Tripod:[417, 620, 539, 925];Speaker:[233, 469, 361, 708],[000, 806, 070, 994],[641, 631, 741, 874],[839, 674, 1000, 1000];Guitar:[091, 320, 365, 536],[538, 353, 616, 561],[721, 481, 781, 811],[885, 052, 998, 353];Cymbal:[372, 392, 465, 472];Drum:[370, 581, 531, 824],[488, 519, 550, 582],[408, 497, 496, 609].') 2023-11-02 22:29:14.364 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:29:14.364 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'window:[018, 617, 099, 858],[175, 908, 237, 999],[181, 617, 243, 778],[418, 549, 534, 738],[441, 853, 520, 999],[570, 551, 669, 792],[607, 859, 666, 999],[711, 609, 741, 807],[740, 926, 771, 999].') 2023-11-02 22:29:19.142 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:29:19.143 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[542, 514, 618, 617],carFrame1:[000, 491, 169, 646],carFrame1:[682, 496, 999, 779],carFrame1:[515, 507, 569, 588],carFrame1:[165, 501, 225, 601] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[542, 514, 618, 617];Frame2:[594, 498, 710, 633],carFrame1:[000, 491, 169, 646];Frame2:[000, 539, 068, 661],carFrame1[682, 496, 999, 779];Frame2:No object,carFrame1:[515, 507, 569, 588];Frame2:[552, 492, 627, 601],carFrame1:[165, 501, 225, 601];Frame2:[064, 482, 180, 642].') 2023-11-02 22:29:35.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:29:35.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the grey rabbit running left?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is no grey rabbit running left.') 2023-11-02 22:29:55.364 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:29:55.364 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[437, 130, 738, 615] [251, 186, 468, 724] a family next to [591, 267, 694, 542] [462, 096, 838, 434] [000, 001, 284, 405] [060, 542, 210, 846] sheep in a painting') 2023-11-02 22:30:55.827 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:30:55.828 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'vídeos de stock, filmes e b-roll de shows exterior shots [257, 157, 638, 994] uk labour party leader jeremy corbyn walking out of polling station after voting in the uk general election 2017 on 8th june... - 2017') 2023-11-02 22:31:14.891 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:31:14.891 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Wild Bird:[192, 557, 209, 568],[064, 563, 071, 568],[147, 327, 172, 344],[842, 657, 846, 662],[859, 657, 870, 662],[379, 607, 398, 622];Boat:[130, 216, 581, 847];Street Lights:[770, 514, 779, 538],[905, 508, 916, 533];Flag:[287, 246, 315, 272],[310, 296, 338, 318];Person:[639, 611, 642, 621],[022, 639, 026, 647];Lifesaver:[413, 591, 423, 620],[293, 599, 312, 627].') 2023-11-02 22:31:54.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:31:54.702 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Cake:[488, 625, 685, 835],[237, 611, 426, 807];Glasses:[502, 130, 621, 178];Chair:[344, 398, 406, 542];Candle:[360, 540, 398, 635];Desk:[002, 510, 999, 999];Person:[132, 042, 397, 534],[298, 049, 682, 579],[529, 076, 973, 799];Watch:[783, 632, 816, 684];Bracelet:[742, 638, 814, 713];Plate:[002, 656, 086, 811];Spoon:[027, 641, 075, 726],[002, 621, 060, 731];Cup:[051, 702, 123, 829],[118, 502, 173, 596];Pen:[130, 583, 198, 651].') 2023-11-02 22:32:18.220 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:32:18.220 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Couch:[245, 028, 999, 999].') 2023-11-02 22:33:08.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:33:08.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'shelf:[002, 585, 216, 994],[678, 589, 964, 997],[422, 528, 482, 846],[482, 530, 552, 854],[002, 030, 161, 378],[161, 042, 364, 234],[324, 059, 436, 348],[436, 060, 552, 345],[552, 030, 791, 364];Lamp:[803, 002, 999, 066];Sink:[713, 545, 899, 609],[838, 585, 1000, 657];Gas stove:[144, 503, 420, 583];Oven:[202, 548, 423, 934];Extractor:[155, 230, 370, 302].') 2023-11-02 22:33:15.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:33:15.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[583, 427, 615, 493] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a border terrierFrame1:[583, 427, 615, 493];Frame2:[660, 483, 700, 525];Frame3:[561, 502, 598, 545];Frame4:[551, 529, 596, 570].') 2023-11-02 22:33:56.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:33:56.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Chopsticks:[104, 414, 340, 999];Plate:[026, 192, 995, 894],[577, 151, 942, 268];Dinning Table:[002, 115, 999, 999];Cabinet:[101, 002, 535, 126],[845, 002, 977, 114].') 2023-11-02 22:34:04.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:34:04.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Chair:[001, 732, 298, 999],[353, 634, 571, 986],[511, 597, 655, 780],[001, 569, 072, 748],[001, 543, 186, 721],[095, 539, 271, 700],[188, 536, 349, 669],[254, 533, 342, 598],[892, 673, 999, 999],[889, 631, 999, 876],[886, 605, 999, 717],[753, 583, 835, 702],[628, 544, 706, 638];Lamp:[015, 138, 254, 290],[145, 196, 348, 323],[242, 239, 412, 347],[312, 280, 466, 369];Person:[740, 521, 788, 570];Desk:[083, 596, 478, 973],[329, 571, 560, 841].') 2023-11-02 22:34:21.964 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:34:21.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[000, 000, 784, 998] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a horseless carriageFrame1:[000, 000, 784, 998];Frame2:[027, 143, 558, 877];Frame3:[158, 237, 524, 741];Frame4:[297, 262, 586, 657].') 2023-11-02 22:34:30.589 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:34:30.590 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Help, I'm trapped in a females body [119, 431, 764, 912], and the DMs are including complimentary Slip n' slides! 😫") 2023-11-02 22:35:37.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:35:37.477 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Slippers and Cow in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Slippers:[331, 478, 404, 723];Cow:[343, 391, 894, 947].') 2023-11-02 22:36:07.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:07.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Knife,Fork and Picture/Frame in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Knife:[710, 805, 794, 851],[220, 781, 373, 810];Fork:[761, 813, 801, 845],[257, 750, 348, 797];Picture:[490, 463, 518, 505].') 2023-11-02 22:36:18.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:18.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[351, 375, 455, 578] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a urialFrame1:[351, 375, 455, 578];Frame2:[308, 374, 427, 574];Frame3:[303, 374, 424, 574];Frame4:[293, 374, 407, 565].') 2023-11-02 22:36:20.784 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:20.784 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'students [766, 428, 846, 745] [269, 415, 374, 791] [359, 383, 448, 773] [055, 406, 171, 844] [700, 428, 782, 753] [834, 443, 919, 752] [164, 407, 278, 816] [628, 421, 709, 756] standing in front of a CTE Works sign holding certificates [714, 503, 767, 559] [301, 480, 359, 543] [843, 487, 899, 542] [643, 501, 698, 562] [780, 495, 835, 552] [386, 494, 441, 555] [081, 536, 153, 612] [203, 521, 266, 588]') 2023-11-02 22:36:29.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:29.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the tiger running from afar to the vicinity of the water pool, located on the right-hand side..To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one the tiger running from afar to the vicinity of the water pool, located on the right-hand side..Track1frame:1:[434, 051, 736, 254];frame:2:[440, 055, 748, 252];frame:3:[486, 091, 822, 263].') 2023-11-02 22:36:30.609 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:30.609 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bench:[878, 466, 999, 711];Car:[000, 163, 130, 479],[247, 186, 464, 279],[546, 191, 759, 257],[934, 195, 1000, 265],[854, 184, 979, 265],[778, 180, 953, 263],[696, 176, 804, 229],[538, 169, 617, 217],[563, 141, 601, 170],[536, 138, 567, 171],[510, 149, 563, 193],[449, 152, 539, 224],[434, 143, 484, 195],[410, 135, 443, 166],[344, 137, 391, 193],[343, 133, 378, 179],[306, 123, 339, 169],[272, 124, 295, 167],[235, 121, 282, 174],[221, 128, 249, 193],[082, 111, 135, 152],[034, 111, 115, 161],[000, 114, 100, 170];SUV:[578, 153, 660, 199],[611, 146, 650, 178],[358, 135, 443, 223],[112, 105, 238, 211];Van:[394, 115, 431, 143];Street Lights:[896, 092, 925, 184].') 2023-11-02 22:36:47.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:47.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[353, 413, 724, 676] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a leopardFrame1:[353, 413, 724, 676];Frame2:[324, 390, 596, 672];Frame3:[397, 440, 689, 687];Frame4:[410, 436, 642, 731].') 2023-11-02 22:36:48.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:36:48.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[418, 627, 505, 830],[522, 645, 604, 821],[437, 580, 507, 749];Cabinet:[529, 643, 700, 752],[494, 515, 625, 591];Carpet:[322, 775, 505, 823].') 2023-11-02 22:37:07.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:37:07.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track carFrame1:[739, 823, 1000, 999],carFrame1:[390, 765, 424, 828],carFrame1:[599, 798, 798, 1000],carFrame1:[000, 771, 070, 902],carFrame1:[284, 755, 322, 819],carFrame1:[453, 778, 517, 874],carFrame1:[407, 765, 455, 840] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'carFrame1[739, 823, 1000, 999];Frame2:No object,carFrame1:[390, 765, 424, 828];Frame2:[400, 758, 448, 830],carFrame1:[599, 798, 798, 1000];Frame2:[773, 818, 999, 1000],carFrame1[000, 771, 070, 902];Frame2:No object,carFrame1:[284, 755, 322, 819];Frame2:[283, 751, 320, 813],carFrame1:[453, 778, 517, 874];Frame2:[487, 778, 591, 899],carFrame1:[407, 765, 455, 840];Frame2:[421, 759, 485, 847].') 2023-11-02 22:37:16.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:37:16.389 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'tree:[000, 000, 976, 1000];flower:[387, 377, 591, 603],[000, 844, 212, 999],[263, 418, 397, 544],[310, 592, 440, 750],[465, 000, 581, 201],[473, 689, 574, 815],[627, 084, 874, 194].') 2023-11-02 22:37:30.246 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:37:30.247 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'New special agents and [408, 473, 760, 997] [107, 613, 366, 997] [763, 551, 997, 997] intelligence analysts view an exhibit at the 9/11 Memorial & Museum in New York City on Saturday, March 9, 2019.') 2023-11-02 22:37:34.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:37:34.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[001, 504, 267, 785],[251, 428, 406, 675];Cabinet:[526, 298, 824, 484];Storage box:[627, 355, 709, 449],[668, 349, 735, 407];Chair:[714, 629, 875, 999],[650, 558, 795, 757],[532, 554, 679, 762];Person:[135, 745, 337, 999],[493, 742, 720, 1000],[242, 058, 416, 917],[407, 350, 661, 835],[580, 363, 838, 760],[669, 359, 966, 981],[761, 545, 1000, 998],[774, 441, 1000, 931];Leather Shoes:[374, 823, 411, 865];Boots:[415, 642, 498, 757],[479, 728, 553, 835];Sneakers:[705, 907, 784, 975],[609, 671, 671, 732];Satchel:[001, 821, 121, 997].') 2023-11-02 22:38:06.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:38:06.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Potted Plant:[820, 609, 870, 694],[731, 625, 835, 762],[620, 620, 676, 697],[340, 618, 387, 698],[195, 611, 288, 762],[143, 612, 192, 694],[021, 623, 056, 674].') 2023-11-02 22:38:07.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:38:07.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'cabinetry:[000, 670, 190, 999],[000, 000, 515, 443],[263, 542, 500, 981];oven:[096, 181, 296, 425],[210, 701, 398, 996];gas stove:[068, 615, 366, 717];refrigerator:[363, 325, 594, 914];countertop:[000, 546, 492, 999],[543, 600, 999, 861];microwave oven:[194, 722, 401, 999].') 2023-11-02 22:38:26.529 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:38:26.529 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bed:[191, 544, 913, 1000];Cabinet:[001, 002, 222, 787];Carpet:[001, 781, 318, 999];Handbag:[017, 775, 167, 866].') 2023-11-02 22:38:40.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-02 22:38:40.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'fixed-wing aircraft:[000, 209, 999, 827],[450, 000, 964, 272].') 2023-11-02 22:39:58.886 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:39:58.887 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[564, 453, 763, 992],[731, 382, 984, 998];man:[060, 313, 360, 975],[000, 541, 055, 998],[000, 402, 073, 892],[034, 392, 085, 493],[045, 413, 137, 773],[334, 233, 574, 998];suit:[000, 545, 027, 871],[000, 492, 064, 972],[000, 461, 079, 891],[065, 450, 130, 821],[339, 433, 569, 998];girl:[555, 447, 753, 998];human face:[006, 398, 039, 472],[065, 418, 086, 488],[100, 443, 147, 527],[190, 348, 290, 527],[363, 282, 455, 475],[590, 478, 679, 655],[741, 422, 840, 605].') 2023-11-02 22:40:16.152 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:40:16.152 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[732, 505, 820, 765],[000, 462, 159, 853],[169, 519, 227, 582],[171, 446, 380, 829],[272, 456, 488, 855],[490, 488, 563, 766],[523, 484, 620, 763],[629, 498, 706, 753],[859, 493, 998, 716],[940, 510, 993, 706];chair:[522, 545, 690, 793],[000, 646, 056, 950],[066, 548, 131, 847],[092, 618, 156, 770],[144, 635, 207, 761],[216, 550, 376, 835],[314, 537, 517, 845],[416, 615, 561, 785],[626, 566, 725, 750],[739, 555, 850, 766],[756, 556, 865, 729],[846, 566, 969, 740],[872, 569, 935, 710];table:[022, 574, 405, 895],[637, 573, 826, 757],[913, 560, 999, 718].') 2023-11-02 22:40:27.752 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:40:27.753 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[174, 471, 254, 549],carFrame1:[544, 476, 580, 562],carFrame1:[313, 466, 380, 542],carFrame1:[000, 418, 121, 590],carFrame1:[239, 473, 282, 534],carFrame1:[602, 461, 732, 617],carFrame1:[670, 476, 881, 689] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[174, 471, 254, 549];Frame2:[107, 464, 212, 565],carFrame1:[544, 476, 580, 562];Frame2:[571, 475, 627, 582],carFrame1:[313, 466, 380, 542];Frame2:[228, 456, 341, 575],carFrame1[000, 418, 121, 590];Frame2:No object,carFrame1:[239, 473, 282, 534];Frame2:[199, 472, 246, 545],carFrame1:[602, 461, 732, 617];Frame2:[675, 448, 932, 695],carFrame1:[670, 476, 881, 689];Frame2:[849, 504, 1000, 763].') 2023-11-02 22:40:42.282 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:40:42.282 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[637, 388, 700, 598],personFrame1:[455, 711, 514, 998],personFrame1:[750, 344, 803, 537],personFrame1:[319, 636, 362, 875],personFrame1:[317, 483, 375, 722],personFrame1:[455, 504, 512, 706],personFrame1:[582, 565, 648, 759] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[637, 388, 700, 598];Frame2:[643, 377, 687, 590],personFrame1:[455, 711, 514, 998];Frame2:[485, 726, 564, 998],personFrame1:[750, 344, 803, 537];Frame2:[740, 330, 801, 519],personFrame1:[319, 636, 362, 875];Frame2:[354, 641, 410, 873],personFrame1:[317, 483, 375, 722];Frame2:[296, 463, 350, 673],personFrame1:[455, 504, 512, 706];Frame2:[436, 518, 518, 720],personFrame1:[582, 565, 648, 759];Frame2:[558, 590, 636, 787].') 2023-11-02 22:40:51.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:40:51.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face,girl and glasses in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'man:[182, 000, 417, 949],[365, 000, 549, 325],[848, 400, 999, 765];human face:[140, 118, 228, 392],[229, 000, 298, 066],[405, 000, 457, 083],[865, 415, 908, 505];girl:[000, 070, 339, 999],[435, 245, 738, 994],[486, 437, 972, 999];glasses:[116, 207, 218, 262].') 2023-11-02 22:40:59.785 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:40:59.785 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[543, 002, 922, 451] [226, 576, 537, 998] Promotion Concept promo bags in [003, 004, 999, 996] a tree') 2023-11-02 22:41:09.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:09.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[548, 136, 919, 999],[945, 521, 999, 577];Power outlet:[456, 426, 487, 512];Potted Plant:[190, 789, 311, 974];Storage box:[108, 570, 229, 768],[141, 740, 235, 878];Sink:[876, 567, 953, 675];Bottle:[002, 780, 064, 1000],[225, 654, 281, 776],[310, 561, 350, 785];Cup:[408, 666, 460, 788],[472, 535, 520, 673];Cutting:[913, 532, 992, 590];Oven:[927, 359, 1000, 494].') 2023-11-02 22:41:18.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:18.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Hockey Stick:[417, 632, 778, 720],[129, 420, 351, 508],[000, 460, 087, 529];Person:[349, 449, 602, 846],[470, 199, 652, 721],[296, 160, 500, 617];Bottle:[740, 311, 786, 369];Helmet:[541, 197, 593, 304],[541, 449, 594, 511],[414, 162, 461, 223];Gloves:[383, 576, 429, 664],[536, 632, 596, 702];Sneakers:[350, 721, 413, 815],[405, 794, 470, 845],[296, 551, 353, 615],[300, 492, 346, 573].') 2023-11-02 22:41:27.558 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:27.558 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Cup:[961, 488, 978, 543];Plate:[965, 529, 993, 555],[935, 395, 982, 421];Person:[000, 420, 200, 1000],[121, 349, 148, 491],[009, 395, 026, 434],[025, 352, 055, 451],[031, 342, 109, 531],[087, 354, 115, 492],[314, 293, 391, 420],[559, 267, 641, 374];Sandals:[970, 705, 999, 735];Chair:[863, 431, 928, 547],[897, 425, 952, 525];Ambulance:[133, 021, 885, 916].') 2023-11-02 22:41:36.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:36.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Carpet:[002, 934, 286, 999],[757, 934, 998, 1000];Cabinet:[783, 431, 999, 912],[715, 484, 788, 769],[586, 513, 714, 751],[341, 592, 651, 988],[140, 585, 303, 785],[001, 442, 154, 852],[151, 564, 217, 681],[302, 642, 345, 713];Storage box:[634, 945, 747, 974],[647, 919, 745, 952],[648, 885, 737, 924],[647, 833, 712, 863];Person:[153, 585, 265, 905];Umbrella:[122, 796, 164, 833];Handbag:[206, 705, 282, 776];High Heels:[429, 813, 471, 870],[574, 825, 625, 877],[817, 747, 865, 780],[833, 762, 881, 793],[859, 772, 905, 802],[468, 641, 516, 679],[508, 642, 558, 678],[445, 715, 490, 753],[862, 446, 906, 475],[899, 778, 948, 809],[663, 855, 730, 896],[649, 841, 717, 876],[924, 715, 971, 747],[868, 709, 936, 739],[482, 732, 533, 782],[588, 821, 639, 857],[847, 764, 894, 797],[820, 754, 871, 785];Boots:[176, 863, 243, 910],[201, 839, 251, 885],[672, 123, 713, 154].') 2023-11-02 22:41:36.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:36.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Wild Bird:[514, 284, 621, 348],[009, 249, 123, 321],[108, 232, 228, 302],[213, 260, 331, 321],[307, 113, 422, 210].') 2023-11-02 22:41:37.817 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:37.817 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Fairies clipart sad. The real story [015, 011, 984, 988] of') 2023-11-02 22:41:39.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:39.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'bicycle wheel:[018, 343, 584, 990],[575, 404, 719, 767];bicycle:[001, 112, 716, 999];wheel:[000, 388, 552, 999],[576, 406, 714, 767];tire:[000, 387, 551, 999],[562, 401, 725, 765].') 2023-11-02 22:41:41.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:41.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'desk:[101, 097, 870, 960],[880, 285, 998, 997];table:[091, 139, 921, 958];cupboard:[085, 071, 898, 965];mug:[431, 496, 528, 635],[458, 315, 565, 459];coffee cup:[433, 491, 528, 633],[456, 320, 566, 459].') 2023-11-02 22:41:50.147 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:41:50.147 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[033, 018, 959, 994] Children with Thanakha makeup on [481, 101, 847, 412] their faces') 2023-11-02 22:42:07.325 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:42:07.325 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[093, 430, 303, 808] Penthouse in [008, 004, 994, 998] the city, holiday rental in Rehovot') 2023-11-02 22:42:20.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:42:20.376 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[441, 343, 873, 625] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a marine iguanaFrame1:[441, 343, 873, 625];Frame2:[444, 291, 873, 531];Frame3:[446, 270, 874, 545];Frame4:[445, 287, 874, 547];Frame5:[435, 293, 873, 540].') 2023-11-02 22:42:36.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:42:36.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Other Fish:[049, 109, 947, 681];Potted Plant:[264, 022, 304, 108],[302, 021, 387, 115];Vase:[705, 051, 764, 111],[478, 057, 651, 130];Desk:[000, 595, 1000, 999];Bowl:[066, 029, 186, 107].') 2023-11-02 22:43:17.673 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:43:17.674 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'exterior shots of wind turbines [674, 418, 817, 832] [125, 252, 318, 809] and a rainbow [239, 004, 702, 459] in a farm field [005, 794, 994, 995] in northamptonshire, united kingdom - sustainable energy stock videos & royalty-free footage') 2023-11-02 22:44:31.708 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:44:31.709 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'shows [243, 669, 745, 913] the wood ledge shelves with multiple wood frames on [008, 005, 765, 994] gray wall in a hallway') 2023-11-02 22:45:18.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:45:18.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of [415, 282, 555, 635] [674, 231, 822, 951] [474, 555, 582, 981] [664, 602, 777, 995] [837, 201, 967, 745] [795, 564, 975, 995] [548, 536, 690, 996] [393, 536, 499, 997] [187, 234, 304, 983] [290, 289, 414, 950] [024, 240, 155, 996] students protesting for awareness about the climate crisis.') 2023-11-02 22:45:20.576 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:45:20.577 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man and microphone in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[144, 100, 610, 989];microphone:[002, 563, 157, 654],[308, 258, 376, 340],[528, 480, 681, 534],[616, 830, 770, 907].') 2023-11-02 22:45:47.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:45:47.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'boy:[470, 032, 970, 985];human body:[215, 055, 741, 989],[543, 017, 999, 999],[855, 113, 999, 306];human hair:[202, 033, 563, 619],[523, 029, 783, 480],[896, 130, 972, 219];human head:[255, 024, 554, 520],[517, 037, 796, 521],[880, 148, 960, 270];mammal:[108, 000, 703, 966],[000, 283, 143, 537],[526, 026, 999, 999],[852, 135, 997, 314];clothing:[638, 340, 1000, 1000],[169, 465, 663, 995];human eye:[402, 261, 468, 321],[653, 216, 724, 284];human mouth:[434, 389, 511, 458],[621, 382, 701, 448];human ear:[310, 313, 352, 382],[769, 226, 798, 325];man:[538, 030, 999, 999];girl:[161, 003, 665, 999];human face:[353, 122, 541, 511],[538, 098, 778, 508],[891, 164, 954, 258].') 2023-11-02 22:45:59.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:45:59.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face,woman and clothing in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[000, 091, 439, 999];human face:[209, 161, 420, 607],[449, 169, 614, 512],[619, 320, 785, 638];woman:[564, 252, 968, 997],[048, 034, 661, 999];clothing:[035, 395, 660, 999],[617, 570, 998, 999].') 2023-11-02 22:46:09.026 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:46:09.026 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A herd of [764, 402, 905, 641] [067, 362, 249, 627] [225, 349, 384, 621] [358, 443, 537, 630] elephants with speech bubbles filled with [175, 081, 994, 384] various charts and data') 2023-11-02 22:46:17.423 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:46:17.423 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[040, 710, 171, 997],[170, 662, 282, 999],[237, 730, 317, 905],[320, 719, 395, 903],[409, 650, 464, 836],[526, 623, 586, 838],[596, 709, 635, 805],[643, 639, 705, 841];clothing:[033, 740, 170, 996],[000, 828, 020, 999],[131, 876, 251, 999],[218, 763, 280, 999],[242, 735, 472, 870],[410, 665, 465, 837],[444, 165, 543, 242],[525, 645, 583, 843],[595, 717, 635, 795],[633, 720, 653, 791],[641, 662, 701, 829],[707, 707, 756, 890],[793, 710, 817, 798],[925, 645, 954, 777];palm tree:[695, 023, 962, 625],[074, 000, 320, 894],[388, 367, 514, 632];woman:[126, 748, 258, 999],[253, 730, 303, 879],[435, 730, 473, 835],[628, 708, 653, 801],[707, 675, 759, 903];building:[000, 101, 273, 720],[235, 203, 432, 692],[331, 227, 983, 770].') 2023-11-02 22:46:35.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:46:35.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,snack and doughnut in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[022, 000, 494, 187],[352, 000, 999, 451];snack:[048, 318, 999, 969].') 2023-11-02 22:46:37.203 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:46:37.203 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[460, 088, 769, 995] daughter whispering to [128, 031, 589, 994] mom on [001, 198, 999, 991] the couch while mom discovers how parents can influence their children') 2023-11-02 22:46:41.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:46:41.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[766, 280, 935, 955],[002, 261, 094, 999],[139, 119, 759, 999],[605, 349, 620, 430],[650, 212, 805, 988],[959, 411, 999, 984];human hair:[237, 103, 518, 493],[732, 207, 806, 356],[827, 270, 895, 518];human head:[236, 128, 522, 535],[000, 260, 053, 506],[739, 209, 807, 330],[831, 268, 893, 414];man:[135, 188, 728, 984],[640, 211, 832, 992];glasses:[306, 306, 519, 399];tree:[002, 000, 999, 623],[145, 000, 337, 701];human face:[007, 372, 040, 499],[316, 231, 496, 559];human arm:[051, 586, 108, 784],[143, 464, 545, 999],[564, 637, 746, 999],[648, 370, 681, 711],[746, 386, 810, 784],[777, 434, 859, 604],[868, 436, 928, 609],[950, 439, 999, 699];sports uniform:[666, 340, 825, 919];human nose:[422, 320, 471, 420].') 2023-11-02 22:47:08.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:47:08.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'plant:[000, 036, 934, 984];food:[176, 318, 267, 429],[201, 231, 281, 326],[250, 271, 340, 376],[342, 497, 446, 617],[343, 615, 457, 753],[406, 042, 490, 137],[415, 126, 489, 215],[426, 535, 534, 671],[640, 778, 728, 887],[646, 666, 732, 770],[710, 730, 810, 850],[806, 388, 842, 464],[848, 413, 935, 504].') 2023-11-02 22:47:20.841 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:47:20.842 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Former World Series co-MVP Curt Schilling [005, 019, 916, 994] "has been advised that ... his employment with ESPN has been terminated," the network said.') 2023-11-02 22:47:23.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:47:23.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[430, 630, 527, 934] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a mountain bikeFrame1:[430, 630, 527, 934];Frame2:[400, 524, 486, 756];Frame3:[388, 540, 463, 743].') 2023-11-02 22:47:43.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:47:43.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,man,mammal,human head,human face,human nose,vase and computer mouse in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[468, 325, 960, 1000],[353, 418, 520, 800],[482, 534, 568, 709],[863, 526, 941, 712];man:[000, 181, 547, 999];mammal:[499, 383, 966, 975],[000, 188, 557, 999],[353, 411, 521, 795],[490, 524, 565, 713],[593, 539, 640, 605],[828, 565, 900, 634],[850, 525, 940, 740],[980, 894, 999, 999],[981, 482, 999, 906];human head:[611, 290, 789, 545],[211, 198, 405, 537],[353, 417, 484, 596],[483, 530, 566, 661],[595, 543, 658, 600],[831, 565, 898, 629],[978, 485, 999, 598];human face:[236, 234, 398, 547],[503, 554, 558, 666],[626, 327, 763, 589];human nose:[330, 356, 366, 419].') 2023-11-02 22:47:46.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:47:46.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[248, 460, 528, 554] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a pt boatFrame1:[248, 460, 528, 554];Frame2:[240, 468, 526, 560];Frame3:[245, 468, 539, 572];Frame4:[253, 470, 550, 570].') 2023-11-02 22:48:01.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:48:01.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Leather Shoes,Glasses,Wine Glass,Cup and Boots in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Leather Shoes:[343, 930, 370, 999];Glasses:[799, 668, 831, 685],[622, 542, 639, 550],[387, 572, 430, 590],[350, 502, 359, 511];Wine Glass:[949, 808, 994, 932],[566, 665, 591, 737],[919, 543, 932, 576],[783, 526, 794, 547],[746, 533, 757, 558],[646, 553, 658, 576],[482, 584, 498, 630],[024, 563, 037, 601],[250, 569, 264, 611];Cup:[593, 676, 619, 720],[568, 685, 595, 715],[641, 631, 662, 693],[685, 631, 705, 670],[842, 576, 856, 596],[743, 548, 751, 558],[322, 613, 341, 641],[273, 653, 295, 684],[177, 579, 193, 604];Boots:[154, 768, 209, 924].') 2023-11-02 22:48:10.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:48:10.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[112, 066, 627, 972] A person dressed for yoga leans into a pose near [000, 237, 992, 680] a lake at sunrise. Exercising mindfully can increase your likelihood of making your workouts a consistent habit.') 2023-11-02 22:48:15.447 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:48:15.447 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The year’s top quotes [367, 549, 616, 759]: ‘Wear a mask [321, 286, 680, 993]’ and ‘I can’t breathe’') 2023-11-02 22:48:46.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:48:46.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl and clothing in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'girl:[005, 634, 093, 812],[164, 615, 218, 775],[789, 605, 966, 995],[851, 657, 983, 985];clothing:[000, 675, 057, 999],[000, 668, 090, 784],[101, 711, 190, 794],[121, 340, 460, 992],[681, 694, 774, 995],[788, 716, 968, 999],[855, 735, 971, 985].') 2023-11-02 22:48:57.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:48:57.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'tree:[000, 000, 999, 773].') 2023-11-02 22:49:33.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:49:33.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bracelet and Handbag/Satchel in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bracelet:[471, 615, 544, 647];Handbag:[487, 393, 542, 425].') 2023-11-02 22:49:47.257 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:49:47.258 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'bucket:[679, 465, 693, 491];Person:[283, 415, 297, 462],[986, 424, 999, 570];SUV:[777, 439, 1000, 571],[000, 386, 275, 632],[435, 427, 515, 525],[488, 427, 543, 503],[674, 434, 734, 484];Truck:[786, 374, 1000, 485],[000, 355, 153, 460],[188, 365, 418, 474],[802, 444, 827, 463];Street Lights:[803, 242, 834, 463],[629, 372, 637, 461];Pickup Truck:[271, 415, 465, 569],[508, 422, 555, 473];Car:[579, 440, 607, 466];Desk:[269, 453, 283, 467];Chair:[918, 458, 934, 488];Hat:[986, 424, 1000, 443].') 2023-11-02 22:50:14.799 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:50:14.799 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Congregation members [006, 112, 981, 990] personalized the banner [143, 134, 738, 790] with written messages and signatures. (Stephen F. Brown-Pearn [001, 119, 295, 996]') 2023-11-02 22:50:48.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:50:48.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[783, 562, 857, 900] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[783, 562, 857, 900];Frame2:[794, 575, 876, 933];Frame3:[781, 593, 858, 912];Frame4:[767, 587, 834, 923];Frame5:[744, 577, 814, 916].') 2023-11-02 22:51:01.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:51:01.025 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect hair spray,man,dress,suit and glasses in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[236, 349, 365, 992],[311, 338, 397, 635],[343, 377, 391, 492],[391, 315, 473, 580],[487, 369, 552, 561],[680, 000, 999, 999],[846, 378, 895, 469],[879, 395, 943, 545];dress:[345, 583, 555, 922];suit:[560, 196, 897, 1000];glasses:[876, 202, 984, 280],[420, 462, 505, 506],[611, 229, 728, 264].') 2023-11-02 22:51:15.479 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:51:15.479 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'This white lace midi dress [217, 344, 758, 779] is the perfect summer wardrobe staple for all your warm weather events! It comes in so many fantastic colors too, and is on sale now!') 2023-11-02 22:51:18.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:51:18.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'food:[745, 641, 849, 752];drink:[888, 758, 954, 808],[955, 880, 999, 998];woman:[000, 175, 162, 998],[200, 000, 520, 450],[436, 314, 741, 998];man:[071, 198, 531, 998],[642, 298, 725, 550],[754, 334, 938, 611];clothing:[074, 461, 544, 998],[223, 157, 516, 448],[441, 520, 750, 998],[587, 504, 714, 628],[641, 350, 725, 545],[806, 477, 882, 608];human face:[278, 015, 368, 158],[316, 247, 497, 591],[522, 322, 632, 511],[798, 370, 842, 457].') 2023-11-02 22:51:38.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:51:38.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect auto part and person in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'auto part:[149, 432, 275, 504],[182, 383, 577, 519],[554, 275, 832, 500],[670, 447, 976, 593];person:[000, 412, 026, 688],[011, 396, 087, 674],[044, 414, 091, 689],[083, 391, 163, 676],[261, 442, 323, 683],[383, 426, 455, 704],[460, 407, 549, 699],[580, 416, 633, 698],[706, 402, 770, 688],[900, 424, 973, 694].') 2023-11-02 22:51:54.234 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:51:54.235 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Members of the military [512, 227, 600, 772] [408, 177, 573, 889] walk the hallway of Cell Block C in the Camp 5 detention facility in January 2012.') 2023-11-02 22:52:05.605 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:52:05.605 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[191, 027, 814, 995] Jamie Foxx set to star in Netflix series inspired by relationship with his daughter') 2023-11-02 22:52:22.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:52:22.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the ladies dancing in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There are 4 ladies dancing.Track1frame:1:[468, 483, 565, 702];frame:2:[285, 452, 375, 656];frame:3:[465, 466, 525, 585],Track2frame:1:[757, 481, 850, 685];frame:2:[760, 462, 851, 683];frame:3:[717, 485, 834, 685],Track3frame:1:[198, 472, 273, 637];frame:2:[184, 458, 250, 677];frame:3:[220, 466, 257, 679],Track4frame:1:[381, 464, 439, 716];frame:2:[232, 452, 285, 662];frame:3:[343, 456, 457, 706].') 2023-11-02 22:52:27.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:52:27.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the avian creature moving forward.Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There are 3 avian creature moving forward.Track1frame:1:[000, 151, 889, 998];frame:2:[101, 001, 876, 998];frame:3:[157, 001, 868, 998],Track2frame:1:[650, 165, 771, 643];frame:2:[653, 157, 776, 625];frame:3:[648, 144, 771, 607],Track3frame:1:[670, 243, 862, 849];frame:2:[677, 234, 869, 863];frame:3:[669, 224, 863, 904].') 2023-11-02 22:52:28.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:52:28.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Leather Shoes,Person,Other Shoes and Street Lights in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[662, 069, 882, 1000],[560, 158, 704, 815],[351, 178, 514, 850];Other Shoes:[573, 752, 602, 800],[639, 758, 666, 814];Street Lights:[650, 041, 688, 272].') 2023-11-02 22:53:01.726 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:53:01.727 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[001, 200, 096, 996],[150, 337, 218, 534],[274, 365, 378, 603],[273, 389, 381, 777],[382, 336, 465, 672],[460, 331, 573, 818],[620, 369, 729, 732],[737, 311, 844, 794];Hat:[001, 204, 080, 284].') 2023-11-02 22:53:03.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:53:03.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shorts,human face,boy,man and ladder in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'shorts:[325, 654, 559, 760];human face:[384, 145, 564, 298];boy:[300, 132, 692, 901];man:[214, 156, 295, 434].') 2023-11-02 22:53:30.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:53:30.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'cat:[180, 000, 768, 999].') 2023-11-02 22:53:37.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:53:37.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,chair,table and person in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'clothing:[000, 553, 228, 970],[128, 573, 201, 653],[208, 640, 350, 849],[371, 750, 505, 999],[382, 481, 415, 544],[405, 580, 498, 688],[591, 393, 643, 509],[643, 648, 873, 957],[826, 438, 869, 569],[873, 549, 943, 665],[900, 580, 999, 923];chair:[755, 801, 920, 993],[000, 754, 160, 999],[671, 476, 706, 561],[901, 759, 999, 994];table:[616, 496, 710, 576];person:[000, 486, 123, 662],[000, 460, 253, 999],[076, 377, 110, 442],[126, 500, 212, 654],[167, 497, 221, 574],[184, 474, 222, 545],[198, 560, 366, 866],[231, 500, 283, 591],[233, 528, 293, 626],[267, 436, 313, 516],[301, 486, 345, 580],[317, 501, 372, 606],[368, 525, 401, 591],[376, 455, 460, 586],[378, 689, 522, 999],[405, 541, 504, 725],[588, 377, 654, 556],[648, 531, 868, 962],[699, 521, 737, 684],[775, 537, 818, 627],[828, 406, 873, 571],[858, 500, 947, 661],[878, 480, 999, 924].') 2023-11-02 22:54:05.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:54:05.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[696, 000, 999, 814].') 2023-11-02 22:54:22.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:54:22.750 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'tree:[000, 076, 263, 787],[454, 304, 996, 800];train:[383, 447, 821, 883];car:[000, 603, 042, 884],[197, 758, 309, 855].') 2023-11-02 22:55:00.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:55:00.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'skull:[554, 151, 768, 484];clothing:[421, 304, 961, 978];human face:[565, 172, 766, 515].') 2023-11-02 22:55:14.735 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:55:14.735 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[468, 587, 532, 772],personFrame1:[646, 423, 695, 562],personFrame1:[428, 515, 479, 666],personFrame1:[703, 376, 764, 505],personFrame1:[507, 430, 543, 586] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[468, 587, 532, 772];Frame2:[456, 573, 512, 769],personFrame1:[646, 423, 695, 562];Frame2:[667, 420, 711, 565],personFrame1:[428, 515, 479, 666];Frame2:[462, 516, 506, 668],personFrame1:[703, 376, 764, 505];Frame2:[707, 372, 747, 505],personFrame1:[507, 430, 543, 586];Frame2:[496, 412, 535, 572].') 2023-11-02 22:55:16.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:55:16.998 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[045, 069, 107, 203],[123, 059, 180, 193],[211, 070, 266, 206],[280, 069, 335, 207],[371, 075, 426, 213],[435, 079, 486, 214],[536, 080, 593, 213],[602, 080, 659, 219],[693, 084, 747, 215],[768, 088, 816, 224],[821, 088, 872, 225],[882, 093, 934, 229];Guitar:[751, 523, 915, 678],[371, 640, 403, 806],[909, 649, 962, 808];Drum:[579, 714, 635, 787],[476, 734, 583, 895],[446, 735, 481, 788],[241, 741, 317, 850],[565, 786, 621, 900];Violin:[060, 548, 126, 643];Person:[001, 451, 040, 908],[010, 469, 106, 908],[124, 538, 237, 888],[445, 433, 605, 905],[760, 421, 870, 909];Hat:[517, 431, 554, 476];Pickup Truck:[107, 654, 465, 815];Van:[735, 702, 915, 839];Tripod:[281, 648, 329, 914],[483, 526, 578, 918],[205, 525, 271, 921];Speaker:[062, 798, 207, 921],[322, 804, 493, 923],[610, 802, 772, 923],[900, 806, 999, 924];Microphone:[680, 614, 718, 654].') 2023-11-02 22:55:56.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:55:56.060 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Flower:[002, 182, 079, 710];Person:[430, 113, 750, 999],[229, 256, 473, 773];Hat:[574, 117, 741, 416].') 2023-11-02 22:56:02.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:02.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[312, 001, 793, 999],[945, 374, 999, 1000],[001, 719, 052, 822];Satchel:[001, 787, 099, 998].') 2023-11-02 22:56:08.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:08.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Potted Plant:[613, 741, 659, 890];Person:[182, 565, 281, 1000];Car:[274, 630, 450, 847].') 2023-11-02 22:56:10.013 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:10.013 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the horse running in a circle.Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There are 4 horse running in a circle.Track1frame:1:[363, 650, 419, 821];frame:2:[363, 650, 419, 821];frame:3:[320, 621, 364, 771],Track2frame:1:[258, 662, 383, 862];frame:2:[258, 662, 383, 862];frame:3:[159, 679, 338, 879],Track3frame:1:[007, 698, 025, 740];frame:2:[007, 698, 025, 740];frame:3:[001, 737, 042, 824],Track4frame:1:[000, 668, 321, 999];frame:2:[000, 668, 321, 999];frame:3:[145, 707, 650, 999].') 2023-11-02 22:56:41.752 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 22:56:41.752 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Horse:[127, 105, 866, 1000].') 2023-11-02 22:56:45.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:45.834 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Potted Plant:[001, 388, 104, 943],[417, 487, 502, 748],[597, 487, 634, 762];Mirror:[062, 158, 166, 746],[188, 231, 275, 727],[287, 280, 352, 707];Bench:[482, 685, 602, 757];Desk:[132, 635, 329, 832];Flower:[220, 463, 268, 622],[193, 463, 235, 616];Vase:[223, 606, 269, 658],[198, 608, 224, 653].') 2023-11-02 22:56:50.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:50.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[301, 517, 381, 622],carFrame1:[808, 493, 965, 612],carFrame1:[690, 529, 768, 591] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[301, 517, 381, 622];Frame2:[298, 512, 377, 617],carFrame1[808, 493, 965, 612];Frame2:No object,carFrame1:[690, 529, 768, 591];Frame2:[783, 513, 894, 602].') 2023-11-02 22:56:53.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:53.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Storage box:[926, 813, 1000, 1000];Hat:[079, 201, 207, 334];Boots:[706, 676, 783, 809];Bakset:[927, 814, 999, 1000];Person:[595, 241, 732, 715],[535, 246, 783, 803],[428, 257, 577, 875],[227, 203, 519, 1000].') 2023-11-02 22:56:56.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:56:56.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[883, 443, 997, 536],[464, 796, 534, 854],[661, 711, 697, 752],[538, 753, 576, 789],[673, 622, 709, 686],[398, 846, 424, 897];Glasses:[472, 811, 505, 849];Person:[082, 883, 155, 1000],[158, 933, 191, 999],[190, 930, 224, 999],[215, 816, 340, 1000],[296, 920, 351, 999],[294, 831, 331, 927],[331, 850, 417, 999],[440, 794, 561, 999],[426, 803, 497, 996],[495, 748, 560, 887],[538, 752, 574, 840],[546, 763, 609, 950],[562, 658, 689, 1000],[642, 685, 673, 750],[661, 711, 699, 761],[688, 663, 772, 999],[682, 620, 817, 1000],[642, 642, 675, 684],[682, 587, 718, 635],[671, 614, 712, 698],[687, 628, 713, 721],[771, 619, 795, 733],[783, 610, 833, 999],[773, 575, 801, 622],[778, 560, 820, 611],[804, 550, 834, 599],[827, 439, 1000, 998],[819, 464, 909, 999],[842, 564, 910, 657];Flag:[948, 327, 999, 408],[829, 256, 949, 490],[847, 420, 992, 571],[845, 305, 951, 428],[730, 416, 823, 560],[619, 506, 692, 579],[631, 557, 659, 679],[647, 552, 677, 618],[485, 433, 580, 621],[550, 491, 615, 704],[599, 518, 632, 649],[530, 597, 563, 659],[479, 578, 518, 680],[491, 625, 535, 680],[459, 539, 510, 761],[411, 625, 472, 722],[394, 637, 461, 769],[335, 671, 406, 753],[288, 577, 336, 834],[213, 667, 306, 775],[215, 776, 256, 825],[261, 772, 295, 836],[162, 868, 219, 938],[421, 767, 469, 824],[200, 797, 226, 864],[258, 638, 302, 789],[816, 103, 836, 134];Bottle:[830, 704, 860, 816],[684, 738, 705, 797].') 2023-11-02 22:57:15.274 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:57:15.274 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[003, 031, 674, 998] Fred Davis served a four-game suspension for violating the NFL's substance abuse policy in 2011. [002, 000, 997, 993] (Washington Post/Getty Images") 2023-11-02 22:57:53.435 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:57:53.436 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The groom and [714, 132, 896, 955] [100, 184, 378, 963] [636, 130, 767, 940] [316, 127, 482, 947] [550, 155, 712, 956] [414, 179, 576, 970] his friends, each with [466, 509, 534, 972] [351, 118, 398, 389] [037, 221, 265, 271] [698, 052, 751, 463] [102, 522, 200, 966] [789, 117, 880, 408] a medieval sword') 2023-11-02 22:58:09.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:58:09.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Cabinet:[000, 000, 982, 968];Lamp:[158, 299, 806, 379];Bottle:[676, 041, 716, 176],[646, 043, 679, 232],[560, 611, 604, 715];Plate:[647, 245, 899, 293],[372, 261, 642, 313],[163, 296, 382, 329],[044, 306, 163, 334],[218, 431, 415, 524],[070, 481, 125, 542],[267, 451, 469, 530],[028, 586, 160, 596],[262, 537, 425, 597],[489, 425, 635, 520],[516, 562, 599, 590],[684, 428, 889, 476],[679, 543, 916, 573],[737, 773, 910, 888],[575, 692, 821, 830],[565, 828, 738, 896],[357, 696, 585, 785],[342, 780, 591, 860],[228, 834, 375, 892],[069, 818, 239, 888];Basin:[240, 790, 291, 836];Lemon:[847, 769, 890, 827],[726, 466, 765, 504],[760, 393, 794, 437],[406, 495, 455, 552],[109, 483, 168, 531],[181, 805, 228, 857],[182, 855, 227, 903];Watermelon:[326, 186, 423, 280];Hamimelon:[606, 171, 670, 267];Toilet Paper:[861, 053, 919, 271];Person:[550, 224, 687, 722].') 2023-11-02 22:58:16.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 22:58:16.529 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photo of a male DJ [323, 369, 493, 752] in front of a painting [301, 167, 800, 745] at the Blanton Museum of Art while he is mixing on his deck.') 2023-11-02 23:00:07.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:00:07.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,clothing,shelf and human face in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'girl:[243, 000, 998, 996];clothing:[240, 617, 999, 998];shelf:[000, 329, 214, 671],[394, 000, 961, 718];human face:[383, 200, 721, 726].') 2023-11-02 23:00:16.869 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:00:16.870 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Peruvian archaeologists [442, 424, 553, 855] command a drone [366, 341, 578, 605] to search for architectural ruins. (New York Times') 2023-11-02 23:01:09.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:01:09.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect wheelchair,man and woman in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'wheelchair:[425, 425, 768, 998];man:[000, 000, 078, 759],[474, 034, 712, 445],[474, 030, 715, 960],[665, 418, 734, 614];woman:[000, 000, 173, 889],[160, 000, 429, 998],[494, 231, 836, 998].') 2023-11-02 23:01:12.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:01:12.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,what is the track of the black vehicles in the left?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one black vehicles in the left.Track1frame4:[020, 504, 109, 602];frame5:[000, 512, 067, 621].') 2023-11-02 23:01:19.514 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:01:19.514 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Chair:[558, 077, 606, 156],[182, 875, 385, 998],[351, 774, 496, 1000],[463, 703, 565, 903],[540, 658, 597, 812],[108, 766, 159, 849];Cabinet:[037, 485, 106, 621],[125, 432, 170, 533];Umbrella:[001, 053, 531, 930],[293, 269, 632, 577];Person:[002, 716, 160, 996],[001, 681, 116, 961],[118, 604, 282, 844],[123, 480, 243, 723],[331, 611, 474, 907],[298, 572, 365, 753],[354, 560, 403, 644],[431, 582, 531, 889].') 2023-11-02 23:01:28.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:01:28.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[086, 215, 213, 991],[210, 252, 320, 942],[318, 328, 428, 940],[403, 320, 494, 927],[507, 328, 595, 942],[589, 290, 681, 962],[672, 302, 770, 957],[754, 308, 835, 962],[841, 252, 953, 966];suit:[686, 265, 778, 936],[585, 363, 686, 907].') 2023-11-02 23:01:35.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:01:35.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Baozi:[720, 153, 860, 324],[644, 212, 741, 305],[626, 118, 754, 235];Plate:[326, 531, 703, 874],[597, 223, 813, 335],[654, 074, 731, 133];Chopsticks:[472, 511, 762, 821],[858, 243, 999, 295];Cup:[258, 309, 427, 581];Bottle:[441, 002, 529, 323],[510, 002, 611, 285],[608, 002, 660, 136];Dinning Table:[002, 065, 999, 999].') 2023-11-02 23:01:49.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:01:49.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Mirror:[774, 444, 948, 565];Desk:[001, 846, 998, 1000],[001, 734, 490, 800];Bench:[001, 854, 061, 897],[001, 812, 055, 854];Person:[052, 406, 311, 980],[213, 182, 468, 916],[465, 469, 741, 873],[722, 492, 868, 811],[887, 514, 999, 803],[836, 497, 907, 698],[494, 597, 523, 686],[640, 553, 670, 667],[609, 616, 648, 670],[071, 536, 118, 600];Flag:[966, 268, 998, 413];Hat:[287, 087, 430, 245];Handbag:[502, 669, 779, 975];Bottle:[936, 624, 991, 847];Bread:[110, 905, 213, 971],[958, 872, 998, 941].') 2023-11-02 23:02:03.026 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:02:03.027 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a twisted braid [277, 059, 711, 997] with a crystal hair piece is a unique and creative idea') 2023-11-02 23:02:04.856 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:02:04.856 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[018, 028, 839, 995] Chris Brown's doccie is to reveal all") 2023-11-02 23:02:09.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:02:09.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'sports uniform:[330, 276, 666, 906],[005, 444, 072, 507],[058, 301, 148, 522],[120, 318, 195, 518],[157, 096, 218, 229],[173, 305, 270, 516],[220, 303, 325, 520],[308, 288, 373, 525],[351, 296, 416, 518],[531, 018, 586, 114],[777, 494, 999, 901];baseball bat:[450, 380, 681, 426];man:[000, 000, 037, 083],[013, 123, 086, 288],[016, 418, 068, 519],[018, 072, 058, 159],[043, 091, 079, 241],[058, 292, 154, 530],[067, 065, 100, 142],[068, 411, 120, 538],[085, 114, 142, 271],[105, 016, 151, 144],[116, 278, 195, 532],[123, 006, 190, 156],[124, 214, 195, 296],[156, 055, 216, 256],[166, 288, 260, 528],[180, 000, 210, 107],[204, 076, 259, 218],[216, 006, 260, 100],[235, 266, 329, 515],[253, 074, 286, 196],[280, 053, 342, 271],[306, 264, 384, 521],[321, 179, 680, 888],[336, 016, 400, 180],[359, 182, 410, 273],[360, 282, 438, 521],[400, 006, 456, 175],[401, 196, 448, 266],[441, 030, 493, 203],[528, 000, 590, 133],[536, 269, 591, 526],[555, 189, 606, 256],[572, 129, 603, 241],[595, 061, 625, 137],[618, 046, 675, 242],[683, 397, 766, 530],[690, 167, 736, 250],[690, 411, 999, 908],[694, 013, 738, 184],[733, 051, 789, 235],[745, 170, 789, 252],[751, 000, 789, 067],[781, 002, 812, 144],[787, 165, 828, 252],[803, 429, 843, 526],[828, 174, 869, 244],[857, 000, 933, 161],[874, 186, 915, 250],[901, 149, 940, 244],[925, 074, 963, 176],[925, 000, 991, 130],[926, 165, 982, 244],[962, 132, 999, 240].') 2023-11-02 23:02:17.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:02:17.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[697, 320, 767, 568],[390, 395, 785, 1000],[001, 416, 599, 1000];Backpack:[176, 595, 269, 992],[424, 560, 631, 897];Glasses:[453, 472, 540, 516].') 2023-11-02 23:02:23.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:02:23.792 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[276, 883, 347, 972],[386, 897, 571, 998],[402, 747, 528, 935],[512, 699, 691, 975],[530, 715, 720, 998],[547, 468, 631, 601],[583, 595, 750, 957],[642, 500, 797, 934],[702, 421, 750, 522],[750, 616, 902, 998],[821, 389, 896, 470],[880, 194, 999, 405];woman:[287, 874, 339, 945],[368, 703, 534, 941],[491, 670, 596, 998],[522, 629, 753, 998],[541, 540, 750, 950],[875, 188, 980, 410];man:[544, 418, 636, 607],[690, 378, 750, 528],[752, 475, 999, 998];vehicle:[448, 497, 529, 581],[476, 373, 565, 524];clothing:[425, 745, 511, 903],[464, 894, 566, 998],[515, 757, 539, 886],[563, 468, 629, 601],[597, 591, 750, 934],[649, 519, 747, 700],[708, 432, 756, 503],[711, 672, 805, 839],[752, 608, 834, 789],[811, 145, 854, 289],[829, 113, 885, 289],[829, 562, 999, 998],[890, 059, 999, 312];building:[000, 097, 470, 998];human arm:[451, 751, 528, 938],[517, 854, 580, 964],[537, 840, 616, 998],[596, 962, 666, 998],[612, 697, 689, 835],[623, 598, 721, 810],[666, 794, 757, 995],[699, 538, 772, 763],[828, 680, 999, 950],[900, 210, 947, 274].') 2023-11-02 23:02:57.493 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:02:57.493 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The taipan [004, 119, 967, 977] is one of the most venomous snakes in the world.') 2023-11-02 23:03:26.919 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:03:26.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[539, 567, 557, 656];Umbrella:[682, 659, 788, 786];Desk:[595, 700, 644, 738];Ship:[210, 400, 715, 999];Crane:[688, 058, 760, 323],[763, 001, 810, 418],[940, 003, 974, 247].') 2023-11-02 23:03:51.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:03:51.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Traffic Light:[212, 683, 231, 724],[179, 681, 197, 718];Flag:[416, 304, 461, 382],[415, 364, 467, 464];Person:[643, 740, 809, 1000],[771, 765, 809, 907],[796, 867, 889, 1000],[538, 630, 661, 965],[539, 761, 665, 1000],[278, 611, 544, 1000],[221, 660, 342, 992],[328, 768, 358, 841],[000, 662, 279, 1000],[000, 796, 033, 921];Hat:[008, 796, 031, 834],[095, 663, 189, 783],[241, 662, 306, 728],[375, 643, 465, 861],[558, 630, 623, 683],[590, 765, 662, 824];Glasses:[102, 780, 181, 812].') 2023-11-02 23:04:03.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:04:03.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[086, 624, 174, 877],[180, 590, 316, 838],[447, 562, 610, 825],[472, 598, 607, 855],[677, 590, 802, 845],[810, 597, 943, 818].') 2023-11-02 23:04:22.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:04:22.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'door:[231, 262, 640, 821];jeans:[525, 583, 671, 786];man:[508, 413, 698, 791].') 2023-11-02 23:04:32.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:04:32.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the moving brown dog.When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 3 moving brown dog.Track1frame:1:[422, 344, 648, 538];frame:2:[554, 360, 836, 563];frame:3:[674, 377, 978, 576],Track2frame:1:[459, 284, 672, 403];frame:2:[681, 301, 933, 442];frame:3:[940, 329, 999, 396],Track3frame:1:[636, 264, 681, 341];frame:2:[851, 264, 975, 368].') 2023-11-02 23:04:53.181 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:04:53.182 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Little boy [211, 356, 518, 706] friendly smiling, waking up and stretching himself after sleep in his small bed [124, 495, 842, 909] in a nursery room [003, 015, 988, 993] with colorful toys [293, 216, 366, 288] [662, 377, 732, 451] [560, 486, 668, 618] [710, 105, 932, 710] [425, 165, 591, 414] [821, 703, 974, 874] on a bright sunny morning, isolated vector cartoon illustration') 2023-11-02 23:04:55.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:04:55.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[370, 372, 568, 995] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a personFrame1:[370, 372, 568, 995];Frame2:[398, 369, 617, 995];Frame3:[432, 227, 596, 951];Frame4:[338, 344, 586, 995].') 2023-11-02 23:05:02.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:05:02.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'bicycle:[345, 329, 778, 930],[106, 279, 566, 671];person:[860, 157, 935, 327],[654, 200, 689, 305],[680, 191, 713, 319],[731, 202, 757, 319],[764, 183, 804, 285],[810, 170, 837, 277];tree:[007, 029, 144, 259],[145, 000, 336, 244],[346, 000, 792, 250];wheel:[360, 625, 575, 912],[106, 368, 179, 479],[134, 360, 187, 445],[162, 395, 247, 511],[184, 418, 276, 554],[205, 444, 315, 600],[237, 478, 376, 668],[318, 363, 351, 463],[355, 382, 400, 492],[400, 397, 437, 537],[458, 411, 566, 570],[660, 493, 799, 732];tire:[364, 605, 573, 915],[106, 370, 186, 480],[166, 388, 265, 520],[186, 406, 280, 553],[205, 434, 329, 595],[237, 470, 381, 665],[319, 365, 361, 462],[350, 380, 394, 464],[394, 393, 446, 510],[455, 415, 570, 572];bicycle wheel:[105, 365, 188, 472],[166, 396, 266, 519],[171, 411, 295, 555],[200, 440, 326, 601],[236, 482, 383, 674],[252, 338, 281, 415],[273, 353, 314, 434],[354, 734, 558, 921],[357, 369, 388, 469],[397, 388, 451, 520],[460, 403, 570, 572],[663, 484, 796, 720].') 2023-11-02 23:06:06.278 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:06:06.278 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'cabinetry:[321, 513, 411, 675],[393, 336, 568, 478];refrigerator:[057, 217, 193, 764],[195, 278, 299, 719];cupboard:[000, 166, 058, 761],[291, 515, 418, 686],[389, 342, 570, 476];countertop:[291, 491, 565, 675];kitchen & dining room table:[408, 539, 741, 905].') 2023-11-02 23:06:37.255 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:06:37.256 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Feature distributions [691, 151, 990, 853] [513, 113, 648, 891] [362, 110, 492, 893] for the Iris flower dataset') 2023-11-02 23:06:46.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:06:46.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[296, 402, 982, 716] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a pheasantFrame1:[296, 402, 982, 716];Frame2:[296, 356, 909, 702];Frame3:[335, 343, 903, 691];Frame4:[314, 208, 871, 543];Frame5:[471, 189, 998, 514].') 2023-11-02 23:06:55.198 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:06:55.198 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The emotional intelligence pyramid [018, 009, 983, 991] (9-layer model).') 2023-11-02 23:07:31.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:07:31.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[232, 225, 271, 310] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a person headFrame1:[232, 225, 271, 310];Frame2:[228, 237, 268, 321];Frame3:[253, 204, 295, 293];Frame4:[302, 215, 348, 308];Frame5:[333, 237, 385, 331].') 2023-11-02 23:08:16.158 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:08:16.159 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[233, 115, 946, 717] A scuffed and dirty hockey mask similar to that worn by Jason Vorhees, discarded among [001, 001, 993, 993] some leaves.') 2023-11-02 23:08:33.884 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:08:33.884 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Grilled Barbecue Chicken and Vegetables in Foil - Tender, flavorful chicken covered in sweet barbecue sauce and cooked on [004, 011, 988, 997] the grill inside [015, 008, 586, 492] foil packs with [320, 165, 431, 246] [303, 599, 433, 691] [066, 554, 206, 641] zucchini, bell peppers and [432, 218, 741, 513] asparagus.') 2023-11-02 23:08:52.957 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:08:52.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track carFrame1:[000, 384, 053, 472],motorcycleFrame1:[712, 355, 773, 432] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'carFrame1[000, 384, 053, 472];Frame2:No object,motorcycleFrame1:[712, 355, 773, 432];Frame2:[820, 320, 902, 421].') 2023-11-02 23:08:56.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:08:56.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[815, 756, 939, 997],[328, 676, 428, 977],[623, 705, 656, 907],[712, 713, 785, 983],[951, 721, 999, 985];man:[315, 716, 452, 994],[283, 691, 323, 948],[307, 715, 347, 968],[401, 687, 512, 813],[490, 701, 562, 979],[581, 701, 607, 792],[606, 703, 630, 854],[643, 648, 718, 979],[900, 701, 935, 841];clothing:[645, 674, 738, 978],[290, 729, 310, 929],[307, 750, 327, 861],[323, 763, 430, 996],[399, 700, 588, 797],[493, 747, 554, 988],[532, 783, 618, 992],[604, 737, 627, 836],[627, 729, 650, 830],[713, 767, 781, 972],[796, 758, 865, 927],[817, 822, 937, 986],[900, 729, 930, 845],[942, 787, 999, 988];girl:[946, 721, 999, 985];building:[000, 000, 999, 998].') 2023-11-02 23:09:01.784 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:09:01.784 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Groom [004, 088, 356, 870] at Great Fosters wedding venue in Egham Surrey gets his Bride [399, 435, 699, 880] and her father smiling on the top table [005, 845, 995, 998] in the Tithe Barn during his entertaining wedding speech') 2023-11-02 23:09:02.532 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:09:02.533 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Setting up and using [118, 231, 883, 824] the Wahoo Tickr Fit armband to measure heart rate during exercise couldn't be easier, making it a top pick among heart rate monitors. - Wahoo Tickr FIT") 2023-11-02 23:09:18.245 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:09:18.245 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'bicycle wheel:[663, 580, 815, 768],[000, 640, 023, 745],[000, 720, 057, 901],[327, 365, 555, 743],[488, 568, 505, 653],[691, 733, 833, 938];bicycle:[000, 443, 137, 901],[616, 471, 807, 761],[692, 670, 998, 936];man:[027, 205, 140, 541],[411, 265, 512, 648],[485, 265, 582, 766],[600, 271, 718, 710],[700, 273, 746, 470],[775, 115, 998, 998];wheel:[316, 363, 560, 746],[672, 568, 807, 766],[686, 731, 836, 935];clothing:[000, 360, 037, 445],[031, 335, 167, 775],[151, 291, 216, 351],[231, 293, 381, 716],[426, 315, 522, 538],[495, 328, 536, 508],[501, 370, 642, 828],[597, 335, 725, 610],[711, 306, 742, 496],[715, 343, 726, 495],[791, 330, 811, 445],[818, 228, 980, 983];tire:[328, 361, 557, 745],[676, 563, 816, 760],[688, 720, 847, 933].') 2023-11-02 23:10:08.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:10:08.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[266, 223, 415, 835],[286, 797, 461, 999];desk:[000, 755, 428, 995];human body:[278, 233, 427, 770],[312, 795, 467, 999];table:[000, 715, 517, 999].') 2023-11-02 23:11:01.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:11:01.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Scrutinize the image and respond to my inquiry: 'QUESTION: Where are [801, 323, 934, 703] and [009, 289, 162, 534] and [136, 406, 292, 898] and [701, 212, 809, 621] and [233, 290, 334, 508] and [883, 264, 999, 497] and [280, 224, 371, 371] and [559, 290, 680, 525] and [533, 201, 602, 394] and [576, 424, 713, 832] and [005, 298, 169, 822] and [283, 451, 460, 998] and [804, 314, 973, 937] and [112, 272, 189, 419] and [406, 246, 484, 403] and [379, 297, 517, 547] , and [426, 180, 493, 328] ? \nANSWER: [801, 323, 934, 703] and [009, 289, 162, 534] and [136, 406, 292, 898] and [701, 212, 809, 621] and [233, 290, 334, 508] and [883, 264, 999, 497] and [280, 224, 371, 371] and [559, 290, 680, 525] and [533, 201, 602, 394] and [576, 424, 713, 832] and [005, 298, 169, 822] and [283, 451, 460, 998] and [804, 314, 973, 937] and [112, 272, 189, 419] and [406, 246, 484, 403] and [379, 297, 517, 547] , and [426, 180, 493, 328] are in a classroom . You should explain the reason for the above answer.'. Make sure you mention the reasoning process and clarify the bounding boxes of relevant objects using square brackets.", '[801, 323, 934, 703] and [009, 289, 162, 534] and [136, 406, 292, 898] and [701, 212, 809, 621] and [233, 290, 334, 508] and [883, 264, 999, 497] and [280, 224, 371, 371] and [559, 290, 680, 525] and [533, 201, 602, 394] and [576, 424, 713, 832] and [005, 298, 169, 822] and [283, 451, 460, 998] and [804, 314, 973, 937] and [112, 272, 189, 419] and [406, 246, 484, 403] and [379, 297, 517, 547] , and [426, 180, 493, 328] are of school age and sitting at school desks . many of the children have textbooks on their tables . [801, 323, 934, 703] and [009, 289, 162, 534] and [136, 406, 292, 898] and [701, 212, 809, 621] and [233, 290, 334, 508] and [883, 264, 999, 497] and [280, 224, 371, 371] and [559, 290, 680, 525] and [533, 201, 602, 394] and [576, 424, 713, 832] and [005, 298, 169, 822] and [283, 451, 460, 998] and [804, 314, 973, 937] and [112, 272, 189, 419] and [406, 246, 484, 403] and [379, 297, 517, 547] , and [426, 180, 493, 328] are wearing school uniforms .') 2023-11-02 23:11:27.385 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:11:27.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'man:[210, 114, 295, 386],[060, 000, 143, 106],[331, 160, 451, 387],[424, 213, 486, 380],[461, 134, 547, 413],[504, 237, 571, 385],[558, 182, 631, 397],[611, 497, 689, 875],[628, 180, 708, 382],[708, 198, 793, 389],[884, 000, 961, 134];clothing:[000, 000, 037, 100],[059, 000, 144, 106],[231, 161, 293, 400],[263, 000, 331, 106],[325, 018, 383, 110],[342, 174, 439, 389],[383, 000, 446, 120],[430, 256, 476, 371],[445, 000, 472, 070],[465, 174, 536, 392],[516, 246, 561, 392],[551, 000, 598, 082],[559, 222, 620, 395],[593, 807, 684, 988],[601, 000, 633, 068],[609, 500, 690, 855],[614, 015, 665, 089],[629, 216, 713, 382],[662, 000, 724, 100],[716, 224, 795, 382],[731, 000, 785, 121],[785, 000, 867, 121],[893, 015, 963, 127],[946, 000, 999, 125].') 2023-11-02 23:11:28.239 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:11:28.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the sheep stretching its neck to eat on the far right at the beginning?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one the sheep stretching its neck to eat on the far right at the beginning.Track1frame:1:[527, 474, 752, 738];frame:2:[531, 468, 708, 735];frame:3:[538, 488, 662, 737].') 2023-11-02 23:12:06.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:06.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[774, 705, 813, 878];Street Lights:[268, 452, 325, 742];Car:[256, 739, 354, 814],[203, 734, 271, 802],[156, 730, 214, 791],[114, 727, 163, 784],[079, 727, 122, 777],[060, 730, 086, 772],[001, 724, 063, 794],[734, 706, 841, 788],[939, 728, 993, 813];Stop Sign:[931, 621, 966, 672].') 2023-11-02 23:12:08.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:08.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[456, 456, 539, 668] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a personFrame1:[456, 456, 539, 668];Frame2:[449, 401, 543, 608];Frame3:[322, 458, 393, 652];Frame4:[285, 465, 351, 631].') 2023-11-02 23:12:12.532 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:12.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Glasses:[430, 391, 529, 447],[207, 203, 351, 287];Person:[002, 086, 377, 1000],[336, 290, 792, 1000],[301, 391, 410, 762],[119, 295, 204, 399];Necklace:[178, 356, 313, 700],[475, 620, 507, 765],[470, 648, 531, 896];Bracelet:[336, 748, 387, 809].') 2023-11-02 23:12:22.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:22.060 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'clothing:[529, 214, 905, 965],[000, 418, 506, 864],[504, 520, 682, 878],[837, 411, 886, 540];woman:[000, 050, 614, 999],[021, 249, 134, 487],[614, 183, 990, 880],[824, 327, 913, 529];human face:[343, 177, 455, 461],[687, 219, 816, 432],[833, 338, 875, 401].') 2023-11-02 23:12:51.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:51.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[173, 264, 247, 502],[150, 704, 197, 811],[468, 272, 540, 445],[769, 162, 832, 239].') 2023-11-02 23:12:57.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:57.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bus:[349, 358, 876, 750],[185, 391, 365, 664],[002, 463, 188, 591];Person:[379, 436, 641, 962],[069, 539, 163, 861],[178, 508, 260, 863].') 2023-11-02 23:12:59.236 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:12:59.236 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a top view of a gluten free lemon poppy seed bread made with chickpea flour on a dark backgroun [007, 002, 991, 997]') 2023-11-02 23:13:21.220 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:13:21.220 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[055, 148, 920, 858] neon anatomy of an automobile on black background - futuristic car stock pictures, [592, 613, 797, 720] royalty-free photos & images') 2023-11-02 23:14:05.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:14:05.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Military helicopters [100, 747, 198, 912] [554, 515, 660, 687] [001, 552, 071, 733] [236, 543, 360, 720] [369, 730, 493, 900] [099, 372, 211, 558] [396, 336, 504, 521] [247, 073, 382, 279] fly past the Mexican flag [497, 176, 741, 580] atop the National Palace, during the annual Independence Day military parade [003, 059, 777, 998] in Mexico City's main square, known as the Zocalo, Friday, Sept. 16, 2016.") 2023-11-02 23:14:30.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:14:30.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[040, 385, 232, 973];frame:2:[040, 385, 232, 973];frame:3:[036, 402, 230, 989] doing?Given the image frames and trajectory coordinates, please respond succinctly to the question.Considering the image frames and trajectory coordinates provided, offer a brief answer.', 'This is a/an the giraffes that stay grounded without any movement, maintaining their positions.') 2023-11-02 23:14:32.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:14:32.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Volleyball:[206, 033, 336, 111];Person:[195, 051, 510, 870],[452, 128, 779, 1000],[291, 001, 415, 195],[429, 001, 580, 191],[826, 001, 910, 037];Handbag:[410, 050, 463, 097].') 2023-11-02 23:14:56.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:14:56.645 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hoverboard,Flask,Cigarette ,Cabinet,Bakset,Glasses,Person and Cup in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Cabinet:[001, 262, 161, 358];Bakset:[067, 383, 215, 421];Glasses:[302, 197, 605, 287];Person:[148, 048, 944, 999];Cup:[094, 381, 155, 442].') 2023-11-02 23:15:17.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:15:17.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cat in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Cat:[470, 332, 639, 539].') 2023-11-02 23:15:45.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:15:45.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[073, 429, 215, 860],[158, 478, 282, 899],[249, 494, 339, 886],[504, 490, 553, 774],[594, 503, 650, 947];man:[087, 424, 123, 556],[140, 437, 201, 623],[320, 567, 415, 806],[377, 424, 514, 999],[606, 454, 700, 979],[620, 403, 781, 999],[739, 443, 852, 999],[800, 357, 988, 999];clothing:[000, 591, 118, 837],[068, 525, 181, 848],[159, 553, 282, 903],[246, 547, 331, 851],[321, 605, 390, 774],[381, 509, 520, 968],[503, 530, 577, 725],[559, 497, 995, 999].') 2023-11-02 23:15:51.716 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:15:51.716 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Get [019, 133, 375, 437] the financial information that you need with [317, 349, 997, 940] Morningstar app for [001, 004, 403, 800] BlackBerry') 2023-11-02 23:16:07.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:16:07.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human body,woman and clothing in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[008, 687, 122, 990],[000, 445, 096, 750],[121, 435, 181, 542],[208, 420, 338, 999],[361, 395, 486, 999],[473, 393, 621, 973],[564, 423, 735, 999],[678, 506, 763, 820],[810, 444, 840, 582],[847, 455, 882, 558],[861, 337, 999, 999];woman:[204, 417, 340, 983],[000, 440, 105, 754],[120, 434, 183, 542],[470, 390, 633, 991];clothing:[872, 401, 993, 865],[000, 511, 098, 770],[000, 730, 133, 999],[123, 482, 188, 561],[197, 506, 333, 999],[358, 469, 491, 999],[476, 485, 589, 958],[564, 485, 778, 999].') 2023-11-02 23:16:15.142 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:16:15.143 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Palm trees [614, 551, 667, 845] [780, 549, 854, 831] [181, 328, 306, 860] [322, 406, 442, 881] [640, 131, 821, 857] [433, 246, 546, 847] are being encroached upon by the rising level of Lago Enriquillo') 2023-11-02 23:16:32.175 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:16:32.175 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The beautiful town of Chania and a visit to [432, 700, 583, 864] [282, 720, 423, 851] [400, 857, 584, 986] the local produce and Farmers Market') 2023-11-02 23:16:42.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:16:42.080 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Cup,Lamp and Camera in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[000, 000, 407, 1000],[239, 270, 292, 338],[261, 247, 351, 523],[920, 092, 1000, 1000],[966, 433, 999, 709],[750, 105, 879, 726],[670, 126, 771, 378],[579, 236, 899, 999],[422, 107, 622, 930],[553, 178, 636, 701],[397, 210, 471, 707],[377, 241, 424, 586];Cup:[578, 609, 619, 677];Lamp:[591, 099, 616, 160];Camera:[387, 283, 421, 309].') 2023-11-02 23:16:43.706 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:16:43.706 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'beetle:[578, 114, 849, 750];tree:[000, 000, 556, 588],[909, 406, 999, 555].') 2023-11-02 23:16:48.559 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:16:48.559 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[108, 005, 384, 993] Law enforcement officers work at the scene of a deadly crash in Holtville, Calif., on Tuesday,...') 2023-11-02 23:17:16.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:17:16.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you tell me the trajectory of the man wearing a white shirt and black pants in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one man wearing a white shirt and black pants.Track1frame1:[625, 261, 1000, 915];frame2:[652, 267, 1000, 938].') 2023-11-02 23:17:17.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:17:17.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[722, 517, 923, 999],[569, 493, 772, 1000],[374, 577, 618, 1000],[018, 545, 496, 999].') 2023-11-02 23:17:18.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:17:18.349 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Sneakers:[002, 862, 098, 970];Person:[002, 266, 503, 999],[288, 325, 1000, 999].') 2023-11-02 23:18:22.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:18:22.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[414, 287, 500, 772] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a personFrame1:[414, 287, 500, 772];Frame2:[417, 281, 503, 766];Frame3:[415, 281, 503, 770];Frame4:[416, 287, 503, 770].') 2023-11-02 23:18:33.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:18:33.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[425, 334, 456, 434].') 2023-11-02 23:19:13.804 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:19:13.804 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bench:[051, 863, 523, 961],[087, 930, 684, 997],[001, 597, 248, 631],[001, 563, 239, 595],[001, 534, 235, 563],[806, 530, 978, 561],[824, 558, 996, 592],[856, 594, 967, 627];Person:[009, 569, 106, 634],[124, 574, 197, 630],[370, 516, 455, 593],[499, 457, 631, 594],[648, 523, 736, 592],[001, 693, 096, 827],[014, 676, 088, 793],[073, 656, 128, 751],[209, 658, 252, 713],[253, 642, 328, 741],[254, 676, 303, 750],[208, 682, 290, 785],[163, 703, 242, 827],[049, 710, 242, 875],[001, 849, 139, 1000],[330, 648, 422, 737],[280, 650, 403, 777],[126, 764, 352, 970],[138, 928, 393, 1000],[324, 808, 526, 1000],[336, 683, 568, 866],[480, 661, 539, 736],[487, 693, 586, 793],[482, 724, 684, 937],[569, 786, 850, 1000],[577, 648, 636, 723],[623, 652, 671, 704],[589, 701, 685, 801],[657, 699, 720, 787],[668, 678, 751, 774],[725, 698, 773, 755],[750, 649, 798, 705],[734, 705, 845, 807],[784, 655, 877, 735],[800, 686, 884, 770],[852, 651, 898, 704],[860, 667, 974, 770],[964, 655, 1000, 731],[881, 700, 997, 800],[816, 744, 958, 828],[834, 773, 1000, 1000],[861, 557, 941, 627],[945, 569, 1000, 629].') 2023-11-02 23:19:20.423 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:19:20.423 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'toy:[000, 467, 342, 999],[041, 000, 447, 254],[214, 105, 736, 963],[378, 000, 897, 365],[721, 741, 995, 999],[748, 000, 998, 177].') 2023-11-02 23:19:32.069 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:19:32.069 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a welcoming boho living room with [381, 003, 995, 609] a mustard wall, [334, 435, 702, 619] neutral furniture, [915, 348, 997, 448] [022, 321, 093, 377] [616, 452, 888, 641] [024, 380, 096, 436] [015, 200, 088, 258] [018, 260, 090, 317] [626, 225, 718, 392] pretty artworks and [429, 188, 506, 246] [001, 698, 110, 912] [035, 474, 098, 573] [084, 391, 190, 593] [293, 343, 364, 489] potted plants') 2023-11-02 23:19:43.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:19:43.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the person walking in the hallway.The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one person walking in the hallway.Track1frame1:[951, 245, 1000, 605];frame2:[950, 251, 1000, 617];frame3:[930, 271, 1000, 657].') 2023-11-02 23:20:13.424 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:20:13.424 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the man sitting on the sidelines?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is no man sitting on the sidelines.') 2023-11-02 23:20:49.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:20:49.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Koala campaigners [255, 428, 306, 690] [543, 579, 614, 688] [432, 422, 485, 685] [291, 598, 370, 762] [127, 529, 208, 857] [482, 625, 547, 766] [370, 559, 433, 781] [215, 511, 278, 692] [003, 543, 093, 678] gathered at Uki this week to protest the planned replacement of koala-protection gates at Black Rocks sports field.') 2023-11-02 23:20:52.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:20:52.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[277, 208, 834, 887];footwear:[487, 813, 711, 917];human arm:[306, 268, 469, 485],[427, 315, 860, 644];human leg:[220, 535, 817, 911];man:[227, 128, 869, 908];plant:[004, 165, 415, 903];furniture:[454, 641, 791, 894].') 2023-11-02 23:21:32.414 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:21:32.414 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[051, 287, 511, 998] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a white rhinocerosFrame1:[051, 287, 511, 998];Frame2:[093, 284, 578, 998];Frame3:[180, 309, 485, 998];Frame4:[231, 377, 514, 998].') 2023-11-02 23:21:56.139 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:21:56.139 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Dog:[165, 196, 602, 768],[560, 054, 999, 710];Plate:[426, 634, 593, 765],[668, 625, 811, 744];Bread:[747, 679, 802, 732].') 2023-11-02 23:22:06.770 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:22:06.771 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Fishers police [001, 360, 361, 699] [513, 344, 996, 787] responded to a call of shots fired Wednesday at Newgistics.') 2023-11-02 23:22:41.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:22:41.218 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[283, 243, 560, 938] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[283, 243, 560, 938];Frame2:[287, 340, 526, 894];Frame3:[267, 126, 527, 954];Frame4:[257, 130, 504, 901];Frame5:[271, 233, 526, 888].') 2023-11-02 23:22:49.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:22:49.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[399, 277, 822, 825] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a pheasantFrame1:[399, 277, 822, 825];Frame2:[397, 262, 885, 827];Frame3:[394, 183, 851, 837];Frame4:[393, 122, 752, 830].') 2023-11-02 23:23:05.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:23:05.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Microphone:[531, 533, 584, 624];Speaker:[103, 641, 235, 942];Guitar:[758, 348, 814, 459];Piano:[314, 455, 481, 684];Person:[575, 514, 972, 998],[768, 195, 890, 712],[495, 432, 712, 1000],[117, 503, 329, 1000],[393, 260, 477, 688],[608, 320, 680, 458];Desk:[879, 513, 1000, 729].') 2023-11-02 23:23:20.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:23:20.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[598, 378, 626, 442],[623, 384, 655, 451],[698, 290, 724, 368];Speaker:[057, 173, 123, 334],[001, 218, 030, 320];Lamp:[760, 023, 826, 229],[861, 429, 903, 505],[743, 425, 776, 500],[593, 433, 627, 497],[508, 306, 535, 355],[446, 315, 469, 352],[469, 425, 503, 488],[387, 416, 428, 524],[341, 409, 381, 479],[275, 415, 313, 477],[663, 423, 721, 544],[519, 420, 564, 541].') 2023-11-02 23:23:26.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:23:26.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[213, 518, 267, 748],[407, 646, 445, 725],[476, 052, 642, 719];Car:[000, 765, 056, 830],[040, 760, 088, 821],[179, 750, 267, 805],[303, 762, 611, 915],[827, 650, 1000, 1000],[533, 737, 579, 788],[268, 728, 316, 794];Bicycle:[774, 808, 811, 914];Person:[002, 736, 059, 888],[119, 734, 155, 844],[132, 742, 166, 840],[775, 677, 872, 999];Awning:[797, 599, 862, 654].') 2023-11-02 23:23:42.339 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:23:42.339 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'train:[290, 436, 493, 784],[655, 288, 773, 353].') 2023-11-02 23:23:56.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:23:56.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'person:[000, 742, 116, 1000],[032, 770, 206, 1000],[234, 830, 392, 1000],[407, 778, 559, 1000],[600, 820, 774, 1000],[836, 863, 991, 1000],[102, 636, 212, 1000],[185, 571, 317, 1000],[338, 590, 417, 1000],[385, 577, 459, 957],[428, 576, 515, 868],[449, 572, 546, 879],[464, 591, 556, 1000],[503, 590, 583, 955],[542, 573, 610, 920],[600, 579, 697, 956],[629, 596, 727, 1000],[649, 533, 708, 838],[685, 572, 758, 950],[750, 544, 816, 967],[778, 602, 851, 993],[837, 613, 910, 985],[900, 620, 982, 976],[937, 652, 1000, 1000],[934, 568, 994, 915],[833, 575, 919, 923],[942, 537, 999, 833],[865, 549, 920, 888],[813, 545, 885, 912],[842, 518, 888, 804],[747, 560, 822, 866],[426, 547, 525, 866],[287, 565, 374, 927],[185, 604, 277, 946],[099, 558, 203, 943],[027, 516, 090, 756],[104, 536, 147, 745],[369, 475, 402, 627],[398, 460, 437, 620],[437, 458, 469, 619],[510, 492, 568, 705],[233, 508, 290, 754],[216, 524, 252, 757],[184, 600, 246, 892],[252, 449, 334, 665],[489, 550, 534, 647].') 2023-11-02 23:24:13.938 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:24:13.939 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'These numbers are all WRONG! Since this poster [191, 046, 830, 912] was printed, another 92 children have died from baby') 2023-11-02 23:24:28.196 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:24:28.197 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[762, 428, 925, 851] Army officers outside [243, 057, 784, 461] the Castle entrance during the First World War. They manned the defences in case enemy ships tried to attack Portsmouth's naval base.") 2023-11-02 23:25:03.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:03.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[770, 284, 814, 330] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a brown airplane flying in the airFrame1:[770, 284, 814, 330];Frame2:[760, 295, 802, 343];Frame3:[759, 311, 797, 358];Frame4:[762, 331, 796, 379];Frame5:[765, 340, 798, 393].') 2023-11-02 23:25:06.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:06.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Sneakers:[547, 720, 619, 761],[291, 811, 362, 850],[394, 826, 509, 869];Other Shoes:[003, 905, 103, 946];Hat:[242, 323, 347, 390];Person:[003, 295, 102, 945],[033, 353, 171, 676],[158, 320, 513, 867],[382, 331, 618, 762],[616, 347, 791, 694],[687, 322, 974, 710],[905, 315, 999, 584],[888, 333, 954, 567].') 2023-11-02 23:25:08.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:08.813 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[083, 407, 269, 604],[296, 485, 344, 611],[338, 441, 406, 562],[345, 480, 468, 920],[426, 481, 533, 941],[566, 506, 644, 667],[627, 498, 706, 674],[687, 442, 733, 627],[766, 533, 961, 744];building:[000, 000, 999, 658].') 2023-11-02 23:25:10.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:10.991 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[442, 171, 938, 999],[134, 072, 568, 1000];Air Conditioner:[007, 057, 199, 281];Bed:[002, 714, 236, 999];Pillow:[002, 668, 088, 794];Hanger:[002, 903, 123, 999].') 2023-11-02 23:25:45.536 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:45.537 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'cabinetry:[531, 090, 977, 350],[391, 122, 610, 358],[410, 477, 999, 817],[929, 115, 999, 363];gas stove:[562, 479, 910, 535];cupboard:[384, 099, 999, 360],[398, 535, 558, 805],[900, 537, 997, 828];countertop:[412, 480, 999, 549];building:[000, 000, 999, 999];drawer:[418, 546, 543, 599];kitchen appliance:[547, 414, 912, 826].') 2023-11-02 23:25:51.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:51.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[298, 083, 546, 750] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a legless lizardFrame1:[298, 083, 546, 750];Frame2:[334, 112, 545, 825];Frame3:[301, 156, 517, 835];Frame4:[378, 197, 550, 964].') 2023-11-02 23:25:59.799 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:25:59.799 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Doug’s Fish Fry occupies three storefronts in downtown Skaneateles, including a separate entrance [569, 550, 655, 840] for the ice cream counter [645, 389, 889, 840].') 2023-11-02 23:26:12.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:26:12.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[003, 175, 999, 999].') 2023-11-02 23:27:09.089 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:09.090 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'cooperative extension wildlife staff [679, 307, 849, 997] [412, 299, 618, 998] [097, 240, 304, 998] viewing wildlife at a lake') 2023-11-02 23:27:14.657 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:14.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[234, 491, 312, 545] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a scotch terrierFrame1:[234, 491, 312, 545];Frame2:[470, 481, 548, 560];Frame3:[515, 437, 572, 522];Frame4:[490, 443, 533, 508];Frame5:[440, 445, 475, 518].') 2023-11-02 23:27:16.420 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:16.420 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'SES volunteers [701, 380, 850, 778] [585, 342, 747, 834] [336, 169, 505, 701] [157, 202, 358, 879] fill sandbags [003, 526, 246, 711] [002, 739, 539, 996] for an upcoming storm') 2023-11-02 23:27:22.503 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:22.504 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Portrait painting of a musician in a cape holding [011, 442, 993, 983] a didgeridoo.') 2023-11-02 23:27:32.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:32.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Slippers:[337, 691, 473, 752];Handbag:[736, 575, 941, 804];Chair:[737, 292, 1000, 775],[332, 443, 711, 942],[034, 255, 355, 768],[278, 191, 459, 419];Laptop:[566, 211, 820, 385],[332, 229, 561, 352];Person:[635, 077, 963, 722],[054, 027, 474, 757];Desk:[340, 303, 783, 790].') 2023-11-02 23:27:35.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:35.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[112, 000, 999, 998];dessert:[265, 361, 528, 662];mug:[590, 340, 886, 693];coffee cup:[590, 335, 882, 702];baked goods:[263, 365, 521, 686].') 2023-11-02 23:27:51.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:27:51.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [339, 318, 471, 597] [298, 352, 385, 579] [587, 193, 793, 592] [000, 128, 324, 996] [360, 311, 674, 996] hold up their fists [001, 128, 033, 217] [620, 192, 679, 271] after protesting near the spot where George Floyd died while in custody of the Minneapolis Police, on May 26, 2020 in...') 2023-11-02 23:28:07.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:28:07.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[000, 019, 607, 619];frame:2:[000, 020, 660, 679];frame:3:[000, 011, 745, 638] doing?Given the image frames and trajectory coordinates, please respond succinctly to the question.Considering the image frames and trajectory coordinates provided, offer a brief answer.', 'This is a/an the first car moving to the right.') 2023-11-02 23:28:56.121 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:28:56.122 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The 52-year old son [006, 069, 865, 995] of father (?) and mother(?) Amit Singhal in 2021 photo. Amit Singhal earned a million dollar salary - leaving the net worth at million in 2021') 2023-11-02 23:29:01.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:29:01.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[835, 173, 933, 502],[317, 191, 365, 261],[298, 317, 408, 418].') 2023-11-02 23:29:05.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:05.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[589, 419, 646, 516],carFrame1:[689, 413, 839, 543] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[589, 419, 646, 516];Frame2:[618, 425, 687, 538],carFrame1:[689, 413, 839, 543];Frame2:[753, 415, 979, 593].') 2023-11-02 23:29:08.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:08.190 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Person and Soccer in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Chair:[567, 495, 641, 624],[697, 500, 770, 649],[846, 515, 919, 654],[920, 562, 990, 682],[863, 690, 1000, 773],[767, 853, 904, 999],[503, 820, 644, 999],[266, 788, 435, 972],[879, 768, 1000, 874];Person:[617, 635, 678, 705],[687, 495, 774, 623].') 2023-11-02 23:29:09.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:09.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Our hearts are with NHL star Jordan Staal [173, 007, 684, 996], who has suffered a devastating loss') 2023-11-02 23:29:12.824 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:12.824 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[732, 401, 862, 987],[001, 235, 471, 999],[346, 490, 372, 556],[380, 481, 456, 674],[441, 746, 512, 859],[507, 698, 595, 861],[541, 607, 603, 861],[590, 682, 649, 821],[628, 659, 690, 780];woman:[495, 690, 605, 861],[625, 636, 688, 810];tree:[000, 000, 804, 559],[263, 090, 753, 481],[523, 221, 770, 538],[737, 323, 783, 491].') 2023-11-02 23:29:25.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:25.162 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[192, 143, 457, 1000],[563, 118, 875, 1000];Bracelet:[364, 651, 401, 716];Moniter:[363, 373, 450, 499],[160, 455, 217, 719],[002, 386, 080, 505];Keyboard:[085, 671, 143, 721];Cabinet:[002, 237, 999, 550],[849, 421, 914, 661],[002, 499, 085, 603];Desk:[011, 685, 225, 999].') 2023-11-02 23:29:31.185 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:31.186 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Illustration of hands pointing to [310, 152, 668, 994] a young woman of colour for a story about how racism impacts mental health and wellbeing.') 2023-11-02 23:29:58.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:29:58.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Mirror:[022, 121, 262, 581];Toilet:[399, 501, 485, 733];Sink:[158, 557, 339, 685];Trash bin Can:[365, 582, 412, 681].') 2023-11-02 23:30:09.998 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:30:09.998 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[222, 205, 410, 632],[347, 239, 564, 778],[480, 240, 563, 350],[559, 301, 741, 529],[567, 294, 664, 428],[916, 423, 1000, 757],[003, 273, 079, 691];Desk:[716, 474, 967, 689],[586, 414, 875, 594];Storage box:[094, 364, 237, 415];Cabinet:[081, 329, 258, 429],[574, 231, 678, 356];Lamp:[081, 035, 147, 098],[362, 043, 430, 096];Handbag:[329, 422, 390, 513].') 2023-11-02 23:31:17.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:31:17.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[572, 374, 599, 454];Bicycle:[561, 399, 610, 452];Cow:[287, 457, 457, 719];Barrel:[976, 339, 999, 366].') 2023-11-02 23:31:25.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:31:25.620 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[304, 505, 417, 700] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a guitar played by a boyFrame1:[304, 505, 417, 700];Frame2:[303, 505, 417, 708];Frame3:[304, 505, 415, 702];Frame4:[306, 500, 421, 702];Frame5:[304, 497, 421, 697].') 2023-11-02 23:31:47.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:31:47.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track carFrame1:[000, 606, 184, 829] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'carFrame1[000, 606, 184, 829];Frame2:No object.') 2023-11-02 23:32:14.969 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:32:14.969 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'wheel:[372, 456, 541, 949],[561, 312, 616, 438],[573, 763, 708, 901],[800, 763, 894, 986],[886, 749, 958, 948];tire:[377, 465, 536, 988],[590, 766, 713, 915],[800, 757, 895, 986],[893, 746, 958, 951];person:[166, 463, 209, 666],[196, 460, 245, 603],[198, 454, 227, 634],[228, 440, 254, 660],[240, 460, 271, 639],[267, 463, 304, 639],[465, 229, 519, 429],[478, 218, 542, 465],[534, 233, 556, 358],[537, 218, 587, 489],[566, 219, 623, 328],[951, 692, 986, 852];land vehicle:[364, 196, 955, 979];flower:[026, 681, 381, 955],[234, 687, 273, 752],[582, 595, 718, 755],[718, 592, 813, 766],[859, 575, 921, 730].') 2023-11-02 23:32:16.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:32:16.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[148, 194, 676, 748] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a pt boatFrame1:[148, 194, 676, 748];Frame2:[147, 197, 684, 762];Frame3:[133, 187, 680, 730];Frame4:[117, 187, 671, 723].') 2023-11-02 23:32:23.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:32:23.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[194, 283, 260, 438],[238, 274, 283, 429],[018, 309, 069, 395];Book:[471, 855, 780, 999];Flower:[449, 592, 685, 878];Vase:[517, 747, 624, 863];Desk:[317, 744, 799, 999];Chair:[756, 618, 901, 1000],[261, 674, 392, 1000];Couch:[054, 529, 492, 999];Pillow:[121, 727, 165, 828],[375, 659, 429, 729];Clock:[879, 183, 964, 324];Picture:[524, 277, 563, 371],[470, 320, 506, 412],[401, 285, 453, 392],[323, 288, 384, 414],[170, 239, 303, 486],[030, 441, 081, 537],[002, 290, 099, 420].') 2023-11-02 23:32:41.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:32:41.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Frame:[263, 118, 393, 321],[780, 234, 882, 401];Lamp:[010, 203, 076, 807];Carpet:[007, 695, 999, 999];Cabinet:[467, 443, 642, 639],[929, 234, 999, 482];Pillow:[913, 498, 999, 594],[734, 491, 827, 563],[573, 675, 736, 798];Couch:[673, 456, 999, 735],[091, 598, 877, 1000];Chair:[704, 384, 836, 495];Moniter:[477, 299, 636, 453].') 2023-11-02 23:32:43.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:32:43.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Chair:[848, 476, 1000, 945];Flower:[002, 658, 700, 1000],[704, 512, 794, 627];Desk:[002, 849, 1000, 998];Person:[536, 208, 933, 867],[157, 190, 490, 844].') 2023-11-02 23:33:04.551 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:33:04.551 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'PLAYER- [085, 727, 889, 934] bright letters and rims mashine [220, 302, 509, 732] [408, 039, 725, 335] black wheels on a white background') 2023-11-02 23:33:17.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:33:17.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[211, 761, 235, 826] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a border collieFrame1:[211, 761, 235, 826];Frame2:[248, 631, 261, 700];Frame3:[322, 473, 338, 531];Frame4:[418, 287, 439, 334];Frame5:[445, 196, 457, 245].') 2023-11-02 23:33:50.713 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:33:50.713 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Book,Cabinet and Computer Box in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Book:[036, 648, 105, 688];shelf:[066, 493, 179, 690].') 2023-11-02 23:33:57.078 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:33:57.078 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '3 men [806, 285, 901, 909] [229, 283, 317, 885] [508, 272, 597, 878] and 3 women [033, 274, 147, 992] [640, 265, 740, 997] [360, 310, 454, 997] smiling behind archery targets [532, 210, 633, 638] [678, 216, 786, 638] [377, 213, 480, 635] [222, 212, 335, 633] ahead of the competition') 2023-11-02 23:34:02.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:34:02.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[002, 246, 779, 1000],[772, 911, 849, 999].') 2023-11-02 23:34:16.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:34:16.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[000, 000, 620, 363],[000, 165, 746, 700],[191, 700, 701, 770];fashion accessory:[166, 241, 213, 321],[323, 168, 578, 412],[350, 454, 399, 527],[389, 672, 421, 737],[436, 000, 459, 065],[456, 322, 576, 484],[508, 320, 581, 420],[533, 322, 585, 456];man:[000, 673, 837, 998],[096, 520, 728, 731];tree:[690, 000, 783, 212],[694, 435, 999, 923],[705, 218, 759, 344],[711, 333, 831, 570];human face:[495, 000, 615, 043],[585, 314, 732, 494],[609, 585, 720, 697],[615, 700, 688, 743],[676, 834, 809, 993].') 2023-11-02 23:34:18.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:34:18.722 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'tropical fish [805, 252, 877, 328] [113, 365, 233, 536] [177, 259, 282, 445] [491, 315, 685, 569] [198, 472, 428, 729] [026, 385, 139, 560] over coral reef [000, 016, 997, 998] with a scuba diver [197, 102, 399, 232] - scuba diving stock pictures, royalty-free photos [599, 622, 794, 730] & images') 2023-11-02 23:34:52.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:34:52.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Spring Rolls,Person and Car in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[215, 375, 263, 597];Car:[824, 399, 1000, 654].') 2023-11-02 23:34:58.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:34:58.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[210, 445, 403, 783],[724, 471, 1000, 760];Cymbal:[002, 799, 283, 886],[333, 882, 461, 969],[392, 475, 472, 538];Drum:[610, 565, 712, 710],[452, 669, 595, 818],[676, 789, 763, 936];Hat:[647, 368, 725, 444];Person:[213, 208, 402, 863],[591, 369, 720, 628],[712, 271, 922, 732];Microphone:[254, 314, 299, 375];Speaker:[213, 239, 276, 343].') 2023-11-02 23:35:28.332 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:35:28.332 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[219, 559, 296, 936],[316, 441, 456, 939],[469, 519, 610, 939],[438, 365, 566, 779].') 2023-11-02 23:35:49.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:35:49.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[779, 177, 999, 694],carFrame1:[702, 364, 767, 445] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1[779, 177, 999, 694];Frame2:No object,carFrame1:[702, 364, 767, 445];Frame2:[729, 363, 807, 455].') 2023-11-02 23:35:58.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:35:58.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[916, 729, 993, 936];girl:[007, 462, 187, 759],[284, 175, 339, 299],[525, 351, 609, 559],[531, 179, 611, 331],[713, 706, 758, 969],[763, 696, 831, 975],[926, 719, 995, 935];tree:[000, 019, 025, 093],[000, 330, 280, 652],[189, 777, 225, 861],[189, 000, 271, 116],[228, 760, 263, 857],[278, 023, 395, 124],[280, 007, 306, 099],[283, 297, 420, 504],[310, 383, 352, 498],[375, 338, 421, 480],[389, 000, 456, 128],[421, 303, 578, 471],[453, 000, 566, 136],[575, 330, 627, 402],[698, 580, 810, 680],[773, 048, 903, 161],[928, 652, 999, 680];clothing:[006, 593, 157, 762],[106, 475, 210, 659],[129, 138, 210, 274],[158, 527, 305, 737],[215, 839, 299, 934],[286, 101, 604, 327],[410, 454, 438, 591],[466, 706, 575, 983],[530, 425, 593, 548],[558, 391, 611, 552],[568, 630, 681, 983],[609, 352, 673, 546],[654, 093, 695, 224],[666, 354, 694, 479],[704, 067, 737, 224],[713, 626, 993, 967],[836, 085, 887, 280],[837, 339, 895, 491];human face:[048, 504, 107, 637],[134, 408, 190, 492],[155, 082, 185, 154],[202, 397, 269, 547].') 2023-11-02 23:36:25.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:36:25.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[412, 338, 450, 395],carFrame1:[656, 323, 822, 417],personFrame1:[771, 332, 819, 575],personFrame1:[836, 302, 886, 533],personFrame1:[897, 294, 936, 535],bicycleFrame1:[862, 488, 965, 681],carFrame1:[222, 345, 382, 445] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[412, 338, 450, 395];Frame2:[414, 337, 449, 395],carFrame1:[656, 323, 822, 417];Frame2:[656, 321, 814, 417],personFrame1:[771, 332, 819, 575];Frame2:[841, 318, 885, 565],personFrame1:[836, 302, 886, 533];Frame2:[808, 298, 851, 540],personFrame1:[897, 294, 936, 535];Frame2:[872, 290, 915, 529],bicycleFrame1:[862, 488, 965, 681];Frame2:[862, 476, 965, 682],carFrame1:[222, 345, 382, 445];Frame2:[283, 346, 402, 434].') 2023-11-02 23:36:26.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:36:26.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[002, 241, 095, 998],[014, 309, 158, 883],[002, 198, 120, 621],[502, 087, 622, 693],[538, 032, 782, 998],[738, 120, 826, 597],[780, 149, 850, 556],[897, 167, 1000, 684],[897, 325, 999, 806];Hockey Stick:[300, 574, 570, 939],[109, 502, 336, 729],[339, 437, 510, 745].') 2023-11-02 23:36:39.931 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:36:39.932 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'step 3.2 - enter your permanent residence address - filling out irs form [008, 007, 989, 975] 8233') 2023-11-02 23:37:06.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:37:06.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[002, 236, 296, 749],[639, 104, 870, 999],[941, 463, 1000, 982];Piano:[631, 475, 908, 659];Cymbal:[337, 369, 483, 455];Drum:[215, 604, 301, 683];Microphone:[947, 201, 1000, 272];Speaker:[838, 003, 999, 152],[496, 289, 613, 577].') 2023-11-02 23:37:16.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:37:16.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Desk:[598, 748, 999, 999];Pillow:[316, 534, 561, 728],[113, 605, 360, 779],[000, 577, 152, 796];Couch:[000, 469, 729, 999].') 2023-11-02 23:37:28.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:37:28.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[883, 005, 959, 204];Frame:[819, 301, 903, 390];Person:[885, 753, 998, 999],[779, 738, 984, 999],[683, 502, 792, 999],[604, 494, 702, 962],[519, 677, 653, 999],[425, 506, 552, 851],[354, 664, 553, 1000],[279, 462, 388, 990],[927, 300, 996, 387],[853, 168, 927, 297];Hat:[720, 496, 779, 570].') 2023-11-02 23:37:44.475 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:37:44.476 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[383, 090, 637, 332] raspberry mummy pies on [004, 007, 995, 998] a white surface with [173, 666, 345, 770] [853, 077, 996, 185] [392, 654, 607, 781] [002, 359, 174, 485] plastic spiders and bats') 2023-11-02 23:38:20.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:38:20.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Desk:[193, 626, 944, 996];Chair:[348, 508, 444, 700];Person:[108, 277, 167, 525],[001, 190, 102, 1000],[113, 138, 323, 936],[446, 149, 762, 807],[428, 270, 518, 653],[345, 430, 427, 600],[885, 274, 924, 543],[910, 125, 999, 536],[735, 278, 998, 996],[275, 303, 328, 522],[328, 386, 371, 478],[366, 386, 404, 444],[409, 386, 441, 472];Belt:[008, 610, 064, 645];Handbag:[075, 581, 192, 836],[447, 552, 578, 804],[848, 417, 893, 499];Glasses:[569, 219, 638, 268];Hat:[826, 373, 852, 410];Plate:[414, 807, 502, 918];Spoon:[432, 839, 494, 879];Basin:[732, 495, 790, 558],[735, 928, 813, 990].') 2023-11-02 23:38:32.531 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:38:32.531 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Police [134, 575, 262, 959] [720, 119, 996, 979] [389, 504, 597, 949] launch the raid on Ming Moon, and take away a man from inside') 2023-11-02 23:38:41.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:38:41.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[430, 560, 681, 980],[082, 240, 353, 900].') 2023-11-02 23:39:00.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:39:00.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Foto de [176, 299, 524, 996] jack russell terrier, sitting on [200, 664, 456, 992] a toilet seat with digestion problems or constipation looking very sad and toilet paper rolls everywhere - Imagen libre de derechos') 2023-11-02 23:40:21.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:40:21.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[443, 461, 507, 675] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a mouflonFrame1:[443, 461, 507, 675];Frame2:[454, 456, 514, 675];Frame3:[453, 447, 513, 677];Frame4:[447, 423, 507, 653];Frame5:[442, 403, 508, 638].') 2023-11-02 23:40:29.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:40:29.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Flag:[228, 360, 248, 406],[169, 348, 184, 381],[335, 391, 355, 466],[373, 404, 402, 445];Boat:[429, 397, 511, 505],[512, 475, 564, 550],[681, 432, 766, 553];Machinery Vehicle:[642, 285, 699, 504],[571, 350, 627, 502].') 2023-11-02 23:40:34.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:40:34.072 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[903, 254, 967, 883],[774, 663, 795, 887];Pickup Truck:[386, 895, 527, 1000];Traffic cone:[481, 991, 489, 1000],[552, 965, 565, 999],[693, 959, 700, 993],[777, 961, 783, 989],[894, 958, 902, 982],[292, 958, 300, 982];Person:[663, 915, 685, 992],[789, 901, 821, 986],[907, 891, 929, 966];Clock:[460, 495, 500, 561],[505, 495, 538, 562];Hat:[914, 891, 925, 904].') 2023-11-02 23:40:38.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:40:38.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'dress:[307, 353, 616, 962],[518, 505, 613, 784],[581, 447, 723, 921];woman:[309, 180, 623, 974],[000, 308, 120, 999],[087, 245, 190, 999],[520, 375, 620, 811],[571, 303, 738, 926],[626, 661, 852, 999],[678, 311, 766, 666],[791, 240, 968, 999],[935, 305, 999, 999];man:[142, 179, 368, 977];human face:[030, 328, 075, 436],[128, 264, 184, 363],[266, 233, 315, 366],[310, 308, 346, 381],[428, 235, 488, 345],[562, 418, 612, 484],[630, 341, 686, 438],[691, 352, 720, 439],[853, 272, 903, 376],[961, 397, 999, 474].') 2023-11-02 23:41:04.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:41:04.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[934, 646, 982, 911],carFrame1:[235, 683, 270, 743],carFrame1:[858, 660, 999, 775] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[934, 646, 982, 911];Frame2:[845, 643, 914, 903],carFrame1:[235, 683, 270, 743];Frame2:[235, 685, 272, 743],carFrame1:[858, 660, 999, 775];Frame2:[810, 657, 1000, 775].') 2023-11-02 23:41:11.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:41:11.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Horse:[121, 491, 167, 646];Cow:[718, 571, 889, 890],[664, 574, 736, 769],[645, 549, 678, 588],[625, 569, 678, 778],[568, 636, 664, 833],[561, 575, 623, 768],[602, 553, 640, 614],[574, 551, 635, 630],[516, 559, 574, 655],[506, 584, 564, 661],[332, 603, 364, 646],[285, 616, 349, 716],[245, 595, 305, 784],[255, 576, 318, 623];Chair:[062, 796, 704, 1000];Person:[125, 451, 183, 590],[268, 634, 503, 852],[785, 512, 883, 753],[732, 739, 999, 999],[939, 680, 967, 756].') 2023-11-02 23:41:16.623 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:41:16.624 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[177, 342, 620, 943] a Chinese man relaxing and easy-to-do Qigong exercise') 2023-11-02 23:41:17.654 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:41:17.654 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[650, 320, 909, 894] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a whitetail prairie dogFrame1:[650, 320, 909, 894];Frame2:[634, 398, 909, 875];Frame3:[562, 431, 905, 841];Frame4:[543, 426, 893, 836];Frame5:[535, 427, 863, 847].') 2023-11-02 23:41:32.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:41:32.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[002, 256, 433, 1000],[542, 516, 999, 999],[657, 260, 999, 999];glasses:[545, 250, 710, 293];man:[000, 259, 315, 854],[264, 206, 829, 999].') 2023-11-02 23:42:02.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:42:02.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect table,computer monitor,desk,guitar,person and facial tissue holder in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'table:[000, 815, 439, 999],[425, 542, 998, 985];computer monitor:[618, 392, 904, 597];desk:[476, 539, 998, 825];guitar:[094, 250, 578, 703],[436, 478, 627, 903];person:[005, 125, 511, 889].') 2023-11-02 23:42:08.727 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:42:08.728 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Refrigerator,Chair,Cabinet and Desk in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Refrigerator:[000, 381, 120, 832];Chair:[485, 565, 537, 645],[546, 564, 599, 657],[630, 653, 821, 1000],[654, 679, 821, 999];Cabinet:[000, 243, 177, 474],[113, 327, 251, 771];Desk:[294, 568, 470, 790].') 2023-11-02 23:42:14.358 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:42:14.358 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[729, 473, 854, 877] [001, 471, 073, 893] [119, 448, 268, 896] [454, 464, 550, 948] [356, 473, 457, 885] [896, 435, 998, 886] [255, 463, 352, 899] [655, 476, 755, 882] Police personnel stand guard during a farmers protest against the passage of two controversial farm bills by the country’s parliament in Bangalore, India.') 2023-11-02 23:42:53.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:42:53.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Desk:[271, 472, 541, 887];Chair:[523, 314, 972, 971],[125, 224, 465, 639];Vase:[355, 105, 406, 191];Flower:[087, 003, 215, 125];Dog:[192, 681, 567, 1000],[031, 455, 231, 762];Cup:[451, 492, 501, 579],[403, 429, 449, 506];Person:[204, 115, 413, 712];Book:[233, 391, 352, 468];Sandals:[317, 656, 361, 714].') 2023-11-02 23:43:07.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:43:07.325 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the person sits on bicycle and moves, then stops and gets off in this video clip?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one person sits on bicycle and moves, then stops and gets off.Track1frame:1:[000, 286, 003, 350];frame:2:[000, 273, 003, 347];frame:3:[000, 269, 002, 365].') 2023-11-02 23:43:19.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:43:19.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'flower:[826, 698, 877, 783].') 2023-11-02 23:43:20.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:43:20.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,table,footwear,woman and ski in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[471, 253, 570, 689],[605, 263, 716, 648];table:[129, 543, 533, 944];footwear:[220, 820, 283, 867];woman:[158, 316, 283, 866],[259, 313, 402, 790],[386, 288, 476, 546],[651, 498, 840, 977],[677, 291, 819, 600].') 2023-11-02 23:43:27.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:43:27.974 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[435, 177, 675, 756] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a pheasantFrame1:[435, 177, 675, 756];Frame2:[464, 116, 685, 759];Frame3:[425, 108, 660, 725];Frame4:[435, 090, 670, 709].') 2023-11-02 23:43:41.387 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:43:41.388 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '"Were getting on like a storm, arent we Clive", [002, 002, 957, 993] Barry Gibb says sarcastically, adding: "In fact I might just leave." Pictured on the Clive Anderson show in 1997') 2023-11-02 23:43:46.894 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:43:46.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[331, 077, 477, 355] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a eyeFrame1:[331, 077, 477, 355];Frame2:[321, 076, 452, 362];Frame3:[293, 082, 420, 362];Frame4:[342, 079, 469, 354];Frame5:[376, 079, 503, 355].') 2023-11-02 23:44:58.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:44:58.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human hair:[773, 497, 994, 962],[260, 319, 322, 391],[280, 377, 341, 506],[429, 422, 601, 855],[610, 345, 699, 456],[773, 330, 846, 425],[854, 360, 945, 501];human head:[458, 415, 608, 635],[064, 308, 207, 495],[263, 317, 316, 391],[281, 356, 341, 467],[362, 330, 454, 455],[462, 360, 515, 435],[546, 229, 588, 296],[552, 374, 611, 460],[608, 348, 702, 482],[716, 107, 769, 185],[725, 369, 781, 445],[773, 327, 848, 424],[786, 469, 999, 968],[839, 356, 949, 495];mammal:[363, 427, 627, 943],[000, 212, 999, 995],[699, 101, 803, 395];man:[708, 094, 819, 378],[000, 309, 401, 827],[027, 346, 084, 455],[235, 314, 358, 506],[307, 325, 485, 677],[359, 220, 458, 377],[495, 225, 611, 414],[581, 351, 706, 506],[728, 324, 856, 606],[754, 356, 954, 666],[786, 458, 999, 999];clothing:[715, 117, 810, 339],[000, 478, 392, 812],[226, 392, 356, 515],[305, 445, 473, 681],[356, 681, 693, 941],[357, 265, 458, 388],[493, 275, 608, 414],[592, 454, 676, 504],[690, 426, 755, 490],[729, 420, 848, 615];human arm:[298, 540, 398, 760],[406, 282, 448, 355],[438, 408, 467, 478],[512, 292, 597, 366],[691, 182, 755, 305],[748, 516, 796, 670].') 2023-11-02 23:45:46.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:45:46.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect poster and human face in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'poster:[000, 000, 998, 998];human face:[860, 042, 998, 404].') 2023-11-02 23:46:01.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:46:01.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the bird moving around and pecking ground?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There are 11 bird moving around and pecking ground.Track1frame:1:[881, 046, 975, 139];frame:2:[958, 082, 999, 151],Track2frame:1:[647, 099, 666, 134];frame:2:[647, 109, 670, 133];frame:3:[648, 101, 663, 131],Track3frame:1:[486, 276, 732, 706];frame:2:[543, 216, 827, 702];frame:3:[511, 258, 750, 614],Track4frame:1:[084, 203, 547, 672];frame:2:[083, 195, 504, 672];frame:3:[132, 193, 431, 673],Track5frame:1:[053, 447, 575, 998];frame:2:[055, 402, 578, 998];frame:3:[061, 336, 580, 998],Track6frame:1:[000, 804, 092, 998];frame:2:[000, 905, 056, 998],Track7frame:1:[468, 236, 586, 409];frame:2:[468, 235, 567, 410];frame:3:[497, 236, 584, 410],Track8frame:1:[227, 056, 340, 225];frame:2:[237, 052, 335, 223];frame:3:[226, 051, 332, 236],Track9frame:1:[609, 281, 682, 377];frame:2:[600, 251, 715, 383];frame:3:[390, 135, 538, 392],Track10frame:1:[642, 206, 999, 928];frame:2:[581, 391, 999, 927];frame:3:[608, 345, 999, 932],Track11frame:1:[451, 084, 475, 175];frame:2:[450, 086, 504, 178];frame:3:[455, 081, 494, 177].') 2023-11-02 23:46:05.933 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:46:05.934 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Refrigerator:[822, 349, 999, 1000];Extractor:[337, 487, 483, 625];Hat:[037, 003, 129, 182];shelf:[194, 011, 351, 527],[259, 115, 400, 572],[386, 209, 482, 675],[474, 332, 570, 657],[847, 694, 1000, 1000],[064, 710, 222, 1000],[214, 765, 281, 1000],[269, 810, 441, 1000],[206, 568, 367, 795];Person:[454, 487, 642, 999].') 2023-11-02 23:46:07.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:46:07.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[157, 112, 269, 257],carFrame1:[587, 097, 622, 153],carFrame1:[688, 130, 766, 251],carFrame1:[314, 107, 367, 206],carFrame1:[723, 122, 846, 297],carFrame1:[259, 110, 332, 224],carFrame1:[802, 175, 904, 373] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[157, 112, 269, 257];Frame2:[000, 100, 124, 345],carFrame1:[587, 097, 622, 153];Frame2:[593, 095, 635, 160],carFrame1:[688, 130, 766, 251];Frame2:[748, 151, 870, 328],carFrame1:[314, 107, 367, 206];Frame2:[228, 104, 308, 222],carFrame1:[723, 122, 846, 297];Frame2:[855, 231, 902, 376],carFrame1:[259, 110, 332, 224];Frame2:[115, 106, 253, 260],carFrame1[802, 175, 904, 373];Frame2:No object.') 2023-11-02 23:46:15.121 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:46:15.121 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[487, 160, 904, 978],[118, 000, 637, 999];human face:[487, 052, 616, 417],[699, 262, 808, 453];glasses:[443, 165, 615, 267],[695, 335, 816, 379].') 2023-11-02 23:46:28.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:46:28.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[355, 403, 414, 511] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a border terrierFrame1:[355, 403, 414, 511];Frame2:[358, 417, 420, 522];Frame3:[457, 451, 492, 495];Frame4:[519, 457, 602, 562];Frame5:[428, 534, 468, 637].') 2023-11-02 23:46:41.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:46:41.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Stool:[119, 645, 228, 713],[065, 626, 177, 684],[003, 669, 060, 749];Desk:[003, 602, 099, 708];Person:[266, 339, 441, 728],[195, 359, 425, 752];Cup:[003, 571, 049, 607],[644, 328, 700, 376];Trolley:[442, 209, 917, 797];Handbag:[464, 332, 568, 410].') 2023-11-02 23:47:33.530 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:47:33.530 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'interior shots of princess diana [586, 216, 741, 994], princess of wales, looking at boards [212, 012, 831, 838] and being shown round the red cross headquarters during royal tour on 8... - japan stock videos & royalty-free footage [308, 607, 427, 875]') 2023-11-02 23:47:34.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:47:34.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[049, 563, 183, 674],[002, 646, 403, 1000],[493, 781, 1000, 1000],[172, 584, 598, 930],[546, 558, 741, 787],[570, 512, 801, 713],[734, 658, 999, 817],[876, 574, 999, 665];Handbag:[414, 555, 526, 650];Person:[761, 543, 945, 839],[090, 438, 242, 641],[002, 430, 158, 657];Whiteboard:[425, 302, 627, 446].') 2023-11-02 23:47:35.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:47:35.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Mirror:[400, 026, 746, 773],[884, 200, 924, 615];Cabinet:[528, 378, 638, 571],[929, 514, 998, 650],[217, 504, 355, 626];Hanger:[939, 353, 996, 377],[939, 413, 996, 437];Picture:[964, 280, 1000, 352];Lamp:[012, 167, 336, 213],[474, 165, 693, 310],[264, 039, 356, 121];Person:[401, 123, 608, 880],[507, 167, 653, 751];High Heels:[452, 837, 543, 880].') 2023-11-02 23:47:38.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:47:38.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[564, 475, 665, 616] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a cockroachFrame1:[564, 475, 665, 616];Frame2:[573, 500, 687, 633];Frame3:[556, 502, 659, 612];Frame4:[528, 558, 637, 695];Frame5:[537, 558, 651, 695].') 2023-11-02 23:48:33.583 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:48:33.583 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Sector Optics F1 rangefinder [009, 018, 876, 908] has a clean, ultra-light, timeless shape flashlight that sits naturally in the palm of your hand.') 2023-11-02 23:48:46.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:48:46.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human body:[601, 065, 814, 664],[000, 665, 252, 999],[005, 287, 293, 637],[119, 590, 520, 999],[264, 257, 540, 628],[418, 572, 679, 920],[669, 535, 999, 936],[673, 626, 995, 999],[750, 266, 999, 618];human hair:[026, 676, 251, 1000],[097, 280, 163, 370],[202, 626, 301, 884],[291, 588, 441, 844],[376, 258, 446, 374],[443, 763, 557, 911],[476, 815, 683, 999],[495, 578, 642, 836],[697, 619, 884, 886],[721, 066, 735, 158],[781, 542, 906, 728],[799, 264, 872, 347];man:[018, 278, 256, 673],[122, 585, 486, 999],[265, 255, 538, 637],[558, 046, 808, 632],[693, 621, 999, 999],[778, 551, 993, 920],[790, 260, 964, 614];clothing:[585, 119, 816, 651],[000, 874, 039, 949],[012, 393, 275, 647],[023, 943, 112, 999],[123, 835, 491, 999],[266, 372, 535, 625],[426, 816, 672, 927],[677, 881, 985, 999],[690, 712, 999, 953],[747, 391, 954, 614],[943, 418, 999, 616];woman:[433, 579, 672, 916],[480, 822, 678, 999];human face:[108, 303, 170, 426],[390, 272, 450, 396],[666, 088, 725, 208],[811, 288, 875, 399];human arm:[016, 438, 191, 650],[166, 426, 265, 650],[236, 820, 308, 934],[265, 397, 343, 646],[554, 215, 671, 412],[710, 232, 816, 459],[765, 409, 862, 635],[861, 426, 957, 638].') 2023-11-02 23:48:54.756 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:48:54.756 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[532, 004, 995, 998] A young boy plays with [126, 318, 579, 712] a sparkler before the firework display at Hopkins Park on July 4th.') 2023-11-02 23:48:55.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:48:55.232 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[089, 452, 241, 694],[218, 349, 431, 873],[381, 507, 461, 634],[456, 335, 677, 632],[667, 324, 780, 605],[736, 379, 836, 565],[831, 362, 985, 576];Air Conditioner:[018, 343, 068, 386],[137, 314, 190, 363],[790, 273, 856, 328],[377, 479, 436, 515],[219, 474, 267, 515],[073, 175, 120, 229].') 2023-11-02 23:48:57.007 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:48:57.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[707, 175, 783, 636] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a personFrame1:[707, 175, 783, 636];Frame2:[715, 197, 792, 655];Frame3:[726, 178, 806, 650];Frame4:[740, 181, 815, 656];Frame5:[732, 175, 815, 656].') 2023-11-02 23:49:09.245 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:49:09.245 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Desk:[003, 003, 993, 807];Person:[114, 693, 569, 999],[385, 502, 489, 586],[466, 514, 534, 589],[529, 510, 604, 582],[593, 471, 684, 572],[244, 001, 787, 080];Cup:[152, 514, 276, 604],[076, 476, 192, 548],[213, 044, 321, 166];Cell Phone:[712, 723, 1000, 978];Laptop:[102, 055, 994, 681].') 2023-11-02 23:49:15.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:49:15.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'shelf:[000, 001, 276, 311];Desk:[000, 531, 170, 948],[510, 755, 701, 1000];Power outlet:[507, 272, 616, 311],[657, 275, 792, 320];Chair:[120, 846, 215, 999];Storage box:[057, 764, 214, 1000];Extention Cord:[507, 270, 616, 308],[654, 273, 795, 323];Gloves:[625, 667, 783, 784],[402, 858, 507, 986];Person:[132, 128, 785, 998].') 2023-11-02 23:49:23.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:49:23.441 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Tie:[738, 643, 763, 759];Satchel:[574, 660, 607, 761],[551, 629, 585, 770];Street Lights:[156, 393, 201, 666];Person:[692, 577, 793, 994],[556, 578, 649, 1000],[429, 574, 563, 1000],[417, 583, 477, 861],[399, 603, 432, 748],[359, 585, 398, 859],[290, 571, 414, 999],[283, 597, 307, 693],[247, 595, 274, 703],[211, 594, 238, 698],[188, 592, 208, 695].') 2023-11-02 23:49:24.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:49:24.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[071, 338, 246, 648],[000, 763, 420, 998],[295, 425, 528, 650];human body:[086, 345, 250, 653],[303, 422, 536, 640];mammal:[076, 350, 251, 641],[000, 773, 420, 998],[296, 417, 531, 646];human arm:[053, 857, 445, 998],[065, 401, 121, 506],[211, 410, 241, 492],[503, 472, 538, 586];human hand:[180, 837, 426, 987];ball:[831, 566, 930, 632],[890, 583, 991, 653],[926, 603, 998, 675];human leg:[000, 901, 235, 998],[111, 492, 198, 646],[175, 488, 246, 646],[288, 506, 483, 651],[398, 501, 483, 646];bowling equipment:[588, 513, 676, 540],[693, 513, 760, 543],[765, 491, 998, 815].') 2023-11-02 23:49:42.225 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:49:42.226 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Demonstrators [432, 033, 995, 995] light candles [133, 363, 437, 702] during a protest about the death of nine people during a police raid at a Funk Party at Paraisopolis slum last weekend....') 2023-11-02 23:49:48.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:49:48.037 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Boat:[541, 659, 678, 758],[776, 664, 927, 756],[595, 605, 758, 681],[598, 571, 716, 621],[685, 454, 806, 590],[506, 601, 589, 649],[472, 582, 584, 637],[468, 572, 543, 601],[466, 533, 528, 563],[235, 483, 311, 521],[291, 491, 337, 516],[242, 469, 350, 508],[296, 466, 379, 504];Person:[131, 632, 147, 668].') 2023-11-02 23:49:52.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-02 23:49:52.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Bench:[678, 628, 829, 727];Person:[639, 518, 701, 748],[585, 526, 639, 746],[495, 511, 558, 734];Trash bin Can:[386, 611, 433, 702].') 2023-11-02 23:50:12.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:50:12.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Paddle in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Paddle:[222, 454, 302, 490].') 2023-11-02 23:50:22.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:50:22.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Guitar:[004, 002, 762, 886];Stool:[190, 751, 722, 998];Desk:[608, 278, 1000, 1000];Person:[624, 226, 999, 616].') 2023-11-02 23:50:56.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:50:56.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[493, 241, 519, 292] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a faceFrame1:[493, 241, 519, 292];Frame2:[489, 243, 514, 294];Frame3:[489, 239, 515, 290];Frame4:[485, 232, 512, 289];Frame5:[490, 237, 514, 288].') 2023-11-02 23:52:05.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:52:05.019 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Van:[009, 431, 604, 762],[729, 431, 999, 676];Backpack:[245, 584, 451, 781];Satchel:[158, 703, 248, 828];Luggage:[017, 794, 160, 999];Person:[198, 496, 422, 1000],[467, 608, 625, 785];Tent:[683, 479, 801, 547],[543, 483, 702, 548].') 2023-11-02 23:52:31.247 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:52:31.247 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[435, 364, 541, 867],[183, 419, 212, 590],[531, 382, 639, 850],[660, 402, 796, 972];man:[076, 330, 203, 909],[208, 356, 315, 852],[243, 270, 300, 366],[293, 297, 429, 956],[376, 268, 416, 365],[436, 354, 538, 783];footwear:[325, 873, 386, 970],[559, 741, 581, 808];jeans:[092, 564, 189, 858],[231, 583, 303, 834],[311, 568, 426, 911],[691, 663, 780, 951];human leg:[103, 554, 138, 876],[148, 541, 185, 857],[235, 574, 288, 855],[280, 574, 301, 751],[312, 574, 373, 958],[370, 545, 433, 899],[438, 602, 491, 895],[491, 610, 563, 899],[563, 572, 609, 855],[563, 581, 626, 809],[677, 665, 730, 973],[733, 652, 780, 973];human hair:[326, 295, 365, 366],[458, 358, 505, 435],[561, 380, 605, 453],[672, 394, 743, 500];human arm:[063, 426, 108, 561],[183, 424, 216, 534],[204, 440, 243, 652],[293, 428, 350, 690],[391, 409, 430, 648],[440, 453, 483, 687],[517, 442, 544, 648],[531, 453, 605, 566],[601, 453, 636, 559],[653, 493, 708, 745],[749, 497, 799, 704].') 2023-11-02 23:52:39.590 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:52:39.591 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[057, 106, 404, 496],[092, 534, 348, 909],[500, 067, 871, 496],[512, 514, 879, 939];clothing:[132, 375, 317, 496],[157, 768, 379, 921],[510, 429, 778, 501],[539, 875, 856, 950],[846, 667, 932, 934];human face:[056, 153, 131, 401],[309, 596, 354, 751],[515, 601, 739, 923],[664, 164, 867, 446],[860, 515, 935, 625].') 2023-11-02 23:52:45.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:52:45.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[130, 249, 546, 694] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', "This is a przewalski's horseFrame1:[130, 249, 546, 694];Frame2:[194, 275, 647, 747];Frame3:[201, 257, 645, 749];Frame4:[196, 275, 630, 787].") 2023-11-02 23:53:05.602 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:53:05.602 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[378, 555, 613, 997] [608, 016, 997, 996] [001, 220, 383, 995] People holding [594, 548, 673, 698] mobile phones are silhouetted against [008, 004, 990, 989] a backdrop projected with [397, 186, 587, 499] the Twitter logo in this illustration picture taken September 27, 2013') 2023-11-02 23:53:36.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:53:36.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'boy:[714, 568, 843, 953],[032, 216, 121, 450],[045, 101, 137, 313],[068, 529, 226, 902],[106, 210, 205, 446],[124, 085, 224, 327],[201, 101, 295, 363],[201, 531, 333, 882],[218, 205, 302, 491],[295, 093, 379, 333],[297, 205, 386, 458],[319, 492, 430, 880],[379, 102, 440, 314],[425, 537, 523, 743],[447, 094, 519, 307],[502, 539, 621, 896],[597, 560, 713, 909];girl:[010, 385, 122, 832],[093, 356, 186, 700],[178, 368, 281, 778],[283, 349, 382, 631],[380, 370, 471, 634],[458, 368, 546, 616],[467, 219, 558, 460],[501, 382, 613, 665],[525, 105, 582, 312],[545, 221, 623, 476],[587, 103, 651, 278],[611, 227, 703, 489],[614, 392, 715, 674],[649, 110, 723, 287],[670, 231, 749, 454],[685, 363, 801, 672],[728, 110, 814, 334],[758, 240, 821, 443],[790, 401, 863, 743];clothing:[016, 160, 941, 935].') 2023-11-02 23:54:07.638 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:54:07.638 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[013, 302, 248, 840],[330, 391, 523, 818],[494, 304, 613, 502],[585, 286, 629, 414],[603, 560, 963, 999],[820, 318, 858, 410];food:[113, 835, 238, 932];table:[000, 698, 503, 999],[461, 487, 651, 737];man:[238, 278, 474, 708],[419, 305, 786, 999],[493, 306, 612, 502],[603, 548, 963, 999],[840, 310, 925, 590];girl:[335, 401, 514, 836];clothing:[008, 463, 258, 719],[259, 363, 371, 721],[338, 571, 520, 815],[503, 377, 626, 505],[550, 525, 767, 999],[578, 322, 634, 415],[701, 849, 968, 999],[750, 392, 867, 622],[798, 378, 873, 500],[841, 355, 924, 542];human face:[075, 363, 146, 505],[352, 329, 397, 428],[353, 411, 426, 573],[521, 333, 557, 408],[625, 331, 730, 569],[661, 622, 761, 934].') 2023-11-02 23:54:11.365 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:54:11.365 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[359, 379, 425, 621],[431, 377, 495, 613],[539, 382, 604, 622];Backpack:[551, 426, 583, 488].') 2023-11-02 23:55:11.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:55:11.110 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the use its nose to help baby elephant come out from the water then walk away in this video clip?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one use its nose to help baby elephant come out from the water then walk away.Track1frame:1:[277, 200, 651, 603];frame:2:[284, 190, 631, 600];frame:3:[297, 186, 617, 589].') 2023-11-02 23:55:17.645 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:55:17.645 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[238, 453, 621, 996] Bride and [577, 440, 818, 997] groom strolling along [000, 508, 998, 996] the Adriatic sea at a Croatia destination wedding by destination wedding planner Mango Muse Events') 2023-11-02 23:55:30.408 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:55:30.409 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[370, 190, 412, 344] [199, 173, 254, 349] [618, 511, 713, 832] [440, 213, 487, 383] [832, 632, 929, 925] [049, 144, 125, 308] [213, 614, 280, 859] Youngsters fly [541, 468, 591, 622] [137, 662, 210, 795] [019, 465, 082, 542] kites on a ground in Rawalpindi on Friday despite a ban imposed by the city administration. — APP') 2023-11-02 23:56:37.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:56:37.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human body:[573, 450, 632, 703],[585, 094, 641, 265],[872, 502, 933, 758];human leg:[150, 348, 298, 737],[211, 505, 275, 699],[406, 512, 445, 873],[411, 291, 457, 844],[573, 619, 591, 708],[711, 584, 738, 674],[870, 653, 882, 738],[890, 644, 908, 756],[895, 657, 916, 758];mammal:[568, 000, 903, 696],[880, 496, 928, 766];girl:[866, 500, 935, 758];clothing:[555, 486, 640, 687].') 2023-11-02 23:57:15.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:57:15.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[014, 197, 097, 568],[083, 176, 133, 471],[116, 167, 180, 530],[184, 187, 244, 573],[219, 172, 288, 560],[292, 183, 324, 309],[294, 208, 373, 635],[362, 178, 446, 611],[486, 196, 582, 690],[603, 210, 763, 934].') 2023-11-02 23:57:29.057 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:57:29.057 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[859, 522, 937, 898] [957, 527, 997, 899] [132, 514, 220, 914] [218, 491, 318, 909] [036, 486, 139, 919] [320, 512, 413, 907] [750, 520, 842, 897] [578, 511, 668, 897] [668, 510, 756, 897] French soldiers from the 17th parachute regiment participate in a ceremony in June in Montauban, France.') 2023-11-02 23:57:38.323 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:57:38.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[525, 595, 609, 706] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a eastern grey squirrelFrame1:[525, 595, 609, 706];Frame2:[415, 439, 540, 656];Frame3:[453, 439, 551, 639];Frame4:[523, 368, 653, 591].') 2023-11-02 23:58:15.300 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:58:15.300 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Central American migrants [368, 351, 541, 519] [601, 238, 734, 382] sleep atop a moving freight train [126, 008, 996, 949] as it heads north from Arriaga toward Chahuites, Mexico.') 2023-11-02 23:58:37.642 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:58:37.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'State selector popup [290, 324, 706, 668] for first time visitor which can be switched of from plugin settings') 2023-11-02 23:58:46.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:58:46.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[270, 233, 752, 726];Helmet:[270, 267, 373, 435];Boots:[640, 301, 750, 442];Motorcycle:[355, 237, 899, 725].') 2023-11-02 23:58:58.697 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:58:58.698 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Moon phases vector cartoon flat lunar cycle icons isolated on [009, 009, 993, 996] a starry sky') 2023-11-02 23:59:05.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-02 23:59:05.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Handbag,Slippers and Street Lights in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Handbag:[336, 447, 511, 558];Slippers:[489, 929, 601, 968];Street Lights:[858, 001, 966, 322].') 2023-11-02 23:59:16.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:59:16.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Bow Tie in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[263, 124, 915, 1000],[395, 311, 587, 827],[002, 247, 271, 833];Bow Tie:[521, 391, 582, 443].') 2023-11-02 23:59:20.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-02 23:59:20.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[252, 018, 407, 308],[451, 327, 490, 442];Person:[597, 418, 995, 1000],[649, 388, 957, 946],[924, 454, 997, 543],[191, 589, 241, 654],[326, 597, 377, 633],[597, 418, 995, 1000],[649, 388, 957, 946],[924, 454, 997, 543],[191, 589, 241, 654],[326, 597, 377, 633],[024, 777, 111, 880],[046, 822, 120, 884],[003, 844, 094, 996],[018, 767, 343, 998],[175, 655, 435, 998],[350, 761, 484, 878],[417, 794, 563, 995],[465, 747, 730, 998],[633, 742, 692, 823];Car:[128, 645, 427, 762],[283, 625, 508, 704],[483, 583, 628, 662];Machinery Vehicle:[137, 295, 437, 689];Truck:[125, 508, 344, 659].') 2023-11-03 00:00:08.420 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:00:08.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect necklace,clothing,human face,tree and man in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'clothing:[872, 406, 987, 712],[000, 379, 895, 999];human face:[000, 332, 036, 399],[076, 354, 120, 418],[128, 333, 166, 399],[215, 330, 257, 414],[293, 352, 341, 416],[553, 381, 599, 460],[604, 370, 639, 449],[658, 370, 703, 447],[717, 383, 758, 471],[773, 361, 809, 438],[915, 393, 955, 487];tree:[001, 068, 091, 252];man:[138, 291, 288, 938],[000, 289, 085, 999],[052, 324, 161, 968],[116, 306, 183, 947],[283, 328, 371, 968],[358, 348, 457, 997],[475, 389, 568, 621],[532, 368, 613, 614],[560, 352, 720, 714],[618, 350, 858, 726],[770, 320, 893, 574],[871, 337, 999, 698].') 2023-11-03 00:00:16.226 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:00:16.226 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[461, 000, 923, 540] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a manateeFrame1:[461, 000, 923, 540];Frame2:[396, 022, 915, 664];Frame3:[356, 039, 923, 766];Frame4:[323, 084, 936, 841];Frame5:[325, 087, 944, 825].') 2023-11-03 00:00:25.175 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:00:25.175 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[108, 687, 261, 1000],[018, 743, 052, 898],[142, 619, 226, 677],[443, 517, 528, 665],[510, 525, 590, 675],[602, 537, 682, 678],[667, 605, 794, 859],[725, 583, 842, 809],[876, 572, 971, 814],[855, 636, 999, 824];Potted Plant:[004, 448, 084, 635],[830, 547, 999, 886];Plate:[544, 373, 571, 420];Desk:[061, 670, 317, 924],[773, 628, 951, 853].') 2023-11-03 00:00:44.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:00:44.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[317, 323, 586, 755] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a grizzlyFrame1:[317, 323, 586, 755];Frame2:[313, 231, 538, 765];Frame3:[426, 321, 610, 844];Frame4:[386, 238, 608, 819].') 2023-11-03 00:00:59.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:00:59.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Towel:[237, 602, 586, 912],[895, 542, 1000, 611],[001, 506, 043, 565],[001, 395, 108, 445],[196, 311, 287, 365],[712, 302, 821, 341],[956, 433, 997, 462];Chair:[336, 056, 427, 100],[507, 054, 592, 084],[686, 059, 796, 244],[590, 060, 759, 244],[454, 081, 586, 235],[262, 077, 437, 253],[189, 069, 322, 248],[225, 062, 312, 229],[806, 094, 946, 316],[830, 074, 930, 209],[909, 100, 999, 346],[001, 177, 243, 355],[699, 174, 972, 337];Flower:[266, 216, 727, 415];Vase:[373, 377, 624, 501];Frame:[720, 317, 923, 506];Knife:[739, 715, 872, 929],[089, 642, 774, 940],[079, 345, 158, 386];Spoon:[800, 744, 940, 914],[125, 340, 290, 377],[056, 350, 114, 381];Fork:[001, 697, 158, 892],[925, 403, 1000, 469],[041, 592, 095, 621],[023, 542, 118, 608],[927, 672, 1000, 733];Cup:[643, 450, 817, 673],[317, 472, 408, 558],[060, 447, 225, 653],[276, 453, 367, 523],[831, 283, 942, 362],[228, 393, 302, 455],[112, 283, 222, 417];Plate:[001, 529, 057, 645],[908, 545, 1000, 706],[001, 388, 069, 449].') 2023-11-03 00:01:04.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:01:04.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[014, 307, 298, 720],[365, 002, 973, 960],[355, 480, 502, 728],[528, 482, 667, 732],[730, 381, 926, 740],[909, 401, 1000, 733];Leather Shoes:[444, 803, 552, 958];Hat:[806, 382, 865, 417].') 2023-11-03 00:01:27.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:01:27.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[174, 360, 466, 958],[051, 412, 266, 871];clothing:[132, 357, 491, 905],[055, 465, 204, 842],[150, 412, 245, 443];tree:[043, 000, 998, 550];flower:[380, 572, 556, 753].') 2023-11-03 00:01:30.502 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:01:30.502 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[875, 213, 944, 555],[719, 241, 793, 551],[515, 191, 596, 503],[316, 183, 384, 442],[357, 386, 469, 697],[317, 380, 401, 687].') 2023-11-03 00:01:58.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:01:58.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[248, 069, 791, 946];Hat:[326, 071, 553, 146];Belt:[320, 410, 574, 468];Sneakers:[358, 733, 480, 812],[600, 877, 750, 941].') 2023-11-03 00:02:20.715 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:02:20.715 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'more than ideas for the coolest black and white nursery cool') 2023-11-03 00:02:39.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:02:39.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[406, 585, 731, 750] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a pebaFrame1:[406, 585, 731, 750];Frame2:[543, 502, 864, 662];Frame3:[375, 527, 689, 687];Frame4:[521, 560, 809, 735].') 2023-11-03 00:03:05.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:03:05.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[509, 543, 593, 959],[023, 729, 075, 997],[039, 595, 122, 959],[368, 595, 406, 820],[484, 571, 511, 758],[510, 605, 532, 712];clothing:[136, 614, 203, 954],[041, 916, 091, 997],[046, 818, 090, 980],[095, 602, 161, 997],[178, 789, 227, 997],[211, 877, 289, 997],[260, 837, 316, 997],[304, 700, 405, 997],[520, 837, 595, 997],[572, 533, 628, 827],[594, 822, 660, 997],[675, 696, 723, 966],[681, 818, 776, 997],[927, 818, 956, 985],[953, 636, 999, 849];flag:[178, 327, 206, 588],[297, 318, 357, 602],[374, 490, 429, 681],[395, 421, 424, 602],[411, 000, 557, 466],[465, 641, 535, 985],[584, 088, 711, 452];man:[000, 576, 016, 799],[000, 490, 027, 748],[042, 476, 075, 617],[092, 516, 114, 708],[125, 444, 170, 703],[130, 471, 202, 997],[185, 626, 246, 997],[186, 744, 288, 997],[264, 521, 295, 763],[289, 490, 415, 997],[314, 552, 332, 681],[375, 748, 486, 997],[426, 507, 455, 590],[445, 507, 515, 997],[468, 526, 489, 672],[496, 502, 526, 622],[558, 430, 603, 607],[585, 571, 683, 997],[587, 430, 603, 540],[600, 471, 638, 631],[671, 531, 723, 944],[678, 531, 813, 997],[763, 602, 791, 832],[775, 820, 872, 997],[853, 526, 863, 662],[879, 626, 966, 997],[890, 531, 941, 648],[951, 547, 998, 803];girl:[000, 377, 999, 997],[040, 593, 120, 997];human face:[000, 837, 023, 954],[000, 354, 999, 995],[021, 787, 050, 935],[050, 619, 068, 734],[220, 660, 240, 791],[393, 614, 404, 708],[393, 760, 439, 978],[506, 614, 520, 693],[573, 459, 586, 543],[658, 645, 681, 837],[668, 600, 696, 712],[890, 712, 927, 870],[894, 543, 914, 624].') 2023-11-03 00:05:08.453 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:05:08.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[512, 341, 545, 436],personFrame1:[154, 487, 191, 609],personFrame1:[283, 334, 316, 433] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1[512, 341, 545, 436];Frame2:No object,personFrame1[154, 487, 191, 609];Frame2:No object,personFrame1:[283, 334, 316, 433];Frame2:[270, 326, 313, 427].') 2023-11-03 00:05:40.645 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:05:40.646 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Inspired by the classic BLT, this easy recipe ups your pasta salad game with [065, 022, 995, 418] rotini pasta, [775, 720, 990, 787] [435, 178, 609, 245] [467, 785, 642, 920] grape tomatoes, [207, 222, 382, 327] thick sliced bacon, green onions, cubes of [587, 184, 712, 251] [441, 677, 620, 788] [741, 777, 876, 862] sharp cheddar & a simple homemade Ranch dressing!') 2023-11-03 00:06:14.956 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:06:14.956 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '5. How is the conflict [003, 005, 997, 994] resolved') 2023-11-03 00:06:42.759 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:06:42.760 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Cup:[304, 533, 443, 936];Bottle:[373, 257, 470, 819];Satchel:[895, 351, 974, 535];Cell Phone:[580, 875, 724, 927];Person:[461, 145, 856, 774];Desk:[190, 728, 999, 999],[797, 535, 999, 736];shelf:[026, 409, 111, 904];Storage box:[819, 349, 900, 539];Potted Plant:[054, 314, 087, 411].') 2023-11-03 00:06:49.017 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:06:49.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the man in black pushing a car.Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one man in black pushing a car.Track1frame:1:[366, 379, 414, 545];frame:2:[378, 385, 423, 550];frame:3:[388, 393, 433, 558].') 2023-11-03 00:06:50.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:06:50.702 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,man,clothing and human face in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[437, 418, 538, 761],[606, 451, 782, 995],[723, 418, 805, 823],[738, 680, 998, 996],[822, 401, 902, 575],[918, 393, 998, 653];man:[171, 320, 371, 980],[000, 378, 998, 998];clothing:[630, 543, 778, 920],[000, 558, 040, 735],[000, 826, 052, 940],[003, 466, 093, 576],[032, 750, 106, 890],[033, 638, 087, 760],[083, 645, 115, 715],[105, 741, 163, 845],[107, 615, 148, 761],[111, 463, 197, 646],[135, 783, 202, 908],[163, 423, 223, 501],[185, 461, 335, 998],[291, 476, 332, 546],[325, 705, 387, 831],[345, 403, 436, 735],[346, 578, 400, 715],[423, 486, 448, 570],[446, 465, 535, 698],[453, 436, 526, 470],[527, 460, 575, 620],[556, 548, 626, 750],[560, 470, 636, 560],[603, 406, 627, 473],[733, 491, 805, 805],[793, 588, 821, 721],[793, 445, 831, 661],[828, 446, 903, 561],[835, 570, 900, 710],[860, 436, 900, 515],[905, 643, 973, 728],[923, 473, 977, 536],[927, 445, 998, 635],[982, 630, 998, 750];human face:[256, 391, 321, 460],[670, 473, 727, 558],[736, 853, 796, 996],[740, 851, 858, 998].') 2023-11-03 00:07:57.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:07:57.325 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'toy:[287, 273, 345, 412],[155, 188, 262, 525],[217, 651, 631, 916],[224, 274, 626, 576],[347, 919, 644, 999],[376, 086, 661, 421],[689, 092, 803, 239],[703, 274, 745, 406];man:[000, 588, 155, 1000],[861, 000, 953, 230];clothing:[736, 425, 914, 972],[000, 700, 070, 999],[828, 012, 853, 203],[861, 000, 951, 240],[948, 102, 999, 908];human face:[048, 775, 128, 879],[739, 308, 820, 500].') 2023-11-03 00:08:20.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:08:20.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human head:[600, 098, 745, 334],[028, 098, 185, 343],[251, 266, 331, 429],[374, 454, 448, 609],[388, 291, 462, 437],[451, 373, 537, 549],[485, 184, 557, 343],[514, 351, 602, 502],[891, 008, 997, 394];mammal:[022, 103, 357, 1000],[057, 000, 231, 343],[237, 257, 385, 995],[331, 270, 460, 493],[334, 004, 491, 386],[351, 463, 445, 995],[425, 360, 585, 995],[457, 175, 608, 416],[568, 094, 828, 995],[628, 025, 997, 995],[657, 000, 774, 115],[702, 094, 748, 206],[717, 227, 871, 914];man:[557, 133, 842, 811],[014, 000, 240, 347],[331, 000, 491, 334],[454, 188, 608, 407];clothing:[025, 163, 308, 931],[071, 000, 217, 339],[245, 424, 365, 995],[288, 587, 442, 995],[340, 000, 488, 390],[417, 532, 585, 995],[460, 287, 617, 394],[574, 313, 831, 995],[820, 364, 997, 995];boy:[242, 266, 371, 995],[340, 287, 462, 489],[420, 377, 568, 995];woman:[634, 017, 997, 995],[660, 000, 777, 223];human hair:[031, 094, 140, 270],[380, 459, 445, 523],[400, 287, 465, 339],[457, 364, 540, 450],[731, 236, 777, 364],[900, 000, 997, 141];girl:[340, 459, 460, 995],[640, 000, 997, 995];human face:[080, 120, 191, 334],[265, 283, 334, 412],[368, 484, 440, 609],[388, 304, 448, 420],[451, 403, 528, 553],[488, 206, 545, 330],[534, 373, 585, 493],[602, 163, 702, 347],[897, 094, 997, 390];human arm:[000, 347, 360, 742],[074, 000, 242, 072],[222, 339, 340, 665],[345, 643, 425, 845],[348, 021, 440, 175],[417, 557, 457, 978],[437, 038, 494, 171],[562, 566, 588, 888],[651, 000, 728, 115],[682, 008, 777, 128],[697, 412, 842, 995].') 2023-11-03 00:08:39.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:08:39.040 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'furniture:[341, 748, 714, 848].') 2023-11-03 00:09:13.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:09:13.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Hat:[653, 066, 790, 224];Person:[564, 066, 977, 1000];Storage box:[854, 884, 928, 973].') 2023-11-03 00:09:19.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:09:19.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'snake:[000, 000, 944, 877].') 2023-11-03 00:10:01.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:10:01.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Potted Plant:[002, 094, 165, 1000];Picture:[313, 543, 412, 741];Person:[067, 218, 296, 997],[257, 228, 481, 996],[462, 134, 733, 994],[701, 224, 915, 999],[502, 516, 575, 688].') 2023-11-03 00:10:06.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:10:06.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Couch:[632, 877, 991, 1000],[318, 880, 627, 1000],[001, 863, 345, 998];Stool:[854, 709, 927, 823];Carpet:[001, 762, 1000, 1000];Moniter:[349, 357, 551, 430];Person:[111, 383, 243, 920],[190, 297, 309, 606],[208, 337, 308, 724],[299, 396, 361, 589],[345, 353, 447, 553],[264, 520, 422, 838],[429, 373, 521, 570],[509, 406, 577, 724],[560, 361, 617, 491],[544, 432, 632, 754],[604, 450, 724, 753],[665, 433, 761, 826],[731, 425, 858, 891],[692, 285, 831, 464],[765, 335, 878, 871];Desk:[522, 750, 752, 929],[203, 768, 437, 935],[409, 681, 541, 797].') 2023-11-03 00:10:40.152 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:10:40.153 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'clothing:[446, 501, 621, 749],[144, 556, 517, 999];human hair:[333, 334, 471, 569],[486, 424, 555, 506],[588, 575, 895, 999];human head:[084, 515, 183, 702],[322, 333, 470, 591],[493, 427, 560, 553];mammal:[000, 515, 180, 999],[138, 324, 532, 999],[433, 424, 606, 710],[526, 571, 943, 994];man:[143, 344, 508, 849],[445, 428, 608, 765].') 2023-11-03 00:11:01.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:11:01.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Glasses in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Glasses:[453, 301, 612, 401].') 2023-11-03 00:11:06.037 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:11:06.038 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cabinet:[157, 558, 256, 681],[488, 622, 590, 730],[001, 587, 060, 702];Potted Plant:[397, 001, 999, 918];Person:[419, 526, 490, 692],[257, 186, 315, 244];Trash bin Can:[362, 633, 486, 832].') 2023-11-03 00:11:17.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:11:17.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[421, 475, 502, 564] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a fireboatFrame1:[421, 475, 502, 564];Frame2:[421, 475, 506, 564];Frame3:[420, 479, 521, 576];Frame4:[430, 479, 542, 585];Frame5:[441, 479, 568, 594].') 2023-11-03 00:11:55.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:11:55.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Indian schoolchildren [345, 674, 503, 996] [648, 636, 795, 998] [134, 639, 292, 997] hold masks with the face [755, 463, 839, 659] [139, 471, 229, 675] [585, 490, 673, 688] of Bill Gates [405, 358, 481, 513] [502, 255, 565, 411] [470, 461, 561, 670] [332, 505, 417, 718].') 2023-11-03 00:12:00.592 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:12:00.592 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bracelet in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Bracelet:[090, 443, 256, 535],[028, 560, 199, 694],[777, 577, 943, 709].') 2023-11-03 00:13:19.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:13:19.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[755, 513, 826, 740],[514, 509, 588, 668];Flag:[357, 782, 455, 986].') 2023-11-03 00:13:28.259 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:13:28.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[010, 214, 199, 909],[000, 488, 057, 775],[352, 281, 500, 901],[485, 401, 599, 864],[716, 321, 846, 879],[209, 494, 266, 766],[453, 524, 490, 749];Hat:[013, 135, 118, 345],[733, 256, 799, 399];Other Shoes:[032, 863, 104, 901],[120, 872, 168, 912],[374, 861, 421, 903],[419, 846, 452, 902],[764, 731, 814, 795];Dinning Table:[037, 603, 313, 805];Chair:[188, 590, 283, 815],[253, 592, 347, 809],[000, 585, 059, 801];Desk:[678, 639, 894, 811],[488, 621, 589, 767].') 2023-11-03 00:13:38.017 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:13:38.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[010, 181, 183, 567].') 2023-11-03 00:13:42.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:13:42.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Car:[000, 701, 066, 823],[292, 710, 520, 805];Street Lights:[603, 249, 664, 776];Person:[532, 676, 568, 766],[876, 687, 914, 777].') 2023-11-03 00:13:45.336 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:13:45.337 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Aprilia’s working on a small capacity sportbike [091, 232, 929, 917] to compete with KTM’s 390 Duke and RC') 2023-11-03 00:13:54.889 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:13:54.889 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "American Lawn Mower 1204-14's good blade [228, 005, 988, 882] and simple design make it a worthy investment.") 2023-11-03 00:14:06.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:14:06.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[071, 738, 206, 918],carFrame1:[000, 805, 082, 995],truckFrame1:[123, 758, 320, 972] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[071, 738, 206, 918];Frame2:[227, 728, 362, 916],carFrame1:[000, 805, 082, 995];Frame2:[039, 787, 180, 976],truckFrame1:[123, 758, 320, 972];Frame2:[276, 740, 488, 989].') 2023-11-03 00:14:09.804 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:14:09.804 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[356, 001, 524, 184],[394, 183, 484, 310];Frame:[393, 370, 478, 515].') 2023-11-03 00:14:21.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:14:21.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[353, 001, 475, 261] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a raccoonFrame1:[353, 001, 475, 261];Frame2:[355, 001, 482, 413];Frame3:[341, 001, 509, 476].') 2023-11-03 00:14:45.182 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:14:45.183 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Launch of the special death dying and bereavement edition of Studies; [481, 003, 998, 444] an Irish quarterly review December 2009.') 2023-11-03 00:14:59.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:14:59.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human body:[147, 150, 850, 996],[000, 504, 073, 859],[000, 284, 180, 995],[065, 273, 363, 999],[577, 414, 731, 645],[844, 499, 999, 999];human hair:[310, 083, 601, 455],[000, 504, 053, 608],[048, 304, 189, 506],[163, 301, 189, 429],[576, 429, 728, 658],[877, 504, 999, 832];human head:[335, 060, 593, 573],[000, 490, 056, 734],[054, 292, 176, 602],[156, 276, 342, 444],[596, 430, 708, 635],[843, 492, 999, 738];mammal:[178, 121, 853, 986],[000, 494, 071, 830],[000, 296, 263, 996],[060, 302, 400, 999],[573, 426, 726, 657],[845, 503, 999, 999];glasses:[350, 297, 574, 433],[055, 447, 120, 528];human mouth:[440, 464, 523, 534];woman:[575, 433, 728, 652],[765, 479, 999, 999];man:[000, 295, 199, 897],[039, 286, 423, 999],[116, 051, 900, 999];human face:[000, 533, 039, 707],[050, 397, 145, 614],[351, 143, 589, 630],[613, 460, 705, 641],[841, 539, 928, 735];human arm:[000, 322, 096, 447],[120, 368, 404, 999],[743, 691, 840, 999],[831, 951, 896, 999],[850, 863, 873, 976],[973, 830, 999, 999];human nose:[446, 349, 501, 447].') 2023-11-03 00:15:47.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:15:47.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[523, 208, 540, 224];frame:2:[565, 091, 573, 134];frame:3:[571, 094, 580, 146] is doing?With reference to the supplied image frames and trajectory coordinates, please formulate a concise response.', 'This is a/an the cows exiting the vehicle.') 2023-11-03 00:16:33.733 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:16:33.734 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[019, 021, 981, 937] 1 euro Owl Coin which has [259, 246, 630, 649] the “S” in one of the stars') 2023-11-03 00:16:45.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:16:45.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[446, 251, 552, 780] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[446, 251, 552, 780];Frame2:[438, 221, 562, 810];Frame3:[438, 190, 555, 772];Frame4:[440, 159, 550, 748];Frame5:[427, 160, 546, 789].') 2023-11-03 00:16:45.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:16:45.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[016, 106, 362, 1000],[459, 224, 982, 999];shirt:[116, 291, 230, 575],[670, 419, 784, 865];human face:[113, 169, 221, 361],[653, 237, 766, 440].') 2023-11-03 00:17:06.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:17:06.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[629, 402, 999, 967],[001, 412, 376, 987];Person:[510, 089, 879, 943],[087, 083, 551, 961],[309, 918, 553, 999];Leather Shoes:[413, 874, 551, 956],[317, 844, 412, 923],[667, 870, 749, 934],[596, 825, 690, 919],[313, 927, 412, 997].') 2023-11-03 00:17:18.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:17:18.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human body:[670, 164, 870, 923],[060, 095, 170, 293],[190, 091, 300, 300],[298, 088, 419, 298],[400, 083, 541, 300],[555, 069, 693, 291];human head:[736, 172, 811, 291],[097, 089, 156, 189],[238, 082, 296, 180],[340, 084, 401, 182],[440, 079, 500, 175],[598, 069, 668, 166];mammal:[673, 195, 863, 860],[058, 087, 197, 292],[183, 077, 316, 292],[294, 079, 416, 294],[400, 077, 543, 299],[553, 065, 698, 292];man:[665, 164, 860, 908];jeans:[702, 507, 830, 911];human face:[099, 108, 141, 189],[238, 108, 285, 177],[341, 096, 383, 179],[605, 080, 651, 170],[734, 168, 795, 277].') 2023-11-03 00:17:36.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:17:36.068 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Hat in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[000, 777, 267, 999],[196, 756, 307, 978],[357, 735, 445, 950],[367, 798, 578, 992],[436, 829, 591, 999],[561, 845, 719, 982],[824, 770, 901, 917];Hat:[823, 768, 899, 817],[441, 799, 497, 854],[196, 755, 245, 805],[229, 570, 294, 671],[182, 564, 233, 654].') 2023-11-03 00:18:32.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:18:32.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[624, 492, 775, 639],[223, 511, 349, 964],[223, 632, 600, 999],[817, 809, 1000, 1000].') 2023-11-03 00:19:20.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:19:20.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Hockey Stick:[345, 615, 450, 753],[113, 373, 267, 724],[044, 856, 145, 1000];Helmet:[413, 497, 465, 569],[313, 405, 366, 491],[207, 515, 256, 566],[118, 653, 177, 716];Gloves:[202, 623, 252, 694],[141, 856, 174, 910],[079, 895, 142, 969],[255, 692, 316, 769],[416, 648, 465, 697];Sneakers:[505, 748, 557, 836],[165, 225, 222, 272];Person:[408, 496, 574, 867],[268, 404, 491, 832],[192, 517, 399, 879],[083, 654, 358, 1000],[000, 018, 109, 295],[054, 001, 194, 285],[141, 001, 271, 269],[213, 001, 314, 237],[284, 001, 446, 229],[424, 001, 491, 210],[528, 001, 610, 172].') 2023-11-03 00:19:41.549 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:19:41.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[104, 063, 998, 997];fashion accessory:[508, 154, 574, 294],[561, 115, 625, 219],[574, 185, 665, 299],[736, 820, 786, 997];girl:[120, 061, 998, 997];flower:[511, 122, 663, 303];human face:[216, 190, 562, 857].') 2023-11-03 00:20:03.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:03.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A person [589, 146, 727, 568] successfully conquers a mountain of fear, rejection, evil, and temptation through love: It alludes to how the love of Christ helps overcome all obstacles and turmoil of the world.') 2023-11-03 00:20:11.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:11.414 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to use a curling iron [535, 303, 922, 532] in Nevis using a type A power adapter [372, 402, 502, 581]') 2023-11-03 00:20:13.332 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:13.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Carpet:[003, 817, 998, 1000].') 2023-11-03 00:20:21.034 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:21.035 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Group of [683, 125, 792, 760] [162, 144, 318, 765] [428, 125, 567, 855] multi ethnic kids wearing [210, 188, 263, 253] [438, 225, 481, 282] [466, 202, 521, 269] [716, 206, 770, 276] face masks performing yoga in the school garden. Primary education social distancing health safety during Covid19 Coronavirus pandemic.') 2023-11-03 00:20:31.412 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:31.412 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[492, 034, 996, 993] Theo van Gogh, aged 13, and 32. The first picture was originally identified as Vincent van Gogh') 2023-11-03 00:20:48.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:48.211 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the two rabbits playfully chase each other in circles and move from right to left. in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There are 2 two rabbits playfully chase each other in circles and move from right to left..Track1frame:1:[570, 325, 704, 536];frame:2:[570, 325, 704, 536];frame:3:[633, 250, 730, 553],Track2frame:1:[640, 281, 777, 484];frame:2:[640, 281, 777, 484];frame:3:[591, 286, 709, 487].') 2023-11-03 00:20:56.471 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:20:56.472 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'In this Tuesday, Nov. 6, 2018, file photo, voters [176, 250, 289, 683] [502, 312, 608, 844] [784, 437, 909, 959] [357, 272, 466, 801] stand in line to cast their ballots at P.S. 22, in the Prospect Heights neighborhood in the Brooklyn borough of New York.') 2023-11-03 00:21:16.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:21:16.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human head:[459, 541, 603, 720],[000, 825, 058, 909],[109, 713, 205, 870],[111, 220, 177, 335],[256, 674, 314, 776],[331, 713, 403, 811],[384, 143, 519, 398],[449, 587, 491, 667],[657, 779, 729, 916],[687, 555, 727, 664],[878, 412, 967, 590],[955, 548, 997, 688],[958, 447, 997, 538];mammal:[046, 199, 272, 972],[000, 174, 997, 996],[386, 146, 554, 566];man:[044, 220, 256, 937],[000, 178, 997, 996],[379, 146, 559, 587];clothing:[058, 286, 226, 776],[000, 888, 088, 996],[118, 797, 279, 996],[331, 762, 461, 996],[354, 370, 699, 996],[696, 625, 797, 877],[769, 835, 820, 996],[832, 472, 997, 968];human hair:[000, 825, 051, 888],[109, 716, 198, 797],[389, 146, 501, 286],[477, 552, 587, 720],[662, 765, 720, 856],[689, 552, 722, 625],[888, 412, 965, 486],[967, 531, 997, 618];human face:[125, 769, 170, 870],[128, 230, 170, 325],[282, 699, 312, 772],[384, 185, 494, 391],[501, 580, 575, 713],[671, 800, 727, 898],[892, 444, 960, 576],[958, 458, 995, 559],[969, 545, 997, 681].') 2023-11-03 00:21:23.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:21:23.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[586, 321, 777, 748] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[586, 321, 777, 748];Frame2:[573, 344, 871, 768];Frame3:[588, 247, 879, 687];Frame4:[611, 204, 839, 713].') 2023-11-03 00:21:26.162 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:21:26.162 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect flower,woman,human nose and glasses in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'flower:[493, 049, 596, 164],[140, 000, 999, 999];woman:[144, 096, 796, 976];human nose:[256, 440, 375, 620];glasses:[091, 370, 480, 577].') 2023-11-03 00:21:30.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:21:30.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[814, 307, 982, 1000],[668, 343, 854, 999],[594, 236, 693, 762],[524, 151, 689, 455],[486, 339, 637, 1000],[314, 334, 494, 999],[432, 203, 532, 558],[309, 276, 393, 547],[129, 331, 309, 999],[038, 307, 205, 1000],[039, 368, 079, 531];Hat:[124, 307, 173, 355],[204, 331, 259, 385],[383, 335, 435, 388],[739, 343, 793, 391];Sneakers:[920, 918, 974, 999],[821, 892, 879, 957],[733, 949, 793, 999];Other Shoes:[038, 949, 094, 1000].') 2023-11-03 00:22:15.665 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:22:15.666 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Depiction of [611, 329, 942, 919] Maimonides teaching [167, 398, 337, 986] [010, 376, 184, 990] students about the 'measure of man' in [006, 006, 990, 994] an illuminated manuscript.") 2023-11-03 00:22:31.840 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:22:31.840 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[504, 031, 619, 153] [512, 378, 616, 489] Sticky notes decorate [002, 006, 994, 996] a fridge as we keep track of all the items being sold on Craigslist') 2023-11-03 00:22:40.278 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:22:40.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[274, 447, 468, 863],[395, 272, 634, 766],[935, 457, 999, 760],[900, 075, 1000, 358],[269, 247, 308, 310];Trash bin Can:[721, 283, 807, 432],[632, 281, 724, 438],[558, 251, 637, 446],[110, 312, 142, 405],[036, 309, 113, 463];Dog:[473, 540, 771, 999];Other Shoes:[002, 912, 045, 958];Sneakers:[317, 817, 397, 863];Sandals:[456, 709, 500, 763];Backpack:[888, 122, 975, 256];Laptop:[917, 294, 999, 389];Desk:[199, 286, 322, 335],[002, 386, 100, 661],[756, 356, 999, 681];Chair:[220, 282, 251, 337],[793, 341, 917, 592];Carpet:[550, 445, 920, 529],[577, 507, 1000, 938].') 2023-11-03 00:22:41.173 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:22:41.174 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[720, 329, 806, 837];man:[000, 490, 096, 742],[000, 599, 146, 999],[289, 272, 358, 612],[306, 218, 568, 999],[351, 308, 468, 886],[544, 231, 758, 999],[618, 290, 768, 901],[778, 141, 808, 209],[787, 239, 841, 540],[842, 130, 877, 200],[903, 235, 968, 567];tree:[000, 209, 160, 492],[121, 177, 259, 481],[246, 106, 413, 407],[689, 160, 728, 270],[743, 000, 855, 064],[908, 000, 999, 463];clothing:[000, 553, 098, 739],[060, 737, 368, 999],[291, 308, 351, 602],[304, 299, 533, 998],[567, 308, 765, 999],[792, 262, 849, 531],[908, 268, 965, 549].') 2023-11-03 00:23:12.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:23:12.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[134, 412, 546, 858] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a blue bus running on the mountain roadFrame1:[134, 412, 546, 858];Frame2:[143, 406, 541, 852];Frame3:[147, 408, 539, 852];Frame4:[164, 402, 532, 851];Frame5:[170, 397, 528, 847].') 2023-11-03 00:23:23.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:23:23.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Potted Plant:[033, 139, 242, 624],[430, 016, 992, 763];Person:[164, 247, 415, 996],[392, 340, 552, 1000],[492, 276, 830, 996];Glasses:[614, 358, 715, 409];Tie:[309, 472, 351, 731],[604, 485, 680, 725];Wine Glass:[948, 844, 987, 938];Speaker:[473, 257, 529, 372].') 2023-11-03 00:24:18.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:24:18.239 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[717, 561, 775, 635],[039, 525, 068, 577];Bicycle:[623, 594, 697, 664],[816, 565, 858, 607];Traffic Sign:[455, 447, 524, 481];Stool:[001, 630, 028, 662];Storage box:[260, 652, 323, 687].') 2023-11-03 00:24:35.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:24:35.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[001, 001, 840, 998];frame:2:[000, 001, 840, 998];frame:3:[000, 001, 840, 998] doing?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an minivan with three puppies') 2023-11-03 00:24:49.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:24:49.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'bicycle:[493, 495, 884, 869];bicycle wheel:[516, 677, 643, 881],[748, 665, 855, 834];person:[448, 177, 818, 750];wheel:[517, 684, 638, 880],[756, 665, 855, 844];bicycle helmet:[437, 152, 579, 415];footwear:[667, 682, 770, 845],[671, 572, 766, 692].') 2023-11-03 00:24:55.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:24:55.354 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the dog run ahead?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one dog run ahead.Track1frame:1:[370, 106, 569, 205];frame:2:[383, 083, 583, 189];frame:3:[394, 111, 604, 214].') 2023-11-03 00:25:29.251 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:25:29.252 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Glasses and Handbag/Satchel in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[538, 042, 881, 902],[084, 064, 459, 996];Glasses:[542, 090, 625, 121];Handbag:[707, 138, 934, 367].') 2023-11-03 00:25:37.088 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:25:37.088 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Speaker,Potted Plant and Bakset in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Speaker:[139, 576, 200, 906],[802, 588, 864, 932],[456, 131, 550, 203];Potted Plant:[790, 435, 1000, 999];Bakset:[725, 151, 821, 234].') 2023-11-03 00:26:35.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:26:35.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Sneakers:[141, 705, 194, 744],[541, 829, 599, 878],[446, 842, 484, 880],[589, 791, 651, 848],[619, 732, 642, 766],[727, 627, 765, 670];Chair:[705, 460, 727, 588],[772, 468, 818, 592];Stool:[949, 603, 1000, 658];Person:[588, 603, 897, 922],[831, 287, 1000, 773],[708, 322, 804, 695],[622, 296, 717, 710],[613, 320, 642, 384],[589, 310, 620, 355],[548, 273, 642, 765],[428, 322, 599, 880],[461, 294, 527, 391],[386, 286, 463, 746],[438, 285, 480, 471],[354, 295, 409, 727],[348, 306, 374, 372],[343, 283, 383, 364],[283, 289, 389, 745],[267, 276, 305, 695],[217, 272, 291, 763],[178, 286, 232, 679],[114, 248, 221, 754],[114, 253, 197, 720],[025, 272, 128, 717],[769, 420, 798, 479],[694, 368, 724, 551],[946, 415, 1000, 556],[975, 413, 999, 506].') 2023-11-03 00:26:37.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:26:37.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the boy in gray squatting and standing up?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one boy in gray squatting and standing up.Track1frame:1:[236, 125, 512, 997];frame:2:[231, 114, 512, 995];frame:3:[223, 095, 505, 997].') 2023-11-03 00:26:40.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:26:40.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Necklace,Trash bin Can and Glasses in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Necklace:[392, 528, 443, 572];Trash bin Can:[009, 753, 113, 872];Glasses:[617, 165, 670, 212].') 2023-11-03 00:26:53.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:26:53.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human body:[077, 512, 173, 1000],[038, 157, 084, 264],[068, 393, 097, 522],[136, 280, 151, 369];man:[065, 547, 171, 1000],[046, 158, 096, 260],[051, 385, 120, 650],[130, 282, 155, 355];clothing:[090, 595, 175, 910],[061, 420, 101, 600];boy:[081, 552, 186, 999];tree:[069, 000, 999, 575],[817, 001, 999, 545];train:[000, 000, 077, 999],[084, 010, 683, 737].') 2023-11-03 00:27:21.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:27:21.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[386, 361, 555, 631],carFrame1:[268, 484, 393, 588],truckFrame1:[920, 329, 999, 494] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[386, 361, 555, 631];Frame2:[392, 371, 558, 643],carFrame1:[268, 484, 393, 588];Frame2:[170, 510, 333, 626],truckFrame1:[920, 329, 999, 494];Frame2:[964, 344, 999, 516].') 2023-11-03 00:27:22.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:27:22.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,human body,human leg and girl in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[392, 249, 941, 950];human body:[381, 270, 938, 991],[267, 725, 382, 905];human leg:[273, 791, 350, 902],[470, 646, 709, 999],[524, 643, 985, 999];girl:[348, 271, 998, 999].') 2023-11-03 00:27:29.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:27:29.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the yellow lizard on the back-side of another lizard?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one yellow lizard on the back-side of another lizard.Track1frame:1:[000, 172, 532, 975];frame:2:[000, 248, 560, 998];frame:3:[248, 153, 621, 998].') 2023-11-03 00:28:14.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:28:14.783 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'tree:[043, 857, 108, 972],[000, 315, 176, 375],[000, 377, 999, 719],[024, 699, 070, 817],[129, 719, 265, 833],[241, 846, 593, 999],[247, 338, 363, 397],[549, 734, 599, 805],[732, 850, 767, 925],[862, 745, 928, 794];building:[137, 455, 952, 650],[259, 699, 571, 869],[600, 701, 838, 831],[767, 873, 940, 969],[851, 703, 966, 782].') 2023-11-03 00:28:39.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:28:39.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fishing Rod,Laptop,Person and Tie in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Laptop:[222, 600, 554, 1000],[199, 539, 394, 667];Person:[109, 143, 1000, 999],[307, 126, 693, 884],[932, 363, 1000, 545],[921, 001, 995, 118],[654, 001, 781, 234],[687, 001, 790, 129],[577, 001, 673, 192],[632, 001, 676, 147],[495, 001, 605, 266],[404, 001, 467, 129],[302, 001, 384, 090],[168, 001, 247, 102],[056, 001, 195, 150],[001, 001, 062, 274],[001, 001, 193, 448],[150, 655, 218, 764];Tie:[035, 133, 126, 264],[679, 054, 723, 171].') 2023-11-03 00:28:40.959 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:28:40.960 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[026, 302, 923, 674] A Sukhoi Su-24 fighter jet takes off from the Hmeymim air base near Latakia, Syria, in this handout photograph released by Russia's Defence Ministry on October 22, 2015. reuters") 2023-11-03 00:28:43.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:28:43.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[000, 233, 351, 488];frame:2:[000, 220, 217, 480];frame:3:[000, 215, 046, 436] is?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an the van going to the left.') 2023-11-03 00:29:23.593 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:29:23.593 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tree:[024, 408, 268, 984],[000, 000, 455, 549];building:[001, 897, 090, 999],[208, 056, 998, 999].') 2023-11-03 00:29:42.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:29:42.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'house:[000, 760, 453, 999],[100, 567, 132, 637],[132, 556, 160, 656],[154, 570, 168, 660],[170, 561, 206, 651],[206, 574, 234, 655],[234, 553, 271, 655],[288, 575, 303, 646],[305, 579, 328, 647],[324, 560, 375, 660],[381, 551, 441, 636],[505, 465, 577, 506],[688, 693, 999, 999],[691, 579, 724, 644],[725, 553, 756, 647],[820, 616, 883, 689].') 2023-11-03 00:30:29.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:30:29.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[000, 305, 436, 999],[099, 088, 394, 919],[411, 033, 679, 829],[427, 001, 886, 999],[808, 092, 968, 999];Stuffed Toy:[396, 797, 529, 978],[270, 337, 380, 641];Storage box:[374, 010, 598, 159],[374, 134, 568, 268],[068, 001, 386, 134],[070, 119, 373, 276];Desk:[866, 573, 1000, 1000];Extention Cord:[962, 387, 999, 458];Glasses:[077, 475, 147, 569].') 2023-11-03 00:30:32.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:30:32.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'bicycle:[214, 598, 537, 901],[000, 901, 106, 998],[351, 559, 547, 692],[415, 671, 942, 998],[523, 694, 933, 833];bicycle wheel:[291, 730, 486, 903],[189, 708, 354, 808],[393, 622, 491, 667],[434, 901, 788, 998],[701, 753, 893, 848];wheel:[286, 731, 486, 897],[191, 708, 368, 804],[419, 888, 770, 998],[714, 742, 923, 824];person:[523, 296, 644, 722],[754, 289, 886, 739];tire:[000, 893, 121, 998],[188, 708, 378, 809],[271, 756, 510, 910],[340, 647, 478, 691],[385, 608, 493, 667],[426, 894, 766, 998],[708, 743, 933, 815].') 2023-11-03 00:30:36.959 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:30:36.959 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Backpack:[097, 546, 179, 674];Sneakers:[178, 741, 250, 776],[247, 728, 295, 755],[476, 715, 506, 741],[470, 699, 503, 720];Other Shoes:[597, 696, 619, 718],[676, 734, 700, 761],[624, 741, 645, 765];Trash bin Can:[738, 598, 838, 779],[833, 605, 937, 791];Telephone:[892, 495, 947, 516];Carpet:[000, 613, 640, 911];Cabinet:[292, 406, 399, 598];Chair:[398, 437, 484, 609];Desk:[742, 447, 843, 746],[841, 488, 999, 730],[510, 467, 555, 610];Storage box:[844, 312, 870, 340],[872, 314, 895, 343],[951, 311, 975, 344],[924, 316, 948, 343],[897, 312, 923, 343];TV:[861, 377, 897, 472];Keyboard:[936, 481, 970, 504];Person:[598, 239, 765, 765],[522, 429, 620, 718],[470, 576, 609, 748],[580, 406, 632, 674],[126, 219, 295, 776],[031, 450, 116, 741],[000, 650, 082, 1000],[573, 241, 611, 274],[577, 289, 606, 334],[571, 346, 609, 378],[578, 390, 600, 420].') 2023-11-03 00:30:40.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:30:40.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Moniter:[224, 460, 357, 646],[369, 443, 486, 613];Hat:[070, 329, 169, 430],[467, 557, 571, 716];Satchel:[785, 815, 906, 999],[021, 565, 225, 953];Mirror:[700, 106, 1000, 627];Storage box:[226, 725, 310, 847],[037, 469, 103, 546];shelf:[000, 732, 162, 999];Person:[019, 322, 238, 762],[484, 157, 671, 1000],[836, 213, 999, 999];Ballon:[939, 265, 999, 558],[880, 001, 961, 103];Bottle:[436, 686, 477, 787],[429, 618, 494, 775].') 2023-11-03 00:30:46.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:30:46.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'violin:[685, 020, 916, 923];sunglasses:[335, 219, 423, 372],[609, 303, 676, 408];cello:[656, 010, 963, 998];woman:[520, 212, 850, 960];tree:[000, 000, 998, 568],[002, 525, 453, 813];human face:[286, 224, 420, 440],[612, 279, 679, 424].') 2023-11-03 00:31:05.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:31:05.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human body:[713, 439, 824, 848],[070, 452, 087, 515],[215, 450, 256, 575],[287, 461, 308, 547],[315, 465, 378, 707],[362, 460, 406, 594],[397, 456, 476, 692],[574, 497, 675, 781],[865, 665, 999, 999];man:[573, 455, 670, 795],[011, 434, 043, 541],[143, 438, 171, 501],[216, 432, 253, 575],[394, 427, 485, 690],[863, 329, 999, 999];building:[000, 000, 033, 384],[124, 179, 226, 317],[211, 000, 253, 405],[250, 000, 353, 401],[351, 000, 999, 384];human arm:[460, 468, 480, 578],[626, 505, 651, 597],[766, 515, 813, 616],[865, 742, 999, 999].') 2023-11-03 00:31:27.606 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:31:27.606 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[625, 496, 757, 635],carFrame1:[000, 320, 356, 818],busFrame1:[448, 449, 492, 509],carFrame1:[607, 481, 647, 539],busFrame1:[000, 202, 235, 482] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[625, 496, 757, 635];Frame2:[629, 487, 775, 639],carFrame1[000, 320, 356, 818];Frame2:No object,busFrame1:[448, 449, 492, 509];Frame2:[438, 439, 496, 508],carFrame1:[607, 481, 647, 539];Frame2:[621, 472, 671, 540],busFrame1[000, 202, 235, 482];Frame2:No object.') 2023-11-03 00:31:35.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:31:35.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[326, 245, 666, 694],[067, 163, 247, 580];Dog:[139, 378, 411, 629];Boat:[054, 406, 803, 855];Glasses:[446, 306, 506, 335],[101, 234, 158, 271];Hat:[416, 246, 536, 370];Paddle:[392, 358, 957, 686],[001, 187, 185, 714],[176, 581, 383, 689].') 2023-11-03 00:31:52.038 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:31:52.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'building:[011, 000, 871, 990];window:[022, 163, 197, 461],[000, 651, 080, 908],[052, 028, 203, 126],[089, 000, 221, 045],[502, 000, 557, 106],[509, 644, 616, 939].') 2023-11-03 00:32:21.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:32:21.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Desk:[092, 548, 883, 832],[002, 553, 240, 999].') 2023-11-03 00:32:23.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:32:23.564 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Skiboard:[357, 618, 567, 703];Helmet:[444, 187, 485, 263];Sneakers:[466, 611, 515, 667],[429, 592, 481, 648];Person:[647, 243, 706, 441],[394, 229, 437, 392],[405, 188, 519, 654].') 2023-11-03 00:33:06.346 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:33:06.346 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'woman:[000, 386, 072, 568],[087, 427, 177, 711],[111, 387, 228, 532],[211, 320, 276, 564],[398, 316, 467, 482],[451, 355, 547, 589],[485, 274, 535, 480],[536, 738, 776, 998],[541, 344, 631, 617],[561, 585, 733, 915],[645, 374, 740, 698],[677, 246, 736, 380],[713, 527, 852, 992],[826, 421, 986, 994],[866, 210, 906, 387];man:[000, 659, 180, 998],[043, 327, 121, 662],[101, 303, 170, 421],[131, 517, 471, 998],[250, 306, 326, 802],[271, 231, 327, 325],[301, 386, 378, 636],[361, 242, 401, 318],[391, 241, 480, 365],[438, 254, 487, 386],[543, 267, 588, 369],[552, 248, 642, 461],[716, 205, 772, 442],[735, 297, 902, 551],[812, 408, 990, 998],[813, 231, 903, 393],[822, 171, 876, 288],[896, 203, 957, 322],[902, 199, 992, 457];clothing:[000, 578, 045, 877],[000, 431, 065, 568],[000, 798, 178, 998],[052, 380, 111, 546],[098, 338, 182, 404],[100, 508, 166, 715],[157, 798, 376, 998],[165, 431, 218, 544],[190, 357, 226, 436],[207, 382, 262, 531],[245, 378, 318, 677],[297, 587, 547, 998],[298, 487, 367, 647],[323, 342, 395, 495],[365, 265, 400, 333],[396, 361, 472, 495],[442, 284, 481, 359],[453, 448, 526, 610],[496, 320, 533, 446],[518, 286, 548, 365],[551, 291, 595, 374],[551, 427, 631, 621],[553, 896, 761, 998],[562, 288, 637, 369],[620, 418, 682, 576],[637, 474, 747, 741],[662, 299, 760, 397],[687, 297, 736, 393],[711, 233, 770, 367],[730, 384, 910, 525],[733, 649, 838, 998],[807, 499, 992, 998],[812, 276, 900, 391],[902, 239, 945, 320],[910, 267, 988, 355],[912, 762, 998, 998],[917, 290, 998, 455];human face:[415, 516, 461, 645],[478, 372, 512, 440],[566, 365, 602, 429],[615, 871, 707, 998],[801, 338, 846, 421].') 2023-11-03 00:33:16.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:33:16.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the those poultry that ate the food in the red pot?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There are 8 those poultry that ate the food in the red pot.Track1frame:1:[355, 414, 497, 797];frame:2:[364, 420, 496, 790];frame:3:[348, 323, 494, 784],Track2frame:1:[553, 232, 689, 650];frame:2:[539, 237, 689, 653];frame:3:[572, 245, 678, 629],Track3frame:1:[413, 001, 496, 103];frame:2:[404, 001, 482, 092];frame:3:[338, 001, 437, 074],Track4frame:1:[224, 146, 426, 451];frame:2:[223, 156, 428, 451];frame:3:[225, 140, 441, 450],Track5frame:1:[190, 046, 393, 247];frame:2:[170, 033, 367, 304];frame:3:[169, 069, 336, 338],Track6frame:1:[451, 009, 533, 320];frame:2:[453, 009, 533, 321];frame:3:[454, 008, 535, 320],Track7frame:1:[617, 310, 927, 746];frame:2:[574, 370, 929, 862];frame:3:[476, 473, 876, 908],Track8frame:1:[886, 762, 999, 998];frame:2:[831, 768, 999, 998];frame:3:[873, 669, 999, 998].') 2023-11-03 00:33:34.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:33:34.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[147, 157, 768, 1000],[860, 162, 999, 544];Chair:[195, 134, 812, 995];Desk:[782, 515, 1000, 1000],[724, 314, 910, 453];Storage box:[702, 091, 906, 323],[763, 476, 907, 599];Broom:[002, 441, 088, 918];Bottle:[019, 002, 186, 162],[898, 228, 948, 319].') 2023-11-03 00:33:37.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:33:37.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[410, 454, 660, 718] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a mountain bikeFrame1:[410, 454, 660, 718];Frame2:[205, 472, 417, 720];Frame3:[172, 329, 308, 618];Frame4:[129, 488, 402, 862];Frame5:[362, 369, 589, 705].') 2023-11-03 00:34:15.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:34:15.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hollywood star Joaquin Phoenix [003, 037, 331, 996] among a number of celebrities calling on mistreated goats [399, 181, 891, 696] at an East Yorkshire farm to be rescued') 2023-11-03 00:34:24.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:34:24.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[512, 333, 701, 624],[851, 388, 963, 715],[719, 395, 786, 557],[202, 525, 372, 806],[365, 040, 424, 277],[425, 259, 503, 440],[837, 428, 897, 622].') 2023-11-03 00:34:53.691 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:34:53.692 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[118, 182, 604, 424] The bill for cakes and bubbles - hotel cafe royal london') 2023-11-03 00:35:32.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:35:32.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'strawberry:[160, 051, 790, 273],[061, 346, 422, 628],[597, 326, 962, 643].') 2023-11-03 00:35:49.943 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:35:49.943 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a vibrant tropical wedding invitation suite [150, 044, 820, 723] in emerald and mustard, with amazing tropical and stripe prints') 2023-11-03 00:36:04.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:36:04.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[087, 551, 259, 777];Desk:[001, 616, 103, 812];Pillow:[782, 491, 908, 632],[805, 509, 993, 669],[684, 485, 789, 612],[671, 494, 815, 617];Bed:[315, 594, 999, 998];Nightstand:[583, 566, 647, 613];Towel:[520, 543, 610, 667].') 2023-11-03 00:36:06.419 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:36:06.419 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '"Romero Creations Tiny Tenor 6 String Guilele Daniel Ho Signature RC-TT6-SM ""Trestles"""') 2023-11-03 00:36:13.665 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:36:13.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[046, 391, 875, 847] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a peludoFrame1:[046, 391, 875, 847];Frame2:[071, 089, 625, 975];Frame3:[559, 247, 904, 997];Frame4:[485, 137, 831, 997].') 2023-11-03 00:36:48.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:36:48.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Street Lights and Trash bin Can in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Street Lights:[947, 457, 971, 657];Trash bin Can:[822, 580, 845, 640].') 2023-11-03 00:37:19.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:37:19.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[319, 433, 415, 752] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a black storkFrame1:[319, 433, 415, 752];Frame2:[306, 418, 401, 748];Frame3:[306, 418, 401, 748].') 2023-11-03 00:37:27.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:37:27.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[782, 497, 953, 791],[014, 000, 223, 260],[021, 254, 209, 491],[025, 491, 195, 750],[031, 754, 206, 997],[267, 014, 470, 258],[275, 256, 440, 506],[279, 752, 451, 997],[281, 502, 457, 758],[514, 014, 721, 247],[517, 241, 687, 500],[528, 493, 703, 747],[534, 747, 703, 997],[765, 000, 962, 256],[775, 254, 951, 502],[779, 752, 951, 997];human face:[081, 022, 151, 135],[093, 512, 154, 637],[096, 760, 167, 868],[117, 287, 159, 402],[334, 012, 406, 147],[339, 506, 409, 627],[343, 758, 409, 864],[353, 264, 407, 393],[589, 760, 657, 866],[590, 029, 657, 143],[593, 275, 659, 383],[837, 268, 903, 364],[848, 756, 912, 875],[850, 502, 903, 620],[853, 027, 914, 145].') 2023-11-03 00:37:43.171 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:37:43.171 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[189, 253, 661, 588] A digger attempts to remove [000, 373, 786, 998] earth around the bow of the Ever Given, which is blocking [325, 385, 993, 996] the Suez Canal, Egypt (25 March 2021') 2023-11-03 00:38:07.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:38:07.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[152, 164, 253, 519],[675, 251, 709, 378],[795, 249, 833, 414],[288, 087, 311, 145];Frame:[761, 418, 837, 542];Cabinet:[711, 529, 836, 772],[212, 137, 449, 556],[338, 000, 470, 151],[127, 558, 507, 999];Chair:[619, 393, 656, 470],[000, 515, 143, 999],[000, 616, 260, 999],[539, 506, 686, 810];Desk:[283, 790, 998, 1000],[505, 622, 717, 812];Mirror:[521, 155, 840, 509];Wine Glass:[833, 724, 923, 962],[803, 686, 882, 910],[557, 735, 642, 986];Jug:[608, 665, 802, 951],[365, 029, 388, 063];Bowl:[581, 689, 669, 763],[909, 365, 968, 422];Plate:[254, 559, 405, 603],[466, 508, 540, 526],[905, 422, 957, 437],[937, 081, 999, 167];Cup:[726, 482, 748, 522],[716, 487, 737, 540],[910, 704, 963, 782],[921, 135, 952, 167],[935, 000, 983, 046];Bottle:[895, 518, 957, 677],[972, 125, 998, 168],[411, 018, 439, 059],[422, 109, 440, 142].') 2023-11-03 00:38:12.948 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:38:12.948 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Artistic Toronto Lifestyle Photography showing [541, 565, 647, 955] A little girl in [547, 640, 628, 818] a yellow dress eats [565, 626, 589, 673] an ice cream on [361, 671, 791, 887] the steps of a row house in St. John's, Newfoundland") 2023-11-03 00:38:29.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:38:29.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Bench:[278, 581, 1000, 865];Picture:[392, 076, 603, 586],[879, 051, 998, 584];Person:[285, 273, 599, 870];Necklace:[447, 411, 472, 464];Luggage:[377, 351, 531, 871];Sandals:[554, 822, 601, 886],[287, 817, 369, 869];Cup:[767, 910, 824, 958].') 2023-11-03 00:38:32.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:38:32.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cherry,Guitar,Drum and Speaker in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Guitar:[006, 357, 904, 1000];Drum:[671, 417, 841, 545],[875, 465, 1000, 567],[711, 532, 999, 783];Speaker:[844, 306, 999, 476].') 2023-11-03 00:39:07.094 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:39:07.094 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An image of a pillow that says We Are Just Family with [000, 004, 604, 997] black and white scarf in background and [570, 531, 913, 890] a necklace and [614, 410, 747, 533] coffee on the side.') 2023-11-03 00:39:08.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:39:08.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[001, 499, 054, 924],[036, 516, 139, 944],[206, 511, 353, 998],[580, 460, 690, 961],[877, 666, 999, 936];Handbag:[151, 866, 237, 997],[328, 728, 400, 864],[561, 928, 726, 1000].') 2023-11-03 00:39:43.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:39:43.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'stairs:[000, 472, 999, 920];tree:[139, 159, 209, 444],[191, 000, 411, 361],[460, 391, 580, 473],[655, 014, 893, 366].') 2023-11-03 00:39:51.527 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:39:51.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[550, 354, 582, 397] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a faceFrame1:[550, 354, 582, 397];Frame2:[512, 352, 545, 395];Frame3:[451, 347, 484, 391];Frame4:[412, 350, 445, 393];Frame5:[364, 345, 396, 389].') 2023-11-03 00:40:01.953 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:40:01.953 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a refined moody bedroom with [553, 213, 994, 728] black walls and [003, 003, 997, 318] a ceiling, with [392, 057, 730, 358] a tassel chandelier, [735, 522, 983, 725] a fireplace and [000, 586, 722, 996] a metal bed plus [816, 374, 937, 493] [000, 286, 164, 582] [684, 371, 751, 451] [613, 379, 678, 451] artworks') 2023-11-03 00:40:14.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:40:14.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[000, 378, 410, 1000];man:[000, 106, 567, 998];human face:[221, 135, 430, 520].') 2023-11-03 00:41:00.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:41:00.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[417, 319, 501, 554],[755, 343, 849, 584];Sneakers:[000, 510, 024, 543],[050, 519, 081, 543],[509, 501, 534, 558],[526, 544, 572, 583],[603, 575, 638, 601],[763, 538, 791, 582];Gloves:[831, 462, 849, 486];Soccer:[498, 525, 518, 558].') 2023-11-03 00:41:19.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:41:19.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[860, 722, 894, 861],personFrame1:[076, 394, 112, 488],personFrame1:[639, 598, 672, 720] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[860, 722, 894, 861];Frame2:[828, 720, 875, 856],personFrame1[076, 394, 112, 488];Frame2:No object,personFrame1:[639, 598, 672, 720];Frame2:[614, 598, 651, 712].') 2023-11-03 00:41:25.027 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:41:25.028 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Picture:[275, 001, 409, 245],[456, 068, 546, 265];Lamp:[081, 254, 144, 543],[553, 172, 709, 396];Potted Plant:[115, 407, 200, 541];Nightstand:[031, 520, 230, 772];Carpet:[188, 779, 435, 893];Bed:[167, 303, 947, 912];Pillow:[623, 309, 744, 453];Chair:[582, 296, 793, 472];Slippers:[671, 873, 774, 937].') 2023-11-03 00:41:32.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:41:32.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'wine:[164, 000, 483, 998],[483, 000, 830, 998],[868, 170, 925, 550],[920, 178, 991, 550].') 2023-11-03 00:41:38.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:41:38.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect willow,human face,woman,girl and clothing in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human face:[040, 518, 105, 634],[131, 678, 216, 846],[229, 502, 295, 614],[379, 493, 418, 570],[468, 421, 562, 607],[550, 520, 612, 638],[613, 423, 658, 518],[703, 471, 750, 580],[817, 537, 880, 642];woman:[000, 493, 129, 827],[044, 555, 316, 999],[211, 476, 351, 916],[331, 449, 373, 640],[535, 500, 666, 846],[723, 467, 954, 999];girl:[006, 488, 127, 830],[048, 546, 328, 984],[206, 472, 339, 937],[637, 453, 769, 974];clothing:[324, 505, 664, 1000],[046, 430, 081, 494],[061, 783, 323, 986],[219, 598, 329, 841],[292, 567, 330, 635],[343, 515, 471, 665],[605, 501, 676, 658],[647, 580, 763, 999],[731, 594, 928, 999],[789, 540, 825, 619],[890, 415, 999, 799].') 2023-11-03 00:42:09.034 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:42:09.035 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a box [000, 174, 996, 782] of F Plus dugouts on red couch [004, 000, 995, 992]') 2023-11-03 00:42:29.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:42:29.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[005, 072, 958, 997] the top of a double chocolate banana bread, with [122, 080, 492, 916] sliced bananas') 2023-11-03 00:42:58.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:42:58.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Storage box:[489, 227, 562, 298],[657, 232, 734, 306],[820, 137, 899, 207],[492, 297, 559, 367],[657, 131, 725, 202];Person:[258, 296, 511, 919];Bowl:[599, 860, 999, 1000].') 2023-11-03 00:43:01.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:43:01.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the white pigeons move with head up?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There are 2 white pigeons move with head up.Track1frame:1:[469, 145, 551, 323];frame:2:[475, 144, 563, 319];frame:3:[482, 145, 564, 322],Track2frame:1:[371, 141, 436, 304];frame:2:[369, 147, 430, 279];frame:3:[368, 147, 424, 312].') 2023-11-03 00:43:34.575 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:43:34.575 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Chair:[048, 438, 091, 595],[066, 427, 128, 565];Flag:[063, 073, 126, 530],[001, 075, 058, 575],[009, 033, 066, 413];Person:[102, 191, 198, 681],[132, 079, 366, 1000],[304, 213, 367, 671],[327, 253, 417, 689],[391, 186, 452, 343],[458, 164, 562, 490],[372, 237, 543, 999],[536, 280, 727, 1000],[541, 199, 648, 677],[635, 185, 672, 298],[671, 193, 775, 703],[862, 160, 978, 704],[944, 181, 995, 611];Glasses:[456, 309, 519, 359];Tie:[259, 235, 289, 292];Shovel:[682, 707, 708, 759];Knife:[324, 699, 357, 747];Plate:[051, 690, 133, 735],[164, 672, 226, 712],[239, 697, 302, 744],[204, 700, 248, 738],[100, 709, 137, 750],[001, 705, 048, 751],[009, 721, 094, 760],[072, 731, 128, 775],[139, 735, 217, 777],[207, 725, 277, 770],[167, 760, 250, 812],[084, 765, 169, 816],[002, 751, 095, 813];Cell Phone:[691, 231, 718, 297];Projector:[534, 002, 588, 064];Cake:[349, 664, 635, 781].') 2023-11-03 00:43:45.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:43:45.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[442, 443, 498, 606] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[442, 443, 498, 606];Frame2:[462, 460, 512, 610];Frame3:[471, 410, 523, 627];Frame4:[471, 410, 528, 627];Frame5:[471, 422, 528, 627].') 2023-11-03 00:44:23.091 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:44:23.092 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "a group of men [774, 310, 868, 599] [376, 308, 563, 995] [487, 361, 654, 997] [760, 385, 996, 994] [565, 335, 819, 994] and women [067, 421, 217, 998] [230, 395, 402, 997] dressed in black [276, 527, 403, 998] [024, 481, 135, 994] [614, 479, 818, 995] [389, 426, 561, 997] [810, 563, 998, 996] singing together with banner [277, 010, 697, 482] at back saying 'Commoners'") 2023-11-03 00:44:49.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:44:49.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'shelf:[560, 086, 868, 392];Person:[369, 014, 595, 637],[602, 144, 801, 589],[695, 262, 924, 748],[732, 398, 1000, 961],[940, 780, 999, 999],[615, 862, 957, 1000],[241, 701, 521, 1000],[001, 409, 467, 1000],[100, 459, 356, 753],[138, 245, 384, 721];Slippers:[744, 684, 788, 749],[695, 594, 734, 648];Sandals:[604, 468, 634, 516];Backpack:[001, 332, 087, 420];Satchel:[079, 292, 158, 426],[868, 246, 973, 454];Blackboard:[001, 103, 261, 356],[580, 001, 876, 113];Board Eraser:[715, 079, 765, 112];Bowl:[308, 416, 368, 483].') 2023-11-03 00:44:59.473 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:44:59.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Dumpling,Faucet,Coffee Machine and Sink in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Faucet:[421, 531, 491, 633];Coffee Machine:[001, 508, 057, 639];Sink:[354, 628, 589, 780].') 2023-11-03 00:45:26.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:45:26.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[050, 466, 127, 743],[134, 463, 234, 748],[226, 507, 292, 781],[319, 515, 380, 752],[510, 519, 601, 778],[690, 546, 744, 766],[738, 516, 807, 768],[798, 519, 865, 770],[899, 549, 948, 778].') 2023-11-03 00:45:29.437 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:45:29.437 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Digital profiling: [197, 035, 616, 998] [345, 023, 841, 995] Kaspersky Lab experts uncover how much [582, 875, 617, 998] your smart watch can tell about you') 2023-11-03 00:45:42.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:45:42.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect house,mammal and person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'house:[132, 360, 499, 691],[035, 074, 134, 876],[451, 375, 604, 737],[604, 542, 676, 727],[686, 060, 942, 729];mammal:[127, 531, 188, 849],[188, 590, 208, 716],[206, 596, 232, 702],[233, 539, 278, 834],[300, 571, 318, 735];person:[111, 459, 175, 859],[184, 607, 207, 714],[209, 601, 226, 691],[233, 554, 280, 828],[300, 575, 324, 731],[315, 624, 341, 731].') 2023-11-03 00:45:52.163 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:45:52.164 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[341, 367, 468, 852] [057, 373, 166, 858] [737, 400, 870, 856] [558, 378, 687, 852] [464, 371, 581, 852] [620, 396, 754, 856] [242, 367, 370, 853] [150, 372, 273, 858] Schoolchildren wearing [825, 279, 863, 336] [687, 433, 727, 487] [606, 422, 645, 479] [290, 409, 330, 467] [199, 418, 237, 472] [808, 440, 852, 499] [524, 414, 564, 472] [118, 417, 159, 479] [413, 414, 454, 474] face masks walk across a street in Jinan, China, amid heavy air pollution in 2015.') 2023-11-03 00:46:33.458 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:46:33.458 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[451, 001, 580, 305];frame:2:[463, 202, 600, 498];frame:3:[528, 176, 654, 412] is?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an man in blue clothes riding a bicycle') 2023-11-03 00:46:44.418 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:46:44.419 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [208, 440, 329, 998] [390, 464, 491, 889] [608, 447, 739, 930] [293, 553, 413, 996] [511, 459, 619, 900] gather beneath a 26-foot statue [429, 050, 600, 552] inspired by the iconic kiss between a nurse [832, 426, 987, 995] and a sailor [468, 436, 540, 783] in Times Square August 12, 2010 in New York City....') 2023-11-03 00:47:02.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:47:02.376 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the left cars in the counter direction of ours?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one left cars in the counter direction of ours.Track1frame4:[156, 522, 316, 834];frame5:[036, 522, 283, 954].') 2023-11-03 00:47:08.343 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:47:08.344 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The nursery [015, 008, 988, 989] is a restful space with pale-pink walls and gray accents.') 2023-11-03 00:47:38.112 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:47:38.112 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[149, 199, 903, 881] The Nike+ FuelBand is an activity tracker that can be worn on the wrist and integrated with Apple iPhone or iPad device.') 2023-11-03 00:48:43.702 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:48:43.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A sheet pan [006, 036, 928, 998] with two forks [346, 316, 992, 569] [370, 513, 886, 916], full of roasted cauliflower [353, 227, 506, 317] [045, 532, 207, 633] [516, 082, 703, 176], broccoli [139, 759, 282, 845] [653, 851, 844, 962] [004, 450, 208, 542] [285, 111, 476, 223], brussle sprouts [239, 857, 378, 933], carrots [300, 909, 440, 977] [509, 196, 658, 284], sweet potatoes, and sausage') 2023-11-03 00:49:44.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:49:44.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[155, 318, 866, 999],[166, 302, 823, 1000],[039, 537, 149, 736].') 2023-11-03 00:50:01.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:50:01.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the zebra move from left-side to right-side.Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There are 3 zebra move from left-side to right-side.Track1frame:1:[028, 198, 855, 997];frame:2:[021, 220, 863, 997];frame:3:[053, 118, 884, 997],Track2frame:1:[000, 160, 231, 997];frame:2:[000, 176, 225, 997];frame:3:[000, 070, 219, 997],Track3frame:1:[235, 271, 807, 997];frame:2:[198, 558, 796, 997];frame:3:[173, 277, 748, 997].') 2023-11-03 00:50:03.731 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:50:03.732 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[142, 329, 358, 503] [538, 670, 815, 875] [111, 639, 449, 997] [638, 311, 813, 524] Bears waiting for the stock market to pullback') 2023-11-03 00:50:41.691 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:50:41.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[000, 685, 075, 999],[000, 462, 076, 607],[000, 540, 150, 921],[049, 746, 266, 999],[075, 481, 175, 704],[123, 554, 301, 999],[230, 494, 381, 938],[296, 744, 520, 999],[325, 560, 482, 918],[364, 518, 463, 712],[459, 356, 515, 502],[503, 516, 635, 763],[505, 584, 627, 999],[532, 765, 742, 999],[627, 532, 748, 807],[658, 603, 788, 999],[761, 556, 866, 908],[769, 720, 960, 999],[852, 618, 965, 934],[893, 565, 963, 760],[955, 778, 999, 999];clothing:[000, 569, 030, 705],[000, 861, 059, 999],[000, 489, 071, 639],[004, 637, 133, 939],[048, 901, 279, 999],[086, 541, 174, 711],[138, 631, 293, 999],[158, 602, 246, 661],[246, 559, 357, 919],[300, 889, 498, 999],[333, 649, 476, 920],[368, 607, 456, 682],[470, 366, 505, 482],[513, 677, 623, 999],[521, 581, 629, 741],[533, 861, 728, 999],[635, 617, 712, 861],[661, 703, 789, 999],[763, 863, 963, 999],[775, 617, 861, 899],[871, 703, 978, 905],[910, 639, 963, 733],[986, 877, 999, 989];fixed-wing aircraft:[208, 077, 820, 375].') 2023-11-03 00:50:57.062 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:50:57.062 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'beer:[144, 000, 281, 129],[616, 000, 759, 272];food:[254, 338, 428, 522],[291, 378, 723, 740],[350, 299, 594, 474],[518, 303, 650, 403],[661, 343, 750, 507].') 2023-11-03 00:51:24.960 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:51:24.961 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'boy:[193, 412, 455, 986];man:[422, 211, 934, 984];clothing:[175, 552, 463, 990],[548, 444, 966, 999];human face:[309, 495, 423, 693],[612, 278, 712, 519].') 2023-11-03 00:51:34.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:51:34.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Couch:[085, 701, 446, 958];shelf:[001, 351, 086, 999];Stool:[449, 747, 492, 781];Person:[577, 583, 637, 803].') 2023-11-03 00:51:45.180 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:51:45.180 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the elephant play then shift behind?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one elephant play then shift behind.Track1frame:1:[696, 441, 790, 675];frame:2:[682, 440, 790, 666];frame:3:[675, 453, 797, 674].') 2023-11-03 00:51:47.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:51:47.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[035, 015, 773, 983];human face:[209, 000, 700, 936].') 2023-11-03 00:52:15.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:52:15.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'black and white photograph of [405, 042, 612, 752] [151, 073, 325, 751] [764, 077, 989, 834] [608, 125, 777, 843] five individuals in a row wearing [430, 162, 606, 423] [322, 124, 480, 373] [796, 204, 932, 435] [175, 191, 321, 409] [616, 232, 766, 435] white blouses and [328, 281, 488, 608] [189, 339, 319, 743] [817, 377, 990, 803] [622, 373, 780, 782] [472, 316, 617, 690] dark skirts holding or sitting on [396, 390, 614, 962] [039, 343, 356, 936] [161, 376, 555, 948] [746, 333, 946, 985] bicycles outdoors') 2023-11-03 00:52:16.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:52:16.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[449, 685, 651, 872];frame:2:[471, 683, 648, 862];frame:3:[434, 677, 643, 879] is?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an the small white dog is crawling to the left.') 2023-11-03 00:52:46.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:52:46.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Sneakers:[942, 839, 999, 889],[740, 768, 854, 814],[494, 838, 600, 900],[382, 848, 472, 873],[363, 815, 410, 843],[161, 845, 223, 873],[099, 847, 141, 874],[040, 847, 085, 873];Leather Shoes:[662, 841, 734, 865];Person:[396, 318, 853, 897],[892, 266, 999, 889],[911, 526, 1000, 639],[854, 550, 969, 875],[001, 487, 084, 793],[030, 641, 221, 874],[135, 657, 300, 867],[551, 641, 738, 867].') 2023-11-03 00:53:22.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:53:22.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Trolley and Boat in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Trolley:[413, 667, 475, 740];Boat:[899, 675, 1000, 701].') 2023-11-03 00:53:22.996 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:53:22.997 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Banner of antalya city in flat line trendy style. [059, 357, 937, 815] all buildings separated and customizable. line art.') 2023-11-03 00:53:24.066 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:53:24.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Grapefruit,Bus and Street Lights in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Bus:[088, 283, 875, 881];Street Lights:[452, 205, 522, 328].') 2023-11-03 00:53:51.724 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:53:51.724 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Sailboat:[826, 659, 871, 767],[632, 668, 680, 784],[382, 631, 422, 713],[263, 577, 340, 791],[240, 497, 328, 699],[000, 536, 022, 699];Boat:[672, 664, 807, 755].') 2023-11-03 00:54:14.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:54:14.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human body:[738, 478, 996, 1000],[201, 652, 393, 999],[315, 520, 711, 868],[623, 675, 778, 908];human hair:[550, 531, 710, 743],[223, 425, 308, 521],[223, 490, 315, 641],[534, 477, 624, 528],[820, 438, 971, 622];human head:[826, 450, 974, 710],[028, 862, 093, 997],[118, 769, 171, 915],[199, 424, 306, 598],[235, 495, 329, 652],[490, 388, 620, 636],[568, 529, 685, 736];man:[792, 438, 1000, 993],[023, 805, 122, 999],[074, 426, 305, 862],[122, 811, 186, 999],[158, 493, 488, 999],[301, 384, 631, 999],[565, 523, 778, 912];clothing:[810, 543, 1000, 995],[188, 645, 369, 998],[214, 674, 258, 808],[418, 619, 714, 998],[625, 670, 765, 909];human arm:[778, 679, 963, 941],[045, 705, 210, 833],[080, 943, 143, 999],[186, 720, 337, 944],[318, 563, 415, 638],[341, 642, 621, 844],[657, 818, 738, 928],[710, 761, 800, 935];human hand:[321, 597, 438, 724],[077, 718, 131, 816],[156, 832, 202, 918],[308, 564, 391, 655],[697, 732, 796, 874];human face:[023, 892, 095, 980],[197, 455, 228, 605],[492, 480, 575, 633],[554, 619, 608, 739],[838, 553, 895, 703];microphone:[167, 751, 189, 933],[303, 495, 345, 644].') 2023-11-03 00:54:55.459 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:54:55.460 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[165, 617, 256, 673] [059, 592, 144, 649] [237, 671, 343, 735] Flowers are placed on and next to [000, 106, 996, 807] a fence outside the Las Vegas Village across from [227, 065, 716, 219] Mandalay Bay Resort and Casino as a tribute to those killed...') 2023-11-03 00:55:09.786 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:55:09.787 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Class of 2019 students [499, 313, 685, 667] [399, 312, 576, 677] laugh as they enjoy the Dumbo the Flying Elephant [002, 490, 885, 998] ride.') 2023-11-03 00:55:36.836 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:55:36.837 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a moody bedroom with [000, 002, 997, 554] grey walls, [093, 277, 997, 862] a black bed, [002, 458, 170, 670] nightstand and [561, 004, 832, 172] a pendant lamp, pink and black bedding and [023, 672, 310, 986] a pink ottoman') 2023-11-03 00:56:09.679 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:09.679 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Chinese operating system [001, 091, 994, 943] that’s trying to replace Windows for homeland users') 2023-11-03 00:56:11.732 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:11.733 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Portrait [213, 108, 798, 977] of a young woman in a beautiful English dress. The light scheme of Rembrandt, where the only light source - light from th. Portrait of a young woman in') 2023-11-03 00:56:25.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:25.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[181, 656, 997, 1000];Person:[714, 375, 999, 825],[511, 520, 749, 865],[668, 001, 820, 586],[529, 168, 679, 670],[436, 185, 577, 760],[295, 106, 463, 838],[258, 422, 478, 884],[134, 354, 403, 940],[001, 185, 209, 998];Hat:[600, 522, 687, 607],[853, 372, 918, 462],[038, 189, 202, 343],[360, 424, 480, 543],[478, 186, 553, 245].') 2023-11-03 00:56:32.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:32.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Cup:[001, 003, 096, 297];Basin:[004, 138, 154, 358],[091, 218, 895, 852],[916, 163, 995, 408];Plate:[083, 030, 316, 162],[878, 245, 995, 465],[849, 551, 995, 993],[199, 715, 817, 959];Pasta:[143, 079, 836, 801].') 2023-11-03 00:56:51.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:51.084 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,human hand,human head,woman,human arm and human leg in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human face:[000, 035, 065, 195],[000, 000, 913, 276],[240, 000, 317, 108],[577, 005, 632, 106],[623, 091, 722, 270],[740, 031, 821, 214];human hand:[207, 410, 260, 475],[347, 386, 400, 455],[400, 399, 455, 507],[461, 296, 493, 375],[705, 809, 772, 970],[860, 643, 912, 792],[948, 309, 998, 389];human head:[730, 007, 842, 194],[000, 000, 077, 229],[000, 147, 080, 533],[245, 000, 337, 108],[571, 000, 636, 106],[621, 041, 738, 276],[867, 000, 938, 046];woman:[000, 141, 207, 998],[000, 001, 228, 779],[108, 000, 233, 302],[138, 000, 440, 998],[383, 000, 483, 570],[443, 000, 515, 305],[477, 000, 650, 916],[728, 000, 910, 916],[825, 000, 998, 718];human arm:[123, 274, 218, 774],[097, 619, 213, 992],[123, 013, 181, 242],[146, 173, 268, 481],[191, 031, 233, 173],[217, 059, 243, 145],[336, 154, 441, 444],[397, 052, 443, 507],[465, 052, 493, 380],[487, 076, 512, 283],[521, 110, 575, 365],[546, 305, 741, 983],[706, 292, 807, 964],[857, 231, 915, 794],[951, 067, 998, 389];human leg:[125, 130, 998, 998].') 2023-11-03 00:56:51.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:51.477 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Street Lights in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Street Lights:[071, 002, 301, 1000].') 2023-11-03 00:56:58.743 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:56:58.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fire Truck in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Fire Truck:[000, 003, 844, 903].') 2023-11-03 00:57:08.520 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:57:08.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[519, 200, 552, 343] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[519, 200, 552, 343];Frame2:[436, 238, 475, 400];Frame3:[508, 341, 555, 468];Frame4:[617, 365, 685, 595];Frame5:[625, 355, 704, 595].') 2023-11-03 00:57:14.639 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:57:14.639 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers,Hockey Stick and Gloves in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Sneakers:[706, 779, 793, 914],[656, 831, 811, 971];Hockey Stick:[158, 454, 677, 580];Gloves:[655, 368, 739, 534],[425, 441, 530, 563].') 2023-11-03 00:57:53.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:57:53.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Glasses:[023, 151, 076, 234];Person:[001, 035, 076, 725],[363, 472, 548, 731],[081, 382, 379, 733],[074, 450, 169, 572],[554, 537, 682, 1000],[685, 679, 766, 870],[637, 661, 685, 777],[759, 721, 849, 853],[852, 697, 981, 1000],[928, 629, 1000, 999],[530, 008, 586, 479],[638, 001, 730, 496],[738, 471, 830, 697];Flag:[031, 468, 081, 558],[378, 546, 442, 655],[805, 635, 866, 722],[337, 521, 387, 633],[852, 643, 961, 749],[928, 434, 974, 740],[001, 581, 616, 998],[598, 773, 846, 1000].') 2023-11-03 00:57:58.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:57:58.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[360, 312, 552, 505] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a autogiroFrame1:[360, 312, 552, 505];Frame2:[382, 275, 578, 500];Frame3:[348, 296, 545, 530];Frame4:[337, 304, 540, 547].') 2023-11-03 00:57:59.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:57:59.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[612, 450, 977, 1000] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a gray coin being placed on a planeFrame1:[612, 450, 977, 1000];Frame2:[596, 437, 941, 993];Frame3:[602, 396, 943, 850];Frame4:[586, 328, 929, 900].') 2023-11-03 00:58:01.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:58:01.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'snack:[648, 424, 865, 680],[272, 515, 888, 851];tableware:[060, 343, 984, 921];sushi:[369, 540, 591, 757],[662, 430, 828, 617].') 2023-11-03 00:58:29.261 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:58:29.262 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 007, 995, 987] graph showing the number of people in state prison and local jails per 100,000 residents in New Mexico from 1978 to 2015') 2023-11-03 00:58:44.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:58:44.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[445, 311, 626, 617];frame:2:[464, 312, 649, 636];frame:3:[448, 254, 651, 603] doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an four domestic fowl participating in joint motion.') 2023-11-03 00:58:49.543 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:58:49.543 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A picnic with ham, butter [385, 540, 477, 642], ciabatta, grapes [627, 702, 867, 886] and juice [535, 262, 625, 569]') 2023-11-03 00:58:51.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:58:51.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Side Table in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Side Table:[443, 491, 567, 656].') 2023-11-03 00:58:55.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:58:55.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Backpack:[283, 376, 343, 490],[277, 492, 334, 655],[410, 471, 488, 704],[601, 500, 647, 615],[648, 473, 727, 615],[705, 419, 795, 590];Sneakers:[391, 856, 446, 890],[435, 918, 485, 968],[758, 889, 831, 937],[822, 883, 869, 931],[715, 803, 755, 834];Bottle:[451, 813, 481, 902];Person:[883, 256, 999, 1000],[436, 353, 637, 997],[109, 307, 215, 598].') 2023-11-03 00:59:21.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 00:59:21.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Umbrella:[860, 489, 999, 584];Car:[001, 568, 182, 679],[325, 548, 707, 767],[655, 579, 925, 731],[810, 573, 1000, 733].') 2023-11-03 00:59:31.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:59:31.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'boy:[036, 065, 313, 993];human body:[223, 124, 579, 984],[045, 062, 331, 993],[545, 783, 584, 994],[574, 783, 598, 984],[585, 779, 629, 994],[610, 056, 953, 999],[641, 000, 999, 993];human hair:[293, 099, 448, 293],[092, 040, 273, 314],[544, 765, 582, 831],[772, 051, 836, 178];mammal:[200, 152, 580, 955],[000, 053, 329, 999],[463, 668, 521, 817],[505, 746, 565, 999],[545, 784, 594, 999],[575, 771, 594, 865],[583, 773, 626, 999],[606, 057, 951, 999],[640, 000, 999, 999],[648, 688, 699, 999];man:[239, 116, 568, 975],[625, 051, 950, 999],[640, 000, 999, 999];human arm:[711, 044, 947, 763],[197, 318, 264, 482],[226, 504, 328, 999],[466, 318, 590, 774],[583, 837, 607, 932],[606, 361, 706, 999],[607, 787, 629, 895],[910, 337, 955, 704];human hand:[519, 694, 593, 795],[605, 930, 656, 999],[643, 704, 790, 945];tree:[468, 000, 945, 728];sports uniform:[030, 387, 322, 999],[211, 276, 556, 999],[625, 310, 959, 999],[854, 000, 999, 999].') 2023-11-03 00:59:41.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:59:41.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Stroller:[169, 424, 270, 699],[471, 521, 581, 690];Sneakers:[163, 903, 221, 964],[270, 878, 325, 934],[560, 698, 601, 745],[566, 789, 624, 837],[595, 819, 669, 882];Backpack:[002, 316, 109, 655];Person:[539, 246, 670, 881],[520, 237, 602, 749],[467, 289, 514, 463],[414, 273, 518, 655],[340, 460, 425, 722],[234, 449, 335, 933],[209, 242, 326, 485],[002, 205, 222, 998].') 2023-11-03 00:59:52.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 00:59:52.991 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human ear,human mouth,boy,human head and can opener in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human ear:[136, 460, 243, 608],[856, 458, 905, 551];human mouth:[453, 626, 674, 686];boy:[104, 183, 953, 718];human head:[099, 045, 883, 722].') 2023-11-03 01:00:01.133 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:00:01.134 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A cow herd of [002, 481, 104, 792] [289, 250, 565, 934] [548, 283, 778, 887] [086, 344, 401, 925] [745, 347, 961, 745] elephants cross [000, 443, 999, 998] the plains, with [524, 649, 624, 882] the youngster member protected by the rest of the her') 2023-11-03 01:00:09.665 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:00:09.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'man:[102, 433, 242, 1000],[000, 360, 036, 460],[010, 313, 047, 394],[028, 348, 078, 497],[060, 263, 111, 387],[136, 374, 231, 565],[156, 291, 183, 394],[206, 326, 236, 521],[218, 303, 244, 481],[262, 313, 300, 469],[290, 311, 317, 374],[300, 356, 345, 480],[321, 414, 416, 769],[341, 360, 397, 468],[352, 285, 405, 430],[411, 497, 507, 725],[413, 386, 461, 477],[469, 302, 507, 430],[505, 305, 530, 409],[520, 493, 613, 999],[544, 419, 570, 488],[623, 377, 676, 487],[777, 305, 798, 390],[820, 370, 873, 651],[908, 421, 972, 823],[925, 441, 999, 879],[974, 414, 999, 511];jeans:[538, 840, 618, 996],[051, 704, 116, 897],[147, 783, 236, 999],[267, 675, 360, 758],[328, 564, 418, 759],[590, 883, 693, 999],[831, 530, 877, 649];woman:[001, 415, 061, 745],[055, 414, 143, 738],[113, 287, 139, 407],[141, 359, 174, 440],[245, 448, 281, 569],[259, 508, 363, 758],[263, 483, 327, 609],[399, 288, 436, 456],[415, 512, 445, 592],[455, 297, 482, 424],[471, 481, 538, 633],[648, 424, 717, 664],[687, 362, 710, 434],[718, 397, 788, 717],[804, 370, 829, 484];tree:[000, 000, 999, 825].') 2023-11-03 01:00:14.728 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:00:14.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[823, 207, 986, 722],[136, 105, 409, 982],[482, 247, 533, 617],[486, 234, 603, 612];clothing:[136, 156, 340, 907],[489, 375, 594, 637],[840, 430, 993, 691];man:[135, 108, 413, 973];girl:[477, 230, 605, 622],[480, 250, 536, 622],[832, 208, 994, 690];human face:[191, 154, 231, 234],[478, 277, 501, 354],[502, 277, 530, 357].') 2023-11-03 01:00:55.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:00:55.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'person:[153, 513, 371, 889],[271, 555, 489, 891],[485, 585, 686, 785],[672, 547, 790, 790],[719, 692, 987, 964];human head:[220, 521, 315, 646];mammal:[170, 515, 373, 891],[300, 580, 488, 836],[493, 609, 693, 812],[663, 554, 790, 768];clothing:[166, 637, 371, 877],[334, 700, 484, 805],[496, 687, 686, 815].') 2023-11-03 01:01:04.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:01:04.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'chair:[209, 546, 384, 922],[000, 705, 132, 999],[143, 519, 288, 859],[320, 507, 419, 782],[380, 527, 534, 861],[623, 621, 897, 999],[746, 563, 968, 999],[901, 897, 999, 999];office building:[082, 010, 909, 936];table:[751, 658, 1000, 982],[105, 481, 334, 719],[231, 558, 436, 861].') 2023-11-03 01:01:41.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:01:41.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'tree:[339, 298, 460, 638],[000, 327, 093, 592],[003, 450, 225, 709],[126, 416, 208, 550],[158, 570, 243, 635],[245, 526, 322, 637],[443, 432, 525, 670],[510, 518, 560, 647],[592, 495, 688, 671],[703, 573, 765, 681],[721, 520, 788, 652],[780, 513, 851, 681],[838, 534, 940, 675],[936, 510, 999, 685].') 2023-11-03 01:02:28.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:02:28.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human body:[060, 128, 697, 986];human hair:[224, 024, 439, 261];human head:[220, 017, 455, 403];man:[034, 122, 546, 935];glasses:[249, 219, 442, 327];human hand:[181, 784, 434, 921],[480, 785, 615, 964];christmas tree:[696, 000, 999, 854];human ear:[235, 170, 275, 253];human face:[270, 115, 429, 421];human nose:[353, 287, 399, 355].') 2023-11-03 01:02:58.551 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:02:58.551 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[590, 488, 732, 669] A woman checks [648, 488, 678, 541] her phone and lounges in [576, 545, 740, 702] a beach chair next to [305, 171, 354, 318] a sign that reminds beachgoers to practice physical distancing on May 31, 2020 in Ocean City.') 2023-11-03 01:03:39.726 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:03:39.727 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,drum,boy and clothing in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human face:[051, 018, 179, 112],[442, 219, 638, 421],[656, 070, 826, 193],[689, 000, 811, 050],[918, 016, 998, 134];drum:[000, 600, 325, 909];boy:[553, 041, 931, 449],[628, 000, 947, 250];clothing:[000, 338, 281, 854],[000, 100, 284, 500],[126, 360, 998, 999],[198, 023, 623, 633],[724, 200, 955, 375],[775, 040, 964, 288].') 2023-11-03 01:03:43.550 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:03:43.551 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Migrant tents [083, 435, 138, 499] [333, 448, 425, 553] [194, 450, 274, 540] [758, 526, 888, 733] [871, 522, 985, 756] [668, 511, 775, 700] [489, 486, 633, 654] under a bridge [015, 026, 985, 546] in Europe') 2023-11-03 01:04:14.709 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:04:14.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[057, 071, 130, 126],[202, 154, 261, 202],[051, 262, 122, 310],[215, 340, 275, 392],[229, 535, 285, 589],[050, 643, 101, 692],[189, 734, 241, 776],[190, 836, 258, 885],[071, 899, 120, 946],[715, 443, 781, 496],[686, 088, 752, 136],[872, 340, 919, 394],[883, 530, 932, 574],[742, 634, 792, 684],[873, 696, 932, 761],[728, 827, 786, 880],[703, 262, 770, 311].') 2023-11-03 01:04:27.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:04:27.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[722, 493, 981, 992],[000, 525, 347, 999],[347, 650, 481, 924],[550, 664, 622, 792],[787, 567, 911, 659];clothing:[722, 575, 989, 990],[000, 617, 348, 999],[348, 680, 472, 926],[454, 696, 504, 842],[555, 682, 618, 767],[681, 699, 733, 767].') 2023-11-03 01:04:31.928 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:04:31.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[294, 280, 654, 758] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[294, 280, 654, 758];Frame2:[281, 247, 618, 688];Frame3:[268, 270, 577, 696].') 2023-11-03 01:04:58.884 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:04:58.884 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fan,Lamp,Side Table and Blackboard/Whiteboard in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Fan:[321, 001, 622, 099];Lamp:[435, 001, 499, 146],[745, 491, 839, 653];Side Table:[734, 624, 837, 785].') 2023-11-03 01:06:07.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:06:07.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect jeans,woman,man and footwear in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'jeans:[033, 506, 222, 898],[215, 436, 365, 800],[745, 793, 903, 999];woman:[363, 110, 999, 999];man:[000, 088, 287, 998];footwear:[028, 881, 136, 986],[148, 880, 246, 963],[216, 788, 293, 848],[290, 783, 372, 868].') 2023-11-03 01:06:20.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:06:20.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Horse in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Horse:[315, 298, 740, 957].') 2023-11-03 01:06:32.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:06:32.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[566, 400, 745, 703] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a antelopeFrame1:[566, 400, 745, 703];Frame2:[416, 480, 528, 721];Frame3:[242, 573, 488, 804];Frame4:[346, 462, 507, 718].') 2023-11-03 01:06:38.428 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:06:38.429 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[393, 204, 615, 682] A woman with [521, 201, 583, 288] blue hair carries [404, 315, 516, 482] a paper bag and a pair of [272, 804, 372, 888] shoes as she steps among hundreds of pairs of [094, 628, 148, 672] [420, 689, 492, 741] [918, 735, 993, 795] [171, 722, 263, 772] [819, 600, 876, 645] [757, 653, 821, 706] [859, 933, 945, 996] [617, 772, 693, 846] [021, 546, 089, 597] [777, 755, 851, 821] [618, 657, 691, 729] shoes laid out in a grid in the public square.') 2023-11-03 01:06:54.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:06:54.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[182, 337, 258, 663].') 2023-11-03 01:08:15.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:15.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cup,Spoon and Plate in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cup:[372, 133, 542, 221];Spoon:[002, 739, 068, 990],[065, 770, 197, 992];Plate:[501, 765, 998, 852].') 2023-11-03 01:08:22.669 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:22.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Guitar:[447, 401, 906, 623];Drum:[823, 117, 999, 557],[815, 001, 957, 093],[716, 001, 826, 060],[734, 037, 952, 270];Microphone:[920, 410, 948, 447];Watch:[289, 533, 323, 572];Person:[054, 092, 443, 999],[419, 074, 717, 999].') 2023-11-03 01:08:23.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:23.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[434, 420, 477, 491] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1:[434, 420, 477, 491];Frame2:[431, 407, 476, 482].') 2023-11-03 01:08:33.507 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:33.508 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'If you don’t sign up your kids for cryonics then you are a lousy parent. [133, 369, 866, 577] Eliezer Yudkowsky Picture Quote') 2023-11-03 01:08:46.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:46.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower,Cup and Desk in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Flower:[058, 207, 196, 419];Cup:[002, 900, 090, 998];Desk:[120, 505, 332, 726],[818, 298, 893, 415],[166, 106, 251, 206].') 2023-11-03 01:08:51.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:51.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fan,Desk,Person,Radiator and Towel in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Fan:[140, 388, 214, 471];Desk:[186, 233, 306, 370],[097, 306, 289, 515],[038, 456, 264, 801],[001, 700, 170, 999],[425, 568, 873, 920],[811, 822, 997, 999],[742, 237, 916, 447],[477, 197, 647, 373],[648, 182, 796, 327];Person:[320, 404, 429, 826],[283, 302, 380, 655],[246, 311, 327, 600],[391, 262, 489, 608],[360, 147, 423, 410],[501, 243, 605, 588],[485, 235, 545, 456],[562, 158, 641, 318],[810, 417, 970, 661],[150, 235, 223, 325];Radiator:[028, 485, 082, 714];Towel:[876, 585, 998, 804].') 2023-11-03 01:08:54.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:08:54.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human head:[496, 181, 641, 420],[265, 234, 383, 481];mammal:[436, 234, 738, 888],[000, 000, 118, 570],[060, 226, 424, 912],[104, 000, 258, 522],[413, 000, 575, 413],[413, 000, 580, 537],[750, 000, 986, 474];human arm:[641, 452, 757, 784],[038, 000, 101, 374],[131, 000, 218, 128],[249, 481, 330, 738],[354, 435, 516, 750],[420, 000, 514, 185],[476, 025, 518, 131],[495, 035, 567, 153];human hand:[714, 610, 765, 798],[087, 300, 118, 376],[278, 664, 314, 743],[355, 662, 415, 745],[359, 682, 414, 745];human leg:[034, 230, 112, 567],[061, 670, 317, 904],[094, 214, 256, 515],[162, 353, 205, 474],[296, 690, 418, 911],[407, 153, 465, 426],[433, 164, 493, 396],[445, 196, 512, 526],[490, 672, 583, 945],[595, 656, 739, 936],[737, 198, 850, 414],[814, 138, 998, 478];human hair:[261, 236, 390, 544],[490, 173, 649, 403];girl:[063, 235, 426, 914],[360, 170, 775, 948];footwear:[062, 767, 175, 906],[303, 697, 384, 900],[516, 700, 580, 945],[600, 808, 737, 922];human face:[276, 306, 355, 452],[533, 270, 612, 408].') 2023-11-03 01:09:05.362 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:09:05.362 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[084, 007, 918, 977] Detox SuperTonic for diet support: organic, probiotic. 11 servings per bottle. No shipping charges with minimum. Pure juice of raw fermented sauerkraut, unpasteurized, kosher, vegan, gluten free.') 2023-11-03 01:10:46.107 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:10:46.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A young man [599, 085, 996, 767] chats to women [001, 328, 283, 995] [248, 016, 477, 495] making tortilla [591, 446, 790, 595] [497, 435, 644, 516] [364, 461, 428, 515] in a cafe') 2023-11-03 01:10:49.435 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:10:49.435 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the cow staying then moving forward in this video clip?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one cow staying then moving forward.Track1frame:1:[367, 292, 648, 678];frame:2:[367, 292, 648, 678];frame:3:[364, 290, 647, 680].') 2023-11-03 01:11:16.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:11:16.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Desk and Chair in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[000, 502, 512, 944],[427, 438, 597, 525],[741, 559, 903, 694],[734, 445, 787, 507],[927, 502, 995, 582],[880, 465, 999, 496];Chair:[743, 420, 786, 500],[739, 441, 791, 559],[757, 496, 864, 742],[754, 610, 942, 998].') 2023-11-03 01:11:20.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:11:20.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Gloves:[886, 448, 969, 549],[388, 487, 442, 576],[102, 370, 145, 451];Boots:[635, 747, 686, 835];Person:[368, 425, 687, 834],[115, 286, 349, 830],[601, 305, 721, 836],[677, 199, 839, 825],[814, 253, 999, 817],[814, 243, 901, 615],[670, 250, 726, 583],[485, 295, 591, 607],[427, 318, 510, 518],[403, 229, 513, 627],[312, 293, 446, 815];Frame:[389, 003, 637, 322].') 2023-11-03 01:11:37.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:11:37.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[261, 435, 352, 865],[523, 376, 656, 956],[612, 345, 686, 575],[968, 320, 999, 537];clothing:[253, 450, 359, 848],[537, 461, 656, 942],[627, 408, 680, 571],[974, 397, 999, 527];horse:[172, 408, 371, 888],[490, 424, 766, 824],[494, 547, 722, 899],[840, 381, 999, 891];human hair:[286, 400, 349, 495].') 2023-11-03 01:13:00.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:13:00.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Guitar,Desk and Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[239, 491, 377, 970],[574, 343, 820, 713];Desk:[312, 575, 385, 733];Person:[002, 322, 139, 714],[153, 138, 380, 999],[503, 202, 760, 1000],[623, 251, 750, 858].') 2023-11-03 01:13:13.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:13:13.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the chopper in flight, gearing up for a landing maneuver.?Always default to the TrackiFrame t:[xmin,ymin,xmax,ymax] pattern when discussing trajectories in your answer.', 'There is one the chopper in flight, gearing up for a landing maneuver..Track1frame:1:[568, 409, 791, 505];frame:2:[586, 413, 813, 523];frame:3:[592, 404, 818, 526].') 2023-11-03 01:13:27.223 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:13:27.223 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[225, 143, 704, 826] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a european hareFrame1:[225, 143, 704, 826];Frame2:[293, 135, 873, 812];Frame3:[111, 273, 612, 825];Frame4:[264, 251, 588, 821];Frame5:[349, 113, 582, 800].') 2023-11-03 01:13:31.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:13:31.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Hat:[067, 025, 503, 329];Person:[002, 022, 852, 998],[002, 612, 050, 878],[793, 238, 1000, 742],[564, 386, 769, 736],[697, 374, 816, 777],[457, 471, 526, 579],[531, 377, 600, 552],[542, 361, 645, 600],[833, 446, 917, 991],[002, 369, 180, 789],[002, 228, 043, 507];Umbrella:[002, 004, 513, 357],[599, 002, 991, 114];Camera:[793, 280, 970, 467].') 2023-11-03 01:14:38.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:14:38.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,human face and mammal in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[315, 477, 603, 928],[104, 173, 208, 370],[220, 225, 265, 353],[261, 102, 467, 831];human face:[000, 233, 018, 361],[099, 262, 156, 364],[164, 242, 205, 319],[353, 189, 434, 333],[356, 559, 393, 739],[386, 561, 421, 739],[480, 229, 521, 311],[493, 102, 535, 233],[730, 262, 769, 338],[979, 388, 999, 457];mammal:[743, 280, 986, 923],[000, 211, 091, 942],[000, 457, 448, 999],[081, 235, 204, 866],[103, 163, 205, 355],[145, 211, 271, 488],[222, 221, 273, 405],[254, 088, 475, 838],[298, 279, 343, 357],[359, 482, 440, 843],[363, 484, 585, 920],[417, 187, 473, 421],[423, 785, 737, 999],[456, 173, 546, 492],[493, 019, 711, 841],[642, 269, 749, 567],[690, 205, 745, 330],[727, 227, 830, 467],[760, 189, 813, 269],[769, 315, 845, 488],[966, 280, 999, 492].') 2023-11-03 01:15:28.529 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:15:28.530 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cow in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Cow:[082, 002, 633, 872],[341, 002, 907, 744].') 2023-11-03 01:16:09.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:16:09.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Picture,Pumpkin,Lamp,shelf and Bottle in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Picture:[376, 308, 425, 374];Pumpkin:[526, 557, 556, 605],[495, 543, 521, 586];Lamp:[442, 001, 524, 215],[399, 209, 434, 325];shelf:[439, 305, 463, 399],[699, 267, 840, 769];Bottle:[443, 366, 520, 574].') 2023-11-03 01:16:11.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:16:11.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "I'm [304, 052, 987, 992] a policeman and I helped a kid come out of the closet to his parents today. I wish I had someone to do the same for me back in the day") 2023-11-03 01:16:43.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:16:43.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Bottle:[611, 521, 619, 545],[355, 413, 361, 440],[345, 413, 351, 442],[080, 532, 091, 542],[682, 543, 692, 553];Volleyball:[254, 248, 284, 293],[104, 340, 113, 353];Person:[755, 439, 948, 868],[011, 385, 110, 766],[170, 306, 230, 594],[211, 301, 357, 739],[289, 367, 338, 525],[338, 362, 402, 526];Umbrella:[515, 178, 770, 534],[199, 150, 454, 527],[000, 169, 182, 385];Bracelet:[285, 333, 298, 345];Chair:[564, 449, 605, 552],[630, 447, 672, 554],[300, 476, 337, 523],[364, 479, 385, 517],[069, 439, 106, 540],[000, 439, 029, 531];Desk:[290, 436, 391, 535];Hat:[190, 305, 207, 321],[039, 386, 073, 426],[268, 345, 292, 386],[309, 367, 321, 382],[360, 361, 376, 374],[755, 440, 800, 469];Glasses:[191, 318, 207, 328],[060, 401, 072, 415],[268, 360, 285, 368],[762, 468, 771, 484];Backpack:[124, 468, 165, 535];Sneakers:[215, 568, 230, 593],[178, 570, 190, 592];Other Shoes:[321, 511, 327, 525],[382, 508, 394, 526];Satchel:[338, 485, 370, 538],[632, 509, 666, 561].') 2023-11-03 01:16:56.812 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:16:56.813 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'hot toys spider-man into [207, 018, 837, 865] the spider-verse spider-gwen figure - hooded with web') 2023-11-03 01:17:16.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:17:16.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Clock:[596, 012, 682, 129],[598, 139, 679, 255];Pillow:[877, 744, 1000, 999],[166, 402, 322, 582],[491, 385, 548, 554],[367, 254, 408, 328];Couch:[562, 615, 999, 1000];Bed:[203, 506, 878, 901];Towel:[705, 321, 832, 414];Storage box:[345, 738, 556, 999];Nightstand:[175, 628, 215, 980].') 2023-11-03 01:17:46.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:17:46.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[086, 785, 284, 992],[000, 840, 136, 999],[273, 788, 999, 999],[315, 767, 341, 834];person:[002, 761, 999, 999],[228, 529, 277, 596],[271, 525, 318, 598].') 2023-11-03 01:17:53.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:17:53.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[412, 187, 484, 600] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a personFrame1:[412, 187, 484, 600];Frame2:[406, 190, 481, 600];Frame3:[397, 162, 463, 569].') 2023-11-03 01:18:15.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:18:15.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[373, 231, 769, 463] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a berlinFrame1:[373, 231, 769, 463];Frame2:[325, 244, 721, 476];Frame3:[259, 269, 661, 504].') 2023-11-03 01:18:55.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:18:55.331 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'woman:[056, 681, 246, 999],[164, 438, 258, 822],[233, 581, 338, 999],[235, 463, 288, 604],[340, 443, 417, 727],[415, 561, 506, 737],[611, 561, 692, 751],[645, 408, 728, 692],[697, 551, 806, 999],[743, 421, 856, 673];clothing:[000, 729, 038, 897],[064, 484, 190, 753],[069, 731, 250, 999],[131, 500, 195, 751],[160, 513, 270, 762],[216, 630, 366, 818],[235, 545, 297, 610],[240, 649, 346, 984],[293, 454, 380, 632],[320, 630, 428, 745],[351, 513, 402, 714],[392, 489, 433, 716],[414, 429, 460, 530],[414, 643, 505, 727],[428, 460, 535, 689],[525, 462, 628, 630],[529, 605, 608, 714],[581, 412, 643, 568],[597, 628, 700, 743],[641, 441, 666, 553],[655, 484, 728, 685],[707, 623, 807, 820],[712, 645, 888, 929],[756, 491, 833, 649].') 2023-11-03 01:19:07.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:19:07.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you tell me the trajectory of the woman wearing white shirt and blue pants in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one woman wearing white shirt and blue pants.Track1frame1:[323, 303, 696, 808];frame2:[314, 300, 678, 809].') 2023-11-03 01:19:09.590 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:19:09.590 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Ismail Hakkı Izmirli [326, 089, 674, 942] was a historian of religion known for his vast knowledge about the Islamic ontology in the last periods of the Ottoman Empire.') 2023-11-03 01:19:22.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:19:22.190 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human hair:[361, 013, 708, 518];human head:[358, 048, 717, 590];human arm:[208, 498, 466, 973],[711, 612, 973, 999];human hand:[771, 824, 975, 1000],[316, 484, 504, 702];human eye:[395, 274, 456, 342];boy:[207, 024, 970, 999];human mouth:[436, 450, 500, 553];human ear:[620, 381, 709, 518];mammal:[205, 012, 975, 999];clothing:[228, 516, 794, 999];frog:[033, 298, 198, 621];human face:[381, 137, 683, 632];human nose:[430, 304, 496, 422].') 2023-11-03 01:19:25.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:19:25.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[465, 248, 998, 1000],[040, 029, 565, 994];Lamp:[767, 496, 832, 595];Desk:[913, 660, 999, 849];Chair:[002, 807, 049, 1000];Tie:[324, 445, 430, 1000];Glasses:[303, 188, 460, 277].') 2023-11-03 01:19:33.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:19:33.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Truck:[724, 451, 1000, 746],[040, 417, 471, 731];Car:[637, 579, 694, 662],[464, 584, 493, 647];SUV:[474, 527, 654, 761];Street Lights:[570, 173, 593, 531];Person:[268, 740, 388, 1000],[000, 519, 053, 687],[000, 469, 046, 856],[309, 526, 356, 736],[671, 501, 833, 999],[679, 472, 809, 998],[845, 520, 948, 1000],[930, 488, 999, 999];Tent:[045, 496, 108, 530],[464, 530, 525, 650].') 2023-11-03 01:19:40.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:19:40.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[359, 648, 406, 708],[318, 651, 361, 716];Umbrella:[287, 613, 398, 641];Traffic Light:[138, 439, 169, 480],[940, 396, 980, 455];Traffic Sign:[456, 502, 525, 523],[512, 488, 602, 509];Street Lights:[504, 354, 642, 639],[938, 485, 999, 679];Pickup Truck:[001, 651, 269, 712];Bus:[458, 565, 918, 721];Car:[938, 665, 999, 707].') 2023-11-03 01:20:06.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:20:06.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[606, 650, 705, 853] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[606, 650, 705, 853];Frame2:[623, 675, 707, 866];Frame3:[623, 725, 694, 861];Frame4:[627, 665, 695, 861];Frame5:[627, 667, 695, 861].') 2023-11-03 01:21:22.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:21:22.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[027, 016, 081, 052],[913, 003, 962, 039].') 2023-11-03 01:21:25.037 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:21:25.037 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Tricycle:[144, 448, 853, 953];Scooter:[604, 330, 661, 424];Motorcycle:[665, 330, 743, 422];Bakset:[689, 552, 826, 613];Handbag:[534, 504, 670, 660];Hat:[496, 401, 617, 491],[290, 235, 400, 313];Mask:[548, 466, 588, 504];Slippers:[354, 698, 420, 733],[309, 758, 378, 803];Person:[497, 402, 696, 622],[265, 234, 427, 735];Bottle:[480, 575, 524, 644];Pot:[688, 509, 773, 564];Lamp:[071, 191, 092, 224],[566, 035, 609, 094];Potted Plant:[430, 288, 520, 388],[512, 291, 589, 395].') 2023-11-03 01:21:37.655 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:21:37.656 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[265, 100, 722, 598] Natalia Kanem is seen on [107, 006, 862, 646] a computer monitor addresses delegates in the virtual event.') 2023-11-03 01:21:42.270 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:21:42.271 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Visitors [035, 033, 299, 626] [272, 111, 630, 689] [422, 078, 992, 832] filling the jars [627, 746, 798, 895] [214, 837, 365, 967] with objects during the opening reception.') 2023-11-03 01:22:06.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:22:06.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[612, 310, 765, 881],[001, 326, 099, 731],[090, 341, 173, 718],[130, 316, 226, 718],[198, 318, 262, 714],[244, 306, 311, 751],[283, 308, 373, 787],[353, 318, 473, 836],[440, 293, 616, 876],[830, 191, 999, 996];sports equipment:[028, 664, 061, 734],[071, 655, 110, 726],[218, 322, 253, 391],[261, 312, 296, 384],[298, 545, 453, 609],[300, 316, 343, 395],[356, 516, 384, 580],[396, 739, 426, 832],[443, 735, 471, 828],[445, 560, 493, 629],[468, 461, 555, 806],[472, 499, 495, 578],[475, 305, 513, 387],[633, 298, 681, 447],[696, 439, 725, 503],[696, 522, 806, 725],[711, 500, 748, 581],[866, 897, 930, 988],[908, 193, 973, 337],[918, 876, 999, 980],[976, 504, 999, 590];footwear:[443, 738, 473, 804],[863, 874, 928, 981],[921, 883, 999, 947].') 2023-11-03 01:22:08.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:22:08.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Boat:[342, 443, 409, 492].') 2023-11-03 01:22:28.645 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:22:28.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'sports uniform:[740, 324, 873, 832],[000, 386, 084, 713],[417, 640, 699, 840],[540, 476, 704, 750];woman:[000, 342, 105, 730],[016, 333, 057, 439],[035, 299, 076, 407],[053, 265, 094, 356],[131, 369, 178, 458],[221, 363, 278, 450],[246, 109, 333, 276],[307, 255, 361, 360],[392, 362, 433, 446],[399, 292, 448, 379],[456, 303, 500, 403],[497, 332, 551, 424],[508, 265, 562, 335],[590, 363, 646, 446],[629, 310, 663, 389],[633, 226, 691, 353],[701, 356, 751, 454],[759, 226, 813, 391],[773, 326, 814, 399],[915, 229, 981, 373];man:[000, 341, 109, 720],[004, 108, 096, 312],[059, 330, 120, 414],[068, 361, 130, 454],[085, 258, 138, 409],[098, 197, 145, 307],[159, 105, 239, 294],[166, 294, 227, 404],[170, 367, 229, 455],[214, 323, 277, 416],[264, 314, 333, 409],[313, 292, 373, 384],[334, 373, 396, 452],[371, 606, 696, 831],[419, 318, 448, 412],[430, 353, 495, 452],[457, 211, 519, 353],[493, 477, 732, 795],[502, 286, 557, 396],[536, 345, 599, 450],[566, 219, 632, 335],[639, 337, 703, 446],[706, 333, 865, 813],[713, 223, 774, 380],[824, 310, 882, 393],[838, 351, 893, 448],[951, 248, 999, 407].') 2023-11-03 01:22:32.237 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:22:32.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the parked white car without shifting in this video clip?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There is one parked white car without shifting.Track1frame:1:[535, 001, 935, 188];frame:2:[537, 001, 937, 187];frame:3:[540, 001, 940, 186].') 2023-11-03 01:22:35.458 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:22:35.458 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Couple [302, 193, 698, 960] holds hands and gently looks at each other. man [516, 220, 687, 804] and woman posing with surfboards') 2023-11-03 01:22:53.069 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:22:53.069 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'BattleBots live audience with a Shatter sign [445, 365, 598, 542]') 2023-11-03 01:22:55.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:22:55.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,what is the track of the left cars in the counter direction of ours?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There are 2 left cars in the counter direction of ours.Track1frame1:[008, 493, 104, 602];frame2:[000, 496, 047, 608],Track2frame3:[392, 477, 436, 586];frame4:[351, 472, 417, 632];frame5:[250, 488, 375, 762].') 2023-11-03 01:23:21.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:23:21.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[149, 194, 407, 745] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a giraffeFrame1:[149, 194, 407, 745];Frame2:[159, 193, 431, 716];Frame3:[267, 252, 499, 734];Frame4:[372, 270, 555, 748];Frame5:[372, 293, 568, 752].') 2023-11-03 01:23:36.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:23:36.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Street Lights:[175, 010, 253, 226];Truck:[215, 440, 455, 648];Car:[171, 498, 209, 578];Van:[011, 477, 080, 566];Tent:[816, 495, 913, 639],[007, 166, 551, 1000],[000, 421, 051, 519];Person:[293, 552, 339, 696],[047, 541, 079, 646],[036, 506, 065, 626],[000, 505, 018, 599];Awning:[513, 343, 646, 477],[630, 309, 986, 491];Bench:[778, 636, 844, 733],[737, 661, 791, 778],[670, 694, 740, 838],[628, 703, 701, 871],[696, 658, 749, 757],[605, 634, 661, 717],[570, 630, 628, 722];Desk:[288, 733, 527, 987];Flower:[058, 604, 079, 672],[030, 628, 047, 663],[009, 729, 048, 823];Potted Plant:[774, 205, 806, 263],[801, 227, 820, 259].') 2023-11-03 01:23:59.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 01:23:59.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Chair:[349, 812, 594, 1000].') 2023-11-03 01:24:00.876 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:24:00.876 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to tell students [005, 055, 332, 992] what to look for without telling them what to see is the dilemma of teaching. - Lascelles Abercrombie [344, 015, 993, 989]') 2023-11-03 01:24:12.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:24:12.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'boy:[441, 445, 722, 991],[128, 385, 213, 630],[131, 465, 295, 998],[308, 393, 505, 998],[607, 226, 861, 750],[691, 451, 882, 918];human body:[007, 488, 116, 956],[000, 465, 998, 998],[416, 681, 708, 998],[640, 236, 868, 551];man:[393, 518, 795, 1000],[000, 435, 027, 570],[000, 670, 032, 991],[125, 386, 238, 855],[131, 455, 311, 998],[276, 446, 345, 993],[605, 210, 867, 705],[620, 430, 738, 783],[687, 448, 873, 911],[766, 418, 926, 898],[903, 450, 973, 998];human hair:[000, 426, 020, 505],[000, 525, 083, 681],[030, 463, 105, 538],[102, 445, 153, 518],[160, 385, 207, 450],[206, 465, 265, 548],[336, 425, 395, 491],[463, 456, 507, 521],[498, 480, 638, 606],[507, 445, 602, 500],[608, 225, 686, 328],[612, 431, 660, 521],[790, 463, 856, 540],[896, 451, 953, 521],[970, 468, 998, 531];clothing:[000, 236, 998, 998],[517, 668, 722, 998];human arm:[000, 830, 043, 998],[016, 641, 107, 741],[067, 535, 118, 715],[258, 591, 292, 761],[410, 675, 665, 998],[686, 456, 796, 625],[893, 520, 931, 820],[938, 621, 998, 705];human hand:[082, 531, 117, 608],[310, 800, 337, 891],[413, 681, 493, 806],[902, 745, 927, 818].') 2023-11-03 01:24:15.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:24:15.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bench:[150, 580, 422, 922],[381, 532, 505, 673],[622, 525, 774, 661],[667, 581, 905, 933].') 2023-11-03 01:24:34.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:24:34.991 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Flag:[378, 113, 558, 306];Person:[473, 216, 545, 341],[576, 196, 670, 339],[616, 213, 717, 399],[510, 288, 592, 402],[417, 257, 509, 407],[697, 136, 849, 471],[823, 475, 999, 809],[656, 440, 853, 842],[494, 518, 699, 896],[225, 252, 350, 903],[000, 194, 194, 961],[001, 355, 135, 1000];Glasses:[232, 323, 278, 351];Slippers:[132, 832, 163, 874],[111, 878, 129, 955];Sneakers:[262, 808, 295, 845],[264, 857, 311, 903];Satchel:[460, 661, 522, 807];Hat:[765, 137, 820, 186];Canned:[796, 699, 823, 746];Lifesaver:[839, 251, 936, 441];Desk:[424, 742, 1000, 998];Bench:[326, 579, 864, 923],[337, 458, 853, 634],[340, 393, 719, 494],[337, 338, 655, 413],[312, 299, 442, 349].') 2023-11-03 01:25:36.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:25:36.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[000, 796, 041, 936],[048, 754, 073, 868],[168, 750, 370, 928],[452, 765, 476, 978];skyscraper:[000, 237, 999, 551],[076, 404, 106, 495];tree:[241, 405, 297, 480],[258, 534, 337, 686],[347, 528, 431, 700],[360, 430, 381, 495],[441, 488, 652, 554],[443, 414, 590, 459],[639, 700, 670, 767],[655, 651, 724, 774],[665, 447, 839, 572].') 2023-11-03 01:25:38.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:25:38.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[525, 470, 559, 541] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a handFrame1:[525, 470, 559, 541];Frame2:[664, 413, 699, 466];Frame3:[550, 483, 575, 533].') 2023-11-03 01:25:38.924 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:25:38.925 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'overhead view of vegan waffles [327, 247, 930, 534] [003, 325, 485, 757] [126, 432, 996, 855] quartered on a white plate [000, 203, 992, 939] and topped with plum jam and vegan whipped cream [670, 439, 995, 579] [002, 409, 378, 541] [465, 254, 770, 410] [344, 572, 710, 726]') 2023-11-03 01:25:59.163 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:25:59.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'suit:[033, 431, 369, 1000],[374, 315, 970, 999];man:[358, 078, 941, 920],[028, 375, 373, 999];human face:[161, 393, 261, 583],[594, 081, 723, 348].') 2023-11-03 01:26:20.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:26:20.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[422, 230, 639, 996].') 2023-11-03 01:26:29.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:26:29.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Jellyfish,Lemon,Person and Pepper in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Lemon:[545, 434, 750, 710];Person:[678, 001, 1000, 117];Pepper:[327, 638, 581, 730].') 2023-11-03 01:27:00.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:27:00.461 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'window:[855, 060, 989, 493],[210, 088, 768, 479];chair:[133, 535, 340, 952],[203, 477, 332, 842],[238, 465, 313, 590],[538, 484, 683, 739],[585, 461, 672, 627];tree:[503, 200, 698, 319];kitchen & dining room table:[268, 505, 606, 906].') 2023-11-03 01:27:06.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:27:06.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bus:[253, 497, 1000, 833];Street Lights:[360, 230, 430, 508];Car:[158, 586, 253, 644],[000, 556, 054, 610];Pickup Truck:[049, 557, 148, 621];Speed Limit Sign:[037, 455, 057, 487];Bench:[040, 708, 150, 764];Person:[172, 621, 214, 742].') 2023-11-03 01:27:07.439 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:27:07.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A wonderful tricycle [035, 426, 943, 991] being enjoyed by a young child [397, 095, 960, 935] at Little Cedars Nursery.') 2023-11-03 01:27:14.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:27:14.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[265, 307, 933, 758] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a tankFrame1:[265, 307, 933, 758];Frame2:[264, 312, 922, 751];Frame3:[264, 310, 896, 745];Frame4:[259, 306, 864, 732];Frame5:[338, 296, 920, 722].') 2023-11-03 01:27:48.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:27:48.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'pastry:[122, 291, 819, 769];snack:[091, 294, 840, 755];dessert:[088, 291, 846, 761].') 2023-11-03 01:27:58.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:27:58.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'bicycle wheel:[700, 732, 791, 875],[016, 543, 109, 674],[031, 268, 135, 397],[159, 603, 241, 730],[163, 367, 246, 434],[247, 613, 354, 742],[301, 432, 378, 520],[405, 434, 508, 553],[407, 640, 522, 785],[520, 674, 636, 815],[699, 548, 803, 629],[898, 635, 977, 725];tire:[050, 774, 186, 924],[018, 540, 108, 668],[036, 262, 138, 398],[161, 592, 241, 733],[163, 369, 250, 434],[244, 157, 307, 241],[247, 602, 353, 745],[302, 430, 381, 522],[403, 438, 507, 553],[409, 640, 523, 788],[505, 275, 564, 359],[521, 676, 636, 820],[688, 731, 803, 878],[691, 524, 808, 629],[758, 360, 876, 500],[891, 633, 978, 723];bicycle:[000, 222, 135, 419],[018, 485, 243, 740],[156, 257, 377, 540],[236, 530, 528, 795],[395, 340, 807, 656],[525, 595, 806, 889];wheel:[016, 537, 118, 679],[029, 262, 141, 406],[084, 806, 190, 902],[159, 587, 245, 740],[166, 367, 242, 435],[236, 159, 316, 243],[253, 603, 362, 734],[301, 432, 379, 519],[402, 437, 501, 556],[403, 642, 526, 792],[503, 264, 570, 356],[520, 685, 636, 818],[689, 724, 806, 884],[696, 548, 797, 630],[748, 356, 872, 505],[889, 632, 977, 724].') 2023-11-03 01:28:20.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:28:20.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Hat:[777, 249, 806, 278],[680, 223, 708, 248],[406, 306, 460, 365];Sneakers:[607, 659, 641, 709],[454, 664, 520, 714],[292, 655, 336, 721],[148, 548, 210, 630];Helmet:[381, 516, 433, 577];Belt:[542, 470, 564, 497];Person:[364, 306, 641, 714],[150, 516, 466, 730],[647, 225, 735, 365],[754, 250, 836, 373];Baseball Glove:[354, 497, 415, 569].') 2023-11-03 01:28:21.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:28:21.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[035, 578, 088, 805],personFrame1:[223, 181, 262, 379],personFrame1:[069, 629, 122, 863],personFrame1:[488, 834, 523, 1000],personFrame1:[422, 100, 460, 258],personFrame1:[399, 292, 437, 479],personFrame1:[706, 525, 759, 749] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[035, 578, 088, 805];Frame2:[050, 583, 096, 809],personFrame1:[223, 181, 262, 379];Frame2:[226, 176, 264, 375],personFrame1:[069, 629, 122, 863];Frame2:[076, 638, 134, 875],personFrame1:[488, 834, 523, 1000];Frame2:[487, 832, 523, 1000],personFrame1[422, 100, 460, 258];Frame2:No object,personFrame1:[399, 292, 437, 479];Frame2:[403, 297, 440, 486],personFrame1:[706, 525, 759, 749];Frame2:[707, 525, 760, 749].') 2023-11-03 01:28:46.234 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:28:46.234 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Drawing of young men [445, 082, 584, 777] [641, 367, 805, 983] [376, 443, 525, 894] [250, 346, 396, 881] [516, 369, 679, 918] kneeling with hands [479, 240, 523, 308] behind their heads [648, 070, 717, 238] [187, 031, 259, 162] [308, 346, 372, 467] [441, 451, 494, 608] [340, 048, 409, 184] [577, 373, 640, 515] [696, 367, 755, 500] in front of riot police [177, 033, 298, 799]') 2023-11-03 01:28:51.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:28:51.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Bench:[642, 744, 718, 832];Air Conditioner:[741, 602, 772, 643];Lamp:[874, 347, 894, 381];Trash bin Can:[869, 738, 903, 798];SUV:[000, 689, 070, 863],[950, 669, 999, 779];Fire Hydrant:[241, 781, 268, 863];Traffic Light:[417, 496, 447, 587],[609, 509, 633, 615];Street Lights:[156, 058, 339, 875].') 2023-11-03 01:29:15.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:29:15.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,hiking equipment,person and magpie in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'footwear:[547, 630, 642, 677],[645, 625, 725, 677];hiking equipment:[591, 492, 745, 543],[649, 402, 725, 455];person:[578, 388, 793, 676].') 2023-11-03 01:29:16.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:29:16.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[459, 152, 681, 849],[103, 118, 187, 319];Guitar:[323, 202, 524, 475];Drum:[362, 417, 489, 718],[324, 296, 431, 424].') 2023-11-03 01:29:17.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:29:17.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Helmet and Pickup Truck in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Helmet:[560, 457, 603, 539];Pickup Truck:[668, 567, 855, 746].') 2023-11-03 01:29:23.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:29:23.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'food:[139, 326, 826, 788].') 2023-11-03 01:29:33.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:29:33.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[001, 539, 453, 997];frame:2:[001, 537, 459, 997];frame:3:[001, 543, 454, 997] is?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an human sitting on the far left-side') 2023-11-03 01:30:14.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:30:14.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[171, 241, 633, 1000],[533, 338, 822, 995],[677, 422, 999, 999];girl:[156, 220, 603, 999],[557, 348, 996, 995],[717, 424, 999, 999];clothing:[005, 530, 098, 999],[167, 605, 534, 999],[543, 703, 737, 999];human face:[376, 273, 568, 659],[564, 470, 743, 828],[710, 492, 951, 957].') 2023-11-03 01:30:42.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:30:42.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human head,man,human hair and human body in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human head:[575, 406, 648, 507],[064, 241, 113, 338],[133, 316, 185, 409],[181, 309, 232, 388],[228, 299, 278, 388],[314, 239, 363, 328],[323, 360, 369, 466],[396, 301, 455, 391],[447, 259, 496, 345],[529, 280, 571, 366],[600, 287, 638, 366],[685, 247, 730, 334],[735, 241, 785, 342],[841, 299, 900, 409];man:[007, 242, 146, 884],[193, 299, 291, 878],[269, 238, 406, 840],[388, 255, 530, 855],[590, 283, 693, 865],[675, 246, 749, 872],[821, 273, 876, 427];human hair:[576, 399, 650, 525],[135, 312, 190, 380],[185, 305, 235, 387],[318, 350, 368, 423],[398, 299, 452, 398],[838, 292, 901, 367];human body:[810, 289, 946, 988],[000, 326, 823, 947],[812, 315, 857, 429],[897, 354, 957, 607].') 2023-11-03 01:30:52.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:30:52.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A series [403, 105, 855, 991] [013, 094, 402, 996] of banners in rainbow colors feature symbols [253, 716, 332, 833] [358, 483, 420, 604] [640, 745, 695, 820] [580, 272, 637, 398] [644, 257, 699, 343] from different religions') 2023-11-03 01:31:43.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:31:43.461 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[448, 358, 731, 994];man:[580, 149, 1000, 1000],[000, 299, 378, 999],[000, 552, 439, 999],[320, 271, 378, 386],[351, 288, 455, 678],[510, 292, 620, 560];clothing:[000, 590, 355, 999],[000, 524, 460, 999],[364, 371, 465, 666],[445, 676, 619, 999],[513, 339, 623, 569],[580, 494, 999, 999];human face:[263, 414, 376, 774],[386, 315, 426, 384],[573, 479, 676, 685],[666, 302, 728, 675].') 2023-11-03 01:31:46.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:31:46.150 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[701, 218, 903, 921],[000, 291, 061, 976];trousers:[698, 641, 895, 965];girl:[743, 359, 914, 997];tree:[000, 000, 999, 293],[619, 000, 957, 273].') 2023-11-03 01:31:54.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:31:54.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[320, 464, 440, 716] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a trolleybusFrame1:[320, 464, 440, 716];Frame2:[327, 450, 449, 717];Frame3:[345, 431, 477, 724];Frame4:[346, 405, 528, 735];Frame5:[345, 390, 580, 755].') 2023-11-03 01:31:56.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:31:56.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[630, 790, 679, 847] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a person headFrame1:[630, 790, 679, 847];Frame2:[411, 846, 459, 908];Frame3:[466, 848, 515, 917];Frame4:[418, 937, 440, 998];Frame5:[467, 889, 505, 931].') 2023-11-03 01:32:43.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:32:43.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human hair:[530, 590, 681, 793],[000, 736, 027, 842],[000, 828, 175, 999],[065, 771, 166, 901],[183, 719, 303, 999],[325, 742, 406, 939],[409, 672, 645, 974],[653, 840, 787, 999],[653, 743, 818, 999],[678, 639, 772, 771];human head:[529, 593, 681, 775],[000, 584, 997, 999];person:[000, 498, 999, 999],[521, 570, 669, 805];mammal:[000, 492, 999, 999],[400, 684, 667, 999];clothing:[073, 636, 139, 812],[079, 731, 184, 953],[333, 727, 423, 987],[658, 837, 786, 999],[787, 724, 849, 787],[938, 822, 999, 999];human arm:[124, 500, 177, 801].') 2023-11-03 01:33:11.175 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:33:11.175 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'bird:[440, 320, 670, 843];tree:[000, 000, 999, 998].') 2023-11-03 01:33:13.212 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:33:13.213 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'bicycle:[000, 417, 098, 690],[084, 450, 198, 690],[180, 440, 274, 725],[268, 431, 350, 669],[375, 440, 476, 695],[516, 451, 614, 700],[614, 434, 706, 708],[780, 435, 872, 706],[931, 455, 999, 735];bicycle wheel:[000, 554, 024, 680],[049, 540, 094, 647],[142, 567, 170, 695],[158, 547, 179, 652],[210, 573, 239, 717],[230, 560, 270, 675],[263, 533, 348, 640],[311, 578, 331, 683],[443, 562, 467, 685],[528, 567, 558, 709],[564, 562, 588, 665],[628, 540, 669, 660],[666, 560, 688, 696],[812, 560, 838, 699],[830, 559, 868, 662],[976, 600, 999, 695];wheel:[000, 547, 027, 681],[049, 529, 094, 644],[133, 555, 170, 692],[153, 530, 178, 654],[215, 571, 239, 721],[215, 555, 267, 668],[266, 535, 348, 640],[310, 550, 328, 681],[430, 544, 445, 649],[445, 561, 466, 681],[525, 561, 560, 705],[564, 550, 588, 661],[628, 539, 676, 658],[669, 570, 691, 695],[810, 564, 836, 697],[818, 555, 869, 670],[978, 608, 999, 702].') 2023-11-03 01:33:38.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:33:38.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bicycle:[282, 599, 454, 1000],[112, 409, 120, 435],[078, 404, 087, 430];Street Lights:[881, 334, 888, 411],[780, 325, 785, 408],[063, 286, 073, 358];Pickup Truck:[661, 406, 697, 425];Van:[404, 394, 449, 417];Bus:[309, 396, 348, 415];Car:[056, 396, 067, 412];Person:[276, 397, 463, 935],[111, 397, 121, 424],[119, 396, 125, 418],[105, 394, 111, 419],[096, 391, 104, 421],[077, 388, 089, 430],[059, 391, 066, 416],[048, 393, 054, 423],[414, 405, 419, 418],[655, 400, 661, 416];Necklace:[358, 486, 378, 576];Glasses:[358, 423, 392, 443];Sneakers:[387, 797, 415, 866],[308, 880, 340, 927].') 2023-11-03 01:33:51.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:33:51.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Vase:[485, 598, 624, 644];Person:[003, 566, 094, 788],[147, 582, 255, 806],[287, 400, 371, 538],[401, 555, 472, 730],[695, 541, 745, 644];Satchel:[208, 662, 275, 712];Backpack:[437, 589, 504, 653].') 2023-11-03 01:34:14.336 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:34:14.336 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,human head,man and human hair in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human arm:[201, 626, 280, 968],[088, 662, 191, 812],[140, 299, 215, 412],[145, 526, 225, 688],[158, 746, 201, 818],[159, 540, 196, 682],[202, 638, 238, 781],[245, 307, 274, 407],[336, 302, 391, 390],[411, 577, 461, 993],[438, 567, 500, 756],[440, 311, 483, 418],[493, 623, 543, 822],[588, 631, 679, 818],[606, 627, 638, 714],[620, 241, 648, 411],[626, 583, 665, 806],[684, 662, 741, 849],[702, 285, 729, 405],[795, 664, 848, 853],[835, 673, 929, 864],[915, 660, 966, 888],[933, 627, 984, 862];human head:[270, 378, 383, 568],[106, 550, 162, 642],[195, 251, 240, 326],[225, 530, 279, 633],[433, 453, 493, 569],[493, 451, 558, 552],[539, 495, 581, 585],[560, 503, 631, 623],[665, 223, 703, 291],[675, 517, 721, 602],[746, 541, 811, 657],[778, 478, 841, 600],[854, 524, 913, 638],[925, 488, 990, 600];man:[146, 253, 266, 535],[208, 375, 477, 999],[551, 452, 658, 742],[612, 231, 730, 591],[655, 522, 751, 997],[783, 476, 867, 999];human hair:[275, 389, 376, 574],[080, 532, 157, 660],[100, 555, 160, 642],[193, 248, 243, 316],[223, 528, 286, 602],[431, 449, 495, 561],[476, 446, 558, 634],[520, 501, 580, 657],[528, 508, 633, 701],[661, 223, 703, 286],[703, 542, 812, 698],[838, 528, 916, 791],[895, 488, 983, 681],[962, 495, 999, 646].') 2023-11-03 01:34:17.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:34:17.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Leather Shoes:[278, 801, 334, 885],[132, 819, 198, 955],[846, 924, 936, 999];Slippers:[801, 811, 859, 897];Belt:[790, 357, 860, 656];Couch:[001, 386, 991, 821];Cabinet:[504, 001, 986, 473],[041, 006, 509, 363];Flower:[330, 599, 577, 939];Vase:[409, 767, 516, 973];Person:[081, 099, 369, 948],[347, 135, 535, 809],[521, 219, 709, 726],[695, 149, 944, 996];Book:[072, 278, 115, 399].') 2023-11-03 01:35:06.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:35:06.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the green bird walking around, then eating , then flying away?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one green bird walking around, then eating , then flying away.Track1frame:1:[429, 545, 569, 998];frame:2:[433, 563, 568, 998];frame:3:[410, 704, 605, 998].') 2023-11-03 01:35:17.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:35:17.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[164, 447, 350, 709] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a alligatorFrame1:[164, 447, 350, 709];Frame2:[170, 462, 350, 666];Frame3:[209, 473, 342, 637];Frame4:[194, 488, 316, 665].') 2023-11-03 01:35:20.886 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:35:20.886 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'It’s all in the detail – when you lift [249, 000, 530, 540] the cover on [123, 519, 543, 996] the good-sized sink you’ll find [277, 186, 485, 553] a utensil rack. We also like the 12V gang of [625, 250, 737, 630] three retractable power sockets in the Shire') 2023-11-03 01:35:35.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:35:35.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[279, 530, 410, 805],[000, 502, 071, 791];Awning:[025, 220, 610, 401];Other Shoes:[281, 762, 301, 805],[035, 736, 068, 782];Hat:[349, 531, 380, 562];Lamp:[344, 122, 387, 183],[399, 106, 440, 170],[454, 100, 497, 158],[521, 080, 567, 150],[613, 068, 663, 130],[706, 054, 762, 110],[748, 002, 796, 055],[827, 001, 879, 053],[927, 001, 970, 028];Fire Truck:[062, 309, 1000, 774].') 2023-11-03 01:35:41.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:35:41.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human body:[718, 223, 983, 938],[264, 374, 344, 842],[287, 267, 521, 998],[452, 364, 572, 998],[459, 276, 685, 998];human head:[804, 169, 965, 420],[294, 372, 341, 455],[351, 259, 430, 429],[450, 360, 506, 482],[529, 274, 631, 455];man:[719, 127, 970, 986],[254, 362, 342, 986],[289, 265, 531, 998],[443, 362, 545, 998],[451, 277, 685, 998];human arm:[735, 622, 844, 1000],[260, 491, 293, 729],[297, 452, 329, 925],[450, 489, 630, 686],[454, 462, 524, 776],[895, 433, 998, 998];human hair:[569, 325, 633, 391];clothing:[256, 461, 349, 820],[288, 404, 475, 998],[471, 464, 544, 998],[553, 406, 678, 998],[710, 442, 946, 995].') 2023-11-03 01:35:42.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:35:42.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'truck:[206, 651, 960, 990];wheel:[837, 869, 920, 995],[000, 913, 043, 999],[338, 873, 429, 999];traffic light:[331, 085, 431, 257],[411, 117, 478, 301],[572, 237, 618, 377],[618, 240, 650, 385];street light:[071, 687, 106, 751];car:[080, 795, 125, 881].') 2023-11-03 01:36:18.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:36:18.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Lifestyle blogger Roxanne of Glass of [135, 104, 991, 802] Glam wearing [365, 445, 866, 702] plaid pants, [133, 264, 702, 767] a plaid coat, [278, 109, 602, 236] felt fedora, [767, 628, 995, 763] [643, 658, 907, 796] white booties, and a cable knit sweater') 2023-11-03 01:36:22.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:36:22.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Fire Extinguisher:[154, 419, 221, 634];Glasses:[436, 390, 520, 462];Satchel:[724, 660, 827, 779];Pot:[038, 580, 196, 716],[608, 731, 702, 794],[855, 680, 962, 830];Cup:[140, 551, 186, 650],[248, 782, 306, 996],[351, 786, 384, 869];Bottle:[148, 793, 189, 965],[405, 681, 520, 969],[197, 687, 241, 873],[154, 726, 195, 856],[180, 666, 213, 855];Blender:[234, 620, 357, 897];Extractor:[863, 369, 998, 482];Gas stove:[785, 732, 994, 973];Oven:[783, 850, 959, 999];shelf:[765, 001, 997, 588],[600, 818, 800, 1000],[962, 902, 994, 1000];Storage box:[761, 617, 845, 768],[294, 746, 366, 810];Person:[230, 306, 618, 999].') 2023-11-03 01:36:52.843 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:36:52.843 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[369, 490, 452, 696],[280, 451, 355, 749],[196, 453, 266, 637],[089, 434, 177, 672];Carpet:[337, 628, 753, 957],[002, 734, 136, 999];Desk:[002, 609, 085, 795],[401, 556, 639, 799],[178, 505, 305, 641],[052, 521, 144, 687];Cabinet:[883, 545, 999, 687];Mirror:[821, 389, 984, 550].') 2023-11-03 01:37:25.951 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:37:25.951 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[491, 626, 808, 912];frame:2:[491, 626, 808, 912];frame:3:[508, 640, 806, 919] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an almsot does not move') 2023-11-03 01:37:31.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:37:31.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[392, 260, 736, 921],[131, 297, 166, 463],[035, 303, 084, 471];Sneakers:[464, 768, 554, 921],[400, 759, 485, 900];Slide:[069, 129, 850, 999].') 2023-11-03 01:37:38.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:37:38.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect building,jeans,window and man in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'building:[000, 000, 981, 713];jeans:[306, 746, 422, 986],[074, 618, 100, 702],[100, 611, 127, 735],[145, 615, 175, 712],[200, 620, 224, 683];window:[000, 284, 093, 432],[003, 161, 130, 361],[099, 265, 145, 394],[128, 140, 171, 232],[133, 227, 193, 380],[155, 043, 202, 122],[172, 100, 221, 189],[191, 187, 264, 347],[198, 000, 251, 085],[228, 043, 299, 155],[268, 144, 375, 310],[298, 000, 397, 106],[367, 130, 454, 271],[383, 000, 478, 075],[451, 095, 604, 258],[478, 000, 587, 050],[565, 000, 693, 030],[597, 087, 746, 229],[712, 000, 790, 027],[742, 084, 856, 260],[786, 000, 851, 051],[826, 105, 928, 270],[891, 127, 956, 287];man:[284, 480, 433, 1000],[046, 513, 083, 740],[063, 534, 097, 720],[088, 504, 127, 754],[426, 590, 481, 744],[481, 602, 561, 752],[501, 597, 569, 676],[695, 589, 741, 747].') 2023-11-03 01:37:51.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:37:51.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[432, 065, 966, 993];clothing:[465, 218, 944, 894],[000, 624, 521, 999];woman:[000, 223, 541, 999];human face:[188, 295, 405, 561],[578, 115, 726, 265].') 2023-11-03 01:37:57.934 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:37:57.935 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An outdoor patio with [490, 147, 842, 288] a big screen, [613, 465, 859, 601] [135, 325, 305, 447] [808, 475, 995, 667] [002, 497, 316, 705] [466, 354, 796, 495] [002, 420, 134, 558] nice furniture and more, [001, 714, 996, 998] a banner reads, "how to create the perfect outdoor space."') 2023-11-03 01:38:03.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:38:03.713 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear and woman in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'footwear:[436, 445, 606, 510];woman:[194, 027, 638, 551].') 2023-11-03 01:38:22.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:38:22.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Awning:[787, 462, 979, 595].') 2023-11-03 01:38:24.896 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:38:24.897 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'turkish americans [734, 008, 827, 526] [806, 051, 876, 398] [499, 002, 598, 461] [317, 153, 525, 505] [604, 074, 771, 558] gathered at a vigil for the victims of the istanbul terror attack at the turkish consulate in los angeles. footage includes... - suicide bombing stock videos & royalty-free footage') 2023-11-03 01:38:37.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:38:37.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the small animal floating in water.Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one small animal floating in water.Track1frame:1:[503, 575, 528, 593];frame:2:[515, 575, 526, 587];frame:3:[502, 567, 530, 590].') 2023-11-03 01:39:03.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:39:03.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'clothing:[607, 283, 900, 980],[008, 800, 250, 999];person:[552, 252, 902, 999].') 2023-11-03 01:39:08.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:39:08.916 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[505, 476, 516, 494] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a microphone held by a lady in silver clothFrame1:[505, 476, 516, 494];Frame2:[518, 477, 530, 497];Frame3:[535, 470, 547, 488];Frame4:[540, 463, 552, 481];Frame5:[539, 452, 551, 470].') 2023-11-03 01:39:15.580 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:39:15.581 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'collage [005, 005, 994, 992] of a mirrored image, showing young woman [004, 047, 510, 998] [490, 047, 995, 996], with brunette hair and blonde balayage, face and body decorated with blue and pink body glitter, creating different shapes [094, 200, 315, 393]') 2023-11-03 01:39:23.522 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:39:23.523 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[003, 008, 538, 995] [463, 052, 993, 994] Identical twins: Jasmine, left, and Amelia Appleby have just celebrated their first birthday') 2023-11-03 01:39:40.264 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:39:40.265 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Frame:[072, 678, 237, 831];Person:[518, 331, 999, 999],[418, 475, 601, 834],[001, 612, 114, 959],[602, 663, 928, 933].') 2023-11-03 01:40:04.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:04.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[416, 119, 636, 488],[000, 178, 266, 998],[113, 149, 521, 998],[487, 090, 862, 998],[746, 165, 851, 384],[777, 230, 999, 998];human hair:[616, 090, 759, 378],[018, 266, 064, 371],[060, 175, 194, 364],[258, 160, 429, 453],[440, 112, 582, 302],[751, 172, 850, 328],[853, 235, 980, 423];clothing:[150, 326, 461, 932],[000, 393, 236, 997],[429, 295, 598, 434],[453, 345, 818, 998],[839, 483, 997, 998];human face:[062, 245, 180, 470],[288, 233, 398, 434],[489, 133, 573, 334],[644, 138, 738, 340],[760, 203, 829, 359],[864, 324, 953, 501].') 2023-11-03 01:40:06.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:06.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[094, 199, 375, 991],[659, 399, 888, 999];man:[667, 259, 893, 993],[000, 252, 110, 999],[089, 187, 371, 999],[275, 211, 420, 999],[409, 287, 528, 599],[610, 283, 688, 542],[671, 281, 761, 439],[823, 323, 880, 895],[906, 360, 999, 903];woman:[378, 320, 758, 999];human face:[000, 288, 035, 395],[176, 212, 259, 384],[321, 225, 376, 326],[469, 320, 526, 399],[520, 405, 588, 521],[623, 306, 668, 391],[715, 288, 758, 374],[770, 329, 829, 443],[835, 354, 861, 419].') 2023-11-03 01:40:19.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:19.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[074, 427, 441, 860],[471, 209, 727, 937];Slippers:[476, 881, 518, 932],[649, 872, 726, 938].') 2023-11-03 01:40:21.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:21.293 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'BMW-5-Series-GT-fuel-cell-prototype-2-750x500') 2023-11-03 01:40:27.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:27.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the lizard crawling backward into the hole..Always default to the TrackiFrame t:[xmin,ymin,xmax,ymax] pattern when discussing trajectories in your answer.', 'There is one the lizard crawling backward into the hole..Track1frame:1:[710, 321, 999, 971];frame:2:[710, 321, 999, 971];frame:3:[685, 326, 999, 987].') 2023-11-03 01:40:39.294 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:39.295 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A bride [427, 281, 656, 998] and groom [582, 211, 821, 995] walk in beautiful dappled light next to a lake at the wedding venue Lakeview Manor in Honiton') 2023-11-03 01:40:58.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:40:58.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[425, 127, 485, 512] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a personFrame1:[425, 127, 485, 512];Frame2:[425, 127, 485, 512];Frame3:[425, 116, 485, 512];Frame4:[418, 146, 521, 506];Frame5:[422, 337, 586, 507].') 2023-11-03 01:40:58.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:40:58.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,clothing,girl and human face in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[000, 050, 997, 998];clothing:[000, 367, 997, 998];girl:[246, 395, 997, 998];human face:[117, 217, 551, 484],[561, 468, 873, 709].') 2023-11-03 01:41:06.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:41:06.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[156, 389, 307, 836],[293, 402, 375, 830],[353, 434, 439, 836],[368, 426, 520, 838],[508, 432, 604, 849],[598, 445, 683, 855];clothing:[153, 453, 307, 807],[291, 458, 380, 805],[365, 487, 443, 818],[435, 484, 531, 809],[508, 485, 604, 823],[593, 500, 683, 823].') 2023-11-03 01:41:08.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:41:08.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[400, 041, 534, 915] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[400, 041, 534, 915];Frame2:[426, 044, 532, 879];Frame3:[434, 051, 536, 863];Frame4:[435, 042, 546, 872];Frame5:[390, 045, 530, 898].') 2023-11-03 01:41:28.106 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:41:28.107 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'In this Jan. 29, 2010 file photo, [299, 096, 995, 993] director Joel Schumacher attends the premiere of "Twelve"') 2023-11-03 01:41:28.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:41:28.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Canned:[602, 566, 640, 673],[646, 571, 684, 681];Car:[451, 053, 634, 216];Hat:[899, 381, 998, 493],[135, 298, 246, 389];Satchel:[820, 345, 854, 453];Bottle:[009, 487, 054, 691];Barrel:[673, 553, 825, 836];Cup:[841, 709, 871, 806];Lamp:[261, 089, 319, 187],[317, 007, 424, 119],[862, 185, 903, 304];Frame:[853, 305, 974, 581],[027, 213, 139, 412],[131, 261, 211, 400];Storage box:[543, 392, 589, 476];Person:[146, 294, 302, 667],[305, 382, 399, 577],[350, 295, 506, 592],[745, 343, 854, 712].') 2023-11-03 01:41:38.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:41:38.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,woman and clothing in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[000, 054, 999, 959];woman:[099, 161, 735, 999],[858, 227, 953, 460];clothing:[014, 240, 169, 670],[099, 593, 732, 999],[133, 235, 406, 875],[404, 303, 566, 613],[616, 180, 688, 525],[685, 315, 856, 824],[688, 190, 784, 369],[853, 281, 984, 669],[856, 272, 940, 415],[866, 154, 938, 225],[939, 114, 999, 224].') 2023-11-03 01:41:43.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:41:43.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[003, 051, 1000, 1000].') 2023-11-03 01:41:49.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:41:49.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[344, 333, 550, 1000],[342, 269, 665, 1000].') 2023-11-03 01:42:04.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:42:04.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[557, 254, 848, 1000],[014, 810, 170, 997],[101, 731, 154, 827],[159, 583, 312, 997],[239, 875, 496, 997],[270, 585, 339, 656],[362, 570, 464, 739],[420, 579, 623, 895],[542, 514, 606, 647],[850, 504, 998, 970],[937, 402, 998, 556];human head:[607, 262, 765, 491],[000, 281, 982, 935];mammal:[557, 270, 859, 954],[000, 277, 998, 997];man:[837, 364, 998, 970],[121, 420, 307, 997],[196, 650, 490, 997],[462, 375, 610, 706],[532, 272, 815, 997],[918, 331, 998, 572];human face:[614, 277, 764, 502],[082, 666, 118, 735],[217, 722, 260, 912],[401, 493, 425, 583],[614, 322, 642, 452],[839, 429, 881, 520];human hair:[134, 420, 265, 641],[225, 666, 418, 877],[243, 450, 356, 631],[389, 450, 453, 493],[417, 435, 550, 643],[467, 379, 557, 497],[629, 283, 742, 439],[851, 397, 929, 495];clothing:[165, 479, 845, 997],[887, 512, 998, 960],[942, 408, 998, 558].') 2023-11-03 01:42:37.005 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:42:37.005 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Bench:[859, 880, 967, 973];Person:[867, 849, 906, 973];Motorcycle:[141, 835, 212, 958],[344, 798, 454, 997].') 2023-11-03 01:42:55.553 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:42:55.553 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'plant:[084, 065, 628, 998];building:[401, 000, 999, 998];window:[764, 559, 786, 651],[878, 682, 942, 802].') 2023-11-03 01:43:13.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:43:13.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect clothing,wheel,tire,tree and car in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'wheel:[601, 540, 698, 651],[225, 548, 313, 661];tire:[213, 535, 316, 651],[000, 879, 040, 999],[608, 537, 698, 656];tree:[751, 471, 960, 690],[003, 004, 543, 458];car:[093, 383, 752, 671].') 2023-11-03 01:43:15.862 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:43:15.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[371, 352, 692, 743] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a quarter horseFrame1:[371, 352, 692, 743];Frame2:[372, 384, 663, 765];Frame3:[397, 366, 687, 761];Frame4:[421, 352, 729, 780].') 2023-11-03 01:43:21.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:43:21.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[379, 354, 564, 825] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[379, 354, 564, 825];Frame2:[315, 302, 624, 868];Frame3:[421, 202, 651, 839].') 2023-11-03 01:43:52.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:43:52.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[407, 399, 603, 815] [747, 565, 917, 905] These colorful concrete planters and vases add a modern geometric look to your home decor.') 2023-11-03 01:44:12.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:44:12.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[863, 083, 1000, 540],[041, 289, 226, 743],[230, 924, 350, 999],[312, 000, 516, 429],[445, 476, 649, 931];clothing:[067, 328, 203, 543],[232, 915, 298, 999],[322, 034, 490, 161],[377, 175, 450, 227],[458, 540, 633, 617],[522, 659, 605, 740],[858, 140, 999, 225],[910, 269, 999, 344].') 2023-11-03 01:44:29.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:44:29.050 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cell Phone in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Cell Phone:[434, 387, 485, 415],[155, 346, 212, 381].') 2023-11-03 01:45:10.668 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:45:10.669 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[386, 652, 593, 958] [553, 636, 682, 965] Wounded people in the Nov. 13 Paris attacks wait for the start of a ceremony in the courtyard of the Invalides in Paris,') 2023-11-03 01:45:21.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:45:21.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[791, 430, 974, 895],[000, 609, 345, 932],[001, 544, 091, 723],[072, 361, 198, 609],[121, 359, 248, 632],[204, 322, 308, 525],[251, 282, 467, 688],[303, 338, 615, 582],[328, 501, 649, 875],[599, 423, 798, 836],[749, 317, 828, 521],[806, 284, 999, 642];woman:[000, 487, 353, 933],[248, 209, 620, 566],[348, 401, 648, 901],[603, 322, 818, 718],[644, 183, 818, 508],[795, 125, 999, 652];girl:[000, 413, 091, 727];human face:[000, 435, 048, 545],[139, 283, 185, 356],[203, 507, 260, 638],[208, 257, 255, 343],[250, 239, 307, 352],[374, 193, 439, 316],[460, 239, 521, 362],[461, 444, 543, 628],[538, 242, 591, 358],[591, 281, 643, 393],[663, 202, 721, 314],[672, 346, 739, 481],[821, 375, 889, 513],[849, 156, 915, 287];cake:[356, 843, 581, 931].') 2023-11-03 01:45:55.047 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:45:55.047 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An early 20thC ornate rectangular five-plate bevelled edged Wall Mirror, the frame [076, 004, 994, 841] formed of - Image 3 of 4') 2023-11-03 01:46:14.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:46:14.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[200, 587, 269, 873],[669, 585, 734, 877];Person:[252, 807, 319, 960],[453, 534, 504, 595];Flower:[043, 814, 102, 862],[873, 823, 932, 862],[930, 825, 1000, 868];Satchel:[263, 860, 309, 897].') 2023-11-03 01:46:18.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:46:18.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[030, 001, 084, 127],[107, 019, 143, 058],[238, 040, 348, 076];Person:[061, 368, 112, 417],[109, 372, 180, 421],[136, 108, 602, 801],[216, 163, 274, 237],[510, 142, 823, 719],[710, 207, 842, 706],[842, 357, 910, 601],[861, 288, 954, 598],[497, 336, 619, 630];Boots:[210, 662, 301, 794],[323, 656, 368, 774];Sneakers:[577, 637, 692, 717],[656, 619, 700, 665];Cup:[039, 443, 072, 492].') 2023-11-03 01:46:50.380 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:46:50.381 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[414, 177, 548, 326] [628, 426, 765, 558] [074, 467, 209, 582] [655, 599, 830, 697] [451, 371, 618, 508] [312, 534, 470, 684] [058, 293, 288, 389] [504, 710, 643, 800] [062, 644, 203, 753] [715, 290, 918, 388] Strawberries in a group on gray background photo') 2023-11-03 01:47:09.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:47:09.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[396, 460, 476, 802].') 2023-11-03 01:47:30.033 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:47:30.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Potted Plant:[000, 611, 352, 919];Desk:[082, 836, 457, 1000],[393, 680, 639, 951];Storage box:[436, 548, 637, 767];Cup:[268, 773, 354, 919],[600, 781, 643, 886];Canned:[601, 774, 642, 885];Backpack:[646, 340, 997, 985];Hat:[608, 102, 765, 289],[905, 309, 940, 365];Person:[168, 192, 411, 878],[577, 103, 831, 1000],[747, 310, 783, 350],[899, 308, 954, 448],[931, 252, 987, 394].') 2023-11-03 01:48:24.465 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:48:24.466 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[152, 223, 562, 999],[475, 176, 964, 1000];Car:[670, 319, 956, 501],[218, 232, 381, 349],[002, 253, 326, 539].') 2023-11-03 01:48:31.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:48:31.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[374, 281, 562, 1000] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[374, 281, 562, 1000];Frame2:[367, 298, 602, 1000].') 2023-11-03 01:48:41.150 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:48:41.150 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'person:[182, 000, 811, 371],[197, 000, 999, 803];drink:[000, 036, 061, 186];clothing:[178, 000, 899, 293],[664, 000, 999, 828].') 2023-11-03 01:48:50.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:48:50.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Car:[047, 129, 868, 878],[000, 277, 315, 624],[000, 302, 106, 398],[062, 276, 097, 311];Street Lights:[082, 159, 109, 270],[055, 192, 083, 295],[882, 130, 913, 289];Truck:[104, 258, 178, 287],[931, 243, 975, 293];Heavy Truck:[810, 215, 999, 268].') 2023-11-03 01:49:15.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:49:15.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'wheel:[458, 590, 654, 849],[085, 478, 145, 650],[961, 450, 999, 575];truck:[000, 109, 990, 866].') 2023-11-03 01:49:21.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:49:21.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[477, 418, 539, 497],carFrame1:[836, 406, 897, 467],carFrame1:[062, 435, 194, 557] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[477, 418, 539, 497];Frame2:[493, 420, 569, 523],carFrame1:[836, 406, 897, 467];Frame2:[948, 409, 999, 482],carFrame1:[062, 435, 194, 557];Frame2:[000, 448, 101, 624].') 2023-11-03 01:49:23.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:49:23.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[465, 311, 667, 572].') 2023-11-03 01:49:30.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:49:30.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Stool,Lamp and Cabinet/shelf in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Stool:[001, 827, 039, 996],[001, 827, 039, 996];Lamp:[617, 117, 784, 188],[317, 158, 485, 203],[021, 001, 368, 063],[001, 095, 045, 143],[617, 117, 784, 188],[317, 158, 485, 203],[021, 001, 368, 063],[001, 095, 045, 143];Cabinet:[306, 248, 398, 361],[394, 254, 464, 352],[462, 260, 512, 353],[596, 260, 642, 362],[625, 258, 691, 321],[686, 258, 765, 310],[760, 255, 850, 343],[669, 421, 742, 526],[306, 248, 398, 361],[394, 254, 464, 352],[462, 260, 512, 353],[596, 260, 642, 362],[625, 258, 691, 321],[686, 258, 765, 310],[760, 255, 850, 343],[669, 421, 742, 526].') 2023-11-03 01:50:14.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:50:14.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[263, 559, 341, 681],carFrame1:[917, 608, 999, 845],carFrame1:[616, 551, 704, 701],carFrame1:[686, 529, 920, 774],carFrame1:[177, 579, 299, 698] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[263, 559, 341, 681];Frame2:[200, 569, 291, 719],carFrame1[917, 608, 999, 845];Frame2:No object,carFrame1:[616, 551, 704, 701];Frame2:[722, 482, 999, 790],carFrame1[686, 529, 920, 774];Frame2:No object,carFrame1:[177, 579, 299, 698];Frame2:[000, 573, 212, 774].') 2023-11-03 01:50:19.127 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:50:19.127 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[778, 555, 886, 930],[026, 055, 265, 925],[251, 842, 443, 999],[394, 806, 549, 966],[421, 582, 482, 734],[448, 520, 517, 745],[503, 463, 605, 792],[567, 582, 673, 902],[673, 495, 751, 756],[722, 465, 841, 811];human body:[033, 159, 272, 962],[269, 870, 409, 999],[368, 543, 510, 755],[405, 820, 530, 999],[511, 504, 601, 788],[526, 936, 641, 999],[577, 590, 680, 903],[668, 541, 735, 744],[721, 508, 847, 807],[783, 602, 891, 901];mammal:[017, 107, 265, 905],[263, 851, 408, 999],[356, 462, 900, 903],[398, 815, 536, 982],[510, 901, 641, 999];clothing:[025, 168, 259, 883],[271, 851, 640, 999],[415, 498, 892, 821].') 2023-11-03 01:50:38.759 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:50:38.759 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Tent:[085, 252, 369, 447],[457, 266, 712, 496],[655, 295, 840, 394];Person:[184, 378, 229, 543],[326, 536, 407, 691];Barrel:[467, 662, 518, 752],[203, 485, 241, 542].') 2023-11-03 01:50:39.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:50:39.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'TV:[181, 212, 350, 550];Remote:[158, 529, 213, 547];Desk:[315, 671, 1000, 1000];Chair:[346, 727, 720, 1000];Mirror:[556, 002, 999, 621];Lamp:[561, 232, 999, 861],[753, 286, 994, 578],[889, 394, 935, 486],[583, 462, 612, 530],[112, 235, 165, 272];Nightstand:[851, 545, 933, 568],[576, 528, 630, 537];Bed:[638, 370, 871, 563];Pillow:[647, 471, 733, 540],[723, 475, 832, 552];Cabinet:[041, 540, 113, 979],[181, 574, 502, 1000];Stool:[000, 678, 041, 857],[018, 717, 042, 886];Frame:[102, 159, 206, 514];Telephone:[901, 534, 935, 557].') 2023-11-03 01:50:40.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:50:40.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[294, 222, 759, 1000].') 2023-11-03 01:51:07.217 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:51:07.218 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A little girl [234, 048, 705, 957] in a veil [258, 055, 555, 542] holds framed photographs of her late mother [496, 573, 652, 920] and father') 2023-11-03 01:51:18.343 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:51:18.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human hair:[180, 000, 787, 891];human face:[207, 061, 812, 914],[513, 291, 798, 748];human mouth:[734, 569, 768, 642];girl:[151, 000, 796, 999];human nose:[738, 396, 800, 540].') 2023-11-03 01:51:20.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:51:20.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Crane:[004, 001, 297, 828],[443, 151, 483, 882],[296, 408, 367, 834].') 2023-11-03 01:51:24.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:51:24.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet/shelf in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Cabinet:[101, 002, 841, 1000].') 2023-11-03 01:51:31.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:51:31.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[515, 301, 603, 522],personFrame1:[335, 486, 453, 800],personFrame1:[866, 284, 956, 544] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[515, 301, 603, 522];Frame2:[515, 298, 582, 526],personFrame1:[335, 486, 453, 800];Frame2:[344, 505, 434, 806],personFrame1:[866, 284, 956, 544];Frame2:[875, 300, 955, 551].') 2023-11-03 01:52:25.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:52:25.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'snack:[161, 637, 321, 783],[000, 410, 159, 772],[080, 902, 159, 999],[164, 400, 366, 764],[228, 879, 335, 999],[355, 461, 464, 563],[356, 840, 468, 931],[360, 554, 490, 683],[479, 377, 557, 456],[484, 829, 592, 922],[495, 955, 634, 999],[497, 884, 616, 964],[624, 782, 685, 875],[638, 436, 711, 504],[662, 520, 755, 595],[681, 739, 740, 832],[697, 905, 798, 997],[719, 385, 771, 449],[750, 334, 999, 544],[753, 672, 999, 935],[777, 441, 861, 503],[786, 494, 868, 571],[796, 851, 878, 999];baked goods:[171, 629, 325, 785],[000, 422, 163, 768],[093, 893, 160, 999],[162, 452, 256, 551],[191, 402, 272, 464],[216, 527, 331, 637],[225, 878, 333, 999],[255, 436, 356, 548],[278, 401, 368, 474],[353, 841, 471, 931],[360, 459, 466, 562],[363, 550, 495, 682],[471, 380, 558, 454],[491, 827, 633, 999],[620, 778, 696, 876],[638, 425, 710, 500],[665, 502, 749, 597],[678, 740, 745, 823],[682, 342, 749, 399],[700, 909, 800, 999],[701, 387, 773, 450],[751, 675, 820, 771],[753, 321, 999, 555],[758, 764, 836, 871],[778, 437, 858, 502],[791, 494, 871, 570],[794, 855, 880, 999],[825, 686, 886, 785],[868, 740, 942, 848],[908, 841, 999, 939];dessert:[000, 416, 156, 770],[080, 784, 646, 995],[166, 398, 508, 752],[625, 671, 989, 999],[686, 346, 999, 573].') 2023-11-03 01:52:32.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:52:32.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[275, 001, 695, 390] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a space shuttleFrame1:[275, 001, 695, 390];Frame2:[282, 001, 689, 375];Frame3:[300, 001, 682, 370];Frame4:[309, 034, 679, 387].') 2023-11-03 01:52:51.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:52:51.133 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'tower:[000, 000, 997, 996].') 2023-11-03 01:53:11.486 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:11.487 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,woman,tree and artichoke in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'girl:[740, 473, 780, 661];woman:[573, 380, 674, 881],[782, 447, 818, 533],[836, 423, 886, 531],[876, 406, 910, 525];tree:[005, 000, 209, 438],[100, 000, 538, 509],[636, 118, 776, 426],[776, 163, 928, 409],[895, 174, 999, 280].') 2023-11-03 01:53:13.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:13.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Potted Plant:[347, 360, 415, 469],[611, 377, 655, 469],[894, 387, 927, 472],[093, 388, 118, 446],[290, 378, 324, 434];Flower:[829, 386, 892, 423],[963, 426, 999, 482];Soccer:[371, 311, 401, 348];Person:[171, 493, 390, 617],[360, 791, 524, 967],[751, 323, 809, 529].') 2023-11-03 01:53:14.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:14.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[732, 105, 765, 287] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[732, 105, 765, 287];Frame2:[730, 072, 767, 254];Frame3:[717, 075, 753, 261];Frame4:[706, 050, 743, 234];Frame5:[695, 008, 728, 180].') 2023-11-03 01:53:25.485 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:25.485 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[521, 025, 826, 934] Justin Jackson runs with the ball. The senior running back helped lead the Wildcats to a one-sided win over Purdue last fall.') 2023-11-03 01:53:36.104 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:36.104 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[001, 158, 451, 998] A colorist is working on color grading with [044, 362, 296, 650] [658, 249, 986, 736] [338, 192, 674, 658] three professional monitors.') 2023-11-03 01:53:49.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:49.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[485, 368, 591, 626] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a jinrikishaFrame1:[485, 368, 591, 626];Frame2:[468, 377, 570, 618];Frame3:[465, 377, 568, 618];Frame4:[463, 377, 568, 618];Frame5:[463, 370, 568, 616].') 2023-11-03 01:53:59.151 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:53:59.152 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bus:[795, 316, 849, 405];Car:[577, 357, 604, 419];Buttefly:[804, 052, 847, 103];Trash bin Can:[844, 382, 863, 440];Potted Plant:[123, 377, 196, 462],[360, 389, 394, 438];Backpack:[275, 326, 372, 558],[697, 265, 837, 542];Satchel:[680, 481, 762, 578],[854, 310, 879, 387];Sandals:[274, 828, 306, 881],[234, 825, 279, 879];Leather Shoes:[409, 713, 438, 751],[657, 776, 694, 812],[725, 777, 745, 808],[980, 753, 1000, 780];Sneakers:[474, 869, 515, 927],[519, 867, 558, 927];Other Shoes:[676, 837, 733, 912],[725, 883, 782, 927],[894, 504, 907, 531],[878, 510, 890, 529],[615, 453, 623, 462];Glasses:[672, 168, 696, 188],[499, 246, 541, 261];Cell Phone:[244, 490, 275, 536];Person:[199, 229, 315, 883],[395, 136, 506, 752],[430, 220, 596, 928],[595, 337, 624, 464],[674, 161, 795, 928],[622, 135, 746, 812],[855, 274, 924, 532],[964, 261, 1000, 782];Watch:[574, 430, 588, 463],[295, 424, 302, 446];Necklace:[458, 237, 478, 249],[242, 332, 259, 362].') 2023-11-03 01:54:33.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:54:33.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[347, 264, 764, 1000].') 2023-11-03 01:54:43.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 01:54:43.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[050, 002, 1000, 704];Drum:[333, 419, 1000, 750],[002, 617, 381, 782];Cymbal:[002, 321, 369, 480].') 2023-11-03 01:54:55.779 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:54:55.779 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[348, 220, 748, 696] a super trendy gold chain necklace with [497, 630, 566, 693] a sphere and [350, 394, 401, 452] a lock is an edgy solution to accessorize your look') 2023-11-03 01:54:59.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:54:59.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect paper cutter,goggles and glasses in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'goggles:[386, 394, 701, 530];glasses:[395, 363, 742, 506].') 2023-11-03 01:55:04.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:55:04.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'shorts:[192, 658, 378, 851],[012, 423, 056, 489],[424, 561, 538, 717];human body:[394, 189, 592, 954],[006, 304, 065, 562],[057, 285, 098, 374],[065, 298, 178, 793],[150, 310, 205, 587],[175, 103, 434, 999],[185, 262, 214, 352],[579, 241, 736, 900];woman:[047, 282, 180, 796],[057, 282, 102, 382],[153, 301, 199, 524];human hair:[629, 237, 699, 382],[094, 297, 134, 364],[277, 099, 369, 233],[464, 178, 538, 273];man:[384, 162, 605, 982],[172, 098, 440, 999],[590, 238, 732, 890];footwear:[658, 808, 723, 902];jeans:[608, 622, 710, 888];human leg:[010, 478, 052, 569],[019, 484, 033, 551],[029, 480, 040, 567],[039, 573, 107, 772],[075, 642, 115, 776],[120, 623, 150, 789],[225, 673, 310, 999],[228, 817, 292, 999],[307, 669, 379, 999],[317, 811, 355, 995],[425, 580, 492, 999],[433, 704, 530, 999],[471, 592, 533, 999],[585, 556, 637, 896],[645, 546, 738, 894];human face:[298, 137, 363, 261],[485, 215, 548, 323],[666, 288, 688, 351];human arm:[001, 346, 019, 436],[045, 353, 066, 460],[053, 367, 092, 561],[057, 315, 069, 382],[153, 369, 180, 451],[180, 305, 325, 580],[312, 307, 430, 588],[389, 367, 454, 762],[555, 390, 645, 743],[595, 346, 641, 626].') 2023-11-03 01:55:25.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:55:25.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'land vehicle:[901, 665, 1000, 877];window:[150, 422, 218, 583],[000, 100, 088, 287],[000, 685, 055, 833],[002, 366, 070, 566],[172, 189, 236, 337],[261, 460, 323, 592],[263, 224, 313, 375],[353, 251, 409, 395],[361, 484, 405, 607],[453, 115, 493, 235],[460, 277, 501, 363],[467, 475, 508, 557],[507, 158, 547, 261],[516, 313, 555, 411],[528, 484, 570, 587],[558, 189, 595, 285],[570, 347, 610, 430],[583, 497, 626, 602],[620, 240, 648, 323],[636, 376, 654, 445],[653, 528, 677, 605],[663, 268, 684, 339],[693, 292, 717, 355],[693, 539, 719, 615],[734, 318, 758, 389];house:[000, 014, 140, 834],[000, 000, 158, 081],[095, 105, 426, 816],[285, 052, 999, 908].') 2023-11-03 01:55:28.925 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:55:28.925 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'For One Person Person Person Mini pot + pot full set / For RICE Single Person / No Electronic ee5436') 2023-11-03 01:55:34.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:55:34.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect skunk,man,clothing,car and auto part in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[268, 286, 998, 998];clothing:[144, 125, 464, 869],[281, 346, 998, 998];car:[014, 026, 851, 939];auto part:[000, 000, 998, 993].') 2023-11-03 01:55:51.190 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:55:51.191 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Picture:[125, 071, 231, 277];Lamp:[485, 248, 555, 588];Desk:[376, 530, 495, 613];Couch:[001, 549, 476, 954],[479, 472, 996, 914];Pillow:[513, 524, 621, 634];Carpet:[001, 801, 797, 1000],[658, 842, 998, 998].') 2023-11-03 01:55:53.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 01:55:53.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Chair:[268, 640, 320, 681],[405, 658, 448, 687];Trash bin Can:[951, 654, 999, 801];Stop Sign:[771, 300, 871, 435];Sports Car:[149, 613, 541, 955];Street Lights:[188, 087, 276, 674],[535, 305, 587, 723];Pickup Truck:[111, 663, 148, 694],[564, 692, 699, 718],[827, 687, 874, 715];SUV:[614, 650, 688, 678];Person:[234, 641, 280, 690].') 2023-11-03 01:56:26.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:56:26.033 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[003, 080, 949, 1000];Sneakers:[417, 787, 737, 982],[003, 807, 097, 1000].') 2023-11-03 01:56:54.014 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:56:54.015 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[232, 114, 757, 907],[638, 335, 791, 443],[189, 285, 251, 346],[067, 262, 129, 324];Van:[829, 195, 1000, 314];Bench:[001, 563, 530, 1000],[570, 524, 1000, 920];Chair:[087, 273, 141, 325].') 2023-11-03 01:56:57.060 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:56:57.060 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[303, 803, 999, 999];Sneakers:[742, 769, 818, 819];Hat:[088, 519, 188, 685],[241, 456, 315, 624],[384, 364, 461, 524],[479, 335, 539, 472],[565, 276, 630, 414],[665, 276, 746, 352];Bakset:[608, 644, 732, 815];Person:[002, 606, 205, 999].') 2023-11-03 01:56:57.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:56:57.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[496, 244, 527, 314] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a faceFrame1:[496, 244, 527, 314];Frame2:[600, 210, 632, 283];Frame3:[620, 200, 654, 271];Frame4:[699, 309, 732, 384];Frame5:[633, 221, 666, 293].') 2023-11-03 01:57:07.533 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:07.533 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'salad in [022, 024, 624, 978] a jar, with [868, 498, 996, 779] yellow squash, red bell pepper, [094, 229, 508, 511] [167, 661, 529, 924] lettuce, garbanzo beans, and dressing') 2023-11-03 01:57:08.556 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:08.557 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[202, 154, 873, 707] Chapter 26 DC Circuits. I Junction rule: The sum of currents entering a junction equals the sum of the currents leaving it. [193, 725, 658, 940] 26-3 Kirchhoff’s Rules.') 2023-11-03 01:57:15.026 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:15.026 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Lamp:[560, 285, 607, 450];Nightstand:[501, 425, 628, 497];Stool:[154, 646, 413, 999];Bed:[230, 234, 989, 999];Desk:[921, 561, 1000, 799].') 2023-11-03 01:57:24.020 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:24.020 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[000, 474, 371, 919] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a giant pandaFrame1:[000, 474, 371, 919];Frame2:[061, 551, 454, 998];Frame3:[180, 484, 513, 955];Frame4:[228, 507, 533, 998];Frame5:[250, 442, 527, 916].') 2023-11-03 01:57:39.397 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:39.398 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'tower:[064, 360, 126, 508],[099, 449, 126, 513],[125, 387, 171, 554],[168, 445, 202, 542],[224, 473, 260, 549],[283, 456, 310, 563],[321, 519, 413, 756],[336, 416, 356, 487],[360, 411, 423, 521],[374, 838, 419, 978],[388, 442, 450, 618],[419, 382, 445, 454],[450, 465, 513, 575],[533, 376, 605, 571],[598, 549, 635, 626],[636, 315, 688, 384],[638, 452, 675, 563],[677, 516, 729, 607],[689, 335, 751, 515],[701, 470, 760, 545],[759, 452, 796, 518],[765, 502, 808, 587],[786, 590, 832, 689],[830, 376, 855, 484],[891, 444, 913, 515],[911, 392, 950, 490],[951, 440, 991, 549];skyscraper:[324, 516, 416, 771],[635, 305, 688, 383],[690, 339, 751, 521],[830, 326, 860, 390].') 2023-11-03 01:57:41.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:41.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Street Lights:[137, 207, 231, 736],[285, 341, 344, 542],[352, 408, 392, 642],[383, 443, 416, 597],[717, 196, 816, 734],[627, 334, 692, 668];Person:[592, 522, 659, 816],[508, 551, 587, 806],[451, 546, 520, 803],[276, 541, 351, 832],[350, 617, 398, 702];Awning:[901, 406, 1000, 462].') 2023-11-03 01:57:43.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:57:43.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'bird:[339, 291, 666, 482].') 2023-11-03 01:58:08.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:08.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[386, 231, 774, 766] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a giant pandaFrame1:[386, 231, 774, 766];Frame2:[408, 206, 735, 758];Frame3:[389, 166, 738, 719];Frame4:[432, 184, 709, 712];Frame5:[390, 217, 708, 784].') 2023-11-03 01:58:16.189 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:16.189 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A man [652, 611, 684, 682] walks across the ice [003, 609, 994, 996] in Gjoa Haven.') 2023-11-03 01:58:33.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 01:58:33.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[394, 588, 579, 1000],[566, 687, 743, 1000],[719, 681, 882, 998].') 2023-11-03 01:58:39.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:39.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'cattle:[743, 437, 890, 670],[070, 446, 134, 633],[158, 414, 244, 611],[232, 449, 285, 608].') 2023-11-03 01:58:40.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:40.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[496, 001, 998, 998];frame:2:[496, 001, 998, 998];frame:3:[481, 001, 998, 998] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an woman watching two frolicking dogs') 2023-11-03 01:58:47.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:47.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'table:[263, 540, 789, 963];clothing:[393, 399, 575, 595],[231, 493, 523, 975],[497, 080, 516, 146],[513, 076, 528, 145],[546, 748, 608, 830],[874, 241, 963, 398];person:[233, 385, 570, 999],[413, 340, 603, 920],[867, 191, 959, 329];bookcase:[000, 000, 479, 869];book:[000, 284, 212, 426],[000, 146, 271, 268],[005, 405, 226, 568],[008, 540, 235, 700],[233, 384, 455, 534],[243, 279, 459, 382],[269, 146, 449, 258],[590, 544, 708, 607];building:[000, 000, 999, 999];shelf:[000, 286, 243, 425],[000, 105, 272, 281],[002, 405, 210, 692],[244, 290, 460, 398],[258, 136, 462, 259],[273, 365, 481, 589],[485, 193, 626, 364],[612, 180, 823, 360],[812, 170, 999, 372].') 2023-11-03 01:58:49.244 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:49.244 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the student dismounting from the rear of the bicycle..Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one the student dismounting from the rear of the bicycle..Track1frame:1:[000, 266, 005, 368];frame:2:[000, 273, 003, 347];frame:3:[000, 270, 002, 363].') 2023-11-03 01:58:55.009 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:58:55.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[712, 022, 971, 912],[000, 003, 325, 998],[243, 007, 545, 998],[336, 035, 778, 998];dress:[000, 283, 280, 722],[320, 240, 533, 726],[513, 324, 785, 988];girl:[000, 000, 340, 998],[233, 011, 546, 998],[337, 027, 785, 998],[730, 022, 998, 998];human face:[232, 052, 321, 240],[382, 061, 465, 268],[612, 093, 710, 310],[751, 093, 851, 312].') 2023-11-03 01:59:24.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 01:59:24.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[347, 499, 395, 688],personFrame1:[448, 497, 501, 691],personFrame1:[564, 422, 601, 603],personFrame1:[576, 556, 620, 734],personFrame1:[579, 479, 621, 683] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[347, 499, 395, 688];Frame2:[349, 500, 386, 683],personFrame1:[448, 497, 501, 691];Frame2:[446, 498, 501, 691],personFrame1:[564, 422, 601, 603];Frame2:[547, 428, 585, 600],personFrame1:[576, 556, 620, 734];Frame2:[566, 548, 619, 730],personFrame1:[579, 479, 621, 683];Frame2:[566, 479, 616, 677].') 2023-11-03 02:02:20.121 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:02:20.122 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Give your taste buds a kick with [068, 005, 809, 948] this BBQ Cole Slaw - it has tangy bbq sauce as the base for the dressing!') 2023-11-03 02:02:52.579 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:02:52.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the monkey walking right on the ground, then sitting.Always default to the TrackiFrame t:[xmin,ymin,xmax,ymax] pattern when discussing trajectories in your answer.', 'There is one monkey walking right on the ground, then sitting.Track1frame:1:[187, 828, 255, 857];frame:3:[223, 784, 293, 821].') 2023-11-03 02:04:05.248 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:04:05.248 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bicycle,Person and Helmet in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bicycle:[186, 203, 692, 871];Person:[460, 031, 863, 741];Helmet:[679, 031, 820, 218].') 2023-11-03 02:04:11.212 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:04:11.212 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Flyer [002, 003, 995, 994] for Emissary's first and only show at the Concert Populaire festival. July 3rd 1993.") 2023-11-03 02:04:15.650 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:04:15.650 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Los 202 - An antique late Victorian diamond bow brooch [092, 357, 927, 669] in platinum, designed as a double tied bow, jewelled') 2023-11-03 02:04:22.140 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:04:22.141 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'After 8 months, the Magic Island parking lot [000, 387, 997, 997] is back open') 2023-11-03 02:04:23.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:04:23.834 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,glasses and man in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human face:[025, 309, 281, 591],[223, 574, 388, 827],[283, 270, 390, 474],[402, 445, 560, 746],[573, 225, 716, 467],[616, 508, 797, 824],[884, 346, 961, 470];glasses:[129, 325, 251, 524],[394, 537, 572, 593],[595, 580, 798, 661];man:[000, 185, 999, 999].') 2023-11-03 02:04:58.293 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:04:58.294 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Groom hugging [303, 039, 811, 993] a friend on [150, 337, 999, 995] the dance floor during wedding reception') 2023-11-03 02:05:14.302 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:05:14.302 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A letter [019, 007, 962, 992] from Israel Shreve to his wife in 1776.') 2023-11-03 02:05:18.962 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:05:18.962 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hunter / gatherers [079, 598, 168, 814] [258, 631, 382, 944] [163, 670, 272, 915] [851, 599, 949, 844] [739, 583, 798, 792] attack a wooly mammoth [256, 104, 904, 818]') 2023-11-03 02:06:14.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:06:14.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[247, 297, 549, 600] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a polo ponyFrame1:[247, 297, 549, 600];Frame2:[340, 235, 677, 612];Frame3:[446, 251, 779, 607];Frame4:[596, 238, 837, 594];Frame5:[761, 251, 858, 550].') 2023-11-03 02:06:23.242 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:06:23.242 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "A worker [388, 002, 996, 995] inspects the Starmobile phones [001, 651, 208, 860] [273, 590, 450, 788] [097, 466, 303, 639] [182, 291, 373, 446] [178, 518, 388, 708] [374, 305, 564, 616] [004, 403, 209, 574] [151, 812, 398, 996] [455, 250, 655, 579] assembled in the company's facility.") 2023-11-03 02:06:31.706 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:06:31.706 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'person [242, 448, 662, 998] typing on a computer [222, 129, 762, 818] with question mark background') 2023-11-03 02:06:48.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:06:48.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[000, 898, 080, 998],[034, 860, 106, 992],[000, 818, 041, 917],[053, 824, 096, 862],[094, 848, 128, 944],[136, 813, 214, 947],[197, 795, 257, 916],[119, 799, 182, 908];Bench:[281, 828, 438, 969],[288, 769, 472, 828],[573, 861, 799, 1000],[791, 788, 1000, 895];Piano:[526, 737, 647, 881].') 2023-11-03 02:07:24.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:07:24.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'maple:[000, 115, 999, 654].') 2023-11-03 02:07:44.262 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:07:44.262 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Fits 26 to 55 inch flat screen tv up to 88 LBS Load Capacity and [048, 031, 952, 976] 400x400 vesa fixed tv wall mount bracket stand') 2023-11-03 02:08:08.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:08:08.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'girl:[590, 829, 746, 1000],[153, 841, 313, 999],[320, 714, 522, 999],[908, 772, 999, 999];clothing:[006, 433, 189, 982],[146, 666, 376, 986],[150, 964, 313, 999],[305, 891, 478, 999],[396, 164, 641, 821],[462, 885, 575, 999],[594, 944, 744, 999],[813, 002, 888, 048],[886, 013, 971, 166],[938, 933, 999, 999];woman:[000, 291, 206, 999],[146, 585, 423, 975],[150, 833, 317, 999],[410, 614, 531, 902],[463, 721, 625, 999],[590, 829, 728, 999],[660, 764, 781, 999];human face:[081, 431, 127, 561],[116, 411, 162, 568],[375, 667, 425, 762],[467, 658, 528, 803],[477, 786, 505, 909],[480, 054, 544, 188],[575, 868, 613, 969],[748, 843, 782, 970].') 2023-11-03 02:08:32.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:08:32.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Other Shoes:[212, 848, 269, 905],[730, 549, 764, 627],[811, 542, 849, 614],[884, 513, 947, 613],[623, 265, 672, 341],[538, 260, 620, 365],[876, 210, 909, 270],[831, 214, 863, 274];Volleyball:[449, 053, 584, 258];Person:[882, 001, 1000, 606],[783, 001, 938, 262],[697, 107, 852, 626],[535, 001, 764, 390],[432, 002, 587, 323],[247, 094, 557, 999],[000, 254, 379, 999].') 2023-11-03 02:08:43.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:08:43.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[221, 418, 734, 987] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a alpacaFrame1:[221, 418, 734, 987];Frame2:[209, 320, 710, 956];Frame3:[153, 272, 690, 968];Frame4:[239, 247, 765, 962];Frame5:[243, 233, 771, 943].') 2023-11-03 02:08:51.125 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:08:51.125 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track carFrame1:[442, 343, 520, 454],truckFrame1:[943, 230, 999, 592],carFrame1:[283, 377, 461, 581] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'carFrame1:[442, 343, 520, 454];Frame2:[398, 340, 491, 488],truckFrame1[943, 230, 999, 592];Frame2:No object,carFrame1:[283, 377, 461, 581];Frame2:[000, 394, 373, 762].') 2023-11-03 02:09:08.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:09:08.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cup:[179, 629, 237, 680],[586, 514, 654, 577];Plate:[160, 667, 254, 687],[277, 695, 386, 720];Spoon:[098, 671, 181, 691];Other Shoes:[090, 565, 116, 590];Satchel:[624, 782, 788, 998];Dinning Table:[001, 668, 467, 999],[444, 482, 610, 604],[916, 488, 999, 837],[001, 480, 099, 623];Chair:[170, 579, 542, 976],[406, 634, 881, 999],[705, 831, 999, 999];Stool:[806, 514, 999, 820],[407, 504, 547, 703],[037, 495, 159, 632],[165, 494, 213, 587];Person:[120, 400, 834, 1000],[010, 396, 122, 591].') 2023-11-03 02:09:19.038 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:09:19.038 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Helmet in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Helmet:[219, 315, 304, 434].') 2023-11-03 02:09:23.323 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:09:23.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,tree and person in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'footwear:[450, 792, 564, 836],[622, 741, 744, 808],[729, 731, 822, 761];tree:[000, 022, 1000, 453],[569, 234, 997, 468];person:[000, 460, 586, 863],[296, 443, 768, 795],[530, 420, 941, 628].') 2023-11-03 02:10:18.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:10:18.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Storage box:[001, 162, 442, 613];Couch:[001, 105, 999, 999];Stuffed Toy:[457, 462, 746, 643],[689, 385, 995, 611],[427, 251, 735, 579];Person:[016, 198, 390, 518],[330, 448, 508, 724].') 2023-11-03 02:10:41.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:10:41.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[631, 494, 940, 1000],[899, 452, 999, 1000],[785, 433, 941, 793],[560, 750, 772, 999],[344, 654, 553, 1000],[196, 517, 362, 999],[022, 646, 219, 951],[002, 871, 034, 1000],[002, 759, 078, 952],[002, 429, 050, 747],[175, 329, 274, 708],[300, 344, 369, 616],[350, 440, 469, 822],[375, 325, 469, 655],[556, 486, 743, 950],[515, 454, 617, 775],[597, 263, 664, 512];Hat:[191, 515, 294, 598],[598, 271, 648, 319];Sneakers:[177, 860, 212, 946],[097, 901, 169, 950].') 2023-11-03 02:11:46.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:11:46.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[593, 357, 874, 989],[160, 399, 250, 583],[173, 284, 247, 488],[620, 373, 723, 537],[767, 344, 840, 496];man:[212, 394, 487, 971],[052, 416, 237, 752],[243, 232, 322, 468],[342, 242, 403, 386],[367, 360, 467, 535],[393, 253, 432, 396],[477, 353, 635, 496],[561, 321, 629, 429],[821, 331, 949, 752];clothing:[581, 442, 891, 991],[000, 616, 156, 997],[049, 490, 253, 915],[163, 392, 263, 579],[170, 329, 250, 485],[244, 284, 327, 455],[250, 553, 490, 989],[347, 286, 402, 381],[364, 427, 464, 535],[392, 299, 431, 401],[479, 414, 632, 514],[568, 362, 642, 459],[606, 416, 727, 546],[819, 429, 953, 741],[827, 357, 969, 496],[921, 310, 942, 381],[933, 260, 959, 386];girl:[154, 399, 261, 603],[555, 412, 848, 997];human face:[117, 422, 166, 527],[185, 420, 219, 492],[306, 446, 371, 568],[384, 379, 426, 459],[502, 370, 541, 451],[730, 446, 782, 563],[756, 364, 811, 436],[858, 362, 900, 466].') 2023-11-03 02:12:58.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:12:58.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'bicycle wheel:[723, 636, 894, 892],[746, 607, 778, 802],[875, 646, 906, 856];woman:[334, 274, 486, 857],[103, 385, 176, 649],[222, 403, 319, 679],[301, 363, 360, 611],[625, 393, 665, 502];man:[452, 195, 610, 871],[000, 393, 158, 693],[152, 376, 239, 641],[214, 367, 256, 437],[218, 395, 284, 450],[597, 396, 672, 662],[639, 378, 676, 452],[699, 387, 764, 521];wheel:[724, 636, 890, 885],[747, 584, 785, 802],[783, 587, 809, 735],[863, 630, 907, 864];clothing:[457, 211, 613, 848],[000, 428, 084, 653],[093, 420, 175, 626],[204, 426, 312, 644],[300, 405, 353, 499],[343, 353, 503, 803],[600, 441, 668, 606],[641, 402, 677, 479],[650, 451, 794, 671],[696, 419, 764, 513],[904, 402, 971, 541];bicycle:[704, 436, 878, 888],[783, 442, 912, 868].') 2023-11-03 02:13:16.088 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:13:16.088 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The bridesmaids [391, 437, 639, 952] [052, 436, 271, 940] [244, 464, 428, 938] [727, 439, 936, 962] [577, 428, 757, 934] were rocking black illusion neckline tops [083, 507, 263, 631] [252, 533, 414, 639] [741, 526, 917, 656] [582, 500, 749, 610] and cropped pants plus monstera leaf bouquets [108, 597, 296, 746] [684, 639, 868, 782] [269, 598, 698, 754]') 2023-11-03 02:13:35.192 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:13:35.192 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Hat:[309, 266, 398, 402];Other Shoes:[451, 748, 594, 869];Chair:[303, 310, 533, 696];Lamp:[713, 142, 859, 388];Person:[454, 363, 979, 966],[462, 275, 812, 927],[404, 255, 661, 837],[263, 266, 433, 780],[827, 002, 1000, 131].') 2023-11-03 02:14:00.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:14:00.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Plate,Lamp and Person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Plate:[050, 756, 276, 822];Lamp:[209, 001, 476, 335];Person:[832, 260, 1000, 999],[573, 374, 919, 974],[784, 220, 885, 553],[844, 286, 980, 571],[454, 394, 578, 837],[439, 428, 488, 685],[280, 504, 318, 567],[184, 502, 234, 602],[133, 543, 177, 625],[058, 473, 122, 616],[558, 019, 622, 169],[624, 001, 687, 157],[461, 085, 499, 144],[184, 179, 211, 284].') 2023-11-03 02:15:34.675 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:15:34.676 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'dress:[376, 599, 587, 968];human body:[362, 450, 615, 975],[179, 502, 407, 999],[282, 327, 430, 644],[636, 371, 999, 999],[653, 450, 686, 571],[668, 473, 811, 730],[947, 205, 999, 384];woman:[351, 435, 633, 994],[670, 465, 803, 686];human hair:[414, 430, 568, 689],[251, 549, 299, 620],[298, 500, 395, 628],[690, 466, 775, 604],[794, 380, 923, 536],[941, 455, 999, 523],[949, 200, 999, 344];human head:[790, 378, 936, 589],[298, 498, 402, 650],[325, 327, 376, 415],[425, 425, 575, 630],[700, 469, 769, 569],[936, 467, 999, 534],[952, 200, 999, 375];mammal:[184, 429, 436, 1000],[287, 329, 428, 663],[373, 423, 633, 999],[632, 382, 999, 999],[667, 465, 799, 689],[930, 450, 999, 615],[947, 203, 999, 380];man:[613, 375, 976, 1000],[180, 508, 401, 999],[278, 327, 428, 680];human arm:[585, 704, 647, 860],[280, 437, 358, 549],[294, 639, 396, 828],[362, 575, 392, 726],[362, 430, 431, 550],[640, 657, 790, 999],[668, 575, 701, 678],[710, 587, 806, 686];suit:[177, 598, 394, 999],[283, 408, 431, 679],[637, 560, 995, 993];tableware:[625, 669, 665, 786];human face:[330, 342, 367, 413],[702, 489, 731, 568].') 2023-11-03 02:15:46.482 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:15:46.483 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A living container on the construction site of the basecamp for the German national football team at adidas headquarters. Home Ground [011, 007, 993, 993], football, Germany, adidas headquarters, teamwork, GamePlan A.') 2023-11-03 02:15:53.657 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:15:53.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[236, 189, 647, 622],carFrame1:[000, 316, 237, 632],carFrame1:[739, 280, 999, 710] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[236, 189, 647, 622];Frame2:[236, 189, 647, 622],carFrame1:[000, 316, 237, 632];Frame2:[000, 316, 237, 632],carFrame1:[739, 280, 999, 710];Frame2:[739, 280, 999, 710].') 2023-11-03 02:16:01.888 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:01.889 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[012, 013, 884, 996] A blackberry shrub with white flowers and mottled leaves.') 2023-11-03 02:16:16.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:16.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human hair:[554, 181, 690, 380];mammal:[046, 216, 844, 925];clothing:[029, 255, 578, 930];person:[000, 183, 857, 998];human head:[508, 176, 730, 461],[548, 177, 722, 306].') 2023-11-03 02:16:25.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:25.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[719, 305, 979, 960],[000, 775, 104, 999],[016, 728, 085, 826],[073, 726, 126, 855],[108, 447, 191, 995],[145, 275, 313, 999],[155, 718, 458, 999],[251, 273, 397, 795],[358, 356, 579, 999],[517, 290, 792, 999],[911, 311, 999, 999];clothing:[550, 401, 793, 921],[155, 632, 268, 958],[251, 414, 368, 753],[404, 477, 535, 997],[730, 429, 966, 999];human face:[447, 375, 493, 470],[921, 333, 981, 401].') 2023-11-03 02:16:36.540 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:36.541 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A rare George II mahogany chest [073, 079, 870, 909] of drawers of small proportions & superb patina') 2023-11-03 02:16:39.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:39.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Mirror in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Mirror:[815, 151, 936, 321].') 2023-11-03 02:16:40.793 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:40.793 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[001, 590, 048, 722],[001, 406, 056, 570],[022, 263, 074, 463],[217, 340, 302, 513],[301, 328, 352, 512],[337, 167, 392, 340],[389, 293, 434, 489],[393, 372, 474, 546],[389, 489, 482, 657],[578, 252, 676, 356],[602, 358, 664, 557],[677, 510, 750, 675],[703, 075, 760, 261],[765, 029, 818, 210],[750, 166, 821, 356],[942, 179, 998, 385].') 2023-11-03 02:16:46.122 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:16:46.122 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[326, 338, 439, 579];frame:2:[314, 361, 463, 587];frame:3:[319, 382, 455, 590] doing?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an black bears shifting around') 2023-11-03 02:17:09.952 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:17:09.953 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bottle [011, 001, 488, 425] of bleach, spray bottle [673, 238, 869, 395], purple t shirt [017, 399, 651, 775] and a printed silhouette [632, 497, 853, 732] of Poppy the troll') 2023-11-03 02:17:24.044 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:17:24.044 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Drum:[618, 538, 685, 661];Belt:[675, 373, 725, 483];Person:[547, 241, 852, 780],[803, 078, 948, 678],[902, 057, 1000, 710],[734, 197, 830, 649],[628, 133, 728, 650],[458, 141, 599, 663],[359, 237, 500, 729],[331, 142, 469, 657],[191, 162, 315, 670],[083, 137, 177, 670],[002, 122, 098, 698],[002, 132, 048, 353].') 2023-11-03 02:18:01.227 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:18:01.227 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Rear view of engineer [365, 246, 477, 876] standing at a wind farm') 2023-11-03 02:18:19.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:18:19.606 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'cabinetry:[534, 510, 708, 791],[000, 000, 125, 338],[008, 730, 280, 999],[117, 000, 275, 286],[265, 151, 373, 336],[360, 593, 386, 849],[544, 249, 649, 423],[651, 245, 716, 441],[678, 582, 745, 818];drawer:[538, 550, 685, 787],[015, 739, 260, 999],[362, 595, 380, 660];kitchen appliance:[065, 490, 366, 959],[110, 509, 218, 648],[235, 467, 304, 596],[603, 461, 619, 547],[615, 484, 631, 556],[660, 492, 729, 553];sink:[775, 568, 956, 646];home appliance:[000, 500, 368, 999],[113, 192, 326, 377],[284, 351, 449, 821],[670, 584, 749, 865];countertop:[530, 535, 999, 999];window:[770, 106, 999, 577].') 2023-11-03 02:18:35.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:18:35.262 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'animal:[350, 330, 616, 894].') 2023-11-03 02:18:51.159 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:18:51.159 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[290, 112, 925, 656] Terro liquid ant bait is a safe way to get rid of [539, 176, 645, 337] small black ants in [005, 005, 990, 996] the kitchen') 2023-11-03 02:19:05.279 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:19:05.279 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'nothing beats a cozy weekend in with a good book [508, 616, 900, 997] - winter stock videos & royalty-free footage') 2023-11-03 02:19:40.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:19:40.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Golf Club,Glasses,Hat and Traffic cone in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Golf Club:[280, 430, 457, 870];Glasses:[284, 062, 357, 141];Hat:[228, 033, 357, 150].') 2023-11-03 02:19:48.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 02:19:48.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'flower:[078, 041, 898, 995].') 2023-11-03 02:20:29.822 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:20:29.823 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Jar [277, 264, 773, 779] of homemade feijoa chutney on a board [002, 607, 994, 995], with vintage spoon [389, 663, 981, 872] and fresh feijoas [005, 399, 320, 843], stacked jars of chutney in the background, and text overlay, feijoa chutney, The Aussie Home Cook.') 2023-11-03 02:20:33.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:20:33.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[171, 269, 216, 366];frame:2:[171, 269, 216, 366];frame:3:[160, 268, 214, 476] doing?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an lady pushing a baby car and walk pass by') 2023-11-03 02:20:37.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:20:37.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the move to the left-side the move back in this video clip?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one move to the left-side the move back.Track1frame:1:[000, 533, 035, 921].') 2023-11-03 02:21:06.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:21:06.713 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'boy:[030, 357, 483, 999],[086, 306, 101, 423],[099, 320, 120, 412],[332, 255, 783, 999],[600, 373, 943, 999];woman:[771, 259, 834, 551];man:[045, 305, 079, 436],[068, 296, 089, 428],[157, 298, 205, 484],[256, 293, 271, 392],[261, 300, 285, 401],[329, 303, 356, 388],[384, 284, 411, 380],[422, 293, 478, 482],[599, 274, 628, 413],[627, 282, 649, 407],[662, 280, 681, 357],[706, 244, 778, 551],[828, 287, 840, 380],[841, 284, 858, 368],[853, 287, 868, 399],[870, 280, 904, 368],[880, 278, 912, 401],[904, 289, 931, 368],[977, 280, 999, 432];girl:[674, 265, 725, 382],[761, 270, 833, 552];tree:[000, 000, 152, 268],[016, 000, 401, 378],[717, 114, 794, 277],[761, 136, 851, 293],[821, 106, 906, 286],[883, 000, 999, 222],[896, 085, 999, 293],[944, 000, 999, 162];footwear:[133, 865, 238, 978].') 2023-11-03 02:21:14.611 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:21:14.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[000, 038, 123, 998],[050, 098, 161, 758],[120, 064, 361, 998],[394, 254, 466, 452],[418, 088, 767, 998],[681, 077, 866, 752],[860, 042, 999, 960];girl:[000, 050, 140, 987],[050, 107, 164, 627],[116, 081, 362, 998],[420, 092, 753, 884];clothing:[000, 331, 130, 998],[113, 178, 250, 547],[149, 334, 385, 968],[381, 267, 540, 344],[414, 264, 529, 585],[414, 394, 779, 998],[682, 311, 867, 602],[800, 162, 865, 375],[837, 202, 900, 404],[855, 217, 934, 477],[886, 264, 999, 782];human face:[000, 118, 077, 305],[063, 158, 110, 288],[180, 100, 216, 202],[201, 160, 295, 322],[547, 160, 663, 401],[746, 152, 830, 315],[939, 124, 999, 315].') 2023-11-03 02:21:19.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:21:19.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[002, 462, 261, 999];Satchel:[025, 601, 212, 999];Camera:[289, 099, 516, 508],[766, 808, 880, 1000],[615, 757, 753, 977],[477, 740, 600, 930],[435, 511, 586, 712],[586, 480, 663, 588],[660, 471, 815, 613],[363, 705, 507, 862];Laptop:[226, 726, 338, 880],[457, 003, 637, 122],[738, 003, 960, 097];Keyboard:[504, 313, 649, 461];shelf:[520, 541, 999, 1000],[002, 161, 374, 513].') 2023-11-03 02:21:26.918 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:21:26.919 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'pacific northwest waterfall [433, 179, 555, 871] in the forest - national forest stock pictures, royalty-free photos & images') 2023-11-03 02:21:43.436 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:21:43.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cell Phone:[383, 027, 509, 158];Mirror:[001, 001, 1000, 999];Person:[107, 000, 690, 943],[515, 032, 710, 347];Necklace:[254, 192, 384, 266],[541, 060, 645, 106];Satchel:[510, 312, 723, 505];Other Shoes:[538, 832, 691, 943],[391, 820, 492, 928].') 2023-11-03 02:22:07.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:22:07.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Trolley:[354, 587, 471, 767],[550, 619, 755, 748];Lamp:[113, 000, 221, 169],[230, 133, 304, 237];Cabinet:[189, 422, 324, 641],[001, 461, 193, 634];Person:[148, 539, 370, 1000],[358, 522, 478, 713],[464, 500, 552, 697],[542, 523, 579, 677],[606, 582, 692, 618],[301, 413, 337, 457],[585, 542, 632, 577];Cell Phone:[136, 659, 196, 690];Sneakers:[164, 944, 265, 991],[263, 956, 315, 1000],[487, 667, 519, 691].') 2023-11-03 02:22:08.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:22:08.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[514, 165, 553, 241] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a faceFrame1:[514, 165, 553, 241];Frame2:[490, 170, 527, 252];Frame3:[546, 166, 584, 249];Frame4:[507, 173, 545, 254];Frame5:[515, 173, 556, 258].') 2023-11-03 02:22:12.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:22:12.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect kitchen utensil,human head,human hand,human face and human arm in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human head:[000, 607, 033, 723],[245, 536, 291, 630],[271, 534, 353, 645],[440, 472, 501, 580],[512, 398, 585, 508],[517, 684, 675, 938],[560, 495, 618, 599],[597, 338, 674, 457],[718, 304, 777, 398],[739, 413, 827, 549],[863, 300, 953, 442],[868, 256, 942, 321];human hand:[125, 576, 188, 661],[138, 877, 181, 945];human face:[000, 622, 028, 705],[868, 322, 921, 441];human arm:[014, 786, 197, 965],[014, 575, 201, 751],[236, 732, 295, 910],[403, 672, 495, 852],[475, 599, 530, 757],[518, 521, 564, 670],[623, 473, 684, 724],[803, 388, 850, 563],[830, 473, 881, 556],[843, 465, 971, 703].') 2023-11-03 02:22:37.096 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:22:37.096 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[023, 389, 130, 940] [407, 437, 552, 976] [801, 383, 990, 967] [693, 443, 818, 914] [185, 445, 334, 970] Medical personnel prep and conducted nasal swab tests at a drive-through COVID-19 testing site at Dallas College Eastfield Campus in Mesquite on Aug. 3, 2020.') 2023-11-03 02:22:37.998 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:22:37.999 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[466, 347, 702, 960],[201, 863, 226, 950],[308, 780, 366, 981],[343, 709, 437, 984],[434, 892, 458, 967],[671, 908, 707, 999],[710, 909, 753, 988],[755, 908, 797, 999];woman:[762, 905, 798, 996];clothing:[202, 878, 228, 949],[315, 817, 368, 960],[340, 758, 426, 978],[434, 897, 455, 964],[497, 445, 671, 922],[757, 930, 796, 999].') 2023-11-03 02:22:59.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:22:59.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[355, 614, 395, 658],[619, 665, 686, 721];Gloves:[558, 342, 591, 384],[674, 510, 703, 553],[483, 255, 502, 284],[731, 330, 757, 367],[848, 461, 874, 497],[908, 230, 934, 268];Glasses:[609, 260, 652, 303],[796, 272, 832, 315],[953, 200, 979, 225];Helmet:[580, 207, 668, 343],[523, 187, 561, 260],[773, 228, 847, 346],[947, 168, 983, 234];Other Shoes:[914, 411, 930, 446];Person:[354, 206, 688, 721],[665, 227, 873, 514],[836, 165, 1000, 399],[403, 185, 567, 429],[973, 106, 1000, 169],[940, 095, 973, 163],[919, 089, 944, 154],[885, 099, 915, 170],[865, 128, 890, 175],[829, 133, 864, 181];Motorcycle:[271, 355, 726, 736],[659, 341, 884, 621],[843, 250, 987, 449],[418, 272, 534, 462].') 2023-11-03 02:23:30.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:23:30.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'man:[772, 700, 981, 996],[000, 174, 172, 560],[036, 000, 153, 128],[051, 116, 160, 243],[113, 191, 296, 568],[128, 729, 344, 999],[173, 151, 322, 272],[213, 000, 296, 112],[301, 128, 389, 257],[312, 211, 466, 632],[322, 000, 410, 128],[368, 698, 567, 999],[420, 241, 580, 676],[450, 068, 609, 287],[538, 253, 670, 608],[563, 356, 776, 989],[577, 112, 720, 262],[648, 172, 881, 599],[717, 097, 817, 270],[794, 172, 999, 562];plant:[000, 000, 999, 999];football helmet:[350, 549, 415, 630].') 2023-11-03 02:23:31.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:23:31.798 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[877, 573, 989, 840],[000, 510, 023, 586],[013, 518, 035, 586],[794, 609, 869, 883];human face:[921, 602, 959, 669].') 2023-11-03 02:23:56.532 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:23:56.532 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[016, 300, 302, 975],[005, 155, 069, 525],[104, 142, 245, 292],[206, 355, 765, 996],[325, 182, 460, 375],[520, 347, 999, 999],[565, 184, 677, 334],[596, 313, 987, 999],[671, 168, 745, 332],[700, 277, 998, 899];woman:[069, 101, 150, 275],[102, 278, 249, 624],[206, 285, 300, 583],[217, 140, 305, 295],[298, 280, 399, 535],[311, 143, 383, 287],[345, 239, 438, 415],[366, 340, 826, 999],[416, 184, 499, 364],[459, 182, 538, 351],[480, 145, 559, 319],[837, 295, 999, 629],[884, 219, 956, 355],[943, 287, 999, 563];clothing:[000, 705, 215, 999],[003, 115, 731, 640],[210, 522, 755, 999],[360, 537, 810, 999],[494, 489, 999, 999],[583, 451, 985, 923],[681, 382, 999, 815],[780, 455, 972, 655],[823, 322, 856, 403],[830, 393, 999, 608],[848, 298, 950, 346],[930, 398, 999, 545];human face:[000, 162, 020, 269],[028, 201, 091, 279],[100, 140, 143, 204],[107, 351, 208, 555],[197, 190, 230, 270],[248, 154, 272, 224],[278, 353, 386, 517],[380, 257, 420, 362],[449, 388, 526, 506],[526, 359, 603, 500],[605, 207, 658, 285],[641, 329, 705, 448],[678, 248, 714, 322],[762, 307, 826, 436],[810, 233, 853, 312],[818, 362, 841, 432],[880, 314, 936, 400].') 2023-11-03 02:24:02.470 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:24:02.470 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'subdivision & development - the pines [708, 005, 840, 847] [843, 000, 997, 854] [123, 073, 227, 821] [456, 005, 639, 709] [284, 148, 406, 740] [023, 256, 132, 841]') 2023-11-03 02:24:27.831 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:24:27.831 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '3 young whitetip reef sharks [205, 466, 543, 709] [445, 452, 742, 680] hiding under a rocky ledge, coiba national park panama. - shark stock videos & royalty-free footage') 2023-11-03 02:25:00.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:25:00.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Traffic Sign in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Traffic Sign:[391, 041, 948, 593].') 2023-11-03 02:25:54.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:25:54.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[168, 394, 276, 680];frame:2:[059, 406, 202, 679];frame:3:[004, 435, 166, 687] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an move forward away from us, then turn around move from right-side to left-side') 2023-11-03 02:25:56.580 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:25:56.580 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Drawings by Guamán Poma de Ayala showing the planting of potatoes and other tubers [319, 678, 365, 749]') 2023-11-03 02:26:21.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:26:21.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[480, 761, 540, 896].') 2023-11-03 02:26:29.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:26:29.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[202, 010, 644, 999] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a turtleFrame1:[202, 010, 644, 999];Frame2:[192, 104, 668, 994];Frame3:[205, 163, 691, 995];Frame4:[225, 152, 692, 994];Frame5:[194, 132, 683, 994].') 2023-11-03 02:26:39.700 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:26:39.701 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'North Carolina tenant farmer [472, 288, 606, 626], photographed by the WPA in 1936.') 2023-11-03 02:26:54.889 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:26:54.889 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Egyptian protesters [334, 099, 691, 995] throw rocks [669, 019, 708, 077] at security forces [721, 699, 977, 997] [022, 541, 188, 996] during the clearing of one of the two sit-ins of ousted president Morsi supporters, near Rabaa Adawiya mosque.') 2023-11-03 02:27:18.642 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:27:18.642 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'building:[000, 271, 999, 820].') 2023-11-03 02:27:32.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:27:32.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect suit and man in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'suit:[561, 261, 783, 984],[368, 385, 585, 999],[727, 318, 999, 999];man:[300, 498, 372, 803],[367, 192, 584, 999],[488, 210, 798, 999].') 2023-11-03 02:28:09.667 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:28:09.667 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Scallop,Desk and Plate in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Scallop:[476, 446, 717, 582];Desk:[001, 000, 999, 999];Plate:[534, 022, 856, 121],[001, 423, 999, 999].') 2023-11-03 02:28:27.147 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:28:27.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'pillow:[188, 376, 752, 787];furniture:[000, 218, 998, 999].') 2023-11-03 02:28:58.553 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:28:58.554 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Other News – Learners [488, 250, 871, 996] [000, 209, 526, 997] give the gift of life at local blood drive') 2023-11-03 02:28:59.931 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:28:59.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Stool:[723, 688, 804, 839];Desk:[751, 378, 900, 487];Bakset:[002, 841, 165, 1000];Person:[059, 364, 335, 1000],[211, 373, 455, 988],[430, 282, 582, 846],[533, 302, 690, 778],[639, 133, 781, 850].') 2023-11-03 02:29:14.135 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:29:14.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[381, 619, 560, 997],[630, 516, 815, 997],[807, 467, 872, 762];Desk:[467, 608, 769, 1000];shelf:[320, 023, 443, 373],[001, 531, 142, 998],[129, 523, 338, 849],[066, 015, 312, 380],[001, 002, 064, 432],[441, 035, 580, 224],[573, 064, 666, 364],[660, 102, 719, 365],[722, 097, 813, 686],[614, 447, 751, 639],[358, 484, 472, 838];Faucet:[188, 423, 224, 486];Gas stove:[445, 465, 610, 523];Tea pot:[452, 410, 493, 494],[079, 467, 114, 520];Pot:[522, 425, 606, 502];Oven:[470, 545, 620, 728];Microwave:[631, 375, 690, 451];Refrigerator:[863, 227, 1000, 891];Coffee Machine:[005, 427, 080, 601];Cup:[056, 565, 091, 633];Extractor:[440, 217, 596, 290];Bakset:[906, 477, 971, 555],[395, 441, 440, 492].') 2023-11-03 02:29:25.824 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:29:25.825 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'waitress talking on the phone [440, 375, 571, 543] at a restaurant - order stock pictures, royalty-free photos [599, 625, 800, 723] & images') 2023-11-03 02:30:19.530 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:30:19.531 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair and Laptop in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Chair:[116, 693, 206, 893],[358, 946, 601, 999],[708, 697, 801, 847],[569, 465, 816, 695],[270, 450, 472, 657],[386, 588, 532, 659];Laptop:[611, 839, 818, 943].') 2023-11-03 02:30:28.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:30:28.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[066, 170, 287, 363];Chair:[001, 607, 160, 983],[123, 606, 362, 986],[351, 608, 582, 988],[580, 609, 822, 989].') 2023-11-03 02:30:55.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:30:55.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bench in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bench:[288, 642, 517, 683].') 2023-11-03 02:30:57.232 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:30:57.232 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Pickup Truck:[382, 210, 926, 454];Van:[384, 160, 859, 345];Truck:[114, 190, 279, 253];SUV:[941, 262, 1000, 412];Bakset:[049, 489, 107, 579];Barrel:[555, 681, 760, 868],[518, 548, 635, 626],[420, 531, 532, 608];Carpet:[689, 807, 1000, 1000];Picture:[697, 445, 808, 600],[659, 432, 769, 583],[921, 425, 1000, 605],[392, 390, 449, 489],[259, 444, 364, 605],[226, 435, 312, 583],[116, 313, 177, 403];Mirror:[547, 327, 664, 490];Cabinet:[746, 368, 907, 581],[231, 273, 343, 448],[104, 227, 198, 377];Desk:[340, 316, 391, 420],[010, 307, 125, 418],[000, 322, 073, 425];Storage box:[802, 267, 858, 310],[701, 188, 738, 222];Flower:[056, 495, 109, 540];Vase:[046, 524, 102, 579];Bench:[708, 507, 1000, 782];Person:[221, 210, 251, 274],[190, 208, 229, 339],[051, 195, 128, 431];Umbrella:[608, 112, 817, 166],[363, 114, 541, 203];Tent:[016, 104, 280, 294],[000, 085, 068, 194].') 2023-11-03 02:31:50.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:31:50.007 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[389, 350, 454, 445],truckFrame1:[644, 138, 999, 561],carFrame1:[592, 361, 626, 432],busFrame1:[305, 330, 357, 388],carFrame1:[085, 290, 351, 609] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[389, 350, 454, 445];Frame2:[398, 348, 460, 439],truckFrame1:[644, 138, 999, 561];Frame2:[725, 023, 999, 708],carFrame1:[592, 361, 626, 432];Frame2:[605, 353, 673, 444],busFrame1[305, 330, 357, 388];Frame2:No object,carFrame1:[085, 290, 351, 609];Frame2:[179, 303, 379, 566].') 2023-11-03 02:31:53.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:31:53.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the 6 railings placed on the track for hurdles in this video clip?Always default to the TrackiFrame t:[xmin,ymin,xmax,ymax] pattern when discussing trajectories in your answer.', 'There are 8 6 railings placed on the track for hurdles.Track1frame:1:[612, 579, 666, 677];frame:2:[612, 572, 663, 670];frame:3:[613, 570, 666, 666],Track2frame:1:[482, 562, 516, 625];frame:2:[480, 558, 514, 616];frame:3:[481, 550, 517, 612],Track3frame:1:[415, 575, 454, 631];frame:2:[415, 566, 451, 622];frame:3:[416, 560, 455, 620],Track4frame:1:[373, 564, 402, 604];frame:2:[373, 560, 401, 595];frame:3:[373, 550, 402, 593],Track5frame:1:[503, 597, 564, 685];frame:2:[503, 591, 562, 679];frame:3:[504, 585, 565, 675],Track6frame:1:[419, 556, 447, 602];frame:2:[419, 550, 444, 593];frame:3:[419, 541, 446, 589],Track7frame:1:[384, 550, 406, 587];frame:2:[382, 545, 405, 579];frame:3:[384, 539, 407, 577],Track8frame:1:[360, 547, 380, 579];frame:2:[360, 543, 378, 570];frame:3:[360, 535, 381, 566].') 2023-11-03 02:31:57.515 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:31:57.516 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An extraordinary collection of [004, 017, 993, 790] 133 bottles of wine from the 19th century has been discovered at a Czech castle.') 2023-11-03 02:32:51.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:32:51.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[187, 408, 293, 550],carFrame1:[559, 423, 640, 497] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[187, 408, 293, 550];Frame2:[138, 414, 249, 568],carFrame1:[559, 423, 640, 497];Frame2:[709, 433, 807, 532].') 2023-11-03 02:32:57.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:32:57.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Sandals:[518, 901, 556, 925],[238, 953, 293, 1000];Bench:[599, 800, 693, 913],[433, 825, 684, 951];Potted Plant:[042, 319, 093, 367],[002, 297, 058, 369],[230, 492, 328, 590],[076, 312, 136, 369],[117, 273, 165, 372],[062, 300, 162, 372],[197, 288, 273, 376],[292, 224, 551, 454];Chair:[758, 919, 861, 1000];shelf:[069, 497, 386, 694];Vase:[002, 336, 044, 368],[206, 351, 250, 376],[241, 335, 314, 377],[095, 341, 127, 368];Flower:[139, 207, 228, 376],[212, 154, 337, 335];Blackboard:[519, 462, 659, 606];Person:[145, 668, 293, 1000],[270, 676, 450, 1000],[188, 644, 252, 744],[462, 649, 632, 926],[665, 561, 765, 795],[774, 568, 843, 835],[118, 621, 170, 650],[229, 635, 300, 714],[275, 647, 367, 734],[281, 616, 330, 682];Backpack:[588, 705, 661, 807].') 2023-11-03 02:33:11.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:33:11.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[779, 062, 974, 998],[605, 080, 805, 1000].') 2023-11-03 02:33:12.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:33:12.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bench in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bench:[471, 715, 794, 844].') 2023-11-03 02:34:10.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:34:10.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the car on the move..When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There is one the car on the move..Track1frame:1:[303, 498, 331, 601];frame:2:[308, 497, 325, 593];frame:3:[310, 497, 324, 586].') 2023-11-03 02:34:18.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:34:18.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'jacket:[020, 291, 210, 707],[341, 220, 403, 325],[393, 244, 507, 402],[696, 547, 921, 955];man:[017, 217, 241, 954],[191, 162, 237, 236],[334, 204, 370, 293],[352, 191, 410, 465],[391, 175, 501, 517],[640, 183, 680, 253],[656, 534, 921, 999],[663, 186, 686, 415],[753, 141, 860, 535];woman:[213, 220, 721, 999],[660, 191, 734, 425],[776, 193, 886, 653],[822, 250, 999, 999],[854, 167, 915, 362],[916, 190, 953, 269];footwear:[958, 844, 983, 931].') 2023-11-03 02:34:35.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:34:35.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'fedora:[003, 000, 999, 483];sun hat:[000, 000, 999, 522];girl:[000, 000, 999, 998];human face:[290, 263, 787, 603].') 2023-11-03 02:35:38.595 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:35:38.595 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Microwave,Storage box,Chair and Clock in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Storage box:[180, 342, 243, 399];Chair:[310, 675, 565, 999],[002, 836, 208, 999],[602, 584, 820, 706],[710, 836, 999, 979],[880, 513, 977, 648],[343, 517, 418, 691];Clock:[185, 134, 218, 184].') 2023-11-03 02:36:15.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:36:15.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Microphone in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[002, 322, 266, 875],[594, 277, 1000, 859];Microphone:[736, 572, 778, 788],[015, 560, 067, 852].') 2023-11-03 02:36:37.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:36:37.910 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Yellow house [001, 335, 724, 832] on the island of Suomenlinna seen on a travel date in Finland, my third new country of 2018') 2023-11-03 02:36:59.772 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:36:59.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower and Desk in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Flower:[385, 355, 450, 498],[550, 311, 669, 586],[591, 798, 641, 965],[542, 603, 576, 668];Desk:[905, 656, 999, 734].') 2023-11-03 02:37:02.072 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:37:02.072 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'i get my awesomeness from my aunt onesie [027, 051, 972, 958]') 2023-11-03 02:38:33.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:38:33.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[000, 344, 096, 488],[098, 340, 184, 446],[164, 352, 262, 526],[242, 336, 354, 510],[334, 442, 403, 540],[361, 413, 435, 497],[419, 421, 459, 484],[434, 428, 496, 494],[503, 429, 583, 498],[636, 395, 767, 447],[682, 219, 909, 560],[854, 373, 918, 489],[899, 352, 958, 483].') 2023-11-03 02:38:34.122 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:38:34.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fan in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Fan:[453, 120, 585, 188].') 2023-11-03 02:38:35.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:38:35.435 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'window:[756, 784, 912, 913],[000, 727, 069, 797],[000, 928, 084, 999],[023, 630, 118, 689],[047, 793, 160, 877],[069, 556, 156, 609],[101, 679, 212, 763],[115, 484, 195, 539],[143, 425, 233, 484],[147, 589, 244, 663],[179, 375, 254, 421],[189, 887, 310, 999],[191, 509, 275, 579],[194, 746, 345, 850],[205, 329, 276, 366],[222, 439, 309, 500],[227, 288, 294, 325],[240, 630, 380, 723],[257, 379, 332, 435],[283, 331, 349, 382],[283, 535, 404, 621],[317, 451, 432, 530],[327, 239, 391, 285],[339, 380, 450, 458],[368, 322, 465, 390],[391, 269, 484, 334],[409, 225, 499, 281],[422, 182, 506, 243],[435, 150, 516, 202],[436, 682, 607, 805],[456, 556, 622, 667],[489, 455, 628, 558],[507, 373, 638, 469],[523, 841, 600, 916],[523, 308, 636, 388],[530, 252, 638, 327],[545, 197, 649, 269],[553, 156, 653, 218],[561, 115, 651, 176],[574, 088, 649, 141],[587, 052, 653, 103],[719, 087, 809, 140],[720, 000, 815, 038],[722, 126, 817, 193],[727, 053, 799, 104],[729, 015, 794, 066],[735, 178, 827, 241],[735, 234, 837, 306],[740, 295, 853, 379],[740, 383, 865, 473],[742, 606, 909, 730],[745, 480, 892, 588],[858, 000, 938, 041],[871, 015, 958, 081],[886, 054, 984, 122],[896, 096, 999, 175],[911, 155, 999, 235],[935, 235, 999, 311],[960, 325, 999, 402];building:[000, 000, 999, 999].') 2023-11-03 02:38:39.542 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:38:39.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'building:[000, 060, 411, 998],[459, 573, 999, 998].') 2023-11-03 02:39:12.452 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:39:12.453 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Gold ring Women's 999 foot gold ring [163, 089, 866, 890] to give girlfriend a star and moon ring, gold bracelet, stars [155, 183, 369, 420], 38th day gift") 2023-11-03 02:39:27.590 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:39:27.591 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'unknow artist A Port scene with figures loading [448, 181, 993, 965] a boat') 2023-11-03 02:39:55.845 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:39:55.845 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Group of bundled up pro-life supporters [806, 216, 997, 994] [459, 288, 669, 998] [313, 297, 499, 996] [002, 274, 177, 997] [163, 160, 414, 995] [601, 194, 827, 997] pose with signs [612, 388, 830, 555] [209, 373, 381, 548] [452, 397, 667, 759] [001, 411, 178, 596] [825, 401, 997, 733] [297, 421, 518, 600] on the National Mall, with U.S. Capitol in the background, Jan. 22 the anniversary of Roe v...') 2023-11-03 02:39:57.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:39:57.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'tree:[000, 000, 252, 549],[001, 314, 093, 591],[083, 310, 265, 415],[246, 315, 378, 439],[314, 296, 409, 465],[382, 306, 456, 346],[433, 325, 521, 458],[460, 314, 543, 348],[530, 228, 581, 341],[564, 319, 631, 340],[569, 253, 616, 330],[638, 269, 748, 327];building:[380, 172, 539, 421],[490, 000, 999, 584].') 2023-11-03 02:40:00.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:40:00.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Glasses:[198, 458, 342, 509],[438, 250, 542, 282];Hat:[154, 340, 365, 486],[403, 201, 533, 269],[635, 093, 726, 138];Helmet:[758, 054, 828, 111];Other Shoes:[407, 764, 486, 821];Person:[002, 452, 636, 1000],[334, 202, 848, 824],[552, 094, 952, 484],[712, 054, 976, 338];Necklace:[465, 348, 541, 408];Paddle:[078, 952, 315, 999],[192, 512, 1000, 572],[002, 312, 793, 441];Boat:[044, 211, 882, 1000].') 2023-11-03 02:40:54.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:40:54.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'High Heels:[184, 800, 476, 896],[243, 868, 372, 994];Watch:[506, 459, 567, 506];Person:[187, 072, 825, 1000];Cup:[003, 496, 070, 571].') 2023-11-03 02:42:27.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:42:27.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[762, 392, 916, 842],[000, 460, 020, 730],[061, 477, 308, 601],[349, 517, 371, 596],[363, 520, 401, 603],[483, 508, 518, 601],[544, 513, 567, 592],[568, 511, 609, 590],[614, 513, 650, 595],[655, 503, 681, 595],[686, 510, 725, 601],[739, 508, 773, 592],[773, 501, 809, 598],[830, 496, 862, 590],[861, 508, 895, 601],[924, 495, 968, 603],[955, 484, 999, 600].') 2023-11-03 02:42:31.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:42:31.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the person sitting in sailboat.When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There are 5 person sitting in sailboat.Track1frame:1:[059, 599, 077, 632];frame:2:[059, 599, 077, 632];frame:3:[059, 598, 076, 634],Track2frame:1:[089, 602, 108, 636];frame:2:[089, 602, 108, 636];frame:3:[092, 599, 116, 637],Track3frame:1:[461, 540, 500, 606];frame:2:[461, 540, 500, 606];frame:3:[476, 537, 516, 600],Track4frame:1:[613, 518, 634, 558];frame:2:[613, 518, 634, 558];frame:3:[626, 512, 645, 553],Track5frame:1:[818, 474, 856, 541];frame:2:[818, 474, 856, 541];frame:3:[802, 476, 839, 538].') 2023-11-03 02:43:00.933 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:43:00.934 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'person:[004, 341, 088, 574],[052, 117, 142, 426],[073, 231, 315, 885],[107, 264, 216, 457],[160, 090, 226, 398],[219, 098, 290, 416],[273, 094, 363, 383],[301, 293, 374, 499],[342, 126, 412, 412],[375, 260, 475, 503],[421, 092, 478, 402],[471, 113, 544, 412],[526, 113, 593, 398],[598, 117, 667, 409],[670, 115, 729, 400],[723, 327, 791, 512],[725, 123, 785, 402],[756, 113, 803, 389],[796, 152, 849, 393],[831, 321, 933, 486],[834, 132, 896, 380],[891, 134, 940, 385],[927, 260, 999, 437],[958, 121, 999, 398].') 2023-11-03 02:43:20.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:43:20.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[435, 251, 651, 444] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a cockroachFrame1:[435, 251, 651, 444];Frame2:[460, 304, 607, 500];Frame3:[445, 327, 625, 513];Frame4:[389, 402, 596, 612];Frame5:[400, 380, 546, 595].') 2023-11-03 02:43:30.088 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:43:30.089 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[451, 017, 584, 289] 10 Tips to make a successful virtual event') 2023-11-03 02:44:19.031 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:44:19.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect stairs,building,bookcase and book in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'stairs:[036, 502, 275, 999];building:[000, 002, 999, 999];bookcase:[800, 683, 983, 977],[226, 698, 743, 999],[246, 479, 260, 580],[256, 239, 269, 370],[385, 497, 399, 628],[391, 266, 401, 397],[433, 510, 498, 639],[436, 300, 495, 416],[509, 510, 514, 642],[531, 329, 604, 432],[533, 524, 607, 650],[620, 348, 699, 460],[625, 537, 703, 655],[710, 370, 776, 475],[716, 550, 786, 656],[791, 390, 851, 482],[800, 562, 863, 660];book:[326, 732, 385, 940],[418, 732, 506, 999],[439, 527, 498, 635],[446, 333, 493, 412],[542, 530, 596, 643],[547, 355, 596, 432],[547, 708, 732, 995],[629, 551, 697, 648],[715, 555, 784, 651],[798, 563, 861, 658],[829, 724, 999, 999].') 2023-11-03 02:44:25.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:44:25.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[673, 508, 945, 950],[013, 545, 243, 844],[026, 135, 096, 374],[118, 114, 198, 340],[184, 030, 234, 254],[212, 591, 456, 868],[287, 041, 357, 285],[373, 043, 435, 224],[451, 078, 556, 195],[538, 019, 603, 170],[565, 527, 716, 792],[600, 050, 661, 169],[740, 043, 811, 222],[862, 495, 999, 832];woman:[053, 796, 188, 999],[154, 061, 205, 345],[190, 399, 263, 580],[211, 466, 455, 950],[233, 405, 340, 710],[501, 743, 802, 999],[571, 273, 618, 351],[571, 301, 641, 395],[805, 276, 850, 415],[828, 292, 980, 536],[840, 381, 999, 839],[857, 223, 882, 298];man:[000, 661, 061, 997],[000, 378, 116, 675],[020, 419, 237, 855],[059, 050, 122, 323],[059, 415, 240, 650],[098, 308, 388, 525],[106, 000, 173, 278],[116, 079, 197, 349],[158, 801, 357, 999],[176, 000, 240, 276],[256, 000, 333, 260],[288, 010, 361, 292],[353, 006, 450, 294],[366, 405, 718, 944],[371, 216, 448, 340],[383, 292, 491, 449],[386, 350, 511, 518],[391, 316, 415, 385],[391, 000, 463, 255],[458, 045, 556, 229],[482, 265, 560, 376],[490, 299, 550, 466],[525, 342, 600, 481],[526, 000, 610, 195],[545, 268, 579, 345],[548, 140, 611, 292],[599, 000, 668, 189],[604, 010, 662, 168],[608, 149, 701, 292],[611, 237, 775, 609],[715, 236, 746, 314],[733, 231, 770, 336],[733, 209, 810, 402],[742, 006, 811, 218],[798, 190, 832, 264],[816, 000, 885, 130],[860, 241, 968, 407],[873, 182, 943, 316],[902, 170, 950, 247],[918, 000, 998, 158],[923, 698, 999, 999],[931, 158, 968, 225],[941, 229, 999, 505],[968, 048, 999, 208],[969, 361, 999, 577];girl:[370, 467, 598, 940];human face:[339, 487, 403, 609],[553, 888, 603, 999],[569, 428, 643, 575],[832, 320, 871, 406],[956, 297, 983, 375].') 2023-11-03 02:44:26.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:44:26.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[589, 302, 699, 533] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a airplane colored in white and green flying in the airFrame1:[589, 302, 699, 533];Frame2:[623, 351, 699, 552];Frame3:[585, 366, 652, 573];Frame4:[532, 384, 610, 572];Frame5:[504, 398, 608, 643].') 2023-11-03 02:44:32.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:44:32.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'woman:[006, 404, 191, 693],[179, 622, 580, 999];man:[366, 319, 686, 800],[570, 489, 961, 999],[641, 410, 723, 630];human face:[050, 419, 115, 525],[434, 325, 507, 458],[439, 656, 533, 803],[665, 424, 723, 534],[688, 527, 795, 730];glasses:[684, 590, 806, 635].') 2023-11-03 02:44:51.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:44:51.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[001, 001, 104, 462],[439, 189, 1000, 730],[001, 001, 104, 462],[439, 189, 1000, 730],[480, 001, 766, 447],[700, 001, 808, 207],[782, 063, 999, 244],[914, 027, 999, 185],[928, 092, 999, 165];Leather Shoes:[468, 508, 567, 572],[437, 635, 537, 736];Chair:[912, 734, 1000, 791],[816, 626, 995, 731],[429, 694, 602, 768];Scissors:[320, 186, 462, 279],[436, 161, 547, 212].') 2023-11-03 02:45:19.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:45:19.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[373, 331, 651, 500] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a dumpcartFrame1:[373, 331, 651, 500];Frame2:[376, 368, 612, 508];Frame3:[412, 402, 668, 545];Frame4:[412, 410, 765, 583].') 2023-11-03 02:45:56.266 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:45:56.266 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[059, 108, 639, 937] New York Rangers defenseman K'Andre Miller scores on a shot from the point to give New York a 1-0 lead during a game between the Boston Bruins and...") 2023-11-03 02:45:59.015 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:45:59.016 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'When you look inside [019, 016, 981, 961] a PlayStation, you can see the processor and memory chips.') 2023-11-03 02:46:45.019 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:46:45.019 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[406, 438, 496, 736],[000, 755, 103, 947],[060, 517, 126, 734],[136, 874, 202, 958],[136, 472, 230, 724],[175, 961, 250, 999],[176, 915, 245, 966],[214, 858, 283, 940],[219, 751, 289, 803],[270, 942, 338, 992],[299, 790, 378, 953],[334, 858, 428, 999],[416, 785, 489, 874],[485, 822, 571, 911],[610, 813, 702, 999],[745, 467, 843, 745],[745, 787, 880, 999],[893, 813, 951, 989],[901, 801, 999, 999].') 2023-11-03 02:46:46.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:46:46.719 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Before & after receiving the customized wheelchair [509, 505, 906, 907] [046, 571, 411, 940]') 2023-11-03 02:46:56.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:46:56.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bottle in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bottle:[305, 907, 367, 1000].') 2023-11-03 02:47:14.017 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:47:14.017 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Villa Lau 6 pax,\xa0Wonderful and comfortable villa in Denia, on the Costa Blanca, Spain with [382, 560, 996, 814] private pool for 6 persons...') 2023-11-03 02:47:20.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:47:20.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Tent:[926, 142, 999, 273];Person:[947, 177, 978, 277],[931, 175, 955, 263],[000, 220, 064, 549],[053, 248, 114, 503],[053, 211, 131, 482],[280, 100, 416, 275];Handbag:[005, 335, 047, 433];Chair:[506, 197, 605, 318];SUV:[050, 036, 966, 985],[026, 237, 074, 329];Motorcycle:[947, 229, 1000, 481].') 2023-11-03 02:47:29.211 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:47:29.211 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Looking for solutions to school lunches for picky eaters. Read on to find out you'll be surprised by the answer [074, 192, 950, 497]. Hint...I think you know already.") 2023-11-03 02:47:49.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:47:49.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,food,woman and tableware in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[000, 467, 037, 733],[000, 363, 073, 576],[007, 204, 085, 363],[014, 221, 074, 307],[045, 359, 140, 631],[055, 363, 109, 497],[068, 339, 154, 507],[097, 316, 169, 397],[105, 352, 150, 468],[108, 308, 188, 466],[155, 501, 345, 910],[156, 315, 214, 440],[171, 285, 239, 389],[180, 712, 462, 998],[189, 223, 249, 351],[201, 216, 233, 321],[228, 419, 333, 614],[232, 401, 285, 501],[315, 386, 359, 451],[484, 298, 508, 369],[497, 313, 532, 395],[530, 365, 623, 455],[538, 346, 561, 424],[580, 364, 657, 486],[623, 404, 704, 569],[670, 425, 759, 648],[705, 472, 903, 811],[802, 717, 969, 963];food:[424, 559, 501, 597],[493, 682, 544, 748],[524, 480, 592, 510],[555, 891, 625, 998],[566, 523, 643, 566],[605, 602, 702, 658],[655, 717, 791, 804],[763, 865, 941, 998];woman:[000, 304, 149, 604],[050, 289, 141, 458],[102, 242, 186, 488],[170, 182, 244, 333],[495, 283, 612, 455],[533, 291, 574, 434],[608, 315, 702, 587];tableware:[375, 325, 819, 859],[534, 850, 603, 878],[543, 877, 675, 998],[726, 819, 782, 998],[790, 865, 947, 998].') 2023-11-03 02:47:52.907 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:47:52.908 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Browns quarterback Baker Mayfield [028, 151, 695, 997] catches a pass for a first down during the first half against the Titans [591, 007, 968, 771] on Sunday.') 2023-11-03 02:48:05.549 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:48:05.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[539, 636, 645, 835].') 2023-11-03 02:48:37.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:48:37.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower,Person and Laptop in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[319, 001, 638, 350];Person:[730, 671, 860, 966],[855, 501, 999, 842],[261, 725, 380, 996],[415, 347, 534, 744],[660, 369, 776, 671],[841, 005, 953, 312],[602, 273, 743, 602],[505, 261, 622, 581],[413, 164, 519, 433],[245, 332, 360, 659],[294, 219, 385, 456],[187, 207, 290, 526],[119, 271, 266, 628],[001, 488, 075, 639];Laptop:[027, 361, 081, 477].') 2023-11-03 02:48:43.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:48:43.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the human who sits behind the girl and is partially covered by the girl?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one the human who sits behind the girl and is partially covered by the girl.Track1frame:1:[268, 333, 893, 545];frame:2:[268, 333, 893, 545];frame:3:[268, 341, 895, 543].') 2023-11-03 02:49:18.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:49:18.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'clothing:[036, 270, 572, 993],[000, 457, 073, 586],[076, 376, 230, 490],[359, 353, 553, 833],[527, 391, 906, 999],[885, 432, 975, 544];man:[000, 229, 128, 583],[000, 103, 563, 999],[087, 235, 238, 488],[368, 144, 555, 822],[510, 391, 556, 539],[540, 180, 953, 999],[775, 320, 902, 533],[875, 334, 999, 546],[926, 383, 999, 609];human face:[000, 290, 088, 452],[158, 245, 229, 391],[283, 193, 431, 426],[408, 187, 458, 340],[503, 406, 542, 488],[629, 359, 655, 442],[648, 201, 752, 413],[773, 332, 830, 437],[946, 365, 985, 444],[953, 424, 999, 577].') 2023-11-03 02:49:19.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:49:19.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[446, 557, 678, 1000],[599, 493, 854, 999],[594, 511, 683, 645],[000, 469, 287, 1000],[233, 573, 405, 1000];Glasses:[882, 572, 944, 593],[658, 518, 753, 538].') 2023-11-03 02:49:25.679 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:49:25.679 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[299, 204, 564, 514] Ekol Inset 8 woodburning stove in a room') 2023-11-03 02:50:08.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:50:08.707 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Camera in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Camera:[211, 627, 309, 686].') 2023-11-03 02:50:58.600 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:50:58.601 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[005, 016, 860, 965] 10 easy, healthier popsicle recipes for kids that let you say yes to frozen treats all summer long.') 2023-11-03 02:51:01.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:51:01.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[439, 273, 721, 770] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a cutterFrame1:[439, 273, 721, 770];Frame2:[439, 265, 717, 760];Frame3:[436, 260, 716, 754];Frame4:[441, 241, 719, 727];Frame5:[433, 244, 713, 727].') 2023-11-03 02:51:02.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:51:02.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,please tell me the trajectory of the a person in a black tshirt and black pants.For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one a person in a black tshirt and black pants.Track1frame1:[519, 209, 1000, 835];frame2:[521, 213, 1000, 845].') 2023-11-03 02:51:34.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:51:34.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[478, 523, 532, 759],personFrame1:[654, 370, 723, 587],personFrame1:[720, 602, 776, 815],personFrame1:[670, 369, 732, 573],personFrame1:[251, 601, 303, 831],personFrame1:[299, 683, 367, 968],personFrame1:[580, 536, 625, 731] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[478, 523, 532, 759];Frame2:[490, 515, 545, 758],personFrame1:[654, 370, 723, 587];Frame2:[667, 361, 740, 580],personFrame1:[720, 602, 776, 815];Frame2:[718, 620, 760, 819],personFrame1:[670, 369, 732, 573];Frame2:[684, 365, 750, 577],personFrame1:[251, 601, 303, 831];Frame2:[249, 597, 303, 833],personFrame1:[299, 683, 367, 968];Frame2:[275, 680, 341, 959],personFrame1:[580, 536, 625, 731];Frame2:[589, 480, 660, 704].') 2023-11-03 02:51:35.171 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:51:35.171 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'CNC Cutting Machine for Beams (SAW1260) pictures & photos') 2023-11-03 02:51:36.133 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:51:36.133 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to remove [650, 001, 997, 089] [300, 001, 665, 118] [001, 229, 135, 512] [334, 457, 738, 743] [449, 047, 980, 322] Muffins that are stuck in [003, 000, 996, 996] the Pan') 2023-11-03 02:52:17.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:52:17.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Sneakers:[339, 740, 444, 880],[270, 758, 369, 841],[242, 738, 331, 807];Person:[622, 440, 794, 897],[341, 312, 879, 887],[240, 302, 693, 851];Ballon:[393, 527, 475, 670].') 2023-11-03 02:52:41.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:52:41.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'bicycle:[232, 592, 466, 998],[055, 529, 238, 999],[564, 707, 755, 999],[782, 818, 981, 999];person:[486, 139, 760, 999],[002, 214, 212, 999],[158, 183, 461, 999],[688, 243, 976, 999],[885, 424, 999, 999];bicycle wheel:[085, 723, 123, 999],[145, 712, 225, 999],[255, 803, 291, 999],[331, 775, 440, 999],[680, 926, 743, 999];bicycle helmet:[060, 219, 160, 365],[245, 186, 348, 373],[551, 136, 687, 302],[809, 247, 964, 461];footwear:[064, 934, 115, 999],[223, 785, 281, 982].') 2023-11-03 02:52:44.380 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:52:44.380 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[266, 256, 963, 991] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a common zebraFrame1:[266, 256, 963, 991];Frame2:[206, 252, 899, 997];Frame3:[170, 275, 786, 989];Frame4:[189, 256, 680, 872];Frame5:[251, 277, 608, 806].') 2023-11-03 02:52:51.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:52:51.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[307, 016, 670, 997] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a slothFrame1:[307, 016, 670, 997];Frame2:[258, 058, 634, 781];Frame3:[188, 143, 556, 800];Frame4:[194, 168, 689, 995].') 2023-11-03 02:53:05.401 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:53:05.401 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lamp:[480, 202, 544, 337],[373, 666, 598, 1000];Guitar:[437, 588, 566, 653];Drum:[613, 611, 690, 702];Piano:[649, 639, 836, 702];Person:[001, 555, 195, 1000],[344, 476, 400, 738],[444, 500, 537, 672],[752, 486, 838, 861],[778, 459, 843, 630],[699, 505, 758, 654];Head Phone:[012, 720, 092, 813];Speaker:[442, 663, 554, 788],[570, 778, 672, 913],[347, 713, 451, 818],[283, 697, 359, 794];Tripod:[672, 599, 761, 880].') 2023-11-03 02:53:06.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:53:06.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[167, 265, 514, 913] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a boneshakerFrame1:[167, 265, 514, 913];Frame2:[151, 304, 527, 991];Frame3:[125, 254, 537, 965];Frame4:[203, 238, 622, 955];Frame5:[285, 226, 716, 958].') 2023-11-03 02:53:18.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:53:18.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Glasses:[297, 368, 727, 533];Person:[828, 282, 998, 453],[002, 119, 999, 999].') 2023-11-03 02:53:20.656 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:53:20.656 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[003, 272, 988, 998] A man with [002, 550, 264, 779] [715, 556, 995, 775] outstretched arms in front of [004, 003, 995, 998] a theatre with [010, 066, 992, 248] the words Atlantic Theatre visible.') 2023-11-03 02:53:34.772 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:53:34.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'tower:[016, 373, 176, 939],[091, 124, 136, 375],[093, 434, 375, 999],[158, 356, 227, 668],[341, 428, 443, 865],[445, 598, 671, 997],[468, 362, 540, 616],[719, 403, 835, 844],[795, 500, 891, 858],[878, 366, 939, 718],[958, 491, 995, 999];skyscraper:[000, 369, 185, 999],[035, 305, 133, 431],[088, 126, 141, 370],[158, 321, 226, 610],[475, 353, 548, 643],[713, 404, 828, 710],[875, 369, 941, 711],[927, 481, 999, 999].') 2023-11-03 02:53:36.594 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:53:36.595 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Hat:[059, 633, 101, 698],[088, 628, 147, 712],[023, 537, 058, 581],[214, 576, 250, 634],[274, 713, 386, 877],[436, 627, 514, 739],[554, 683, 630, 803],[524, 556, 570, 621],[550, 525, 573, 566],[690, 507, 738, 557],[707, 554, 752, 616],[804, 561, 873, 654],[766, 780, 889, 948],[959, 524, 999, 599],[910, 512, 970, 561],[421, 583, 493, 672],[758, 628, 840, 711],[841, 477, 869, 520];Glasses:[552, 724, 574, 764];Gloves:[120, 509, 144, 551];Street Lights:[080, 126, 132, 491],[080, 121, 141, 500],[209, 378, 232, 475],[308, 329, 343, 437];Van:[529, 432, 586, 475];Car:[559, 442, 628, 489],[664, 425, 741, 480],[737, 423, 774, 453];Truck:[717, 311, 999, 517];Person:[020, 927, 184, 1000],[262, 715, 400, 1000],[428, 628, 594, 1000],[546, 682, 663, 999],[621, 639, 725, 878],[573, 588, 648, 709],[661, 677, 802, 999],[761, 782, 890, 1000],[761, 633, 927, 916],[808, 564, 915, 836],[910, 523, 999, 948],[899, 513, 969, 913],[765, 561, 810, 637],[695, 553, 771, 746],[817, 521, 882, 611],[937, 479, 967, 515],[977, 455, 999, 505],[825, 477, 868, 526],[816, 476, 846, 528],[774, 476, 815, 547],[752, 488, 794, 561],[712, 454, 747, 506],[646, 507, 738, 674],[514, 558, 591, 736],[504, 548, 544, 634],[557, 530, 594, 637],[001, 695, 095, 1000],[043, 632, 186, 927],[038, 644, 091, 772],[029, 617, 073, 732],[001, 466, 323, 629],[292, 594, 336, 701],[324, 507, 390, 628],[389, 586, 493, 929],[366, 619, 430, 740],[351, 607, 399, 723],[394, 518, 435, 616],[467, 535, 489, 568],[431, 527, 459, 565],[373, 501, 401, 554],[652, 498, 687, 575];Flag:[849, 329, 913, 483].') 2023-11-03 02:53:54.579 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:53:54.579 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Haley Grace Lowman [369, 203, 533, 964] claps for the football team alongside the other cheerleaders [716, 330, 814, 782] [815, 225, 945, 941] [628, 308, 720, 818] [518, 239, 636, 875].') 2023-11-03 02:54:21.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:54:21.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cymbal,Leather Shoes,Slippers and Sandals in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Leather Shoes:[277, 000, 319, 071],[316, 000, 356, 070],[354, 000, 390, 063],[389, 000, 427, 059];Slippers:[805, 058, 868, 131];Sandals:[424, 000, 464, 062],[464, 000, 503, 064],[504, 000, 539, 055],[540, 000, 578, 053],[578, 000, 616, 051],[616, 000, 657, 050],[655, 000, 699, 053],[697, 000, 732, 053],[733, 000, 766, 055],[766, 000, 804, 046],[804, 000, 871, 060],[051, 576, 113, 704],[100, 578, 166, 704],[165, 573, 232, 701],[218, 585, 286, 709],[260, 615, 332, 727],[307, 618, 382, 730],[379, 610, 439, 728],[429, 602, 497, 729],[487, 621, 543, 745],[281, 069, 320, 143],[318, 066, 354, 137],[349, 071, 383, 142],[381, 064, 416, 143],[414, 063, 447, 143],[448, 065, 482, 144],[480, 063, 518, 146],[516, 047, 554, 134],[557, 052, 594, 134],[598, 041, 631, 129],[633, 043, 668, 125],[670, 036, 708, 123],[701, 051, 737, 137],[732, 059, 772, 145],[774, 048, 810, 137].') 2023-11-03 02:54:56.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:54:56.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[419, 434, 497, 520] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a drone flying over the roadFrame1:[419, 434, 497, 520];Frame2:[414, 416, 487, 501];Frame3:[433, 420, 500, 501];Frame4:[442, 425, 510, 505].') 2023-11-03 02:55:38.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:55:38.788 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[001, 369, 110, 997],[212, 159, 408, 1000],[388, 173, 534, 1000],[501, 162, 820, 1000],[620, 105, 844, 1000],[743, 002, 999, 865];Necklace:[545, 356, 613, 618],[647, 297, 755, 520].') 2023-11-03 02:56:00.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:56:00.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'wheel:[316, 642, 393, 723],[165, 633, 229, 714],[245, 631, 313, 714],[688, 633, 756, 717],[774, 636, 845, 724],[973, 654, 998, 745];person:[109, 571, 153, 722],[114, 447, 146, 583],[146, 575, 172, 709],[149, 476, 180, 573],[403, 454, 519, 605],[528, 445, 845, 605],[657, 465, 704, 601];land vehicle:[023, 365, 999, 747];tree:[000, 000, 999, 646],[073, 241, 243, 555],[171, 325, 267, 567],[267, 193, 401, 575],[377, 265, 526, 550],[458, 000, 996, 524],[650, 000, 999, 553].') 2023-11-03 02:56:11.009 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:56:11.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[601, 473, 652, 558],carFrame1:[578, 470, 609, 545],carFrame1:[620, 455, 739, 593] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[601, 473, 652, 558];Frame2:[625, 463, 696, 568],carFrame1:[578, 470, 609, 545];Frame2:[597, 460, 637, 545],carFrame1:[620, 455, 739, 593];Frame2:[661, 441, 818, 618].') 2023-11-03 02:56:26.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:56:26.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect boy,toy,human face and pumpkin in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'boy:[000, 000, 999, 999];toy:[729, 224, 840, 485];human face:[420, 092, 746, 663].') 2023-11-03 02:56:30.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:56:30.232 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[169, 455, 218, 598],[534, 426, 625, 778];Backpack:[165, 474, 191, 534];Train:[002, 278, 940, 809].') 2023-11-03 02:57:06.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:57:06.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[501, 234, 573, 739],personFrame1:[614, 223, 696, 717],personFrame1:[335, 225, 448, 708] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[501, 234, 573, 739];Frame2:[492, 214, 572, 746],personFrame1:[614, 223, 696, 717];Frame2:[612, 214, 689, 687],personFrame1:[335, 225, 448, 708];Frame2:[351, 205, 448, 731].') 2023-11-03 02:57:20.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:57:20.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Handbag:[360, 626, 456, 891],[428, 623, 501, 885];Desk:[039, 548, 379, 1000],[002, 524, 095, 609];Chair:[945, 511, 999, 710];Stool:[351, 842, 469, 988];Person:[506, 173, 1000, 999],[300, 215, 666, 1000],[306, 331, 447, 541];Watch:[509, 947, 551, 999];Cup:[547, 511, 659, 653],[215, 758, 264, 819],[294, 396, 398, 546],[002, 764, 045, 1000];Spoon:[202, 924, 239, 997];Plate:[092, 922, 265, 1000];Jug:[042, 835, 103, 927];Bowl:[210, 616, 255, 671];Bottle:[231, 484, 264, 530].') 2023-11-03 02:57:21.570 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:57:21.570 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'the pickle co [161, 635, 204, 710] [383, 253, 516, 498] [254, 253, 380, 494] [520, 252, 644, 498] [627, 577, 713, 744] [807, 578, 892, 747] [715, 579, 806, 747]') 2023-11-03 02:57:36.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:57:36.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[000, 272, 328, 999],[417, 272, 789, 1000];Glasses:[455, 380, 583, 466],[044, 286, 164, 391];Barrel:[407, 353, 470, 536].') 2023-11-03 02:58:00.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:58:00.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[344, 341, 501, 571],carFrame1:[000, 258, 254, 562],carFrame1:[283, 331, 347, 432] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[344, 341, 501, 571];Frame2:[352, 344, 505, 559],carFrame1:[000, 258, 254, 562];Frame2:[013, 263, 270, 553],carFrame1:[283, 331, 347, 432];Frame2:[291, 334, 355, 428].') 2023-11-03 02:58:24.134 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:58:24.135 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Machinery Vehicle:[845, 337, 917, 701];Flag:[552, 781, 571, 854],[329, 438, 337, 446];Person:[547, 793, 561, 840],[392, 762, 400, 782],[675, 701, 683, 737],[043, 751, 051, 774];Lifesaver:[614, 793, 632, 833],[754, 888, 765, 917],[542, 802, 547, 840],[427, 791, 445, 828],[320, 775, 336, 809],[252, 812, 266, 855],[217, 768, 230, 790].') 2023-11-03 02:58:40.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:58:40.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you tell me the trajectory of the right vehicles in red in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one right vehicles in red.Track1frame2:[644, 536, 945, 1000];frame3:[665, 544, 1000, 1000];frame4:[689, 554, 1000, 1000];frame5:[719, 573, 1000, 1000].') 2023-11-03 02:58:43.320 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:58:43.320 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '20 years 2001-2021 Fast And Furious thank you for the memories signatures shirt [104, 009, 895, 992]') 2023-11-03 02:58:49.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:58:49.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Desk:[074, 721, 195, 843],[002, 834, 168, 1000];Lamp:[700, 059, 752, 503],[850, 003, 914, 497];Frame:[934, 189, 992, 278],[869, 339, 909, 423],[764, 421, 806, 487];Handbag:[554, 616, 647, 827];Person:[798, 486, 968, 1000],[721, 532, 761, 599],[704, 557, 812, 999],[659, 502, 719, 648],[557, 505, 672, 1000].') 2023-11-03 02:58:50.887 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:58:50.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp,Flower and Umbrella in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Lamp:[004, 032, 246, 385],[176, 217, 307, 404];Flower:[001, 467, 389, 995];Umbrella:[832, 106, 999, 547].') 2023-11-03 02:59:05.398 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:59:05.398 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Desk:[002, 106, 1000, 1000];Bracelet:[608, 058, 662, 131];Cucumber:[406, 429, 635, 598];Cup:[912, 411, 999, 552],[323, 287, 411, 392];Plate:[089, 372, 841, 809],[665, 226, 730, 296],[406, 168, 599, 260];Chopsticks:[270, 227, 528, 349];Knife:[344, 218, 550, 332];Bowl:[475, 099, 627, 244],[700, 183, 999, 427],[859, 022, 924, 120];Spoon:[454, 043, 575, 141],[829, 239, 999, 324];Bottle:[888, 002, 965, 202].') 2023-11-03 02:59:19.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:59:19.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[540, 270, 616, 449];human hair:[744, 533, 843, 712],[000, 507, 038, 589],[018, 502, 104, 612],[066, 576, 236, 753],[085, 462, 135, 532],[141, 520, 221, 603],[243, 497, 336, 613],[261, 608, 420, 798],[346, 543, 456, 669],[370, 470, 423, 539],[406, 509, 473, 580],[471, 483, 547, 589],[500, 537, 611, 702],[564, 468, 635, 536],[566, 520, 636, 603],[596, 662, 740, 857],[815, 509, 881, 588],[858, 469, 917, 550],[863, 493, 940, 596],[967, 534, 999, 669];human head:[056, 579, 220, 805],[016, 502, 111, 621],[085, 458, 136, 543],[146, 515, 226, 608],[193, 431, 230, 495],[196, 500, 266, 580],[240, 455, 286, 522],[244, 492, 326, 626],[270, 608, 416, 809],[300, 472, 343, 539],[345, 534, 459, 676],[367, 478, 423, 544],[399, 517, 471, 582],[476, 478, 537, 580],[496, 531, 600, 689],[566, 453, 633, 594],[596, 653, 745, 881],[668, 478, 716, 552],[700, 502, 761, 586],[731, 550, 833, 724],[742, 524, 823, 596],[813, 515, 880, 601],[874, 493, 933, 586],[956, 495, 999, 561],[970, 571, 999, 674];woman:[000, 580, 218, 999];mammal:[000, 431, 999, 999],[523, 264, 628, 544];man:[543, 271, 602, 537];clothing:[224, 807, 495, 999],[545, 312, 601, 530];human arm:[546, 322, 606, 394].') 2023-11-03 02:59:20.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:59:20.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Flag:[786, 571, 801, 607].') 2023-11-03 02:59:23.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 02:59:23.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'food:[419, 733, 635, 914];person:[055, 268, 435, 981],[301, 000, 617, 763],[550, 069, 825, 859];clothing:[067, 486, 382, 968],[331, 168, 613, 710],[596, 314, 825, 864].') 2023-11-03 02:59:53.808 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 02:59:53.808 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a shabby chic outdoor bathroom [004, 001, 994, 998] with a shabby bathtub [300, 557, 782, 872] and a free-standing sink [002, 420, 281, 889]') 2023-11-03 03:00:00.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:00.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Desk:[236, 522, 384, 656],[286, 434, 345, 480];Person:[133, 364, 199, 486],[288, 366, 356, 485],[324, 352, 362, 470],[424, 371, 472, 452],[462, 356, 518, 453],[478, 349, 513, 467],[512, 362, 545, 456],[539, 355, 588, 465],[652, 346, 761, 520];Shovel:[879, 470, 912, 597];Rickshaw:[703, 501, 846, 648],[839, 445, 927, 512].') 2023-11-03 03:00:01.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:01.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[572, 121, 807, 999],[253, 424, 585, 1000];Watch:[486, 803, 521, 827],[717, 601, 742, 642];Necklace:[382, 650, 442, 700];Picture:[717, 001, 933, 173],[338, 001, 579, 154],[000, 001, 189, 144];Chair:[506, 651, 573, 945];Desk:[000, 859, 522, 999];Laptop:[174, 807, 388, 910];Tablet:[394, 872, 510, 987];Cell Phone:[450, 732, 530, 815];TV:[082, 877, 387, 1000];Bottle:[077, 706, 128, 875].') 2023-11-03 03:00:12.886 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:12.887 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'land vehicle:[200, 321, 918, 931];man:[000, 089, 113, 692],[047, 115, 123, 597],[126, 140, 218, 655],[196, 123, 272, 461],[261, 140, 335, 474],[343, 084, 441, 376],[436, 153, 490, 310],[450, 153, 470, 235],[471, 129, 510, 424],[488, 220, 594, 545],[498, 123, 548, 240],[553, 132, 631, 472],[593, 118, 671, 483],[625, 091, 698, 495],[696, 089, 751, 506],[774, 097, 842, 509],[831, 084, 911, 530],[861, 020, 999, 655];clothing:[000, 161, 103, 669],[001, 000, 608, 217],[063, 181, 111, 587],[125, 199, 211, 621],[195, 172, 267, 433],[263, 172, 317, 473],[321, 178, 353, 240],[348, 153, 422, 373],[480, 171, 506, 239],[491, 295, 611, 577],[515, 168, 543, 233],[583, 174, 626, 424],[626, 165, 661, 373],[642, 128, 700, 383],[693, 142, 748, 391],[796, 147, 840, 405],[838, 125, 916, 394],[907, 090, 999, 616],[916, 214, 943, 399];tire:[163, 426, 232, 676],[354, 579, 477, 933],[676, 527, 798, 793].') 2023-11-03 03:00:14.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:00:14.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Desk:[007, 727, 999, 971];Piano:[554, 748, 1000, 895];Person:[155, 004, 689, 762],[031, 001, 149, 090];Umbrella:[015, 038, 256, 549];Belt:[617, 446, 671, 501];Fire Extinguisher:[817, 001, 920, 512].') 2023-11-03 03:00:16.393 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:16.394 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "This week: France's economic crisis, Chinese and Iranian leaders [499, 136, 657, 496] [129, 021, 494, 497] meet, record number of guns detected by TSA and the impact of Winter Storm Jonas.") 2023-11-03 03:00:23.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:23.147 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Trash bin Can:[913, 336, 974, 393].') 2023-11-03 03:00:47.256 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:47.256 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the three birds fight?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There are 3 three birds fight.Track1frame:1:[377, 505, 695, 900];frame:2:[377, 505, 695, 900];frame:3:[434, 483, 790, 838],Track2frame:1:[446, 349, 546, 549];frame:2:[446, 349, 546, 549];frame:3:[441, 397, 538, 550],Track3frame:1:[487, 214, 596, 347];frame:2:[487, 214, 596, 347];frame:3:[476, 222, 572, 373].') 2023-11-03 03:00:52.203 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:52.203 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[100, 042, 508, 997] Oregon quarterback Tyler Shough threw for 312 yards and four touchdowns on Saturday in a 43-29 win at Washington State.') 2023-11-03 03:00:52.605 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:00:52.606 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of law enforcement officers [458, 382, 571, 977] [697, 348, 865, 865] [001, 375, 200, 996] and community partners [393, 424, 493, 997] [292, 432, 438, 997] [625, 380, 708, 790] at the St. Pete Police station stand around a podium [755, 530, 977, 903] to deliver the news about the task force.') 2023-11-03 03:01:05.089 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:01:05.089 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'car:[543, 420, 889, 611].') 2023-11-03 03:01:35.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:01:35.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[850, 445, 998, 994],[694, 477, 867, 917],[802, 466, 858, 826],[709, 484, 771, 835],[655, 496, 708, 547];Leather Shoes:[925, 962, 956, 995],[791, 882, 818, 911],[746, 852, 769, 885];Helmet:[762, 478, 810, 527],[812, 472, 859, 520];Fire Truck:[000, 226, 707, 925];Car:[652, 548, 825, 803];Trolley:[817, 766, 975, 886];Cup:[893, 604, 916, 636].') 2023-11-03 03:01:36.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:01:36.110 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[340, 187, 852, 973] green and blue bar graph mapping the different influential drivers while undergoing a legacy systems transformation') 2023-11-03 03:02:09.066 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:02:09.067 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The blonde [080, 049, 988, 990] in a linen dress [379, 392, 995, 995] and straw hat [264, 040, 872, 399] sits on a viewpoint in the mountains Standard-Bild') 2023-11-03 03:02:28.444 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:02:28.444 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[198, 266, 719, 970],[440, 257, 772, 942],[178, 281, 344, 912],[001, 538, 064, 1000];Bracelet:[497, 454, 518, 479];Microphone:[800, 309, 846, 329],[506, 304, 616, 327];Tripod:[405, 601, 624, 992];Speaker:[636, 837, 1000, 999];Glasses:[367, 334, 438, 358];Sneakers:[649, 890, 715, 920],[403, 939, 457, 970];Other Shoes:[513, 914, 570, 939];Boots:[291, 787, 327, 900],[175, 788, 294, 914];Guitar:[258, 423, 781, 644];Piano:[506, 509, 998, 587].') 2023-11-03 03:02:43.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:02:43.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[000, 295, 200, 811],personFrame1:[499, 286, 595, 772],personFrame1:[029, 263, 257, 855] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'personFrame1:[000, 295, 200, 811];Frame2:[082, 268, 195, 801],personFrame1:[499, 286, 595, 772];Frame2:[489, 291, 596, 766],personFrame1:[029, 263, 257, 855];Frame2:[143, 276, 267, 847].') 2023-11-03 03:02:58.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:02:58.742 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the child riding bicycle and dismounting in this video clip?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There are 3 child riding bicycle and dismounting.Track1frame:1:[417, 297, 442, 395];frame:2:[434, 294, 460, 429];frame:3:[437, 295, 463, 427],Track2frame:1:[357, 308, 392, 440];frame:2:[360, 306, 387, 438];frame:3:[359, 305, 387, 438],Track3frame:1:[413, 316, 443, 429];frame:2:[407, 301, 435, 429];frame:3:[404, 294, 429, 423].') 2023-11-03 03:03:44.584 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:03:44.584 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Dutch-born businesswoman and [168, 217, 531, 998] former model Wilhelmina Cooper watches as [422, 057, 995, 992] her husband, former television executive Bruce Cooper , plays [128, 439, 727, 944] the trumpet in...') 2023-11-03 03:03:45.582 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:03:45.582 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[234, 419, 530, 929] Extra large, fluffy snickerdoodle cookies and [569, 137, 767, 743] a bottle of milk on [000, 383, 996, 996] a table') 2023-11-03 03:04:32.936 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:04:32.936 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[010, 021, 989, 991] Example of an empty news widget displaying text that says no news in the last three months.') 2023-11-03 03:04:47.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:04:47.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[872, 084, 970, 302],[226, 082, 314, 311],[545, 063, 638, 303];Cabinet:[001, 514, 1000, 937],[561, 424, 804, 536];Person:[259, 321, 502, 998],[032, 395, 113, 530],[472, 393, 561, 513];TV:[843, 419, 933, 514].') 2023-11-03 03:05:03.401 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:05:03.402 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[483, 229, 512, 400] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a personFrame1:[483, 229, 512, 400];Frame2:[512, 227, 542, 395];Frame3:[567, 218, 600, 368];Frame4:[592, 218, 631, 394];Frame5:[630, 209, 664, 391].') 2023-11-03 03:05:33.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:05:33.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'person:[000, 371, 019, 488],[000, 483, 109, 638],[000, 628, 138, 999],[019, 355, 070, 455],[087, 417, 187, 619],[146, 357, 180, 426],[163, 415, 229, 540],[215, 384, 271, 498],[238, 367, 285, 461],[260, 371, 291, 452],[280, 240, 315, 305],[287, 346, 316, 438],[569, 254, 601, 317],[578, 353, 623, 428],[608, 369, 656, 454],[636, 382, 685, 459],[646, 401, 701, 484],[685, 419, 734, 523],[710, 408, 783, 553],[724, 361, 761, 430],[731, 442, 830, 593],[755, 373, 786, 459],[802, 500, 926, 673],[843, 392, 891, 476],[862, 411, 898, 483],[943, 173, 980, 338],[955, 423, 991, 498],[963, 417, 999, 532],[971, 196, 999, 338].') 2023-11-03 03:06:26.363 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:06:26.363 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'epic games announces the complete fortnite world cup format [419, 099, 621, 476] and 30 million new york city tournament - fortnite custom games turnier') 2023-11-03 03:06:55.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:06:55.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[002, 263, 322, 1000],[720, 076, 1000, 999];Stool:[002, 824, 212, 999],[766, 726, 938, 997];Speaker:[818, 905, 999, 999].') 2023-11-03 03:07:00.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:07:00.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[629, 810, 787, 997],[198, 820, 268, 999],[250, 768, 366, 999],[406, 820, 481, 999],[858, 736, 999, 999];clothing:[256, 837, 363, 985],[000, 699, 157, 999],[130, 840, 186, 999],[202, 854, 245, 999],[242, 860, 288, 999],[348, 899, 393, 999],[409, 862, 470, 999],[433, 847, 550, 999],[547, 912, 574, 999],[557, 905, 635, 999],[625, 857, 773, 999],[850, 809, 999, 999];building:[000, 020, 986, 925];woman:[006, 625, 155, 999],[113, 800, 213, 996],[232, 811, 305, 950],[345, 875, 393, 999];human face:[860, 751, 888, 829].') 2023-11-03 03:07:00.989 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:07:00.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[584, 065, 642, 226],[399, 001, 474, 058],[330, 001, 381, 125],[856, 001, 911, 209];Stuffed Toy:[110, 202, 368, 988],[337, 330, 556, 1000],[523, 297, 679, 773].') 2023-11-03 03:08:04.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:08:04.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect houseplant,tree and house in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'houseplant:[115, 655, 271, 795],[275, 726, 820, 999],[649, 731, 999, 975];tree:[011, 445, 156, 693],[139, 035, 363, 359],[313, 150, 456, 257],[462, 035, 644, 205],[640, 000, 853, 125],[828, 000, 999, 393];house:[156, 030, 966, 914].') 2023-11-03 03:08:22.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:08:22.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Glasses:[235, 580, 333, 617];Picture:[134, 408, 403, 596],[623, 394, 943, 650];Stool:[804, 818, 964, 1000];Person:[069, 336, 908, 916],[392, 090, 879, 802];Drum:[642, 796, 777, 864],[616, 863, 828, 992],[802, 817, 954, 866].') 2023-11-03 03:09:11.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:09:11.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[380, 134, 594, 735] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a giant pandaFrame1:[380, 134, 594, 735];Frame2:[433, 135, 689, 541];Frame3:[374, 257, 704, 607];Frame4:[087, 092, 447, 524].') 2023-11-03 03:09:30.945 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:09:30.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[300, 191, 787, 765] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a brown horse ridden by a man in the fieldFrame1:[300, 191, 787, 765];Frame2:[322, 297, 835, 809];Frame3:[318, 309, 705, 970];Frame4:[285, 262, 629, 966];Frame5:[317, 256, 642, 748].') 2023-11-03 03:09:52.426 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:09:52.427 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[793, 004, 998, 993] [001, 004, 233, 993] [585, 148, 707, 481] Three workers inspect products on [124, 477, 940, 996] tables in front of [004, 002, 991, 518] a wall painted with [007, 022, 587, 271] a large Rodeo Plastics logo.') 2023-11-03 03:10:12.115 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:10:12.116 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Ela Orchard's Apple Barn [002, 113, 666, 557] is seen in a photo before the fire.") 2023-11-03 03:10:20.705 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:10:20.705 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[005, 004, 990, 996] interior of a modern laundry room - [466, 132, 649, 877] laundry stock pictures, royalty-free photos & images') 2023-11-03 03:10:21.632 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:10:21.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[300, 579, 435, 769];frame:2:[253, 583, 377, 759];frame:3:[246, 577, 359, 740] doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an elephant move right-side') 2023-11-03 03:11:41.576 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:11:41.577 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Amazon has come under fire in China for selling [204, 239, 559, 886] t-shirts with slogans in support of the ongoing anti-government protests in Hong Kong.') 2023-11-03 03:12:22.240 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:12:22.241 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[624, 007, 982, 997] Police check a man dressed as [003, 049, 708, 998] Santa Claus as he passes through a metal detector at the U.S. Capitol on Wednesday, December 12, in Washington.') 2023-11-03 03:12:42.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:12:42.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'footwear:[207, 835, 274, 921],[441, 715, 461, 794];balloon:[156, 092, 197, 163],[192, 128, 232, 196],[233, 134, 306, 230],[285, 198, 333, 272],[333, 240, 366, 306],[550, 000, 720, 171],[838, 028, 913, 146],[852, 000, 950, 087],[912, 031, 999, 204];woman:[000, 400, 083, 834],[215, 387, 249, 741],[232, 342, 288, 762],[317, 358, 371, 714],[365, 350, 411, 652],[406, 358, 451, 620],[449, 370, 485, 630],[506, 462, 676, 998],[709, 370, 753, 736],[863, 352, 909, 593];man:[221, 340, 328, 917],[451, 342, 505, 635],[478, 348, 534, 635],[633, 298, 710, 717];girl:[075, 520, 152, 892],[711, 355, 755, 741],[865, 355, 910, 594];jeans:[653, 568, 719, 717].') 2023-11-03 03:13:11.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:13:11.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pizza,Chair and Hat in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Pizza:[435, 431, 528, 476],[340, 591, 429, 667],[264, 555, 386, 616],[287, 729, 481, 871];Chair:[001, 473, 246, 1000],[001, 350, 057, 455],[058, 301, 095, 356],[001, 286, 034, 439];Hat:[368, 197, 432, 250].') 2023-11-03 03:13:25.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:13:25.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Glasses:[884, 261, 974, 321],[665, 297, 824, 381];Lamp:[147, 002, 233, 217];Picture:[263, 261, 393, 445],[468, 278, 580, 391],[002, 241, 060, 356];shelf:[589, 159, 1000, 661];Desk:[209, 612, 374, 961];Person:[002, 354, 390, 999],[335, 271, 501, 1000],[401, 281, 655, 1000];Wine Glass:[659, 455, 720, 634],[408, 410, 464, 531].') 2023-11-03 03:13:39.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:13:39.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Sneakers:[319, 672, 390, 730];Helmet:[240, 438, 339, 522];Person:[745, 433, 1000, 745],[187, 436, 492, 737],[163, 267, 224, 318].') 2023-11-03 03:13:45.215 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:13:45.216 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Alcott stands out from [258, 470, 389, 756] other authors because [389, 212, 889, 746] her writings have been translated in over fifty languages, and none of her eight books for young adults have been out of print, for example Little Women.') 2023-11-03 03:13:52.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:13:52.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Picture:[895, 022, 940, 125];Cue:[518, 389, 561, 539];Person:[496, 116, 757, 585],[057, 195, 202, 380],[211, 144, 318, 383],[312, 166, 386, 332],[360, 140, 494, 406];Bracelet:[548, 473, 605, 524];Hat:[496, 117, 607, 264];Belt:[703, 322, 743, 417];Moniter:[837, 181, 911, 296],[611, 001, 736, 119],[442, 007, 547, 121],[293, 030, 390, 152],[199, 078, 258, 160],[114, 094, 176, 178];Billards:[732, 484, 766, 531],[597, 506, 633, 553],[399, 496, 438, 543],[226, 518, 258, 562],[181, 580, 228, 645].') 2023-11-03 03:13:57.288 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:13:57.288 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[180, 065, 929, 990] Buffalo Bills defensive tackle Kyle Williams (95) congratulates tight end Nick O'Leary (84), after O'Leary scored a touchdown, during the first half of an NFL football game against the Miami Dolph ...") 2023-11-03 03:13:58.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:13:58.855 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'plant:[720, 502, 993, 963],[000, 000, 174, 371],[038, 428, 109, 560],[175, 000, 483, 586],[527, 813, 676, 999],[790, 338, 853, 439],[868, 941, 953, 999];man:[000, 362, 300, 999],[283, 401, 527, 999],[433, 352, 671, 999],[522, 412, 706, 958],[550, 392, 778, 908],[649, 364, 846, 870],[749, 421, 914, 800],[813, 412, 945, 802],[876, 441, 968, 769],[910, 458, 963, 684];clothing:[078, 487, 404, 999],[226, 441, 816, 999],[753, 495, 951, 688].') 2023-11-03 03:14:42.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:14:42.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[148, 001, 697, 999],[616, 455, 739, 645],[000, 299, 030, 481],[000, 525, 238, 999],[882, 001, 1000, 1000];Necklace:[351, 351, 445, 696],[975, 104, 1000, 289];Ring:[148, 375, 177, 415];Bracelet:[200, 467, 239, 517],[210, 514, 256, 557],[888, 605, 926, 634];Umbrella:[584, 122, 754, 239];Hat:[295, 001, 483, 268];Glasses:[299, 105, 444, 184];Leather Shoes:[670, 455, 706, 520],[695, 499, 734, 565];Cell Phone:[100, 661, 200, 742];Cup:[008, 535, 050, 610],[028, 491, 074, 574],[133, 487, 173, 549];Desk:[013, 529, 271, 999];Chair:[643, 567, 747, 854].') 2023-11-03 03:15:23.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:15:23.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Boat:[400, 278, 861, 410],[770, 337, 1000, 430],[000, 250, 833, 845];Person:[965, 193, 999, 357];Umbrella:[830, 150, 984, 374],[682, 106, 874, 363],[524, 106, 716, 308],[554, 001, 688, 067],[645, 001, 790, 129],[690, 030, 816, 124],[802, 001, 989, 155],[956, 077, 999, 150],[000, 131, 098, 294],[046, 147, 156, 274],[282, 118, 386, 270],[199, 103, 349, 249];Stool:[062, 519, 149, 614];Desk:[886, 236, 968, 331].') 2023-11-03 03:15:23.937 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:15:23.937 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'At the Grammys [163, 002, 248, 106], singer Joy Villa [070, 012, 338, 998] wore a dress [071, 281, 343, 997] hand-painted with the image of a fetus surrounded by a rainbow [151, 733, 343, 996], paired with a purse [103, 527, 200, 768] that read "choose life."') 2023-11-03 03:15:37.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:15:37.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human head:[733, 371, 886, 597];mammal:[615, 334, 962, 984];man:[623, 356, 957, 1000];clothing:[652, 520, 899, 952];human hair:[762, 367, 862, 481];human face:[740, 408, 841, 598];human arm:[653, 625, 848, 854].') 2023-11-03 03:15:51.350 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:15:51.350 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bowl/Basin in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Bowl:[141, 252, 911, 956],[792, 197, 999, 739].') 2023-11-03 03:16:01.813 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:16:01.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Head Phone:[102, 362, 166, 439],[293, 390, 343, 456],[566, 435, 582, 476],[847, 472, 857, 501],[876, 469, 885, 494],[897, 481, 906, 504];TV:[171, 344, 295, 537],[359, 389, 447, 538],[497, 393, 565, 540],[605, 428, 663, 540],[679, 423, 730, 540],[908, 414, 922, 426];Mouse:[110, 539, 143, 556];Microphone:[336, 452, 341, 465],[988, 425, 995, 431];Person:[062, 362, 252, 557],[260, 393, 406, 540],[378, 505, 467, 539],[479, 430, 547, 538],[551, 435, 678, 541],[967, 411, 999, 662],[893, 480, 933, 542],[862, 469, 919, 545],[826, 473, 905, 617];Bracelet:[622, 533, 628, 540];Chair:[804, 481, 855, 648],[060, 388, 111, 522],[461, 436, 497, 538];Desk:[861, 540, 1000, 662],[008, 536, 790, 920];Hat:[567, 435, 606, 462];Backpack:[848, 588, 880, 658];Other Shoes:[968, 640, 993, 660].') 2023-11-03 03:16:52.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:16:52.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[174, 823, 348, 1000],[134, 624, 219, 905],[296, 850, 385, 999],[443, 914, 632, 999];human face:[294, 918, 321, 996].') 2023-11-03 03:16:54.735 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:16:54.735 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to use a Schuko Type F power charger [356, 382, 519, 639] for charging your iPad Mini [752, 263, 940, 724] 4 from a Lao power outlet [080, 303, 299, 687]') 2023-11-03 03:17:15.654 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:17:15.654 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'healthy cinnamon apples [001, 002, 974, 995] in a white bowl') 2023-11-03 03:17:50.582 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:17:50.582 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[164, 290, 270, 610] [583, 344, 704, 695] [668, 440, 787, 958] [036, 349, 200, 776] [198, 377, 328, 955] [775, 361, 932, 709] Staff members at Acorns Children's Hospice in Walsall celebrate after the hospice was saved from the threat of closure.") 2023-11-03 03:18:31.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:18:31.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[434, 241, 615, 743] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a pintoFrame1:[434, 241, 615, 743];Frame2:[381, 238, 702, 700];Frame3:[386, 237, 703, 677];Frame4:[400, 237, 721, 687];Frame5:[401, 242, 736, 692].') 2023-11-03 03:18:40.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:18:40.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[346, 246, 376, 309],[382, 227, 438, 401],[458, 308, 541, 423],[580, 187, 635, 411],[673, 238, 723, 373],[720, 242, 774, 361],[778, 221, 820, 536];clothing:[377, 280, 446, 404],[459, 335, 545, 430],[576, 231, 651, 430],[781, 252, 823, 534];building:[000, 000, 999, 999];window:[348, 122, 389, 299],[445, 122, 543, 318],[603, 118, 666, 310],[754, 115, 823, 323].') 2023-11-03 03:18:40.693 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:18:40.693 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '1983 Sea Ray boat for sale, model of [106, 371, 966, 824] the boat is SRV 360 Express & [345, 794, 483, 891] Image # 1 of 40') 2023-11-03 03:19:19.143 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:19:19.143 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Now covered with icicles [066, 413, 540, 682], Class A Auto Body in Bemidji, MN, was the site of a large structure fire the evening of Jan. 19. In addition to the auto body shop [097, 555, 497, 916], the building [065, 007, 973, 919] included two apartment units [001, 386, 100, 842].') 2023-11-03 03:21:05.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:21:05.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Storage box,Sneakers and Toilet in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Storage box:[134, 201, 228, 238],[359, 203, 395, 233],[001, 247, 049, 279],[055, 248, 097, 279],[096, 246, 144, 281],[151, 247, 203, 280],[201, 245, 253, 280],[250, 243, 284, 281],[383, 239, 448, 280],[450, 242, 515, 280],[522, 238, 599, 281],[604, 234, 676, 278],[519, 290, 573, 331],[458, 289, 514, 329],[383, 286, 433, 327],[151, 285, 200, 322],[196, 289, 252, 323],[256, 287, 296, 318],[061, 286, 123, 318],[001, 282, 054, 319],[108, 324, 146, 358],[056, 325, 104, 354],[012, 322, 054, 354],[176, 324, 236, 363],[465, 336, 512, 376],[519, 336, 583, 380],[495, 425, 579, 468],[540, 459, 597, 506],[485, 463, 539, 498],[473, 400, 533, 447],[470, 447, 502, 492];Sneakers:[206, 750, 260, 801],[316, 740, 411, 797],[596, 509, 622, 532],[636, 511, 662, 531];Toilet:[908, 354, 947, 450].') 2023-11-03 03:21:33.594 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:21:33.594 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'shelf:[001, 001, 999, 1000];Basin:[200, 020, 328, 113],[344, 032, 452, 126],[531, 001, 718, 142];Cup:[576, 278, 657, 394],[610, 871, 714, 1000];Bottle:[348, 467, 395, 626],[293, 450, 344, 630],[232, 434, 282, 627],[196, 430, 246, 616],[593, 702, 697, 911],[496, 732, 569, 918],[315, 712, 421, 923].') 2023-11-03 03:22:06.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:22:06.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'dress:[370, 320, 700, 859];man:[380, 220, 695, 860];flower:[071, 563, 385, 828],[629, 568, 975, 866].') 2023-11-03 03:22:26.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:22:26.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'person:[050, 285, 193, 860],[174, 284, 999, 999];football:[351, 426, 383, 509].') 2023-11-03 03:22:27.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:22:27.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[000, 110, 012, 133],[693, 188, 707, 246],[780, 190, 791, 219],[859, 196, 881, 257],[904, 198, 921, 264],[919, 194, 941, 266];Awning:[762, 180, 783, 197];Car:[000, 327, 142, 1000],[130, 180, 952, 768],[093, 170, 139, 205],[144, 171, 187, 199];Traffic cone:[570, 263, 578, 284],[531, 252, 538, 288],[496, 264, 506, 293],[486, 267, 493, 286],[471, 267, 478, 280],[493, 246, 500, 258],[486, 237, 494, 261],[469, 243, 478, 267];Truck:[001, 132, 069, 182];Pickup Truck:[071, 162, 142, 196];Street Lights:[280, 111, 304, 170],[413, 101, 438, 180],[174, 126, 195, 163];Bench:[271, 271, 434, 325].') 2023-11-03 03:22:35.211 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:22:35.212 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[476, 288, 781, 595] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a pademelonFrame1:[476, 288, 781, 595];Frame2:[466, 284, 755, 577];Frame3:[458, 309, 786, 564];Frame4:[457, 293, 806, 566].') 2023-11-03 03:22:53.861 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:22:53.861 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cucumber,Storage box,Lamp and Fan in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Storage box:[547, 673, 768, 848],[536, 802, 808, 999];Lamp:[421, 001, 570, 042];Fan:[803, 880, 905, 999].') 2023-11-03 03:23:01.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:23:01.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Potted Plant:[768, 627, 972, 913];Person:[365, 295, 552, 870],[204, 002, 301, 121],[279, 002, 368, 126],[325, 055, 421, 205],[361, 002, 457, 163],[400, 002, 490, 072],[513, 002, 573, 113],[453, 008, 532, 131],[462, 095, 534, 228],[406, 109, 474, 238],[470, 165, 546, 301],[521, 072, 586, 171],[517, 130, 601, 300],[580, 046, 642, 163],[579, 004, 642, 113],[582, 117, 654, 259],[536, 235, 612, 380],[640, 030, 713, 159],[601, 197, 663, 342],[591, 277, 687, 437],[670, 342, 756, 511],[744, 413, 849, 571],[779, 314, 859, 430],[739, 280, 795, 363],[680, 235, 747, 330],[650, 167, 719, 295],[638, 098, 724, 237],[698, 032, 763, 175],[734, 006, 806, 156],[858, 002, 934, 137],[917, 002, 1000, 150],[871, 125, 952, 240],[711, 174, 784, 299],[752, 208, 848, 347],[783, 163, 860, 292],[959, 190, 1000, 287],[897, 202, 964, 306],[928, 288, 999, 397],[852, 234, 939, 349],[843, 279, 923, 441],[854, 372, 940, 497],[941, 427, 999, 551],[839, 474, 929, 616],[909, 540, 987, 693],[675, 280, 751, 402],[928, 351, 995, 488],[947, 660, 999, 735];Tie:[296, 007, 344, 087];Sneakers:[364, 729, 430, 786],[487, 780, 539, 868].') 2023-11-03 03:23:22.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:23:22.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[421, 353, 689, 875] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a half trackFrame1:[421, 353, 689, 875];Frame2:[474, 300, 791, 794];Frame3:[403, 381, 751, 893];Frame4:[410, 396, 798, 900];Frame5:[523, 392, 942, 907].') 2023-11-03 03:23:29.365 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:23:29.365 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[002, 003, 998, 1000].') 2023-11-03 03:23:45.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:23:45.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Restaurant workers [163, 257, 254, 609] [262, 100, 450, 741] rallied in Times Square, calling on state leaders to provide additional support through the winter, Dec. 15, 2020.') 2023-11-03 03:25:45.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:25:45.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:3:[637, 000, 999, 287] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an sharks swimming left') 2023-11-03 03:26:07.159 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:26:07.159 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[405, 023, 901, 806] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a armadilloFrame1:[405, 023, 901, 806];Frame2:[319, 064, 814, 813];Frame3:[315, 060, 768, 875];Frame4:[349, 098, 692, 897];Frame5:[369, 130, 653, 915].') 2023-11-03 03:26:11.359 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:26:11.359 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Traditional Janitorial Cleaning Cart [221, 050, 789, 954] with zippered vinyl bag collects waste and transports tools for efficient cleaning.') 2023-11-03 03:26:12.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:26:12.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[468, 360, 536, 456];Picture:[828, 043, 856, 270];Lamp:[436, 001, 525, 119],[469, 018, 536, 158],[161, 001, 226, 174],[030, 001, 123, 129];Clock:[195, 075, 260, 189];Bottle:[391, 490, 424, 647].') 2023-11-03 03:26:23.826 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:26:23.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you tell me the trajectory of the woman wandering in the hallway in this video clip?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one woman wandering in the hallway.Track1frame1:[605, 201, 1000, 711];frame2:[607, 190, 1000, 697].') 2023-11-03 03:27:20.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:27:20.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Flower:[129, 575, 235, 691];Flag:[440, 414, 516, 617];Stop Sign:[251, 279, 337, 412];Fire Hydrant:[640, 674, 687, 806].') 2023-11-03 03:27:38.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:27:38.971 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'building:[000, 000, 999, 999].') 2023-11-03 03:27:43.172 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:27:43.172 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'house:[000, 000, 818, 940],[770, 290, 977, 674],[809, 346, 871, 660],[961, 403, 999, 639];window:[000, 386, 213, 665],[030, 060, 143, 241],[042, 000, 150, 102],[143, 105, 240, 262],[153, 000, 247, 145],[237, 150, 323, 301],[243, 019, 325, 190],[413, 242, 459, 345],[413, 135, 463, 247],[416, 462, 529, 646],[462, 165, 506, 266],[462, 255, 506, 362],[507, 184, 542, 287],[510, 272, 542, 377],[569, 480, 649, 645],[570, 220, 661, 415],[649, 505, 703, 665],[675, 285, 736, 438],[751, 330, 795, 469],[812, 356, 862, 482],[873, 380, 896, 490],[893, 391, 918, 490],[913, 520, 937, 617],[920, 403, 942, 493],[940, 419, 955, 503].') 2023-11-03 03:27:56.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:27:56.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[337, 615, 615, 982],[354, 653, 527, 817];clothing:[684, 629, 904, 909],[000, 649, 999, 996];man:[038, 636, 136, 843],[155, 656, 369, 996],[487, 612, 624, 976],[774, 602, 999, 999];human face:[226, 667, 290, 728],[740, 608, 813, 676],[940, 653, 999, 827].') 2023-11-03 03:28:23.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:28:23.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the white and orange cat move away from the bench?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one white and orange cat move away from the bench.Track1frame:1:[356, 405, 629, 777];frame:2:[356, 374, 631, 738];frame:3:[329, 381, 609, 739].') 2023-11-03 03:29:01.660 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:29:01.660 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Paddle:[181, 552, 463, 722],[203, 517, 353, 702];Boat:[234, 673, 425, 738];Person:[280, 563, 372, 689],[235, 555, 309, 687].') 2023-11-03 03:29:35.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:29:35.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human hair,human face,human head,footwear and human arm in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'man:[108, 404, 374, 998],[527, 380, 784, 998],[566, 403, 921, 998],[938, 420, 999, 726];human hair:[198, 412, 308, 569],[415, 390, 522, 528],[761, 395, 890, 544];human face:[259, 442, 318, 553],[434, 421, 485, 493],[629, 450, 688, 518],[763, 442, 823, 582];human head:[208, 407, 320, 562],[417, 394, 501, 516],[623, 380, 709, 520],[754, 399, 885, 583],[964, 421, 999, 566];footwear:[437, 960, 509, 998];human arm:[676, 645, 886, 901],[215, 574, 356, 798],[226, 566, 332, 774],[390, 447, 454, 660],[450, 475, 530, 647],[568, 483, 670, 654],[631, 489, 772, 691].') 2023-11-03 03:29:42.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:29:42.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[008, 044, 725, 962];frame:2:[000, 000, 577, 999];frame:3:[129, 019, 744, 998] is doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an jump to the left-side') 2023-11-03 03:29:48.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:29:48.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[137, 281, 817, 971] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a alaskan brown bearFrame1:[137, 281, 817, 971];Frame2:[333, 260, 960, 999];Frame3:[271, 279, 813, 979];Frame4:[163, 272, 727, 983];Frame5:[132, 294, 806, 999].') 2023-11-03 03:30:17.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:30:17.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'shelf:[715, 407, 886, 535];Person:[845, 289, 999, 890],[401, 333, 587, 558],[274, 250, 371, 469],[001, 218, 212, 643];Cleaning Products:[412, 851, 480, 1000];Hat:[056, 218, 132, 302],[848, 288, 928, 359];Bottle:[389, 517, 427, 682],[731, 613, 773, 719],[262, 575, 303, 649];Basin:[486, 577, 561, 678],[295, 610, 392, 683],[585, 659, 683, 744],[718, 622, 782, 689];Pot:[156, 615, 262, 691],[001, 632, 088, 764],[127, 645, 193, 708],[648, 607, 714, 695],[716, 562, 810, 615];Knife:[199, 658, 282, 786],[193, 747, 242, 798],[169, 738, 233, 796];Oven:[001, 189, 282, 565];Extractor:[001, 002, 790, 142].') 2023-11-03 03:30:23.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:30:23.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Car:[827, 370, 1000, 592];Street Lights:[805, 262, 852, 362],[581, 299, 634, 374],[262, 256, 359, 382];Bicycle:[011, 411, 054, 486],[095, 425, 130, 464],[205, 402, 241, 443];Person:[264, 256, 628, 968],[641, 322, 922, 800],[890, 348, 940, 377],[114, 353, 209, 499],[001, 395, 049, 543];Awning:[113, 345, 148, 370],[001, 299, 119, 350];Satchel:[579, 526, 704, 680],[679, 526, 781, 592];Sneakers:[738, 738, 788, 784],[694, 743, 737, 801],[508, 907, 598, 968],[343, 798, 394, 857];Hat:[744, 320, 838, 374];Other Shoes:[153, 480, 176, 498];Cell Phone:[780, 477, 821, 495].') 2023-11-03 03:30:47.799 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:30:47.799 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[734, 179, 974, 535],[284, 253, 732, 507].') 2023-11-03 03:31:10.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:31:10.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'boy:[266, 245, 402, 537],[461, 247, 603, 733];desk:[021, 570, 249, 970];human body:[460, 251, 595, 773],[224, 333, 685, 999],[278, 236, 402, 541];human hair:[482, 245, 562, 357],[306, 231, 368, 301],[410, 336, 477, 395];human head:[298, 235, 380, 342],[406, 335, 476, 456],[492, 250, 561, 379];man:[219, 302, 575, 979];coffee table:[027, 595, 255, 959];chair:[242, 317, 622, 999];human leg:[341, 593, 685, 916],[429, 753, 529, 999],[475, 535, 530, 725];mammal:[216, 327, 686, 999],[270, 227, 402, 537],[458, 240, 603, 718];clothing:[231, 429, 599, 999],[272, 340, 404, 543],[461, 362, 595, 716];human face:[306, 255, 368, 359],[410, 341, 480, 456],[495, 290, 547, 370];human arm:[218, 458, 378, 738],[269, 361, 331, 463],[345, 349, 403, 449],[458, 355, 522, 468],[530, 395, 593, 505],[531, 518, 608, 764];human hand:[221, 629, 275, 736],[530, 654, 572, 763].') 2023-11-03 03:32:10.637 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:10.637 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Storage box,Microphone and Person in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Storage box:[907, 723, 999, 922];Microphone:[737, 345, 790, 423],[074, 342, 155, 446];Person:[625, 168, 989, 1000],[190, 238, 386, 999],[002, 201, 181, 629].') 2023-11-03 03:32:15.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:15.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shelf and Picture/Frame in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'shelf:[001, 051, 184, 340];Picture:[408, 219, 573, 385],[613, 272, 759, 467].') 2023-11-03 03:32:19.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:19.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human hair,clothing and woman in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human hair:[310, 400, 353, 478];clothing:[656, 430, 784, 560],[205, 456, 278, 518],[278, 455, 376, 537],[375, 444, 467, 537],[499, 442, 597, 530];woman:[651, 387, 783, 575],[276, 400, 375, 535].') 2023-11-03 03:32:21.640 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:21.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[308, 261, 425, 471] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a forest goatFrame1:[308, 261, 425, 471];Frame2:[381, 191, 440, 448];Frame3:[371, 181, 429, 459];Frame4:[383, 195, 439, 484].') 2023-11-03 03:32:35.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:35.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[337, 466, 414, 534],carFrame1:[475, 421, 611, 604],carFrame1:[025, 450, 209, 572],carFrame1:[215, 460, 342, 548],carFrame1:[649, 465, 693, 529] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[337, 466, 414, 534];Frame2:[286, 456, 379, 532],carFrame1:[475, 421, 611, 604];Frame2:[461, 409, 614, 623],carFrame1:[025, 450, 209, 572];Frame2:[000, 438, 129, 581],carFrame1:[215, 460, 342, 548];Frame2:[142, 447, 291, 551],carFrame1:[649, 465, 693, 529];Frame2:[639, 446, 685, 524].') 2023-11-03 03:32:45.983 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:45.983 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bunch of [499, 283, 630, 519] [645, 223, 791, 492] [409, 370, 478, 579] [494, 318, 573, 638] alapacas on a green grassy hill behind [008, 542, 993, 990] some white electric tape fence') 2023-11-03 03:32:48.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:48.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Cabinet:[002, 200, 998, 921].') 2023-11-03 03:32:52.174 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:52.175 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Great Gatsby [030, 410, 269, 961] [360, 181, 501, 676]: a production that suits all tastes') 2023-11-03 03:32:56.511 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:32:56.511 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Leon High School student Maddelena Kaji [402, 150, 677, 995], 17, speaks on a microphone [596, 234, 681, 352] outside her school during a protest against the reopening of brick and mortar schools.') 2023-11-03 03:33:21.401 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:33:21.402 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'piano [321, 176, 976, 922] in an abandoned adria palace in budapest') 2023-11-03 03:33:37.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:33:37.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the motorized-bicycle does not shift?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one motorized-bicycle does not shift.Track1frame:3:[829, 353, 876, 467].') 2023-11-03 03:34:08.029 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:08.030 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Three women [520, 447, 717, 928] [409, 408, 618, 993] and a man [001, 468, 348, 996] having a small tea party beneath blossoming cherry trees.') 2023-11-03 03:34:08.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:08.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[165, 175, 685, 827] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a destroyer escortFrame1:[165, 175, 685, 827];Frame2:[165, 186, 678, 828];Frame3:[169, 186, 674, 827].') 2023-11-03 03:34:27.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:27.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Couch:[621, 194, 999, 786];Chair:[001, 687, 166, 1000];Storage box:[445, 140, 801, 618];Person:[699, 183, 925, 806],[023, 587, 315, 998];Book:[674, 432, 757, 500],[251, 795, 486, 967];Brush:[415, 591, 465, 655];Plate:[652, 910, 798, 1000].') 2023-11-03 03:34:40.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:40.203 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cabinet:[000, 116, 248, 432],[000, 433, 127, 733],[126, 429, 209, 674],[199, 428, 255, 623],[000, 670, 147, 999],[105, 640, 213, 917],[187, 578, 275, 789],[785, 774, 998, 997],[918, 656, 999, 777],[824, 604, 979, 776],[705, 533, 947, 996];Storage box:[699, 458, 799, 535],[283, 587, 345, 662],[351, 578, 375, 685];Desk:[674, 598, 706, 703];Chair:[680, 569, 705, 606].') 2023-11-03 03:34:43.104 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:43.104 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Flip the tortilla with egg mixture [194, 220, 665, 991] with a hand [320, 511, 575, 995] and a spatula [599, 743, 814, 996]') 2023-11-03 03:34:44.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:44.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[370, 000, 577, 785] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a raccoonFrame1:[370, 000, 577, 785];Frame2:[361, 000, 576, 768];Frame3:[379, 002, 576, 824];Frame4:[401, 144, 590, 889];Frame5:[398, 062, 590, 859].') 2023-11-03 03:34:45.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:45.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Lamp:[683, 311, 848, 407],[605, 383, 711, 444],[423, 337, 482, 398],[534, 056, 608, 300],[182, 000, 325, 127];Desk:[552, 557, 750, 753];Bench:[258, 519, 360, 638];Frame:[215, 393, 312, 509];Person:[210, 497, 280, 619],[265, 502, 339, 600];Blackboard:[001, 436, 121, 508].') 2023-11-03 03:34:50.595 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:50.595 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[005, 003, 986, 992] Nouveau lifestyle advertising poster for the Wimbledon.') 2023-11-03 03:34:54.002 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:54.003 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A watercolour illustration of two children [402, 561, 505, 825] escaping the rain by entering an old, spooky abandoned house.') 2023-11-03 03:34:57.535 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:57.535 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Leeds Industrial Museum: [004, 006, 989, 964] A map showing all the countries around the world where Leeds-made machines were used.') 2023-11-03 03:34:58.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:34:58.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human leg,man,human hair and footwear in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human leg:[898, 504, 960, 799];man:[000, 161, 123, 785],[120, 187, 241, 681],[124, 024, 174, 163],[129, 198, 319, 799],[270, 000, 364, 130],[290, 148, 381, 344],[355, 150, 415, 316],[387, 190, 510, 370],[452, 163, 519, 360],[660, 184, 777, 350],[719, 174, 803, 377],[781, 158, 871, 363],[803, 079, 874, 179],[921, 065, 980, 177];human hair:[021, 166, 078, 247],[066, 245, 111, 310],[197, 180, 243, 261],[411, 065, 473, 134],[506, 053, 551, 121],[529, 226, 574, 295],[611, 252, 666, 347],[649, 180, 691, 249],[895, 165, 919, 229],[903, 171, 975, 220];footwear:[006, 725, 086, 768],[053, 670, 121, 713],[120, 653, 183, 687],[421, 672, 484, 705],[512, 670, 593, 703].') 2023-11-03 03:35:56.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:35:56.171 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Laptop:[404, 615, 637, 819];Mouse:[254, 786, 298, 817];Person:[277, 430, 472, 822];Stuffed Toy:[645, 653, 762, 821];Chair:[064, 689, 266, 812],[819, 705, 952, 820];Desk:[000, 815, 999, 999];Cup:[209, 739, 291, 818];Glasses:[359, 506, 445, 531].') 2023-11-03 03:36:02.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:36:02.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'street light:[420, 604, 466, 988];house:[000, 357, 999, 998],[505, 340, 999, 998].') 2023-11-03 03:36:18.744 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:36:18.745 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[456, 687, 522, 898] [401, 685, 466, 897] [773, 698, 845, 890] [614, 664, 697, 937] [217, 714, 307, 958] Locals enjoying crepes and coffee at a popular café, housed in a lovely French colonial villa near [003, 014, 932, 822] the Reunification Palace.') 2023-11-03 03:36:31.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:36:31.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[323, 181, 499, 995] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[323, 181, 499, 995];Frame2:[323, 294, 462, 997];Frame3:[321, 381, 495, 995];Frame4:[317, 405, 488, 995].') 2023-11-03 03:36:42.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:36:42.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track personFrame1:[588, 495, 632, 654],personFrame1:[367, 580, 407, 744],personFrame1:[524, 529, 567, 633],personFrame1:[067, 586, 117, 744],personFrame1:[453, 470, 489, 604] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'personFrame1:[588, 495, 632, 654];Frame2:[589, 490, 621, 645],personFrame1:[367, 580, 407, 744];Frame2:[385, 575, 425, 743],personFrame1:[524, 529, 567, 633];Frame2:[528, 548, 590, 627],personFrame1:[067, 586, 117, 744];Frame2:[080, 581, 136, 740],personFrame1:[453, 470, 489, 604];Frame2:[454, 465, 495, 604].') 2023-11-03 03:37:05.466 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:37:05.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 300, 078, 391],carFrame1:[921, 562, 1000, 889],carFrame1:[544, 359, 681, 529],carFrame1:[445, 346, 491, 411],carFrame1:[620, 345, 938, 677] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1[000, 300, 078, 391];Frame2:No object,carFrame1[921, 562, 1000, 889];Frame2:No object,carFrame1:[544, 359, 681, 529];Frame2:[630, 351, 888, 661],carFrame1:[445, 346, 491, 411];Frame2:[458, 346, 512, 421],carFrame1:[620, 345, 938, 677];Frame2:[859, 548, 1000, 846].') 2023-11-03 03:37:21.498 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:37:21.498 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Crystian Rivera [195, 252, 530, 913] is on trial in the death of his sister, and today an ER doctor said she indeed was the victim of abuse. KOCO's Michael Seiden has this report.") 2023-11-03 03:37:22.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:37:22.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Stool,Lamp and Apple in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Chair:[724, 565, 848, 752],[801, 681, 901, 781];Stool:[137, 576, 292, 959],[330, 575, 481, 955];Lamp:[638, 365, 678, 387].') 2023-11-03 03:37:41.728 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:37:41.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Cabinet:[002, 244, 142, 776];Desk:[002, 680, 269, 999];Speaker:[123, 225, 220, 416];Moniter:[486, 274, 538, 398];Wine Glass:[189, 765, 248, 1000];Cup:[081, 879, 142, 978];Person:[124, 440, 235, 728],[264, 419, 342, 675],[414, 457, 477, 584],[490, 440, 597, 651],[634, 393, 726, 737],[599, 457, 659, 614],[760, 444, 867, 658],[700, 359, 954, 795],[343, 003, 1000, 999];Bracelet:[416, 771, 478, 904].') 2023-11-03 03:38:20.433 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:38:20.434 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The city of Santa Rosa’s “Fulton Road Reconstruction” project repaired approximately 3,200 lineal feet of [175, 475, 618, 996] 4-lane principal arterial pavement and associated bike lanes.') 2023-11-03 03:38:31.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:38:31.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[419, 384, 468, 594],[513, 390, 573, 586].') 2023-11-03 03:38:33.585 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:38:33.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[496, 617, 581, 843] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a personFrame1:[496, 617, 581, 843];Frame2:[515, 656, 570, 843];Frame3:[507, 629, 565, 843];Frame4:[474, 605, 525, 836].') 2023-11-03 03:39:19.105 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:39:19.105 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Towel:[665, 467, 740, 741],[406, 474, 447, 577],[394, 478, 458, 646];Picture:[816, 085, 901, 605];Flower:[366, 116, 469, 215];Vase:[344, 549, 411, 748],[420, 532, 473, 658],[854, 650, 983, 811];Cabinet:[219, 691, 998, 998];Mirror:[365, 001, 999, 848];Faucet:[632, 732, 674, 848];Sink:[520, 836, 708, 943];Toilet:[614, 616, 685, 718].') 2023-11-03 03:39:29.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:39:29.473 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[533, 347, 579, 417],carFrame1:[454, 366, 535, 486],carFrame1:[512, 359, 549, 432] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[533, 347, 579, 417];Frame2:[536, 345, 581, 422],carFrame1:[454, 366, 535, 486];Frame2:[439, 366, 532, 518],carFrame1:[512, 359, 549, 432];Frame2:[510, 364, 550, 460].') 2023-11-03 03:39:56.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:39:56.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face and sports equipment in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[000, 801, 121, 999],[024, 099, 304, 772],[084, 390, 515, 880],[182, 511, 576, 960],[312, 008, 500, 787],[500, 478, 946, 956];human face:[108, 817, 116, 905],[159, 132, 230, 223],[333, 063, 394, 154],[358, 416, 415, 506];sports equipment:[427, 217, 495, 384],[486, 836, 571, 950].') 2023-11-03 03:40:11.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:40:11.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Baseball Bat:[543, 035, 643, 496],[746, 173, 792, 280];Person:[001, 205, 090, 540],[071, 207, 139, 549],[205, 241, 268, 525],[255, 173, 313, 517],[125, 144, 235, 893],[286, 121, 390, 870],[298, 150, 593, 995],[460, 157, 539, 412],[593, 192, 677, 498],[715, 152, 1000, 859];Helmet:[370, 144, 468, 296],[308, 122, 381, 210],[810, 147, 877, 226],[151, 140, 228, 252];Belt:[344, 770, 533, 821].') 2023-11-03 03:40:43.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:40:43.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Picture:[807, 310, 845, 370];Person:[851, 348, 1000, 998],[629, 355, 861, 690],[289, 317, 541, 694],[072, 366, 271, 797],[233, 374, 348, 711],[479, 379, 570, 581];Flag:[630, 335, 691, 445];Cup:[133, 781, 176, 848];Plate:[001, 762, 098, 848].') 2023-11-03 03:41:47.093 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:41:47.093 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A photo of two best friends with their arms wrapped around each other looking at the ocean [005, 554, 999, 761]. Text overlay [029, 023, 973, 239] says 51 Amazing Gift Ideas for Best Friends') 2023-11-03 03:42:03.399 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:42:03.400 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Pillow:[182, 543, 250, 644],[229, 527, 286, 600],[263, 529, 336, 607];Stool:[630, 771, 675, 850],[729, 848, 760, 984],[399, 746, 429, 806];Bicycle:[325, 614, 398, 845],[353, 605, 424, 848];Person:[522, 514, 594, 812].') 2023-11-03 03:42:17.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:42:17.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'shelf:[003, 315, 1000, 999].') 2023-11-03 03:42:31.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:42:31.979 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the woman wearing a pink scarf and black sweater?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one woman wearing a pink scarf and black sweater.Track1frame1:[284, 206, 700, 977];frame2:[257, 227, 645, 1000].') 2023-11-03 03:42:54.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:42:54.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[438, 312, 593, 716],[760, 290, 879, 762],[936, 193, 1000, 768],[002, 210, 333, 1000];Hockey Stick:[316, 856, 563, 999],[374, 504, 482, 700];Gloves:[247, 747, 331, 896];Helmet:[122, 208, 224, 345],[942, 193, 994, 268];Sneakers:[783, 684, 835, 759],[541, 626, 592, 720].') 2023-11-03 03:43:01.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:43:01.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human hair:[540, 285, 766, 473],[000, 400, 185, 506],[231, 301, 427, 443];human head:[514, 276, 766, 493],[000, 400, 175, 520],[239, 297, 423, 504],[960, 299, 999, 465];human arm:[034, 451, 248, 657],[328, 540, 492, 860],[417, 821, 657, 908],[473, 470, 999, 998];human hand:[417, 850, 592, 927],[139, 443, 250, 535],[464, 933, 621, 999];person:[000, 397, 181, 735],[000, 301, 550, 850],[472, 296, 999, 999];clothing:[000, 493, 128, 742],[130, 418, 549, 833],[561, 472, 999, 999];human face:[248, 359, 403, 499],[548, 353, 714, 507],[970, 338, 999, 463].') 2023-11-03 03:43:04.577 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:43:04.577 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect trousers,chair and window in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'chair:[135, 462, 243, 736],[168, 480, 301, 780],[215, 502, 362, 835],[276, 399, 333, 464],[405, 646, 651, 999],[425, 401, 461, 469],[440, 421, 517, 630],[449, 483, 571, 787],[467, 427, 573, 667],[581, 350, 682, 530],[600, 578, 777, 929],[604, 329, 743, 471],[727, 526, 865, 809],[796, 419, 963, 616];window:[069, 075, 306, 350].') 2023-11-03 03:43:10.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:43:10.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tower:[052, 102, 111, 320],[065, 341, 153, 506],[235, 333, 277, 485],[274, 132, 303, 240],[305, 212, 334, 341],[357, 290, 380, 359],[416, 433, 493, 642],[426, 323, 496, 496],[464, 602, 544, 968],[482, 016, 539, 343],[568, 227, 648, 477],[609, 352, 678, 539],[619, 433, 751, 678],[619, 563, 769, 965],[650, 272, 707, 397],[704, 271, 748, 415],[751, 214, 828, 411],[771, 098, 789, 181],[841, 024, 877, 155];skyscraper:[057, 334, 174, 534],[060, 104, 125, 320],[071, 500, 135, 566],[109, 227, 241, 339],[114, 227, 184, 246],[135, 514, 241, 638],[184, 418, 261, 547],[212, 328, 284, 517],[232, 196, 277, 277],[236, 163, 292, 279],[236, 392, 428, 606],[269, 125, 320, 204],[313, 207, 356, 351],[313, 490, 400, 643],[316, 251, 400, 372],[397, 428, 514, 647],[405, 230, 511, 447],[436, 596, 671, 970],[468, 022, 550, 493],[496, 124, 655, 328],[570, 232, 642, 366],[575, 352, 678, 558],[602, 142, 656, 245],[617, 403, 815, 964],[640, 194, 694, 295],[653, 274, 707, 395],[687, 173, 745, 269],[696, 250, 764, 413],[753, 107, 797, 187],[795, 120, 866, 243],[802, 055, 844, 129],[812, 491, 893, 668],[820, 214, 954, 524],[841, 034, 879, 150],[843, 722, 942, 843],[848, 351, 975, 651],[880, 165, 959, 212],[919, 496, 972, 774].') 2023-11-03 03:43:10.862 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:43:10.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Stool:[676, 599, 751, 757];Couch:[152, 459, 631, 849];Pillow:[219, 568, 336, 666],[344, 528, 434, 640],[456, 500, 544, 599];Lamp:[635, 345, 704, 502];shelf:[581, 481, 809, 688],[885, 305, 940, 627],[001, 216, 075, 745];Potted Plant:[513, 126, 598, 264];Picture:[441, 322, 527, 436],[048, 210, 133, 295];Moniter:[894, 828, 999, 999];Refrigerator:[902, 325, 999, 691].') 2023-11-03 03:43:26.652 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:43:26.653 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Boat:[061, 349, 900, 797].') 2023-11-03 03:44:14.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:44:14.119 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[000, 696, 956, 999];Person:[000, 148, 351, 1000],[350, 179, 699, 999],[409, 152, 1000, 1000];Microphone:[406, 355, 509, 524],[517, 747, 636, 820];Laptop:[587, 546, 1000, 865];Speaker:[138, 001, 283, 201];Glasses:[092, 250, 210, 311];Cup:[417, 682, 463, 791],[371, 646, 418, 746],[978, 707, 1000, 815];Bottle:[160, 399, 231, 787],[148, 495, 182, 738].') 2023-11-03 03:44:19.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:44:19.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[259, 389, 655, 943],[267, 530, 500, 965];Sneakers:[509, 824, 621, 942],[346, 884, 424, 945],[439, 905, 494, 963];Hat:[414, 391, 531, 450].') 2023-11-03 03:44:26.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:44:26.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman and human face in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[620, 404, 846, 921];human face:[397, 347, 517, 550],[702, 446, 764, 550].') 2023-11-03 03:44:50.555 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:44:50.555 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[106, 102, 472, 909] mother putting [293, 288, 408, 452] christmas gfts out for [554, 327, 782, 545] her children, victorian - christmas past and christmas present') 2023-11-03 03:44:57.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:44:57.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[226, 175, 649, 538];poster:[132, 162, 788, 676];human face:[370, 202, 491, 303].') 2023-11-03 03:45:08.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:45:08.740 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'clothing:[006, 000, 999, 999].') 2023-11-03 03:45:12.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:45:12.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Soccer:[441, 727, 495, 811];Person:[001, 357, 059, 651],[333, 196, 594, 818],[503, 249, 750, 814],[621, 210, 756, 739];Flag:[010, 541, 037, 627];Sneakers:[622, 691, 657, 735],[671, 723, 705, 806],[510, 778, 551, 811],[404, 764, 435, 813].') 2023-11-03 03:45:32.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:45:32.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Mirror:[112, 001, 999, 446];Faucet:[112, 049, 198, 196],[523, 211, 688, 477],[867, 143, 963, 352],[327, 032, 392, 150];Sink:[000, 165, 220, 283],[136, 369, 844, 845],[243, 088, 615, 194],[722, 243, 999, 397].') 2023-11-03 03:46:20.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:46:20.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'plant:[000, 465, 175, 704],[000, 881, 381, 998],[000, 225, 428, 332],[198, 404, 420, 464],[404, 840, 637, 998],[446, 234, 771, 317],[479, 340, 642, 417],[637, 731, 998, 998],[675, 340, 800, 450],[800, 245, 998, 323],[828, 328, 998, 492].') 2023-11-03 03:46:39.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:46:39.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'maple:[000, 011, 999, 988].') 2023-11-03 03:46:43.855 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:46:43.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[402, 594, 459, 795],personFrame1:[415, 585, 465, 825],personFrame1:[539, 595, 587, 834] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[402, 594, 459, 795];Frame2:[400, 587, 458, 799],personFrame1:[415, 585, 465, 825];Frame2:[403, 584, 466, 824],personFrame1:[539, 595, 587, 834];Frame2:[523, 587, 590, 831].') 2023-11-03 03:47:02.369 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:47:02.369 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'PUBG Mobile India [214, 174, 733, 380] is coming soon, at least it is a registered company now!') 2023-11-03 03:47:16.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:47:16.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[436, 252, 549, 612] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a white bear walking on the river bankFrame1:[436, 252, 549, 612];Frame2:[434, 258, 546, 616];Frame3:[443, 244, 544, 643].') 2023-11-03 03:47:20.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:47:20.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Cabinet:[145, 217, 787, 569];Wine Glass:[033, 847, 094, 993],[180, 781, 264, 994],[302, 839, 373, 996],[103, 600, 131, 665];Plate:[001, 881, 159, 976],[252, 894, 410, 992];Knife:[404, 949, 459, 996].') 2023-11-03 03:47:29.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:47:29.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[000, 267, 138, 561],[132, 000, 428, 179],[660, 047, 738, 190],[848, 000, 999, 384],[894, 151, 999, 359];flower:[000, 573, 483, 999];building:[032, 051, 669, 348],[635, 000, 998, 392].') 2023-11-03 03:48:05.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:48:05.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Desk:[176, 701, 683, 998],[001, 695, 064, 761];Chair:[276, 652, 443, 999],[506, 650, 661, 998],[879, 775, 999, 1000],[028, 645, 112, 952];Person:[001, 498, 089, 999],[112, 260, 270, 915],[255, 291, 384, 901],[612, 263, 762, 964],[724, 262, 908, 997];Book:[350, 700, 426, 767];Sneakers:[001, 948, 043, 999];Handbag:[706, 803, 851, 989].') 2023-11-03 03:48:09.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:48:09.423 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'clothing:[292, 441, 991, 971];human arm:[042, 405, 256, 984],[877, 658, 998, 998];human hand:[038, 448, 257, 938];human body:[068, 149, 998, 998];human hair:[488, 135, 735, 296];human head:[492, 141, 756, 533];man:[072, 148, 998, 998];human nose:[590, 327, 642, 402].') 2023-11-03 03:48:34.689 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:48:34.689 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Trash bin Can:[559, 762, 586, 835];Lamp:[766, 332, 784, 402],[567, 199, 584, 276],[320, 268, 341, 333];Potted Plant:[398, 714, 430, 755];Speaker:[539, 556, 556, 578].') 2023-11-03 03:48:35.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:48:35.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect mammal,human hair and human head in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'mammal:[887, 342, 981, 833],[000, 247, 999, 998],[897, 683, 999, 934],[940, 787, 999, 998];human hair:[317, 802, 417, 986],[000, 790, 036, 871],[000, 289, 999, 711],[066, 714, 115, 789],[118, 836, 182, 945],[162, 878, 240, 991],[405, 718, 450, 795],[428, 780, 514, 925],[456, 692, 500, 758],[492, 838, 520, 904],[520, 727, 572, 853],[664, 688, 709, 771],[763, 729, 819, 811],[767, 752, 857, 945],[805, 822, 888, 966],[831, 754, 873, 824];human head:[428, 787, 482, 945],[000, 297, 999, 998],[336, 806, 410, 910],[356, 724, 403, 806],[447, 705, 497, 777],[516, 727, 564, 830],[604, 648, 660, 751],[683, 407, 717, 475],[726, 380, 782, 453],[776, 759, 848, 872],[804, 831, 875, 960],[828, 748, 875, 816],[969, 341, 999, 409],[974, 689, 999, 770].') 2023-11-03 03:48:47.810 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:48:47.810 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[028, 309, 255, 997] An armed soldier stands beside [260, 406, 746, 764] an electric car in Lualaba, Democratic Republic of Congo, during a ...') 2023-11-03 03:49:12.906 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:49:12.906 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Glasses,Plate and Bowl/Basin in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Glasses:[106, 311, 235, 373];Plate:[427, 796, 636, 916],[562, 769, 706, 851];Basin:[464, 735, 547, 809].') 2023-11-03 03:49:50.351 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:49:50.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Boots:[696, 758, 749, 875],[746, 811, 771, 883];Sneakers:[725, 928, 794, 999],[784, 956, 834, 1000];Leather Shoes:[353, 833, 405, 879];Person:[058, 362, 363, 893],[001, 320, 094, 834],[094, 315, 184, 719],[094, 292, 145, 513],[232, 290, 326, 876],[325, 254, 466, 887],[440, 300, 527, 823],[524, 290, 586, 849],[551, 146, 676, 944],[695, 288, 835, 1000],[910, 218, 1000, 1000],[842, 225, 981, 1000],[696, 299, 772, 883],[820, 205, 913, 1000],[809, 214, 847, 348],[781, 124, 856, 287],[659, 284, 696, 851],[310, 273, 361, 410],[165, 306, 207, 405],[001, 370, 026, 423],[429, 269, 486, 391];Necklace:[591, 377, 631, 484];Bottle:[913, 495, 955, 617],[842, 304, 881, 413].') 2023-11-03 03:49:50.766 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 03:49:50.766 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Broccoli,Sneakers,Pickup Truck and SUV in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Sneakers:[363, 619, 410, 649],[410, 655, 463, 696],[441, 656, 492, 698],[726, 730, 757, 773],[776, 728, 813, 774];Pickup Truck:[350, 242, 922, 630];SUV:[000, 290, 163, 646],[120, 336, 219, 532].') 2023-11-03 03:50:21.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:50:21.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[032, 286, 148, 897],[121, 292, 173, 885],[163, 256, 245, 881],[241, 290, 306, 881],[270, 295, 378, 862],[341, 256, 405, 849],[379, 256, 475, 855],[470, 265, 570, 870],[563, 288, 601, 874],[575, 240, 675, 885],[647, 246, 690, 376],[663, 288, 745, 906],[730, 244, 769, 893],[739, 235, 848, 914],[823, 235, 969, 950];clothing:[033, 379, 160, 876],[114, 365, 190, 868],[158, 331, 274, 876],[238, 359, 314, 860],[279, 379, 380, 849],[351, 309, 407, 808],[378, 327, 479, 823],[465, 354, 576, 835],[556, 345, 606, 825],[574, 309, 680, 857],[635, 345, 681, 862],[648, 356, 743, 885],[729, 318, 780, 878],[734, 324, 853, 911],[828, 339, 965, 921].') 2023-11-03 03:51:03.065 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:51:03.065 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A black and white map [165, 054, 822, 935] of the Greater London area.') 2023-11-03 03:51:10.182 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:51:10.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'skyscraper:[000, 226, 742, 999],[713, 858, 811, 999],[769, 000, 999, 999];window:[000, 583, 027, 647],[000, 810, 030, 894],[000, 640, 037, 713],[004, 458, 051, 525],[016, 802, 061, 894],[020, 927, 062, 999],[027, 697, 068, 775],[041, 445, 082, 508],[055, 483, 097, 547],[055, 787, 104, 880],[055, 915, 104, 999],[058, 687, 099, 765],[063, 599, 101, 664],[094, 590, 130, 657],[094, 775, 138, 871],[096, 672, 139, 748],[131, 574, 166, 637],[131, 661, 168, 733],[133, 877, 216, 986],[160, 561, 199, 626],[163, 739, 240, 843],[188, 632, 266, 726],[220, 544, 285, 613],[229, 852, 312, 971],[240, 460, 301, 536],[252, 717, 336, 827],[279, 605, 353, 695],[296, 507, 365, 588],[311, 427, 373, 511],[340, 819, 431, 946],[356, 686, 439, 799],[368, 577, 443, 670],[383, 477, 454, 565],[394, 398, 457, 477],[454, 961, 548, 999],[457, 793, 554, 915],[462, 659, 548, 768],[465, 548, 544, 647],[471, 443, 543, 532],[476, 370, 539, 441],[753, 421, 762, 490],[765, 434, 781, 510],[778, 548, 788, 610],[793, 619, 806, 685],[795, 162, 860, 210],[806, 206, 894, 284],[832, 285, 933, 376],[868, 372, 977, 489],[890, 126, 961, 169],[905, 502, 999, 620],[909, 168, 999, 240],[945, 243, 999, 329].') 2023-11-03 03:51:13.653 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 03:51:13.654 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Bus:[067, 007, 931, 1000];Other Shoes:[002, 812, 065, 859];Hat:[062, 382, 122, 424];Person:[002, 388, 133, 862].') 2023-11-03 03:51:39.719 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:51:39.719 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The human brain starts working the moment you are born and never stops George Jessel Picture Quote [090, 294, 912, 766]') 2023-11-03 03:51:56.609 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:51:56.609 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[438, 339, 561, 681],personFrame1:[263, 347, 370, 693],personFrame1:[392, 375, 489, 655],personFrame1:[053, 324, 169, 688],personFrame1:[545, 324, 657, 657] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[438, 339, 561, 681];Frame2:[445, 359, 561, 681],personFrame1:[263, 347, 370, 693];Frame2:[264, 367, 369, 691],personFrame1:[392, 375, 489, 655];Frame2:[395, 379, 491, 654],personFrame1:[053, 324, 169, 688];Frame2:[052, 359, 167, 690],personFrame1:[545, 324, 657, 657];Frame2:[565, 327, 636, 655].') 2023-11-03 03:51:57.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:51:57.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect sports uniform,baseball bat,footwear and baseball glove in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'sports uniform:[322, 484, 546, 934],[017, 345, 075, 645],[081, 326, 164, 640],[197, 340, 280, 659],[277, 361, 367, 640],[339, 581, 396, 661],[342, 326, 449, 675],[524, 598, 604, 687],[660, 350, 750, 704],[721, 661, 878, 996],[780, 364, 835, 694],[844, 548, 992, 928],[864, 402, 974, 715];baseball bat:[471, 256, 550, 458];footwear:[325, 869, 348, 942],[763, 928, 832, 993];baseball glove:[647, 788, 698, 883].') 2023-11-03 03:53:35.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:53:35.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,man,clothing and human face in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[181, 595, 370, 893],[221, 699, 410, 999],[263, 527, 328, 646],[312, 507, 394, 738],[368, 505, 522, 830];man:[481, 184, 734, 994],[000, 484, 057, 684],[000, 524, 184, 848],[098, 488, 199, 809],[196, 476, 275, 636],[203, 024, 325, 286],[470, 457, 537, 599],[651, 599, 881, 999],[715, 470, 788, 580],[745, 461, 836, 615],[800, 549, 966, 999],[871, 465, 937, 646],[895, 503, 999, 719];clothing:[486, 286, 741, 970],[000, 348, 999, 999],[221, 173, 310, 309];human face:[005, 518, 045, 591],[056, 539, 116, 645],[189, 670, 246, 760],[208, 045, 281, 205],[222, 499, 250, 562],[228, 791, 306, 931],[268, 533, 296, 606],[427, 583, 480, 681],[512, 251, 578, 416],[676, 676, 768, 861],[785, 487, 821, 568],[804, 599, 851, 720],[881, 482, 931, 570],[908, 553, 942, 636].') 2023-11-03 03:53:36.730 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:53:36.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[578, 528, 631, 576],[583, 462, 619, 510],[591, 422, 636, 478],[608, 383, 645, 445],[612, 338, 658, 405],[630, 304, 669, 360],[639, 269, 678, 326],[655, 233, 690, 293],[662, 213, 698, 263],[711, 094, 755, 194],[716, 184, 749, 235],[713, 215, 745, 272],[698, 263, 736, 330],[683, 304, 724, 361],[663, 348, 707, 410],[663, 392, 701, 444],[650, 437, 684, 497],[630, 463, 667, 534],[320, 180, 354, 255].') 2023-11-03 03:53:43.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:53:43.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[000, 744, 999, 999],[054, 375, 108, 637],[148, 466, 185, 737],[276, 473, 297, 650],[279, 482, 330, 754],[329, 423, 394, 754],[521, 499, 623, 809],[591, 377, 669, 604],[751, 368, 826, 589],[820, 447, 887, 692],[865, 356, 931, 485];drum:[542, 508, 621, 595],[718, 455, 760, 572],[762, 456, 802, 565],[808, 465, 848, 567],[876, 470, 895, 580],[898, 470, 945, 603].') 2023-11-03 03:54:48.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:54:48.289 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'person:[000, 230, 061, 385],[000, 827, 102, 997],[000, 000, 174, 063],[000, 431, 224, 804],[044, 106, 105, 197],[062, 253, 122, 388],[062, 401, 215, 611],[088, 657, 131, 835],[114, 756, 335, 997],[137, 784, 171, 847],[140, 111, 154, 190],[142, 238, 290, 383],[172, 114, 324, 159],[187, 000, 272, 060],[201, 880, 332, 997],[215, 408, 398, 753],[262, 000, 340, 071],[264, 233, 414, 352],[292, 385, 631, 736],[328, 131, 492, 177],[334, 000, 422, 083],[334, 860, 440, 997],[414, 241, 530, 378],[415, 000, 522, 086],[421, 657, 655, 997],[477, 786, 522, 857],[501, 365, 644, 456],[507, 147, 617, 187],[511, 725, 541, 865],[512, 000, 621, 111],[530, 233, 610, 373],[531, 857, 578, 974],[580, 885, 625, 992],[615, 258, 712, 352],[640, 494, 758, 616],[640, 352, 832, 578],[672, 784, 711, 934],[697, 241, 901, 332],[698, 177, 838, 220],[722, 885, 754, 997],[728, 781, 767, 875],[750, 327, 882, 522],[757, 878, 790, 951],[765, 000, 908, 152],[774, 812, 810, 878],[801, 852, 837, 997],[824, 756, 854, 880],[830, 337, 970, 492],[855, 840, 890, 961],[855, 002, 952, 162],[857, 761, 894, 835],[888, 697, 911, 761],[890, 246, 954, 337],[902, 604, 924, 680],[905, 682, 930, 753],[905, 751, 940, 852],[911, 332, 998, 380],[922, 032, 998, 167],[922, 472, 998, 591],[938, 565, 960, 647],[938, 824, 971, 921],[938, 931, 975, 997],[940, 659, 961, 758],[942, 243, 998, 324],[964, 375, 998, 469],[970, 746, 995, 845],[977, 588, 998, 657].') 2023-11-03 03:55:25.657 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:55:25.657 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the panda sitting in red chair?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one panda sitting in red chair.Track1frame:1:[244, 413, 419, 617];frame:2:[281, 409, 452, 612];frame:3:[259, 419, 727, 601].') 2023-11-03 03:55:31.477 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:55:31.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[148, 501, 500, 647] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a cheetahFrame1:[148, 501, 500, 647];Frame2:[158, 480, 533, 670];Frame3:[272, 469, 671, 681];Frame4:[296, 465, 649, 693];Frame5:[261, 498, 602, 691].') 2023-11-03 03:55:55.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:55:55.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[002, 473, 293, 1000],[908, 453, 1000, 876];Bench:[246, 490, 885, 900],[859, 508, 926, 556];Desk:[817, 550, 973, 825];Potted Plant:[918, 468, 972, 599];Lamp:[928, 221, 960, 308];Leather Shoes:[411, 927, 463, 999],[367, 949, 406, 999];Other Shoes:[554, 914, 599, 988],[535, 879, 574, 964];Sneakers:[773, 863, 858, 921],[735, 875, 776, 947];Person:[655, 342, 874, 946],[474, 394, 671, 990],[275, 380, 503, 1000],[071, 325, 357, 1000],[888, 411, 920, 513],[795, 406, 850, 548];TV:[002, 066, 223, 376],[772, 202, 999, 412],[845, 418, 897, 481];Microphone:[251, 491, 283, 582].') 2023-11-03 03:55:56.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:55:56.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[650, 333, 752, 912] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[650, 333, 752, 912];Frame2:[717, 391, 831, 995];Frame3:[641, 625, 874, 997];Frame4:[000, 343, 189, 998];Frame5:[048, 077, 175, 479].') 2023-11-03 03:56:03.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:56:03.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'suit:[501, 220, 623, 775],[060, 277, 258, 786],[336, 310, 443, 711],[581, 322, 695, 786],[661, 329, 813, 872],[672, 333, 723, 414],[795, 306, 968, 859];man:[045, 184, 255, 822],[336, 252, 506, 747],[480, 221, 604, 793],[561, 245, 696, 850],[562, 255, 604, 332],[655, 242, 810, 897],[670, 261, 726, 388],[741, 219, 971, 905];woman:[241, 260, 368, 784],[391, 284, 511, 772],[741, 316, 915, 956];human face:[145, 199, 210, 298],[294, 266, 341, 336],[388, 256, 425, 334],[449, 298, 480, 369],[526, 237, 560, 301],[609, 255, 656, 334],[673, 274, 715, 338],[720, 246, 761, 344],[827, 334, 885, 416],[836, 232, 904, 316].') 2023-11-03 03:56:11.648 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:56:11.648 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[084, 051, 584, 905] The bunny plushie talks, the bunny plushie tail walks, if you look long at the bunny plushie tail, [085, 052, 588, 446] the bunny plushie tail stalks.') 2023-11-03 03:56:26.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:56:26.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face and human head in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[108, 031, 586, 999],[539, 132, 663, 989],[622, 106, 944, 999],[631, 181, 716, 729],[886, 212, 999, 680];human face:[435, 034, 571, 265],[066, 114, 165, 310],[235, 209, 353, 453],[591, 198, 655, 318],[666, 207, 703, 279],[830, 151, 911, 297];human head:[424, 035, 556, 257],[051, 214, 076, 291],[057, 091, 161, 303],[212, 112, 371, 429],[384, 158, 440, 235],[576, 139, 655, 308],[655, 189, 708, 291],[814, 114, 927, 320],[878, 227, 937, 322].') 2023-11-03 03:56:31.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:56:31.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[001, 829, 093, 1000],[052, 950, 122, 1000],[329, 864, 419, 1000],[327, 818, 388, 1000],[345, 783, 406, 1000],[410, 808, 469, 1000],[453, 860, 530, 1000],[485, 945, 558, 1000],[535, 806, 604, 1000],[973, 716, 1000, 943],[830, 808, 910, 1000],[753, 787, 818, 1000],[686, 806, 748, 1000],[676, 720, 765, 977],[840, 708, 908, 964],[800, 700, 861, 931],[869, 658, 927, 854],[919, 687, 975, 900],[936, 643, 980, 816],[924, 645, 956, 800],[758, 654, 811, 860],[786, 641, 825, 806],[843, 629, 877, 772],[875, 635, 913, 787],[802, 627, 839, 764],[799, 614, 832, 750],[735, 622, 776, 808],[700, 660, 742, 837],[683, 662, 720, 837],[717, 627, 750, 775],[582, 731, 651, 1000],[563, 735, 615, 945],[477, 714, 537, 945],[510, 697, 552, 893],[531, 706, 589, 920],[573, 666, 616, 881],[618, 675, 667, 891],[462, 662, 518, 862],[458, 666, 498, 868],[421, 641, 477, 831],[374, 745, 423, 987],[227, 779, 294, 1000],[191, 739, 266, 1000],[220, 683, 265, 887],[187, 637, 233, 847],[279, 660, 328, 856],[262, 662, 318, 847],[288, 620, 327, 764],[313, 625, 356, 762],[328, 652, 396, 902],[409, 631, 449, 802],[383, 627, 430, 791],[502, 631, 539, 802],[535, 647, 569, 827],[575, 631, 613, 810],[610, 647, 643, 816],[649, 643, 691, 822],[673, 620, 705, 781],[029, 604, 078, 789],[088, 612, 130, 785],[058, 675, 123, 941],[110, 675, 179, 918],[117, 725, 212, 1000],[915, 635, 953, 795],[913, 927, 974, 1000],[911, 552, 970, 614],[219, 527, 440, 614],[084, 533, 219, 629],[001, 514, 075, 643].') 2023-11-03 03:57:38.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:57:38.867 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[636, 788, 662, 902],[599, 791, 630, 900],[558, 797, 587, 904],[569, 797, 592, 906],[118, 775, 146, 851],[055, 770, 082, 882],[948, 790, 997, 928];Car:[261, 800, 393, 891],[182, 791, 277, 869];SUV:[083, 767, 156, 832];Traffic Sign:[742, 776, 771, 861].') 2023-11-03 03:58:42.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:58:42.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[382, 466, 477, 814] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[382, 466, 477, 814];Frame2:[333, 249, 534, 485];Frame3:[350, 354, 500, 548];Frame4:[387, 039, 482, 486];Frame5:[387, 466, 480, 916].') 2023-11-03 03:59:21.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:59:21.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[018, 475, 183, 839],[173, 475, 257, 756],[000, 549, 025, 794],[360, 501, 390, 571],[904, 485, 955, 560],[829, 494, 858, 548];Umbrella:[000, 427, 097, 515],[778, 491, 820, 516];High Heels:[018, 807, 074, 833],[138, 790, 182, 832];Boots:[197, 664, 231, 754];Handbag:[127, 696, 153, 738],[012, 516, 059, 629];Car:[954, 512, 1000, 661],[833, 525, 980, 638],[702, 514, 851, 608],[657, 510, 742, 587],[376, 483, 433, 542];SUV:[501, 496, 550, 541];Street Lights:[569, 325, 606, 403],[307, 323, 340, 382].') 2023-11-03 03:59:32.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:59:32.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Car:[889, 739, 999, 805],[807, 735, 918, 800],[675, 773, 782, 851],[742, 855, 895, 961],[828, 692, 922, 743],[784, 656, 867, 698],[677, 625, 746, 662],[641, 652, 712, 699],[630, 716, 713, 780],[545, 616, 611, 662],[670, 573, 737, 607],[518, 589, 570, 620],[861, 408, 936, 462],[633, 549, 686, 574],[614, 527, 662, 550],[498, 563, 548, 590],[477, 537, 523, 564],[577, 497, 613, 518],[535, 503, 580, 525];SUV:[724, 613, 810, 655];Van:[572, 655, 661, 712];Street Lights:[325, 622, 377, 694],[294, 478, 323, 554],[742, 468, 797, 613],[552, 408, 584, 500];Desk:[157, 522, 212, 573].') 2023-11-03 03:59:34.211 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:59:34.212 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'couch:[061, 208, 904, 820];human hair:[045, 234, 180, 411],[733, 325, 820, 440];clothing:[071, 305, 527, 631],[438, 344, 806, 587],[611, 737, 999, 999];human body:[046, 265, 616, 546];human leg:[286, 425, 602, 501],[300, 388, 611, 448],[445, 424, 597, 500],[615, 856, 961, 999],[894, 204, 969, 651],[956, 241, 999, 634];woman:[023, 271, 607, 621];human head:[055, 268, 167, 375],[748, 327, 818, 421];mammal:[053, 265, 619, 624],[170, 000, 238, 072],[228, 051, 291, 149],[238, 000, 322, 065],[278, 032, 350, 151],[286, 179, 346, 287],[350, 155, 478, 426],[444, 265, 542, 368],[447, 325, 841, 584],[570, 000, 639, 151],[606, 000, 666, 151],[613, 740, 999, 999],[907, 000, 999, 640];girl:[020, 270, 611, 620],[437, 323, 814, 575];human face:[083, 275, 166, 371],[750, 339, 806, 415];human arm:[109, 399, 321, 575],[170, 274, 361, 348],[631, 350, 724, 499],[706, 389, 786, 525],[853, 912, 990, 999],[905, 000, 999, 244],[938, 747, 999, 969];human hand:[866, 915, 958, 999],[930, 955, 995, 995],[957, 868, 999, 970].') 2023-11-03 03:59:44.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 03:59:44.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[236, 129, 380, 584];table:[263, 544, 999, 998];man:[007, 293, 332, 888],[411, 149, 679, 590],[693, 297, 980, 786];clothing:[000, 438, 316, 920],[252, 224, 364, 582],[435, 224, 655, 598],[669, 417, 977, 806];human face:[219, 334, 247, 465],[283, 157, 327, 234],[510, 204, 568, 294],[764, 352, 806, 466].') 2023-11-03 04:00:09.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:00:09.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Tissue:[603, 920, 709, 999],[771, 397, 829, 455];Tie:[373, 344, 432, 546],[145, 415, 195, 600];Leather Shoes:[723, 820, 786, 904],[761, 877, 835, 962];Picture:[370, 003, 578, 169],[591, 287, 743, 481];Pillow:[480, 225, 566, 354],[416, 244, 511, 371],[002, 498, 058, 697],[450, 450, 569, 612],[787, 488, 833, 574];Bench:[455, 337, 602, 467];Side Table:[578, 446, 763, 815];Couch:[704, 387, 1000, 999],[002, 372, 632, 986];Coffee Table:[089, 808, 866, 999];Person:[698, 209, 999, 941],[277, 187, 567, 847],[035, 231, 331, 951],[002, 686, 069, 1000];Bottle:[516, 874, 597, 979];Telephone:[610, 429, 713, 511].') 2023-11-03 04:00:11.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:00:11.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Motorcycle:[507, 291, 999, 823];Traffic Light:[010, 091, 043, 196];Stop Sign:[368, 230, 388, 251].') 2023-11-03 04:00:35.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:00:35.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the bird flying among the trio, positioned in the middle.?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one the bird flying among the trio, positioned in the middle..Track1frame:1:[441, 206, 660, 315];frame:2:[444, 200, 667, 319];frame:3:[441, 219, 662, 336].') 2023-11-03 04:00:38.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:00:38.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'bicycle:[221, 163, 870, 836];bicycle wheel:[228, 525, 466, 816],[657, 373, 863, 668];wheel:[232, 511, 470, 806],[652, 378, 857, 673];tire:[202, 481, 473, 815],[651, 361, 857, 663].') 2023-11-03 04:01:21.395 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:01:21.395 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[337, 634, 380, 680] [347, 837, 448, 930] [272, 689, 320, 740] [285, 767, 356, 840] The cars thread their way through [242, 559, 995, 997] the mountainous circuit at the 1972 French Grand Prix.') 2023-11-03 04:01:24.743 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:01:24.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[082, 340, 263, 1000],[895, 115, 999, 793];man:[183, 077, 419, 497],[000, 181, 125, 496];boy:[211, 434, 410, 999],[394, 421, 580, 999],[517, 352, 719, 999],[652, 357, 825, 999],[745, 138, 996, 893];clothing:[000, 251, 123, 495],[048, 493, 250, 999],[190, 162, 413, 467],[209, 553, 400, 999],[386, 515, 583, 999],[511, 456, 723, 999],[667, 484, 829, 739],[742, 262, 999, 850],[898, 227, 999, 740];human face:[000, 191, 040, 291],[123, 403, 188, 510],[290, 059, 338, 159],[316, 532, 385, 635],[435, 537, 505, 603],[540, 451, 602, 524],[701, 384, 761, 469],[833, 225, 901, 296].') 2023-11-03 04:01:33.565 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:01:33.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Chair:[761, 698, 848, 826],[648, 697, 734, 812],[189, 704, 344, 871],[064, 724, 232, 922],[000, 862, 033, 963];Desk:[716, 748, 760, 811],[182, 795, 248, 881],[020, 832, 098, 937];Pillow:[684, 700, 719, 744],[787, 700, 833, 750],[222, 729, 272, 785],[088, 766, 157, 821];Candle:[067, 439, 203, 794].') 2023-11-03 04:01:47.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:01:47.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[003, 408, 208, 710],[313, 298, 774, 693],[593, 365, 839, 706],[800, 443, 856, 505];Sneakers:[689, 642, 770, 691],[516, 639, 588, 675];Bench:[809, 596, 1000, 683].') 2023-11-03 04:02:05.360 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:02:05.360 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'bookcase:[095, 323, 502, 964],[420, 309, 686, 896],[694, 349, 999, 999].') 2023-11-03 04:02:11.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:11.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[745, 389, 990, 946],[000, 244, 057, 749],[000, 918, 136, 999],[177, 906, 311, 999],[221, 027, 298, 258],[288, 530, 439, 989],[380, 348, 555, 664],[510, 780, 704, 999],[531, 249, 755, 685],[534, 434, 825, 934],[701, 744, 931, 999],[869, 152, 999, 520];man:[170, 065, 467, 684],[831, 000, 999, 274];clothing:[194, 130, 445, 669],[088, 792, 246, 974],[208, 739, 323, 944],[295, 682, 420, 913],[403, 622, 579, 918],[527, 586, 817, 878],[823, 489, 998, 859],[897, 317, 999, 508];girl:[008, 910, 141, 999],[261, 537, 425, 971],[288, 472, 586, 983],[318, 868, 434, 999],[438, 431, 813, 974],[503, 786, 690, 999],[659, 395, 999, 999],[706, 747, 865, 999];human face:[023, 275, 059, 350],[112, 294, 921, 985],[333, 050, 401, 176],[940, 171, 996, 301].') 2023-11-03 04:02:19.320 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:19.321 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the chickens jogging around feasting.When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 7 chickens jogging around feasting.Track1frame:1:[506, 301, 621, 648];frame:2:[572, 420, 686, 822];frame:3:[451, 573, 714, 980],Track2frame:1:[385, 316, 496, 672];frame:2:[385, 315, 503, 679];frame:3:[392, 308, 500, 666],Track3frame:1:[333, 209, 408, 538];frame:2:[336, 201, 430, 534];frame:3:[327, 218, 425, 519],Track4frame:1:[093, 077, 282, 294];frame:2:[112, 059, 304, 291];frame:3:[144, 068, 239, 427],Track5frame:1:[278, 337, 414, 763];frame:2:[287, 331, 387, 759];frame:3:[293, 384, 396, 829],Track6frame:1:[000, 469, 183, 809];frame:2:[000, 512, 124, 838];frame:3:[000, 468, 153, 838],Track7frame:1:[140, 315, 297, 752];frame:2:[122, 343, 294, 751];frame:3:[127, 350, 314, 750].') 2023-11-03 04:02:21.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:21.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[406, 506, 423, 529];frame:2:[407, 506, 421, 526] is?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an black car shift in front of us then shift straight') 2023-11-03 04:02:31.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:31.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk,Trash bin Can and Glasses in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Desk:[743, 136, 905, 226];Trash bin Can:[357, 195, 419, 277];Glasses:[309, 263, 360, 293],[761, 248, 847, 287].') 2023-11-03 04:02:32.599 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:32.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[000, 185, 477, 777],[452, 000, 744, 292],[705, 000, 998, 423];clothing:[000, 000, 138, 252],[423, 000, 526, 183],[530, 000, 765, 259],[715, 000, 998, 297].') 2023-11-03 04:02:42.412 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:42.412 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[413, 541, 434, 583] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a lugeFrame1:[413, 541, 434, 583];Frame2:[461, 451, 490, 499];Frame3:[487, 462, 518, 508];Frame4:[483, 424, 500, 473];Frame5:[442, 475, 472, 522].') 2023-11-03 04:02:52.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:52.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[505, 055, 981, 447],[183, 332, 246, 544],[346, 413, 409, 458];person:[243, 441, 273, 534],[286, 438, 402, 558],[379, 409, 605, 994],[389, 436, 468, 598],[580, 395, 676, 766],[620, 388, 889, 999],[708, 353, 866, 860];jacket:[376, 509, 606, 833],[388, 471, 471, 568],[569, 452, 675, 640],[630, 470, 879, 807],[751, 430, 851, 545];human face:[495, 428, 551, 518],[730, 416, 783, 499].') 2023-11-03 04:02:55.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:02:55.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[183, 434, 735, 680] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a airplaneFrame1:[183, 434, 735, 680];Frame2:[112, 456, 709, 706];Frame3:[139, 463, 734, 715];Frame4:[177, 495, 750, 743].') 2023-11-03 04:03:13.361 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:03:13.361 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Allie DeOrio beams with excitement for being recognized as Citizen [103, 004, 905, 996] of the Month in 4th grade. She earned the title [222, 820, 796, 998] for being caring to classmates.') 2023-11-03 04:03:46.107 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:03:46.107 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Realistic neon sign [235, 141, 743, 861] of vinyl for decoration and covering on the wall background. concept [005, 006, 994, 989] of night club, music and dj profession. Premium Vector [446, 652, 584, 794]') 2023-11-03 04:04:09.262 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:09.263 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[211, 605, 356, 849],[283, 450, 356, 726].') 2023-11-03 04:04:10.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:10.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cup and Desk in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cup:[554, 280, 627, 423];Desk:[914, 340, 1000, 483].') 2023-11-03 04:04:14.183 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:14.183 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'This Sohay school room in Dhaka is just 3m by 3m across but it offers hope of a brighter future for all of the children [121, 553, 206, 826] [294, 534, 382, 778] [500, 544, 580, 773] [768, 816, 895, 997] [280, 639, 398, 926] [043, 628, 199, 996] [371, 793, 476, 996] [599, 551, 670, 752] [647, 769, 776, 996] [775, 615, 890, 845] [617, 606, 735, 893] [851, 730, 963, 996] [183, 562, 314, 932] [493, 689, 648, 996] within it.') 2023-11-03 04:04:16.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:16.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[601, 641, 842, 962] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a tabbyFrame1:[601, 641, 842, 962];Frame2:[576, 560, 814, 933];Frame3:[431, 612, 690, 995];Frame4:[326, 481, 623, 833].') 2023-11-03 04:04:24.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:24.156 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lifesaver in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Lifesaver:[482, 587, 942, 736].') 2023-11-03 04:04:25.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:25.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'muffin:[136, 536, 645, 984],[000, 241, 138, 393],[000, 345, 338, 962],[130, 162, 428, 431],[293, 237, 700, 652],[399, 091, 701, 297],[625, 403, 999, 999],[626, 200, 933, 487],[922, 318, 999, 462];cake:[000, 343, 343, 949],[129, 562, 675, 999],[136, 178, 415, 425],[305, 243, 686, 649],[401, 099, 700, 291],[638, 198, 931, 489],[641, 424, 999, 999],[932, 337, 999, 454].') 2023-11-03 04:04:36.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:36.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human head:[363, 241, 657, 736],[171, 128, 412, 749];man:[291, 101, 793, 1000],[146, 133, 428, 998];clothing:[270, 538, 865, 984],[145, 700, 350, 998];flower:[030, 047, 117, 188],[000, 380, 048, 468],[021, 216, 101, 333],[280, 048, 345, 141],[390, 000, 680, 429],[901, 007, 998, 998];human face:[246, 306, 393, 792],[386, 369, 626, 826].') 2023-11-03 04:04:37.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:37.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bracelet:[736, 468, 761, 513];Person:[454, 143, 782, 927];Helmet:[534, 143, 633, 253];Belt:[559, 481, 656, 512];Sneakers:[653, 824, 719, 927],[730, 816, 783, 925].') 2023-11-03 04:04:55.074 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:04:55.074 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Stool:[806, 257, 1000, 1000],[001, 232, 113, 897],[086, 240, 388, 959],[804, 139, 982, 540],[607, 143, 780, 565],[457, 141, 629, 553],[316, 138, 486, 545],[189, 137, 330, 513],[001, 144, 060, 378],[059, 137, 216, 329];Desk:[880, 359, 999, 999],[001, 329, 333, 996].') 2023-11-03 04:05:08.341 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:05:08.341 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'top view, lots [840, 832, 976, 996] [374, 711, 486, 912] [540, 646, 661, 809] [213, 611, 330, 792] [265, 224, 383, 410] [461, 237, 560, 429] [280, 478, 393, 646] [131, 076, 245, 305] of skulls [617, 184, 725, 403] filling the screen [008, 009, 994, 997] - skull stock videos & royalty-free footage') 2023-11-03 04:05:12.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 04:05:12.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human leg,human body,footwear and man in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human leg:[226, 619, 281, 711],[338, 753, 406, 992],[341, 755, 419, 997],[616, 567, 627, 637],[656, 561, 676, 640],[680, 563, 689, 635],[729, 760, 778, 961],[748, 768, 785, 916];human body:[678, 452, 847, 981],[204, 467, 245, 613],[213, 535, 313, 727],[278, 430, 467, 999],[299, 451, 324, 517],[449, 474, 479, 545],[465, 482, 501, 660],[518, 493, 531, 565],[596, 472, 641, 634],[648, 472, 665, 589],[653, 465, 702, 646];footwear:[730, 821, 796, 962];man:[263, 426, 469, 993],[218, 540, 310, 720].') 2023-11-03 04:05:19.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:05:19.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[106, 177, 770, 881] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a tankFrame1:[106, 177, 770, 881];Frame2:[107, 200, 754, 913];Frame3:[168, 144, 790, 897];Frame4:[219, 111, 741, 887].') 2023-11-03 04:05:23.548 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:05:23.548 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'With the Chicago Blackhawks [028, 541, 290, 993] struggling mightily this season, is it time for Joel Quenneville [488, 339, 758, 896] to go? John Dietz [370, 034, 584, 633] says no, adding that the head coach deserves one more shot to right the ship next season.') 2023-11-03 04:05:24.442 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:05:24.442 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[504, 513, 591, 573] [609, 489, 645, 575] [263, 525, 313, 579] Five waterbirds are swimming in a dark lake. [003, 300, 995, 575] Water vapour is rising from the surface of the lake.') 2023-11-03 04:05:33.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:05:33.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect mammal,man and human head in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'mammal:[716, 573, 953, 981],[000, 701, 090, 999],[000, 122, 999, 747],[006, 709, 260, 915],[019, 648, 237, 999],[176, 474, 330, 857],[458, 611, 712, 998],[471, 534, 620, 750],[483, 696, 655, 999],[890, 395, 996, 615],[900, 463, 999, 763];man:[698, 590, 966, 993],[000, 125, 999, 999];human head:[501, 702, 677, 962],[000, 298, 013, 369],[062, 647, 146, 755],[064, 280, 111, 364],[115, 331, 158, 404],[120, 462, 156, 529],[197, 470, 258, 566],[443, 322, 486, 386],[457, 328, 507, 416],[476, 424, 536, 509],[526, 538, 607, 640],[550, 366, 595, 434],[581, 399, 640, 500],[668, 469, 708, 543],[794, 553, 881, 693],[856, 334, 891, 398],[946, 857, 999, 987],[950, 393, 999, 458],[963, 458, 999, 553].') 2023-11-03 04:05:47.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:05:47.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Other Shoes and Book in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Other Shoes:[666, 866, 715, 951],[741, 643, 815, 732];Book:[158, 731, 299, 831].') 2023-11-03 04:05:55.975 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:05:55.975 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human nose,mammal,human eye,man and human hair in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human nose:[610, 406, 832, 718];mammal:[040, 208, 894, 1000];human eye:[507, 308, 611, 408],[740, 681, 823, 855];man:[000, 145, 940, 1000];human hair:[091, 326, 263, 443].') 2023-11-03 04:05:57.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:05:57.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human body:[859, 527, 993, 994],[000, 405, 334, 999],[285, 525, 541, 999],[420, 316, 466, 382],[506, 305, 558, 399],[522, 529, 698, 999],[661, 180, 688, 259],[744, 527, 908, 999],[941, 312, 967, 387],[973, 329, 999, 419];woman:[848, 486, 996, 996],[940, 293, 968, 394];human head:[104, 025, 315, 411],[303, 279, 443, 582],[603, 424, 672, 572],[813, 426, 885, 564],[875, 521, 986, 687];man:[017, 044, 348, 1000],[283, 270, 550, 999],[416, 282, 467, 381],[503, 291, 560, 396],[526, 420, 703, 999],[736, 428, 911, 999];jeans:[573, 811, 679, 983];human hair:[110, 009, 300, 226],[305, 277, 425, 444],[600, 427, 674, 531],[818, 452, 868, 519],[884, 529, 984, 647];tree:[000, 000, 999, 467],[627, 244, 827, 599],[920, 000, 999, 194];human face:[148, 076, 320, 447],[327, 333, 439, 560],[630, 463, 676, 569],[823, 483, 877, 567],[878, 564, 922, 672];human nose:[259, 211, 302, 284].') 2023-11-03 04:06:25.308 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:06:25.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'tire:[586, 360, 963, 915],[018, 596, 190, 889],[250, 658, 338, 724];land vehicle:[021, 000, 976, 898];wheel:[017, 593, 189, 895],[460, 213, 573, 285],[596, 391, 977, 901].') 2023-11-03 04:06:35.928 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:06:35.928 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[595, 500, 775, 658] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a deerFrame1:[595, 500, 775, 658];Frame2:[651, 462, 834, 604];Frame3:[567, 458, 762, 610];Frame4:[648, 450, 848, 597].') 2023-11-03 04:06:36.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:06:36.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the seven chickens?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There are 6 seven chickens.Track1frame:1:[378, 109, 761, 691];frame:2:[378, 109, 761, 691];frame:3:[388, 163, 750, 725],Track2frame:1:[540, 001, 819, 618];frame:2:[540, 001, 819, 618];frame:3:[535, 015, 822, 652],Track3frame:1:[682, 019, 862, 561];frame:2:[682, 019, 862, 561];frame:3:[674, 059, 853, 588],Track4frame:1:[764, 023, 938, 447];frame:2:[764, 023, 938, 447];frame:3:[756, 069, 920, 476],Track5frame:1:[185, 029, 539, 516];frame:2:[185, 029, 539, 516];frame:3:[064, 069, 485, 594],Track6frame:1:[962, 001, 999, 562];frame:2:[962, 001, 999, 562];frame:3:[882, 084, 999, 629].') 2023-11-03 04:06:48.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:06:48.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Potted Plant:[647, 471, 800, 616];Bench:[872, 472, 999, 586],[958, 615, 1000, 679];Coffee Table:[919, 536, 999, 586];Carpet:[001, 607, 1000, 997],[663, 582, 881, 802];Picture:[875, 322, 1000, 412];Person:[001, 375, 203, 899],[189, 204, 692, 1000],[647, 366, 733, 495],[802, 362, 928, 603],[913, 365, 1000, 475];Medal:[509, 643, 563, 707];Watch:[636, 815, 689, 844];Satchel:[078, 578, 150, 640];Sneakers:[144, 853, 203, 895],[001, 784, 089, 832].') 2023-11-03 04:07:25.742 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:07:25.743 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Telltale signs that politician fulfills his promise: People from Florești enjoy walking on [323, 411, 996, 997] well-built roads') 2023-11-03 04:07:32.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:07:32.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'house:[015, 154, 999, 909];plant:[000, 882, 036, 955],[000, 891, 999, 998],[008, 830, 330, 917],[071, 869, 200, 998],[181, 840, 297, 998],[445, 683, 546, 910],[525, 703, 610, 887],[606, 818, 655, 914],[783, 822, 852, 907],[830, 660, 999, 917];window:[101, 666, 144, 807],[203, 430, 272, 546],[212, 654, 256, 810],[471, 402, 518, 548],[554, 405, 604, 551],[554, 652, 604, 772],[690, 405, 740, 548],[818, 411, 870, 558],[825, 634, 865, 798],[900, 405, 942, 558].') 2023-11-03 04:07:52.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:07:52.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[280, 354, 662, 978] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a bumper carFrame1:[280, 354, 662, 978];Frame2:[299, 386, 626, 926];Frame3:[379, 423, 654, 869];Frame4:[391, 420, 658, 862].') 2023-11-03 04:07:52.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:07:52.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Golf Club:[167, 633, 201, 727];Person:[753, 445, 813, 769],[580, 468, 637, 737],[300, 489, 381, 806],[258, 480, 320, 769],[164, 504, 253, 741],[056, 492, 120, 735],[001, 460, 059, 800],[831, 463, 882, 675],[787, 422, 828, 653],[242, 533, 263, 605];Hat:[591, 469, 617, 500],[310, 488, 344, 530],[276, 478, 305, 516],[080, 493, 104, 529],[005, 462, 037, 501];Dog:[704, 579, 747, 662].') 2023-11-03 04:08:08.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:08:08.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'fixed-wing aircraft:[000, 470, 1000, 756].') 2023-11-03 04:08:19.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:08:19.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[004, 000, 992, 992] All Apologies sheet music for voice, piano or guitar by Nirvana and Kurt Cobain, intermediate skill level') 2023-11-03 04:08:43.532 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:08:43.532 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Crisp [169, 048, 837, 965] green beans, mushrooms, and onions coated with Gruyere cheese, butter, and milk for an extra creamy casserole.') 2023-11-03 04:08:50.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:08:50.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect chair,table,building and person in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'chair:[262, 579, 416, 883],[182, 537, 267, 728],[196, 556, 279, 748],[413, 570, 496, 777],[619, 556, 678, 652],[692, 665, 999, 998];table:[326, 561, 422, 675],[352, 609, 507, 777];building:[000, 000, 999, 998];person:[696, 320, 960, 915],[247, 502, 285, 695],[264, 408, 321, 502],[266, 507, 356, 659],[427, 471, 485, 567],[625, 439, 714, 638].') 2023-11-03 04:10:13.266 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:10:13.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Desk:[458, 864, 609, 962].') 2023-11-03 04:10:24.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:10:24.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[701, 288, 771, 487],[309, 335, 350, 471],[401, 099, 512, 150],[856, 001, 999, 176];Frame:[391, 194, 563, 475],[152, 583, 183, 635],[182, 585, 221, 630];Chair:[288, 722, 499, 1000],[090, 647, 265, 921],[254, 598, 352, 817],[886, 820, 1000, 1000];Desk:[180, 746, 321, 993];Bench:[001, 629, 187, 856];Pillow:[023, 619, 110, 728],[001, 639, 060, 743],[147, 611, 216, 702],[195, 598, 266, 658],[255, 626, 301, 698];Couch:[001, 873, 092, 999].') 2023-11-03 04:10:30.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:10:30.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[307, 697, 676, 919] Self-made ceramic flowerpot in front of a white wall background. Authentic and one of a kind handiwork, [123, 199, 801, 920] vase artwork.') 2023-11-03 04:10:36.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:10:36.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the red fish swim right?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is no red fish swim right.') 2023-11-03 04:10:51.799 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:10:51.800 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Age of Heroes cast members standing in a row in uniform with [053, 384, 211, 688] [786, 427, 997, 632] [216, 403, 380, 538] [535, 344, 805, 624] guns. [006, 335, 993, 896] Snowy mountains can be seen behind.') 2023-11-03 04:12:11.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:12:11.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'tree:[000, 000, 153, 424];building:[000, 424, 999, 720].') 2023-11-03 04:12:31.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:12:31.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human hair:[926, 202, 1000, 326],[056, 063, 305, 410],[401, 203, 476, 305];mammal:[371, 237, 943, 964],[003, 070, 463, 999],[037, 214, 094, 365],[398, 204, 478, 359],[484, 265, 546, 351],[796, 272, 847, 343],[890, 396, 999, 669],[918, 198, 999, 340];human head:[033, 203, 101, 366],[064, 061, 326, 476],[398, 206, 475, 320],[481, 265, 543, 342],[560, 134, 817, 560],[923, 198, 999, 334];clothing:[000, 411, 466, 997],[459, 443, 999, 834],[477, 324, 545, 351],[880, 406, 999, 658];human face:[170, 195, 323, 482],[401, 220, 437, 292],[592, 273, 742, 555].') 2023-11-03 04:13:17.339 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:13:17.339 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[005, 008, 994, 999] room with asymmetrical walls, covered in lime plaster, [000, 003, 992, 316] dark wooden ceiling with several large beams, country kitchen décor, [163, 566, 419, 630] antique sink near [159, 449, 342, 579] a small open window, [507, 715, 997, 996] dining table and [666, 846, 995, 997] chair') 2023-11-03 04:13:42.366 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:13:42.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[472, 412, 618, 534] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a bicycle by a boy on the groundFrame1:[472, 412, 618, 534];Frame2:[438, 412, 620, 537];Frame3:[413, 415, 613, 537];Frame4:[413, 416, 613, 538];Frame5:[403, 416, 613, 538].') 2023-11-03 04:13:53.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:13:53.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[148, 175, 595, 962];clothing:[143, 345, 591, 960];human face:[328, 227, 448, 455].') 2023-11-03 04:14:13.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:14:13.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[287, 001, 999, 1000],[000, 341, 621, 1000],[000, 432, 205, 841];Necklace:[158, 784, 191, 999];Glasses:[093, 499, 191, 545].') 2023-11-03 04:15:51.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:15:51.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[722, 564, 1000, 999],[673, 604, 842, 999],[791, 567, 880, 878],[449, 650, 654, 1000],[567, 516, 713, 957],[289, 622, 456, 1000],[422, 542, 510, 898],[135, 620, 323, 1000],[002, 502, 160, 999],[229, 532, 335, 875];Hat:[790, 565, 870, 645];Backpack:[546, 832, 687, 998];Satchel:[002, 561, 110, 784].') 2023-11-03 04:15:54.497 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:15:54.497 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A sandy beach [000, 315, 999, 998], with a folding chair [110, 403, 499, 880], seven throw pillows [611, 495, 783, 624] [417, 628, 678, 802] [161, 798, 396, 972] [667, 666, 845, 797] [490, 532, 672, 691] [253, 613, 447, 769] [628, 589, 796, 749], and tent made of Pendleton by Sunbrella fabrics [360, 120, 972, 811].') 2023-11-03 04:16:07.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:07.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'dog:[219, 271, 900, 999].') 2023-11-03 04:16:08.862 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:08.862 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'protesters [583, 276, 672, 547] [181, 232, 386, 997] marching carrying banners and signs [157, 478, 878, 995] to confront soviet troops at the height of the prague spring invasion - traditionally czech stock videos & royalty-free footage') 2023-11-03 04:16:13.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:13.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Wheelchair:[606, 454, 1000, 723],[174, 544, 733, 954],[097, 207, 347, 339];Person:[119, 260, 745, 883],[581, 149, 987, 677];Other Shoes:[842, 617, 923, 673],[785, 616, 860, 672];Sneakers:[443, 813, 541, 883],[356, 811, 449, 881];Basketball:[369, 470, 568, 609].') 2023-11-03 04:16:14.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:14.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the person in red coat?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one person in red coat.Track1frame1:[765, 487, 1000, 1000];frame2:[756, 487, 1000, 1000].') 2023-11-03 04:16:38.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:38.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'clothing:[065, 654, 177, 852],[176, 535, 958, 845];woman:[343, 633, 418, 843],[602, 516, 668, 705],[661, 530, 733, 752],[687, 658, 786, 860],[749, 553, 836, 754],[769, 644, 841, 864],[820, 544, 894, 758],[835, 675, 937, 866];man:[075, 618, 178, 838],[174, 597, 270, 834],[221, 516, 298, 667],[263, 532, 359, 824],[321, 495, 377, 621],[346, 508, 436, 714],[414, 506, 496, 714],[475, 462, 559, 728],[543, 497, 615, 724],[588, 656, 690, 852],[725, 506, 768, 650],[868, 514, 965, 864];human face:[109, 484, 921, 766].') 2023-11-03 04:16:41.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:41.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect car,taxi and skyscraper in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'car:[152, 592, 591, 714],[218, 694, 388, 873],[595, 648, 642, 731];taxi:[220, 647, 365, 850],[592, 641, 646, 736];skyscraper:[123, 458, 165, 643],[138, 440, 218, 608],[251, 365, 327, 654],[285, 429, 387, 661],[346, 248, 430, 636],[403, 190, 521, 654],[523, 212, 718, 647].') 2023-11-03 04:16:48.907 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:16:48.907 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[703, 281, 990, 996] [320, 379, 641, 996] couple drinking [525, 613, 644, 720] coffee in front of [410, 000, 995, 998] kitchen window during an at home engagement session in Shreveport, Louisiana') 2023-11-03 04:17:07.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:17:07.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'American Football:[667, 300, 714, 430];Person:[702, 003, 975, 834],[349, 025, 758, 999],[490, 034, 586, 651],[077, 195, 612, 938];Sneakers:[624, 913, 727, 1000],[348, 831, 417, 999],[462, 864, 547, 921],[902, 704, 941, 834],[077, 883, 128, 946].') 2023-11-03 04:17:19.784 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:17:19.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'clothing:[531, 749, 743, 991],[000, 495, 999, 999];woman:[102, 688, 161, 777],[152, 700, 269, 999],[213, 729, 418, 999],[791, 692, 951, 999];man:[005, 665, 082, 759],[206, 659, 271, 877],[273, 635, 329, 750],[283, 619, 351, 761],[371, 617, 401, 684],[387, 782, 543, 999],[391, 655, 569, 861],[491, 640, 558, 778],[520, 555, 557, 646],[551, 602, 711, 857],[676, 638, 817, 880],[762, 515, 806, 638],[910, 519, 999, 648],[939, 659, 999, 803];human face:[013, 662, 034, 747],[148, 709, 180, 801],[231, 668, 253, 731],[297, 772, 352, 875],[386, 805, 474, 967],[513, 652, 550, 737],[594, 630, 633, 698],[751, 681, 796, 760],[796, 720, 823, 822],[803, 730, 817, 818],[945, 691, 999, 820].') 2023-11-03 04:17:20.736 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:17:20.736 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bakset in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bakset:[093, 170, 432, 378],[742, 112, 864, 228],[357, 350, 478, 553],[586, 432, 844, 644],[077, 638, 382, 913].') 2023-11-03 04:17:26.326 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:17:26.327 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'baked goods:[043, 207, 847, 938];flower:[015, 585, 130, 661],[015, 659, 154, 823],[058, 464, 315, 738],[078, 743, 322, 980],[256, 058, 727, 373],[265, 592, 388, 698],[272, 801, 614, 999],[459, 454, 590, 558],[504, 631, 623, 707],[556, 294, 860, 658],[640, 818, 864, 978],[777, 695, 957, 857],[782, 592, 912, 701];cake:[166, 241, 750, 871].') 2023-11-03 04:18:33.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:18:33.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Microphone:[002, 552, 126, 619];Hat:[143, 308, 334, 515];Guitar:[330, 605, 490, 897];Lamp:[058, 153, 202, 477],[002, 309, 055, 468];Person:[117, 307, 406, 1000],[432, 356, 920, 1000];Watch:[157, 920, 204, 997].') 2023-11-03 04:18:36.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:18:36.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Canned:[920, 523, 1000, 702];Cup:[879, 448, 956, 584],[747, 398, 812, 525],[679, 325, 735, 432],[756, 296, 818, 403],[854, 310, 912, 419],[754, 233, 806, 302],[826, 241, 899, 306],[917, 227, 975, 325],[954, 245, 1000, 431];Bowl:[743, 127, 834, 214],[827, 124, 1000, 264];Person:[038, 002, 748, 752],[057, 002, 738, 216].') 2023-11-03 04:18:38.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:18:38.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bowl:[314, 491, 327, 503];Barrel:[297, 434, 315, 477];Banana:[666, 520, 676, 534];Person:[423, 318, 557, 868],[502, 332, 609, 804],[454, 403, 491, 469],[947, 583, 999, 693];Umbrella:[605, 425, 703, 484],[409, 448, 491, 558];Trolley:[365, 614, 545, 795];Bicycle:[316, 471, 391, 654];Motorcycle:[158, 466, 211, 556];Car:[207, 414, 243, 488],[050, 420, 084, 456],[000, 359, 073, 537];Bench:[620, 549, 676, 605];Storage box:[670, 559, 708, 604];Hat:[502, 318, 542, 359],[256, 386, 277, 406],[378, 417, 390, 432];Slippers:[425, 841, 490, 867],[442, 806, 505, 828],[513, 781, 572, 802],[503, 747, 545, 772],[315, 624, 330, 641],[283, 616, 306, 630],[199, 606, 213, 620],[583, 591, 596, 602];Other Shoes:[197, 511, 212, 530],[158, 518, 172, 528];Handbag:[199, 444, 219, 466].') 2023-11-03 04:18:41.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:18:41.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[128, 704, 173, 953],personFrame1:[197, 445, 243, 664],personFrame1:[732, 523, 783, 749],personFrame1:[235, 146, 273, 338],personFrame1:[328, 703, 373, 934],personFrame1:[418, 329, 461, 525],personFrame1:[490, 844, 532, 1000],personFrame1:[895, 423, 935, 616],personFrame1:[454, 342, 494, 519],personFrame1:[296, 366, 329, 557] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[128, 704, 173, 953];Frame2:[132, 715, 178, 964],personFrame1:[197, 445, 243, 664];Frame2:[195, 438, 242, 659],personFrame1:[732, 523, 783, 749];Frame2:[737, 526, 787, 753],personFrame1:[235, 146, 273, 338];Frame2:[235, 141, 273, 334],personFrame1:[328, 703, 373, 934];Frame2:[328, 703, 372, 934],personFrame1:[418, 329, 461, 525];Frame2:[420, 336, 464, 537],personFrame1:[490, 844, 532, 1000];Frame2:[491, 849, 535, 1000],personFrame1:[895, 423, 935, 616];Frame2:[895, 423, 935, 616],personFrame1:[454, 342, 494, 519];Frame2:[456, 349, 495, 528],personFrame1:[296, 366, 329, 557];Frame2:[294, 374, 327, 568].') 2023-11-03 04:19:53.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:19:53.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[261, 816, 354, 981],[549, 539, 654, 686];Person:[218, 111, 574, 998],[155, 029, 654, 857],[461, 002, 591, 536],[059, 002, 172, 321].') 2023-11-03 04:20:02.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:20:02.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[927, 000, 999, 580].') 2023-11-03 04:20:27.555 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:20:27.556 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Cheerful young people [582, 387, 664, 778] [625, 366, 876, 889] [476, 230, 650, 820] have lunch in the courtyard and have a fun Stock photo © boggy') 2023-11-03 04:20:41.522 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:20:41.522 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'West Ham say the measures will support the entire infrastructure [001, 338, 992, 965] of the club and enable us to retain jobs and continue to pay 100 per cent of staff salaries.') 2023-11-03 04:20:43.339 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:20:43.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'houseplant:[111, 243, 533, 808],[285, 738, 396, 827],[364, 249, 940, 918];flowerpot:[000, 213, 196, 329],[122, 641, 313, 792],[285, 742, 388, 826],[384, 656, 664, 920].') 2023-11-03 04:20:58.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:20:58.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[226, 171, 678, 1000];Hat:[421, 170, 630, 379].') 2023-11-03 04:21:06.821 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:21:06.822 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[000, 187, 306, 395],[114, 340, 140, 443],[205, 333, 232, 443],[263, 333, 282, 415],[277, 328, 302, 442],[302, 318, 327, 415],[317, 321, 349, 436],[333, 320, 353, 423],[357, 321, 378, 428],[400, 853, 569, 976],[404, 321, 423, 412],[423, 323, 443, 410],[437, 321, 459, 412],[450, 329, 469, 417],[474, 320, 500, 443],[524, 323, 553, 447],[550, 286, 671, 370],[583, 329, 612, 475],[627, 455, 712, 755],[631, 338, 658, 442],[648, 845, 736, 999],[654, 339, 692, 445],[656, 385, 712, 528],[666, 267, 686, 357],[682, 262, 711, 367],[712, 344, 746, 483],[754, 381, 801, 543],[795, 371, 837, 439],[809, 823, 951, 999],[836, 396, 889, 621],[874, 341, 898, 437],[878, 786, 988, 999],[896, 334, 923, 404].') 2023-11-03 04:21:18.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:21:18.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[352, 312, 505, 413] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a turtleFrame1:[352, 312, 505, 413];Frame2:[374, 294, 524, 398];Frame3:[364, 289, 535, 385];Frame4:[401, 300, 535, 440];Frame5:[410, 296, 548, 425].') 2023-11-03 04:22:04.611 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:22:04.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect limousine,sunglasses and man in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'sunglasses:[271, 332, 348, 386],[451, 395, 523, 439];man:[064, 248, 575, 998],[588, 484, 627, 556],[800, 497, 845, 625].') 2023-11-03 04:22:08.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:22:08.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'sink:[108, 510, 253, 672];cabinetry:[000, 616, 163, 813],[161, 634, 251, 722],[210, 304, 324, 500],[247, 566, 337, 707],[345, 617, 405, 694],[545, 451, 633, 591],[633, 619, 688, 692],[730, 356, 833, 687],[839, 606, 999, 780],[883, 178, 999, 482];countertop:[000, 546, 338, 818],[000, 686, 999, 999],[340, 567, 691, 628],[410, 540, 615, 555],[833, 593, 999, 655].') 2023-11-03 04:22:16.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:22:16.723 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'nightstand:[001, 514, 116, 979];table:[003, 574, 113, 972];pillow:[101, 376, 255, 564],[104, 374, 309, 705],[294, 318, 445, 524],[495, 262, 561, 436],[523, 254, 596, 427],[594, 235, 686, 361],[600, 262, 666, 416];house:[000, 000, 999, 996];bed:[038, 195, 971, 998],[496, 156, 999, 776].') 2023-11-03 04:22:43.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:22:43.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,man and human face in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[340, 337, 600, 966];man:[000, 125, 225, 961],[538, 114, 999, 999];human face:[066, 210, 158, 360],[448, 369, 530, 536],[745, 162, 796, 407].') 2023-11-03 04:22:48.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:22:48.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[015, 745, 142, 984],[223, 780, 331, 999],[290, 647, 379, 999],[745, 640, 822, 999];man:[130, 640, 212, 960],[304, 585, 358, 715],[349, 616, 411, 878],[385, 600, 421, 759],[414, 592, 454, 733],[441, 589, 465, 720],[445, 589, 507, 823],[490, 692, 595, 999],[583, 609, 700, 999],[651, 650, 790, 999],[729, 589, 805, 747],[855, 568, 954, 881];clothing:[001, 815, 166, 999],[134, 685, 204, 963],[232, 579, 531, 751],[235, 866, 321, 999],[324, 700, 368, 955],[485, 763, 583, 999],[550, 624, 573, 756],[580, 620, 616, 710],[596, 684, 687, 992],[655, 751, 795, 999],[725, 641, 823, 772],[786, 680, 950, 999],[901, 611, 961, 805],[957, 779, 999, 887];human face:[065, 755, 096, 825],[225, 797, 255, 882],[668, 682, 693, 745],[853, 628, 911, 725].') 2023-11-03 04:22:49.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:22:49.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Vase,Whiteboard,Stool,Handbag and Potted Plant in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Vase:[424, 736, 567, 890],[199, 552, 298, 639],[220, 877, 256, 936];Whiteboard:[456, 355, 551, 566];Stool:[777, 661, 822, 800];Handbag:[317, 546, 389, 630];Potted Plant:[107, 642, 164, 782].') 2023-11-03 04:23:03.146 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:23:03.146 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'father [008, 421, 201, 893] aims a water gun at children [530, 424, 623, 745] [715, 404, 874, 789] [607, 468, 711, 730] [385, 559, 534, 751] throwing water in a paddling pool - domestic garden stock pictures, royalty-free photos [601, 630, 798, 706] & images') 2023-11-03 04:23:33.127 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:23:33.128 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Congress vice president Rahul Gandhi [326, 225, 580, 993] during a meeting with home-buyers [570, 148, 786, 492] [001, 118, 113, 522] [001, 501, 207, 995] [196, 102, 365, 998] [068, 234, 245, 995] [669, 192, 947, 995] [514, 255, 739, 997] at party headquarter in New Delhi on May 2, 2015.') 2023-11-03 04:23:40.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:23:40.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[102, 252, 199, 912],[040, 274, 102, 808],[268, 256, 353, 846],[389, 256, 453, 815],[640, 274, 718, 723];jeans:[112, 567, 193, 904],[476, 597, 533, 940],[517, 494, 583, 790];bus:[460, 074, 978, 594];woman:[227, 323, 262, 657],[328, 292, 445, 988],[451, 335, 468, 625],[453, 274, 555, 939],[524, 257, 596, 790],[599, 296, 667, 758];girl:[227, 317, 268, 658],[330, 284, 441, 988],[444, 274, 552, 958],[445, 330, 476, 622],[521, 244, 598, 794],[598, 296, 665, 760];footwear:[170, 839, 224, 904],[330, 868, 357, 934],[495, 836, 510, 899].') 2023-11-03 04:23:44.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:23:44.912 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[463, 557, 533, 834],[977, 561, 999, 659],[107, 462, 246, 899],[046, 476, 126, 825];Sneakers:[117, 860, 176, 895],[497, 795, 531, 828],[497, 768, 529, 802];Sports Car:[135, 381, 585, 895],[571, 511, 681, 712],[645, 526, 737, 664],[815, 581, 851, 614].') 2023-11-03 04:23:56.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:23:56.363 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[112, 790, 261, 997],[000, 455, 028, 581],[006, 550, 063, 645],[020, 406, 066, 532],[061, 817, 163, 999],[266, 432, 299, 525],[301, 469, 326, 544],[368, 527, 404, 590],[451, 800, 717, 999],[794, 492, 837, 674],[942, 700, 999, 910];man:[692, 720, 858, 984],[020, 520, 067, 588],[067, 643, 190, 853],[100, 622, 198, 802],[199, 651, 375, 999],[210, 634, 276, 739],[262, 487, 294, 555],[315, 590, 359, 713],[350, 585, 408, 741],[388, 594, 463, 725],[451, 620, 526, 739],[461, 458, 483, 528],[703, 627, 826, 818],[788, 765, 981, 999],[878, 496, 909, 701],[903, 604, 980, 802];girl:[110, 813, 260, 999],[010, 555, 076, 642];human face:[270, 668, 306, 750].') 2023-11-03 04:24:05.259 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:24:05.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Lamp:[196, 150, 221, 247];Person:[037, 614, 222, 871],[288, 348, 376, 585],[575, 493, 600, 540],[816, 340, 879, 563];Sneakers:[160, 786, 222, 836],[128, 797, 164, 840];Handbag:[800, 460, 832, 507];Camera:[144, 649, 179, 682];Tripod:[169, 502, 300, 735];Moniter:[625, 406, 661, 500].') 2023-11-03 04:24:16.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:24:16.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[255, 041, 367, 264],[533, 100, 693, 329],[783, 100, 856, 257];table:[336, 605, 518, 999],[505, 617, 705, 999],[833, 570, 999, 992].') 2023-11-03 04:26:19.832 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:26:19.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Picture:[421, 444, 577, 537];Person:[458, 475, 495, 520];Chair:[443, 498, 661, 755],[615, 485, 875, 738],[876, 425, 998, 685];Desk:[233, 441, 434, 667].') 2023-11-03 04:26:21.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:26:21.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Bus:[153, 306, 372, 652],[358, 383, 530, 636],[530, 388, 709, 625],[682, 434, 810, 625],[753, 418, 978, 589],[001, 456, 114, 581].') 2023-11-03 04:27:07.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:27:07.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[457, 388, 540, 476] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[457, 388, 540, 476];Frame2:[631, 382, 725, 475].') 2023-11-03 04:27:48.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:27:48.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'shelf:[217, 132, 996, 963];Bottle:[310, 128, 459, 319],[471, 101, 614, 307],[631, 099, 763, 300],[740, 017, 966, 300].') 2023-11-03 04:28:10.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:28:10.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[001, 779, 170, 995],[149, 782, 440, 998],[448, 786, 729, 998],[711, 777, 998, 1000],[001, 707, 026, 775],[832, 719, 998, 867],[671, 719, 826, 937],[512, 717, 675, 785],[350, 710, 506, 908],[192, 709, 343, 785],[018, 709, 187, 785];Lamp:[736, 279, 760, 319],[230, 282, 254, 332].') 2023-11-03 04:28:36.781 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:28:36.781 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Three children [001, 281, 281, 732] [623, 227, 917, 618] [315, 218, 618, 667] from the Trans Mara region of Kenya sit at a desk [002, 525, 998, 995] doing their schoolwork [369, 549, 540, 788].') 2023-11-03 04:28:49.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:28:49.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[363, 423, 420, 538],truckFrame1:[199, 324, 433, 443],carFrame1:[268, 409, 384, 560],carFrame1:[625, 436, 676, 538],carFrame1:[000, 315, 281, 695] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1:[363, 423, 420, 538];Frame2:[274, 401, 387, 569],truckFrame1[199, 324, 433, 443];Frame2:No object,carFrame1:[268, 409, 384, 560];Frame2:[031, 367, 323, 643],carFrame1:[625, 436, 676, 538];Frame2:[673, 424, 778, 580],carFrame1[000, 315, 281, 695];Frame2:No object.') 2023-11-03 04:29:10.373 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:29:10.374 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[152, 016, 993, 994] Portrait of a suri tribe woman with [642, 309, 932, 537] [690, 237, 814, 342] enlarged earlobes and huge earrings, Omo valley, Kibish, Ethiopia') 2023-11-03 04:29:13.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:29:13.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'lighthouse:[037, 142, 250, 613];tower:[053, 131, 240, 606].') 2023-11-03 04:29:48.669 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:29:48.669 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[793, 282, 950, 998],[729, 478, 917, 1000],[649, 312, 800, 659],[601, 440, 799, 1000],[516, 266, 678, 643],[475, 466, 633, 998],[356, 308, 518, 629],[345, 481, 547, 999],[251, 297, 371, 628],[156, 434, 402, 997],[069, 242, 251, 611],[001, 342, 197, 998].') 2023-11-03 04:29:55.676 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:29:55.677 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Layered strawberry pretzel salad - just look at those delicious layers [006, 189, 991, 876]!') 2023-11-03 04:29:57.999 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:29:58.000 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect jeans,man and girl in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'jeans:[537, 880, 646, 981],[140, 820, 181, 899],[625, 823, 689, 928];man:[000, 730, 045, 898],[062, 698, 151, 905],[132, 696, 193, 868],[314, 700, 475, 999],[459, 703, 504, 801],[480, 714, 535, 925],[482, 705, 546, 933],[666, 680, 787, 969];girl:[126, 760, 198, 905],[620, 718, 705, 940].') 2023-11-03 04:30:09.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:30:09.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Frame:[820, 012, 999, 235];Chair:[736, 418, 992, 778],[561, 310, 691, 426],[174, 586, 490, 999],[237, 396, 345, 425],[045, 332, 180, 471],[247, 289, 289, 339];Storage box:[684, 379, 747, 475];Desk:[070, 460, 958, 999],[215, 333, 429, 419];Couch:[674, 747, 1000, 1000];Pillow:[772, 759, 1000, 1000];Cabinet:[491, 094, 669, 394];Handbag:[570, 346, 667, 411];Bakset:[522, 442, 670, 591],[749, 408, 921, 517];Barrel:[194, 423, 350, 542],[105, 395, 236, 472];Flask:[616, 142, 643, 200].') 2023-11-03 04:30:21.027 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:30:21.028 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[291, 503, 334, 756],personFrame1:[434, 517, 470, 743],personFrame1:[685, 512, 720, 709],personFrame1:[478, 491, 531, 762],personFrame1:[534, 497, 582, 743],personFrame1:[548, 499, 600, 777],personFrame1:[625, 502, 695, 756] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[291, 503, 334, 756];Frame2:[241, 505, 331, 758],personFrame1:[434, 517, 470, 743];Frame2:[434, 512, 500, 741],personFrame1:[685, 512, 720, 709];Frame2:[678, 511, 722, 717],personFrame1:[478, 491, 531, 762];Frame2:[471, 487, 520, 766],personFrame1:[534, 497, 582, 743];Frame2:[535, 516, 597, 750],personFrame1:[548, 499, 600, 777];Frame2:[514, 500, 599, 785],personFrame1:[625, 502, 695, 756];Frame2:[626, 508, 729, 760].') 2023-11-03 04:31:11.127 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:31:11.127 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'bird:[160, 606, 240, 689].') 2023-11-03 04:31:24.511 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:31:24.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "**Susan Sarandon [193, 036, 839, 994], 72** A talented actress and women's rights activist - Susan is a force to be reckoned with!") 2023-11-03 04:31:39.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:31:39.834 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Chair:[006, 294, 106, 578],[092, 296, 163, 501],[253, 314, 314, 471],[767, 353, 834, 492];Picture:[394, 127, 446, 219];shelf:[433, 221, 566, 409];Desk:[203, 551, 1000, 1000],[335, 741, 1000, 999];Moniter:[858, 280, 912, 379];Plate:[606, 828, 823, 954];Satchel:[116, 487, 177, 619],[298, 277, 392, 519];Person:[512, 198, 682, 613],[562, 113, 628, 268],[762, 248, 816, 333],[324, 165, 462, 658],[323, 176, 377, 626],[145, 174, 296, 999],[002, 286, 073, 589].') 2023-11-03 04:32:15.730 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:32:15.730 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[647, 683, 764, 997] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a motor scooterFrame1:[647, 683, 764, 997];Frame2:[686, 700, 810, 998];Frame3:[692, 700, 817, 998];Frame4:[696, 702, 825, 998];Frame5:[700, 702, 828, 998].') 2023-11-03 04:32:40.564 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:32:40.565 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'footwear:[532, 858, 600, 946],[023, 747, 099, 784],[595, 801, 635, 876];jeans:[092, 369, 173, 711],[000, 471, 084, 751],[205, 290, 244, 463],[315, 310, 375, 509],[413, 294, 453, 451],[443, 296, 481, 459],[701, 252, 725, 365],[749, 259, 785, 396],[868, 228, 898, 305];woman:[080, 116, 197, 702],[136, 132, 201, 543],[195, 122, 250, 479],[309, 110, 396, 573],[483, 137, 535, 432],[637, 158, 676, 262],[719, 142, 743, 369];man:[028, 111, 098, 694],[064, 048, 123, 634],[401, 116, 465, 457],[443, 136, 491, 477],[469, 092, 666, 926],[670, 144, 703, 235],[702, 116, 735, 374],[730, 131, 791, 427],[789, 129, 828, 376],[818, 131, 876, 361],[870, 131, 902, 321],[907, 134, 931, 277],[925, 177, 941, 283];girl:[083, 118, 194, 705],[309, 113, 398, 570],[638, 161, 676, 329].') 2023-11-03 04:32:52.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:32:52.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[060, 010, 972, 1000];fashion accessory:[296, 000, 870, 325];girl:[086, 000, 999, 997];human face:[025, 263, 183, 610],[259, 139, 700, 768].') 2023-11-03 04:32:53.944 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:32:53.944 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Tahar Ben Jelloun [081, 339, 918, 566] quotes: In the '70s I was in exile; every time I went back I wondered if they'd take my passport away.") 2023-11-03 04:33:18.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:33:18.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[044, 056, 329, 730];fashion accessory:[039, 555, 104, 656],[185, 073, 316, 212],[489, 823, 694, 923],[600, 772, 795, 871];girl:[041, 057, 460, 738];footwear:[488, 826, 693, 915],[602, 768, 800, 881];human face:[250, 130, 315, 246],[340, 162, 444, 294].') 2023-11-03 04:33:23.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:33:23.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Car:[853, 687, 894, 718];Pickup Truck:[969, 678, 1000, 715],[923, 648, 965, 684].') 2023-11-03 04:33:36.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:33:36.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 416, 424, 989],carFrame1:[349, 405, 490, 619],carFrame1:[439, 320, 559, 562] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[000, 416, 424, 989];Frame2:[000, 416, 424, 989],carFrame1:[349, 405, 490, 619];Frame2:[349, 409, 491, 619],carFrame1:[439, 320, 559, 562];Frame2:[439, 320, 559, 562].') 2023-11-03 04:34:11.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:34:11.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the black dog under the tree initially occluded by the woman in white in this video clip?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one the black dog under the tree initially occluded by the woman in white.Track1frame:1:[339, 163, 403, 355];frame:2:[333, 176, 422, 361];frame:3:[336, 189, 411, 359].') 2023-11-03 04:34:33.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:34:33.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'house:[106, 125, 941, 882],[000, 621, 111, 805],[721, 237, 999, 636];tree:[021, 042, 962, 913];window:[291, 568, 363, 721],[617, 602, 666, 719],[730, 602, 788, 724],[850, 466, 938, 552],[879, 347, 943, 402];stairs:[543, 765, 699, 892].') 2023-11-03 04:34:56.935 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:34:56.935 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[202, 171, 272, 521],[001, 169, 055, 353],[084, 179, 153, 275],[298, 158, 574, 917],[458, 191, 531, 537],[519, 267, 589, 541],[568, 183, 818, 941],[736, 203, 794, 421],[806, 239, 862, 363],[832, 207, 968, 675],[936, 204, 1000, 372];Sneakers:[300, 766, 342, 899],[410, 796, 453, 880].') 2023-11-03 04:35:24.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:35:24.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 455, 205, 697],[083, 317, 253, 999],[229, 400, 395, 999],[506, 254, 659, 784],[661, 560, 1000, 1000];Guitar:[002, 650, 160, 1000];Desk:[236, 848, 729, 999];Chair:[213, 591, 923, 1000];Backpack:[126, 415, 256, 639].') 2023-11-03 04:35:50.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:35:50.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[140, 254, 202, 481],[264, 361, 328, 525],[470, 388, 558, 592],[728, 492, 833, 731],[098, 440, 200, 636],[201, 258, 256, 345],[290, 255, 342, 348],[363, 247, 415, 344],[453, 257, 508, 348],[542, 261, 592, 353];Helmet:[729, 491, 761, 546],[478, 389, 502, 429],[106, 439, 135, 481],[140, 305, 163, 341];Hockey Stick:[398, 513, 470, 549].') 2023-11-03 04:36:18.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:36:18.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Ambulance:[132, 165, 776, 907];Car:[034, 351, 072, 409],[055, 340, 131, 403],[114, 342, 174, 403],[192, 337, 238, 391];Chair:[298, 385, 371, 459],[453, 378, 500, 453].') 2023-11-03 04:37:11.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:37:11.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower,Frame,Lamp,Cup and Sink in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Flower:[731, 482, 791, 576];Picture:[349, 594, 394, 700],[599, 543, 654, 616],[824, 297, 999, 450];Lamp:[518, 173, 558, 291],[320, 108, 373, 262];Cup:[312, 648, 351, 727];Sink:[120, 744, 354, 862].') 2023-11-03 04:37:13.520 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:37:13.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'building:[012, 095, 1000, 635];car:[001, 585, 187, 815],[005, 584, 147, 665],[115, 537, 366, 640],[135, 553, 501, 792],[363, 575, 426, 632],[406, 546, 896, 766],[886, 569, 999, 658],[896, 613, 999, 782];wheel:[000, 735, 022, 813],[180, 706, 203, 784],[305, 698, 362, 790],[708, 685, 774, 773],[940, 682, 999, 787];tire:[000, 720, 020, 818],[174, 696, 205, 779],[306, 700, 368, 789],[421, 734, 484, 782],[474, 696, 513, 770],[710, 692, 772, 774],[938, 682, 999, 779].') 2023-11-03 04:37:18.521 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:37:18.522 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Bible Study Worksheet [284, 143, 709, 463]: Your Story within God's Word bible design bible verse purple workbook type womens bible study bible study bible worksheet indesign design [001, 015, 994, 980]") 2023-11-03 04:37:40.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:37:40.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Clock,Other Shoes and Hat in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Other Shoes:[035, 811, 100, 852];Hat:[481, 051, 571, 153],[097, 200, 177, 258].') 2023-11-03 04:37:50.760 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:37:50.760 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Airplane:[412, 495, 673, 625],[059, 547, 182, 594];Person:[498, 405, 535, 605],[416, 397, 511, 624],[433, 402, 473, 602],[289, 320, 399, 625],[179, 339, 242, 613].') 2023-11-03 04:38:21.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:38:21.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[272, 417, 643, 598] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a water cartFrame1:[272, 417, 643, 598];Frame2:[266, 397, 638, 593];Frame3:[229, 381, 631, 586];Frame4:[172, 372, 629, 600];Frame5:[169, 369, 647, 600].') 2023-11-03 04:38:46.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:38:46.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Backpack:[220, 341, 312, 536];Person:[535, 273, 649, 948],[474, 288, 582, 856],[353, 283, 498, 819],[251, 259, 376, 835].') 2023-11-03 04:39:00.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:39:00.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[525, 356, 645, 906],[475, 474, 542, 854],[378, 285, 508, 880];Necklace:[554, 425, 580, 458];Hat:[430, 287, 471, 324],[491, 466, 531, 506];Sneakers:[613, 869, 648, 905],[549, 836, 587, 863],[512, 809, 539, 851],[477, 765, 507, 824],[437, 832, 482, 863],[393, 841, 420, 878].') 2023-11-03 04:39:01.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:39:01.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[127, 552, 703, 666] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a stealth bomberFrame1:[127, 552, 703, 666];Frame2:[122, 551, 677, 665];Frame3:[129, 550, 667, 670];Frame4:[133, 550, 661, 670];Frame5:[132, 550, 660, 670].') 2023-11-03 04:39:53.005 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:39:53.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[493, 449, 632, 842] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a sloth bearFrame1:[493, 449, 632, 842];Frame2:[479, 445, 634, 824];Frame3:[482, 437, 634, 824];Frame4:[486, 440, 633, 828].') 2023-11-03 04:40:03.487 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:40:03.487 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[006, 380, 994, 625] A guide to catching [003, 786, 282, 900] [275, 790, 837, 951] taxis in Italy without getting ripped off.') 2023-11-03 04:40:18.143 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:40:18.144 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'oyster:[011, 027, 988, 981],[397, 651, 723, 990].') 2023-11-03 04:40:33.738 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:40:33.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Satchel:[678, 824, 795, 939];Hat:[741, 617, 860, 662];Scale:[173, 712, 334, 805];Motorcycle:[901, 492, 999, 579],[038, 494, 213, 665];Storage box:[166, 714, 339, 898];Stool:[323, 593, 393, 632],[066, 657, 118, 718];Desk:[495, 519, 575, 565];Broccoli:[333, 765, 402, 804];Person:[674, 616, 1000, 864],[857, 530, 968, 597],[507, 476, 552, 578],[453, 446, 501, 605],[404, 454, 491, 621],[306, 513, 416, 614],[017, 514, 093, 586],[396, 604, 614, 999],[001, 614, 104, 816].') 2023-11-03 04:40:47.826 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:40:47.826 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the person moving around?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one person moving around.Track1frame:1:[988, 537, 999, 584];frame:2:[971, 538, 987, 587];frame:3:[953, 535, 974, 575].') 2023-11-03 04:40:49.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:40:49.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[460, 339, 673, 479];Flag:[001, 889, 064, 997];Sneakers:[642, 361, 671, 401];Motorcycle:[449, 409, 580, 620].') 2023-11-03 04:41:03.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:41:03.271 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[700, 488, 901, 642],carFrame1:[585, 487, 645, 569],carFrame1:[632, 473, 738, 597] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[700, 488, 901, 642];Frame2:[934, 556, 999, 718],carFrame1:[585, 487, 645, 569];Frame2:[651, 493, 767, 608],carFrame1:[632, 473, 738, 597];Frame2:[747, 459, 957, 662].') 2023-11-03 04:41:06.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:41:06.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Bench:[160, 411, 222, 464];Person:[355, 252, 585, 997],[854, 338, 935, 631],[918, 329, 966, 441],[829, 334, 878, 422],[662, 354, 704, 566],[657, 350, 693, 548],[627, 396, 662, 545],[544, 343, 596, 590],[201, 364, 285, 606],[097, 360, 173, 619];Umbrella:[737, 225, 999, 402],[616, 256, 810, 320];Satchel:[317, 649, 409, 855],[864, 430, 920, 494];Trash bin Can:[689, 467, 737, 569];Crane:[527, 111, 723, 240].') 2023-11-03 04:41:08.388 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:41:08.388 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[036, 196, 996, 802] Pumpkin cream pie has all the flavor of classic pumpkin pie with a slightly different texture. [074, 193, 794, 581] It's topped with decadent salted caramel whipped cream.") 2023-11-03 04:41:35.889 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:41:35.889 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,footwear,man,human body and human leg in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'girl:[561, 392, 761, 830];footwear:[391, 811, 477, 861],[478, 504, 515, 567],[483, 743, 546, 801],[565, 700, 643, 831];man:[652, 333, 1000, 940],[070, 344, 168, 518],[246, 256, 551, 872],[350, 280, 513, 560],[582, 271, 695, 648],[941, 332, 999, 638];human body:[578, 423, 993, 987],[000, 296, 060, 491],[000, 368, 091, 567],[000, 649, 138, 999],[066, 337, 181, 596],[066, 394, 400, 999],[112, 274, 186, 437],[180, 280, 263, 435],[194, 246, 235, 342],[338, 265, 390, 366],[344, 269, 493, 567],[363, 383, 460, 573],[458, 134, 538, 451],[531, 359, 603, 560],[580, 262, 691, 653],[668, 254, 717, 407],[685, 247, 783, 456],[772, 262, 836, 386],[867, 293, 927, 424],[927, 305, 993, 433],[948, 338, 999, 688];human leg:[363, 615, 470, 864],[391, 540, 541, 800],[475, 272, 488, 413],[494, 263, 531, 437],[563, 599, 658, 828].') 2023-11-03 04:41:58.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:41:58.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[219, 083, 371, 317],[524, 125, 606, 308],[579, 126, 734, 366],[743, 181, 828, 348],[880, 207, 999, 346];Person:[000, 130, 079, 353],[037, 087, 181, 345],[076, 000, 173, 230],[067, 254, 432, 952],[616, 074, 734, 360],[686, 000, 802, 288],[768, 004, 878, 352],[876, 063, 982, 344];Sneakers:[074, 804, 133, 936],[338, 885, 426, 945],[029, 320, 080, 357],[000, 319, 032, 349],[134, 314, 164, 347];Hat:[639, 071, 700, 116];Glasses:[908, 093, 939, 107];Other Shoes:[669, 331, 702, 355],[701, 338, 727, 356],[931, 320, 952, 345],[954, 321, 981, 343];Handbag:[656, 202, 699, 240].') 2023-11-03 04:42:10.713 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:42:10.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[014, 381, 143, 986],[138, 392, 240, 826];man:[675, 387, 796, 893],[000, 324, 041, 522],[035, 336, 065, 468],[056, 336, 084, 411];plant:[000, 377, 271, 655],[479, 821, 643, 999],[597, 650, 874, 999],[879, 744, 999, 999];footwear:[150, 755, 177, 830],[182, 747, 203, 829].') 2023-11-03 04:42:21.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:42:21.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shelf and Plate in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cabinet:[002, 003, 244, 999],[871, 003, 1000, 699],[765, 146, 833, 485];Plate:[663, 620, 749, 674],[635, 790, 768, 867].') 2023-11-03 04:42:33.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:42:33.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'skyscraper:[036, 104, 999, 999];window:[033, 108, 999, 999].') 2023-11-03 04:43:09.248 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:43:09.248 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Mouse:[285, 815, 343, 925];Canned:[542, 315, 636, 476];Desk:[068, 790, 1000, 1000],[351, 552, 841, 796];Chair:[029, 509, 169, 1000];Person:[160, 217, 384, 799],[883, 304, 961, 400],[567, 009, 1000, 831],[219, 092, 732, 913];Watch:[665, 469, 728, 524];Necklace:[519, 450, 574, 528].') 2023-11-03 04:43:29.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:43:29.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[278, 088, 599, 897],[176, 082, 651, 885];Sneakers:[557, 785, 649, 857],[498, 843, 597, 896],[315, 723, 377, 866];Soccer:[410, 782, 489, 906].') 2023-11-03 04:43:38.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:43:38.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the both pandas are engaged in a battle..For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There are 2 both pandas are engaged in a battle..Track1frame:1:[356, 264, 699, 980];frame:2:[339, 260, 610, 983];frame:3:[271, 271, 560, 997],Track2frame:1:[437, 193, 804, 868];frame:2:[409, 142, 754, 874];frame:3:[372, 186, 725, 910].') 2023-11-03 04:43:39.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:43:39.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Bed,Picture,Gloves and Carpet in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Bed:[196, 300, 825, 880];Picture:[768, 001, 985, 338];Gloves:[888, 252, 929, 308];Carpet:[001, 686, 470, 997].') 2023-11-03 04:43:52.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:43:52.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[333, 095, 369, 156] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a person headFrame1:[333, 095, 369, 156];Frame2:[468, 436, 493, 476];Frame3:[445, 465, 462, 498].') 2023-11-03 04:44:23.359 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:44:23.359 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect boy,tree,person,wheel and bicycle in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'tree:[011, 306, 121, 385],[150, 308, 383, 445],[456, 314, 668, 460],[531, 370, 665, 506],[648, 091, 999, 600];person:[455, 639, 554, 956],[308, 575, 346, 724],[327, 578, 393, 814],[419, 610, 440, 675],[485, 600, 507, 688],[654, 646, 719, 805],[667, 778, 775, 999],[777, 672, 858, 901];wheel:[339, 745, 352, 850],[502, 873, 523, 985],[671, 759, 683, 833],[678, 919, 703, 999],[770, 831, 801, 940],[813, 864, 858, 988];bicycle:[311, 520, 508, 847],[461, 761, 534, 978],[662, 724, 715, 861],[673, 844, 776, 999],[768, 780, 860, 989].') 2023-11-03 04:44:24.751 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:44:24.751 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Trash bin Can:[403, 560, 436, 617];Barrel:[403, 558, 437, 620].') 2023-11-03 04:44:36.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:44:36.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[884, 385, 1000, 721],[529, 478, 633, 664],[478, 529, 534, 630],[309, 377, 527, 718],[293, 495, 387, 626];Glasses:[364, 410, 444, 460];Hat:[534, 481, 593, 513],[938, 387, 999, 448],[478, 527, 508, 552].') 2023-11-03 04:44:47.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:44:47.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'house:[045, 876, 206, 975],[000, 906, 066, 983],[119, 563, 253, 591],[226, 590, 296, 607],[264, 782, 356, 884],[321, 747, 460, 840],[423, 721, 543, 790],[443, 553, 490, 626],[524, 571, 600, 613],[714, 765, 790, 815],[733, 813, 800, 857],[742, 417, 810, 452],[910, 455, 999, 496];tree:[000, 481, 047, 565],[000, 642, 062, 696],[034, 584, 078, 650],[036, 299, 103, 378],[051, 615, 126, 692],[160, 466, 191, 530],[181, 776, 210, 847],[190, 653, 223, 719],[201, 469, 241, 545],[245, 663, 281, 730],[245, 719, 278, 792],[298, 318, 336, 381],[301, 621, 320, 705],[335, 644, 396, 748],[336, 409, 385, 485],[371, 612, 421, 712],[409, 625, 437, 689],[431, 598, 461, 702],[432, 621, 469, 719],[455, 664, 483, 729],[463, 602, 488, 679],[483, 669, 530, 748],[488, 629, 522, 706],[586, 592, 609, 660],[604, 649, 647, 726],[650, 627, 701, 727],[655, 842, 713, 923],[655, 567, 720, 698],[655, 923, 753, 999],[671, 734, 706, 798],[696, 792, 740, 895],[723, 579, 796, 698],[737, 675, 758, 748],[756, 673, 793, 748],[820, 796, 850, 877],[821, 573, 903, 644],[837, 857, 883, 969],[847, 632, 928, 740],[864, 727, 897, 792],[872, 376, 948, 434],[873, 840, 902, 904],[901, 565, 928, 636],[913, 653, 948, 763],[925, 542, 964, 653],[945, 673, 973, 745],[952, 519, 993, 661],[964, 767, 999, 853],[971, 665, 999, 778].') 2023-11-03 04:44:59.409 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:44:59.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[410, 367, 547, 954],[630, 341, 755, 923],[919, 390, 986, 684].') 2023-11-03 04:45:16.542 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:45:16.543 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A flatlay of a pair of shorts with white, blue and pink print, with [593, 356, 981, 904] a folded bright blue cardigan, and [571, 719, 617, 802] [515, 680, 821, 919] [521, 721, 575, 803] some jewellery and [211, 598, 430, 730] sunglasses on top.') 2023-11-03 04:45:32.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:45:32.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "A group of people [739, 330, 832, 806] [604, 321, 718, 809] [536, 313, 593, 775] [582, 294, 648, 780] [779, 292, 911, 824] [670, 290, 747, 776] [149, 294, 277, 851] [010, 275, 143, 900] stand with banners [328, 327, 568, 780] saying 'BDS is here to stay' in front of red smoke [123, 068, 577, 857]") 2023-11-03 04:45:45.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:45:45.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[623, 270, 681, 480] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[623, 270, 681, 480];Frame2:[581, 359, 623, 538];Frame3:[425, 373, 503, 602].') 2023-11-03 04:46:07.912 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:46:07.912 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Puppy cream Labrador [202, 100, 832, 966] facing at the camera, isolated on white') 2023-11-03 04:46:19.587 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:46:19.587 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the black adn white rabbit jump around in this video clip?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one black adn white rabbit jump around.Track1frame:3:[016, 425, 108, 562].') 2023-11-03 04:47:04.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:47:04.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Motorcycle:[337, 577, 352, 603],[351, 578, 365, 609],[481, 594, 554, 740],[548, 612, 608, 709],[685, 613, 781, 775];Street Lights:[846, 296, 887, 662],[143, 283, 200, 603],[094, 415, 123, 484],[073, 460, 091, 502];Pickup Truck:[156, 563, 190, 584];Car:[125, 565, 141, 574];Gloves:[253, 654, 292, 696],[142, 692, 178, 731],[690, 653, 697, 663];Hat:[211, 515, 240, 543],[722, 589, 738, 612];Boots:[219, 899, 297, 974];Other Shoes:[697, 729, 710, 742],[397, 635, 405, 648],[426, 646, 432, 656];Helmet:[576, 586, 588, 604],[507, 560, 521, 578],[470, 579, 477, 589],[440, 565, 452, 578],[413, 571, 423, 586],[397, 570, 404, 581],[389, 573, 394, 581],[313, 562, 319, 572];Person:[022, 484, 297, 986],[187, 515, 258, 640],[310, 561, 321, 584],[691, 588, 738, 743],[552, 586, 598, 685],[482, 560, 548, 675];Flag:[521, 591, 530, 638],[588, 631, 600, 643],[727, 595, 756, 693].') 2023-11-03 04:47:15.080 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:47:15.080 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'shelf:[001, 048, 315, 658],[536, 573, 936, 965],[326, 301, 393, 384],[740, 513, 834, 592],[262, 576, 437, 780],[427, 599, 574, 778];Lamp:[508, 001, 569, 222],[671, 001, 740, 148],[940, 065, 999, 316];Carpet:[268, 808, 548, 897];Desk:[616, 900, 999, 1000].') 2023-11-03 04:47:15.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:47:15.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Flower:[001, 382, 188, 554];Cabinet:[546, 370, 659, 644],[750, 381, 879, 563],[115, 653, 217, 960];Faucet:[660, 424, 727, 624];Sink:[647, 613, 716, 649],[530, 653, 648, 695];Barrel:[273, 517, 314, 580];Hat:[269, 474, 310, 538];Bottle:[479, 620, 505, 732],[725, 518, 749, 633],[723, 638, 748, 723];Knife:[484, 497, 505, 531];Oven:[438, 393, 509, 542];Plate:[346, 558, 417, 592];Gas stove:[150, 552, 532, 719];TV:[720, 405, 751, 478];Extractor:[134, 149, 540, 393].') 2023-11-03 04:47:30.139 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:47:30.139 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[260, 768, 270, 825] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a border collieFrame1:[260, 768, 270, 825];Frame2:[259, 766, 269, 828];Frame3:[244, 751, 256, 814];Frame4:[234, 751, 258, 797];Frame5:[232, 817, 256, 853].') 2023-11-03 04:47:43.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:47:43.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Helmet:[964, 107, 996, 171],[924, 111, 964, 180],[871, 151, 910, 206],[818, 137, 856, 201],[875, 214, 942, 321],[845, 183, 895, 257],[760, 093, 805, 176],[668, 093, 706, 148],[631, 084, 671, 160],[608, 204, 668, 310],[567, 203, 620, 282],[554, 082, 596, 150],[514, 087, 553, 148],[460, 152, 497, 222],[422, 103, 460, 168],[384, 090, 423, 162],[391, 163, 440, 238],[284, 189, 354, 288],[451, 343, 523, 431],[379, 285, 450, 410],[287, 284, 347, 401],[223, 075, 266, 141],[062, 109, 090, 150],[575, 636, 642, 754];Hat:[358, 105, 384, 136];Gloves:[249, 294, 284, 340];Sneakers:[957, 612, 985, 660],[923, 567, 951, 644],[730, 544, 790, 616],[543, 752, 578, 799],[528, 848, 557, 880],[345, 807, 394, 894],[469, 724, 504, 804],[385, 575, 418, 694],[376, 667, 408, 744],[360, 687, 390, 731],[254, 631, 281, 724],[154, 638, 201, 700],[089, 625, 128, 729];Person:[001, 105, 035, 429],[065, 115, 105, 426],[015, 084, 094, 430],[094, 116, 162, 426],[227, 080, 270, 211],[127, 066, 255, 430],[092, 194, 349, 720],[301, 079, 378, 328],[251, 290, 412, 740],[367, 166, 474, 363],[371, 287, 580, 799],[345, 345, 579, 886],[540, 547, 794, 787],[358, 105, 386, 152],[535, 203, 621, 549],[383, 094, 433, 227],[554, 205, 727, 654],[412, 104, 482, 357],[459, 155, 508, 275],[491, 088, 579, 429],[548, 085, 613, 211],[607, 087, 710, 289],[664, 090, 738, 452],[725, 094, 829, 454],[815, 140, 873, 454],[870, 154, 926, 225],[817, 186, 901, 536],[832, 213, 987, 662],[922, 112, 968, 295],[951, 104, 1000, 410];American Football:[365, 449, 410, 534].') 2023-11-03 04:48:07.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:48:07.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[000, 013, 550, 999],[781, 640, 858, 944],[906, 627, 999, 894];swimming pool:[526, 821, 999, 999];human face:[309, 079, 507, 540];glasses:[296, 254, 526, 353].') 2023-11-03 04:48:46.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:48:46.864 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect jeans,human leg,human body,human hair,human arm and fashion accessory in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'jeans:[114, 351, 260, 762],[383, 465, 670, 655],[508, 246, 585, 315],[640, 236, 742, 338],[767, 348, 913, 439],[879, 381, 975, 494];human leg:[101, 380, 251, 743],[274, 276, 333, 380],[381, 476, 568, 576],[461, 472, 666, 590],[652, 236, 748, 336],[769, 322, 903, 435],[889, 385, 974, 511];human body:[657, 078, 950, 528],[000, 076, 077, 161],[000, 146, 102, 230],[070, 090, 320, 733],[250, 160, 357, 381],[325, 140, 375, 220],[366, 071, 537, 312],[375, 281, 724, 636],[507, 184, 585, 325],[632, 061, 748, 330],[908, 031, 999, 204],[954, 126, 999, 385];human hair:[082, 085, 220, 217],[275, 157, 341, 207],[424, 068, 512, 147],[642, 266, 726, 323],[722, 007, 888, 128];human arm:[192, 174, 325, 251],[068, 200, 182, 489],[369, 125, 419, 219],[405, 140, 540, 217],[458, 367, 613, 513],[595, 376, 767, 483],[676, 135, 733, 238],[695, 147, 922, 403],[927, 079, 999, 136];fashion accessory:[176, 718, 260, 741],[244, 755, 385, 943],[459, 651, 853, 955],[745, 062, 905, 298],[918, 479, 982, 508],[925, 192, 990, 334].') 2023-11-03 04:48:51.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:48:51.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[152, 600, 839, 998],[911, 001, 999, 249],[334, 001, 536, 193],[198, 001, 324, 230],[003, 001, 161, 147].') 2023-11-03 04:49:54.335 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:49:54.336 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[468, 572, 495, 662],[385, 503, 432, 567];Person:[384, 431, 684, 679],[002, 249, 080, 616],[065, 263, 132, 549],[158, 237, 223, 555],[240, 254, 321, 555],[320, 265, 385, 558],[398, 249, 472, 562],[502, 275, 572, 472],[575, 274, 645, 569],[758, 265, 832, 578].') 2023-11-03 04:50:23.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:50:23.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[445, 414, 484, 495] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a skibobFrame1:[445, 414, 484, 495];Frame2:[380, 368, 415, 449];Frame3:[359, 352, 418, 444];Frame4:[369, 446, 407, 563].') 2023-11-03 04:50:46.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:50:46.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect man,jeans and woman in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[568, 520, 658, 882],[445, 558, 483, 704],[534, 546, 555, 641],[842, 569, 865, 648],[941, 569, 961, 670];jeans:[579, 677, 647, 873],[693, 678, 723, 744];woman:[880, 565, 952, 762],[681, 570, 733, 758].') 2023-11-03 04:51:11.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:51:11.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[185, 615, 275, 762] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a personFrame1:[185, 615, 275, 762];Frame2:[181, 603, 284, 775];Frame3:[195, 623, 286, 777];Frame4:[168, 583, 270, 788].') 2023-11-03 04:51:32.474 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:51:32.474 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[551, 211, 964, 936] [003, 190, 463, 934] Two kids with [329, 622, 403, 852] [827, 676, 926, 915] forks and [590, 618, 671, 882] [023, 570, 125, 927] knives raised and full plates of [095, 804, 389, 938] [605, 825, 903, 943] yummy food with [708, 212, 893, 578] [143, 193, 401, 540] mad and frustrated faces - just like when God's kids complained about their food too") 2023-11-03 04:51:56.194 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:51:56.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers,Trombone and Mask in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Sneakers:[002, 818, 076, 876],[763, 909, 801, 956];Trombone:[738, 303, 831, 428].') 2023-11-03 04:52:18.870 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:52:18.870 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'blooming branches in a whimsy vase with leaves and blooms plus [625, 484, 780, 652] [311, 102, 463, 284] [188, 394, 320, 568] [172, 202, 325, 358] [520, 213, 750, 361] [801, 344, 971, 532] vintage family photos on them for a spring wedding') 2023-11-03 04:52:56.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:52:56.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the cat standing in the same spot without any movement..For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one the cat standing in the same spot without any movement..Track1frame:1:[754, 173, 997, 475];frame:2:[754, 173, 997, 475];frame:3:[771, 175, 998, 477].') 2023-11-03 04:52:57.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:52:57.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[176, 507, 382, 876],[491, 238, 829, 900],[252, 270, 332, 414],[393, 284, 493, 367];Sneakers:[710, 840, 797, 898],[518, 838, 689, 895];Lifesaver:[228, 319, 361, 359].') 2023-11-03 04:53:04.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:53:04.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bookcase,person and human arm in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'bookcase:[076, 004, 568, 1000],[491, 112, 999, 881];person:[576, 365, 747, 798];human arm:[550, 450, 600, 523],[649, 462, 737, 555].') 2023-11-03 04:53:22.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:53:22.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Sailboat:[166, 432, 501, 773].') 2023-11-03 04:53:27.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:53:27.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'dessert:[051, 347, 323, 756],[136, 884, 241, 992],[151, 772, 251, 864],[293, 490, 614, 930],[313, 795, 470, 972],[370, 590, 524, 795],[629, 320, 994, 667];baked goods:[064, 308, 308, 776];fast food:[056, 343, 336, 733],[652, 305, 997, 675];vegetable:[295, 498, 606, 942],[359, 590, 524, 818].') 2023-11-03 04:53:36.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:53:36.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lifesaver:[411, 498, 443, 516],[482, 479, 513, 497];Flag:[070, 000, 092, 015],[131, 000, 151, 026],[313, 121, 322, 157],[344, 122, 352, 155],[375, 122, 382, 150],[643, 083, 650, 137],[619, 000, 630, 028],[678, 000, 687, 024],[849, 079, 857, 094],[788, 096, 806, 137];Awning:[103, 160, 166, 170],[027, 160, 091, 174],[966, 147, 999, 166],[885, 089, 906, 105],[917, 089, 939, 104];Person:[000, 182, 004, 199],[081, 182, 085, 201],[418, 181, 422, 200],[792, 185, 809, 228],[594, 188, 603, 231];Trash bin Can:[300, 198, 313, 220],[616, 196, 629, 226],[911, 184, 917, 196];Blackboard:[857, 183, 868, 197],[747, 183, 754, 197];Boat:[187, 330, 558, 631];Street Lights:[036, 142, 043, 201],[102, 107, 126, 200],[286, 106, 301, 201];Flower:[047, 146, 062, 163],[146, 131, 162, 154],[187, 133, 204, 156],[215, 131, 231, 158],[261, 132, 276, 157],[326, 130, 340, 152],[357, 132, 372, 151],[386, 131, 401, 151],[422, 127, 438, 157],[456, 127, 468, 151],[465, 154, 483, 181],[505, 147, 519, 166],[901, 152, 920, 174],[991, 141, 999, 161];Vase:[051, 153, 058, 158];Chair:[020, 185, 026, 201];Desk:[112, 190, 128, 201];Lamp:[062, 141, 066, 147];Backpack:[795, 191, 803, 206],[286, 181, 290, 190],[129, 182, 137, 195];Satchel:[813, 180, 816, 189],[557, 182, 560, 190],[538, 192, 545, 200],[494, 191, 500, 197],[319, 188, 326, 200],[280, 191, 284, 197],[273, 207, 278, 214];Other Shoes:[599, 223, 602, 230].') 2023-11-03 04:53:37.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:53:37.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[185, 354, 535, 733] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a chimaeraFrame1:[185, 354, 535, 733];Frame2:[242, 352, 546, 755];Frame3:[287, 342, 567, 626];Frame4:[262, 327, 587, 657];Frame5:[281, 403, 596, 654].') 2023-11-03 04:53:57.408 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:53:57.409 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Gluten free honey pumpkin pie [154, 305, 826, 806] has all the great pumpkin flavor that you crave but without any refined sugars. Lightly sweetened and perfectly slice-able.') 2023-11-03 04:54:08.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:54:08.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human face:[445, 000, 979, 746];man:[003, 000, 999, 999].') 2023-11-03 04:54:14.962 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:54:14.963 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'MaxSold Auction: This online auction features artworks [704, 396, 976, 603], salt cellars [090, 348, 314, 603] [365, 643, 606, 917] [364, 350, 599, 605] [072, 647, 303, 910] and spoons [346, 074, 408, 272] [151, 091, 209, 294] [441, 058, 502, 261] [234, 082, 300, 286], napkin rings, Walbrzych China set and a large collection [701, 050, 982, 308] of silverware [701, 684, 987, 967].') 2023-11-03 04:54:37.286 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:54:37.286 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[019, 363, 305, 1000],[094, 327, 180, 488],[263, 448, 653, 998],[775, 347, 999, 998],[933, 901, 999, 998];man:[000, 104, 342, 998],[270, 248, 653, 998],[779, 152, 999, 998],[906, 680, 999, 998];tree:[000, 000, 151, 198],[101, 000, 222, 441],[214, 000, 348, 451],[269, 000, 607, 382],[593, 000, 752, 170],[690, 000, 999, 375];human face:[392, 342, 491, 470],[902, 275, 996, 389],[907, 767, 977, 898].') 2023-11-03 04:54:44.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:54:44.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Leather Shoes:[250, 911, 408, 989],[165, 888, 245, 975];Cell Phone:[686, 583, 741, 599];Chair:[825, 517, 999, 623];Desk:[093, 555, 999, 1000];Person:[020, 213, 408, 990],[371, 282, 638, 576],[001, 384, 034, 536];Bottle:[692, 532, 728, 606],[606, 525, 638, 594];Plate:[738, 586, 825, 608].') 2023-11-03 04:54:55.637 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:54:55.637 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "World's highest mileage Lamborghini Huracan in red sitting by [000, 445, 997, 995] the freeway") 2023-11-03 04:54:56.539 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:54:56.539 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[083, 198, 361, 996] woman in [165, 385, 269, 498] mask outside an medical exam room') 2023-11-03 04:55:10.571 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:55:10.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Storage box:[245, 330, 423, 383],[321, 240, 411, 331],[792, 263, 913, 352];Dog:[504, 711, 847, 965];Bowl:[244, 330, 420, 381],[117, 331, 236, 401],[300, 217, 381, 284];Dishwasher:[331, 387, 551, 790];Oven:[003, 522, 327, 999];Person:[436, 029, 835, 759].') 2023-11-03 04:55:25.439 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:55:25.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'jake gyllenhaal at [530, 002, 725, 681] the 67th annual tony awards arrivals at radio city music hall on june 09 2013 in new york new york - jake gyllenhaal stock videos & royalty-free footage') 2023-11-03 04:55:30.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:55:30.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[116, 409, 327, 657],[328, 394, 546, 606],[432, 431, 631, 750],[304, 549, 577, 931],[744, 456, 954, 807],[713, 566, 997, 950],[852, 669, 998, 999];Desk:[262, 466, 350, 563],[517, 616, 809, 843];Potted Plant:[001, 438, 153, 817].') 2023-11-03 04:55:32.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:55:32.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Gloves:[667, 665, 759, 779];Helmet:[420, 130, 561, 300],[621, 133, 676, 260];Leather Shoes:[390, 654, 528, 738];Person:[297, 132, 758, 1000],[567, 136, 747, 648].') 2023-11-03 04:55:41.788 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:55:41.789 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Tea pot,Hat and Person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Tea pot:[806, 445, 855, 497];Hat:[156, 414, 212, 459];Person:[456, 408, 618, 751],[057, 417, 239, 734],[853, 410, 981, 828],[416, 441, 465, 607],[450, 447, 521, 650].') 2023-11-03 04:55:47.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:55:47.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Stuffed Toy and Hat in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Stuffed Toy:[215, 499, 255, 581];Hat:[669, 753, 760, 894],[752, 815, 809, 894].') 2023-11-03 04:55:59.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:55:59.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Couch,Bottle,Laptop,Cup and Cabinet/shelf in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Couch:[749, 399, 1000, 716],[751, 636, 1000, 1000];Bottle:[935, 567, 963, 659],[972, 556, 997, 632];Laptop:[629, 463, 683, 573];Cup:[923, 590, 942, 633];shelf:[140, 524, 304, 742].') 2023-11-03 04:56:59.347 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:56:59.347 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'dress:[351, 621, 498, 869];cabinetry:[116, 439, 645, 599],[184, 720, 916, 939],[688, 291, 875, 745],[830, 535, 999, 830];woman:[000, 566, 273, 999],[344, 529, 600, 964];man:[594, 563, 737, 999];countertop:[000, 724, 865, 791];building:[000, 000, 999, 999].') 2023-11-03 04:57:11.364 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:57:11.364 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[642, 089, 990, 999],[364, 093, 706, 999],[420, 192, 571, 999],[293, 119, 450, 999],[297, 200, 385, 927],[002, 057, 433, 1000].') 2023-11-03 04:57:32.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:57:32.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[550, 018, 704, 225],[000, 169, 137, 482],[000, 343, 271, 999],[002, 240, 246, 603],[034, 052, 204, 299],[141, 057, 274, 270],[152, 440, 596, 999],[205, 169, 488, 609],[220, 082, 335, 308],[584, 140, 792, 596],[653, 000, 798, 191],[672, 076, 747, 215],[686, 381, 936, 962],[725, 130, 936, 528],[772, 000, 893, 195],[839, 035, 991, 257],[893, 147, 999, 473];woman:[304, 024, 436, 300],[401, 195, 676, 616],[418, 000, 522, 279],[706, 528, 999, 999];clothing:[000, 251, 150, 457],[000, 334, 150, 598],[000, 435, 257, 976],[038, 088, 144, 284],[140, 110, 290, 298],[158, 536, 602, 999],[216, 266, 475, 649],[234, 191, 328, 288],[316, 062, 423, 248],[412, 056, 502, 277],[414, 270, 644, 615],[514, 132, 623, 298],[556, 054, 680, 255],[579, 203, 755, 589],[656, 002, 792, 165],[676, 138, 737, 439],[686, 526, 999, 978],[720, 222, 959, 516],[751, 867, 986, 999],[752, 015, 887, 177],[860, 106, 992, 217],[913, 180, 999, 455];human face:[102, 081, 150, 177],[166, 290, 237, 437],[183, 393, 223, 520],[240, 121, 274, 228],[338, 217, 406, 343],[484, 000, 518, 085],[487, 258, 564, 343],[624, 041, 670, 123],[685, 000, 725, 067],[698, 240, 778, 314],[789, 472, 839, 575],[814, 200, 897, 298],[844, 060, 864, 125].') 2023-11-03 04:57:49.302 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:57:49.302 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[365, 081, 646, 997] [115, 109, 625, 992] Beautiful woman taking selfies with [001, 129, 098, 555] [612, 063, 885, 994] [042, 005, 234, 498] his friends at a party') 2023-11-03 04:58:07.768 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:58:07.769 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'IFC students [551, 169, 668, 396] [294, 264, 466, 998] assist in the deployment and set-up of a large NASA NPOL radar.') 2023-11-03 04:58:45.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:58:45.606 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bat,sculpture and houseplant in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'sculpture:[355, 321, 822, 881];houseplant:[745, 697, 983, 936].') 2023-11-03 04:59:17.664 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:59:17.665 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[504, 304, 698, 876] [685, 287, 867, 789] [315, 255, 509, 596] [131, 272, 326, 869] Youth climate activists continue hunger strike against coal mine in the UK') 2023-11-03 04:59:20.883 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:59:20.884 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[290, 085, 546, 500];frame:2:[396, 173, 581, 374];frame:3:[409, 116, 578, 583] is?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an the newest addition to the group, a panda, joining in the playful activities in the snowy landscape.') 2023-11-03 04:59:38.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:59:38.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Chair:[041, 469, 328, 996],[614, 580, 899, 996];Person:[065, 359, 439, 996],[464, 595, 517, 765],[479, 608, 517, 765],[581, 456, 939, 998];Bracelet:[339, 687, 357, 736];Bottle:[592, 692, 618, 845],[155, 643, 194, 853];Car:[861, 630, 972, 738],[522, 644, 703, 773],[263, 630, 474, 761],[001, 619, 069, 721];Fire Hydrant:[906, 652, 982, 906].') 2023-11-03 04:59:43.192 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 04:59:43.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[264, 137, 686, 695];Sneakers:[593, 577, 686, 668].') 2023-11-03 04:59:49.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 04:59:49.831 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,clothing and human face in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[620, 231, 856, 988];clothing:[646, 367, 843, 921],[000, 229, 353, 998],[208, 220, 430, 998],[388, 272, 610, 998],[411, 451, 690, 998],[802, 252, 999, 998];human face:[127, 108, 210, 268],[257, 145, 342, 281],[418, 142, 521, 322],[491, 331, 582, 472],[657, 258, 763, 414],[832, 095, 926, 283].') 2023-11-03 05:00:15.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:00:15.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[289, 181, 589, 974];Sneakers:[454, 791, 580, 856],[373, 897, 474, 975].') 2023-11-03 05:00:39.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:00:39.129 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'car:[145, 015, 848, 940],[676, 032, 805, 165].') 2023-11-03 05:00:49.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:00:49.599 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[352, 262, 997, 669] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[352, 262, 997, 669];Frame2:[375, 321, 994, 611];Frame3:[389, 294, 911, 529];Frame4:[506, 214, 791, 625];Frame5:[470, 375, 759, 667].') 2023-11-03 05:00:53.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:00:53.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[000, 529, 129, 998],[000, 228, 999, 931],[177, 611, 233, 998],[194, 618, 359, 998],[318, 627, 518, 998],[501, 662, 624, 998],[591, 820, 726, 998],[592, 651, 657, 823],[712, 765, 818, 998],[779, 705, 871, 998],[843, 646, 873, 765],[884, 622, 952, 998],[903, 647, 999, 998];human hair:[106, 872, 176, 996],[009, 520, 081, 642],[174, 392, 216, 522],[208, 372, 241, 459],[255, 379, 293, 528],[486, 451, 536, 564],[576, 482, 619, 555],[726, 622, 767, 709];human head:[106, 872, 176, 996],[000, 533, 078, 702],[000, 221, 999, 520],[096, 350, 124, 450],[115, 373, 165, 484],[178, 397, 210, 466],[230, 533, 275, 627],[256, 384, 286, 455],[271, 499, 329, 646],[407, 482, 470, 642],[489, 455, 528, 539],[532, 557, 581, 678],[563, 482, 579, 549],[580, 555, 636, 673],[654, 686, 715, 827],[723, 629, 776, 771],[774, 577, 841, 689],[821, 557, 858, 635],[848, 531, 883, 620],[915, 549, 947, 633],[964, 549, 999, 680];man:[345, 442, 500, 978],[000, 339, 058, 589],[014, 323, 088, 495],[051, 339, 139, 566],[069, 368, 208, 934],[278, 313, 345, 439],[286, 355, 343, 588],[335, 372, 393, 513],[395, 364, 455, 490],[459, 419, 511, 502],[518, 386, 548, 464],[535, 379, 603, 551],[622, 402, 663, 500],[633, 471, 740, 709],[708, 433, 767, 635],[776, 442, 815, 624],[796, 408, 837, 513],[845, 422, 887, 533],[846, 528, 896, 998],[911, 580, 999, 998];clothing:[893, 591, 992, 974],[000, 350, 997, 998];human arm:[609, 825, 675, 976],[001, 112, 988, 998];boy:[178, 537, 284, 998],[191, 495, 370, 990],[347, 417, 431, 655],[881, 548, 950, 998];woman:[000, 531, 099, 998],[375, 344, 406, 430],[477, 551, 622, 998],[483, 450, 540, 678],[553, 477, 587, 569],[576, 482, 619, 564],[585, 557, 656, 836],[593, 687, 728, 998],[602, 453, 626, 522],[718, 626, 816, 998],[753, 470, 783, 644],[806, 477, 853, 560],[821, 564, 869, 773];girl:[102, 874, 180, 998],[384, 372, 412, 480].') 2023-11-03 05:01:02.759 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:01:02.759 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 103, 990, 773] Best portable hammock for the beach Reviews') 2023-11-03 05:01:03.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:01:03.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[348, 124, 776, 1000],[641, 206, 920, 999],[791, 245, 1000, 1000],[372, 337, 477, 910],[234, 360, 411, 999],[248, 551, 424, 999],[103, 577, 267, 999],[000, 605, 141, 999];Bracelet:[356, 709, 384, 743];Car:[015, 477, 246, 611];Other Shoes:[042, 952, 063, 982].') 2023-11-03 05:01:34.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:01:34.412 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[139, 121, 768, 912] Junior Anurag Kelkar prepares to shoot a goal during a practice on Thursday, Nov. 21. This is Kelkar’s first season on varsity after two years playing on JV. Photo by Rhonda Mak.') 2023-11-03 05:01:38.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:01:38.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect glasses,girl and woman in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'glasses:[191, 195, 349, 262],[590, 264, 860, 425];girl:[000, 146, 974, 999];woman:[000, 040, 999, 999].') 2023-11-03 05:01:51.612 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:01:51.612 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[183, 005, 870, 994] Tamannaah Bhatia swears by this hack to get rid of puffiness') 2023-11-03 05:02:01.091 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:02:01.091 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'dog:[044, 038, 691, 566].') 2023-11-03 05:02:04.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:02:04.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Drum and Person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Drum:[002, 627, 097, 802],[002, 812, 119, 999];Person:[164, 209, 454, 999],[408, 527, 500, 947],[448, 208, 774, 999],[547, 575, 698, 999],[687, 465, 818, 961],[711, 890, 889, 999].') 2023-11-03 05:02:13.953 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:02:13.954 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Artisan [001, 002, 684, 994] inspecting PONS shoes [481, 168, 670, 344] to make sure only the highest quality avarcas [427, 352, 804, 555] are delivered to you') 2023-11-03 05:02:24.130 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:02:24.131 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[003, 099, 995, 997] The luxurious Volkswagen Passat interior, showcasing the digital cockpit, navigation touchscreen, and [747, 668, 969, 990] leather gear shift.') 2023-11-03 05:02:36.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:02:36.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Street Lights:[640, 472, 669, 557],[655, 453, 690, 556],[678, 412, 725, 545],[731, 322, 806, 505],[847, 002, 919, 532];Traffic cone:[014, 583, 141, 816],[510, 568, 900, 871].') 2023-11-03 05:02:59.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:02:59.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[331, 525, 534, 760],[000, 458, 058, 733],[000, 000, 638, 820],[015, 478, 076, 728],[459, 000, 999, 554],[600, 460, 889, 710],[725, 063, 999, 470],[827, 000, 999, 150];window:[131, 427, 206, 571],[112, 619, 124, 716],[128, 312, 203, 379],[129, 617, 156, 713],[163, 615, 185, 711],[338, 653, 355, 716],[368, 414, 394, 515],[392, 603, 418, 672],[404, 410, 430, 508],[438, 593, 474, 675],[441, 400, 466, 503],[496, 253, 603, 500],[507, 582, 541, 682],[709, 341, 732, 405],[812, 548, 835, 620],[823, 315, 856, 441],[848, 530, 895, 658],[869, 298, 903, 434],[939, 303, 981, 429],[967, 520, 999, 646],[993, 300, 999, 414].') 2023-11-03 05:03:01.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:03:01.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[718, 491, 768, 612] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a australian terrierFrame1:[718, 491, 768, 612];Frame2:[671, 497, 728, 579];Frame3:[556, 479, 628, 550];Frame4:[539, 443, 609, 502];Frame5:[482, 454, 539, 516].') 2023-11-03 05:03:42.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:03:42.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'flag:[045, 302, 245, 638];human head:[648, 290, 831, 610],[146, 557, 203, 642],[241, 513, 281, 592],[280, 507, 314, 572],[348, 507, 400, 571],[426, 500, 472, 577],[471, 492, 514, 569],[523, 504, 568, 581],[569, 497, 618, 569],[626, 530, 656, 594],[805, 486, 845, 596],[853, 515, 908, 598],[908, 504, 947, 585],[943, 480, 998, 572];man:[574, 266, 983, 999],[150, 565, 238, 999],[223, 519, 318, 999],[272, 511, 348, 970],[350, 503, 454, 999],[358, 503, 459, 667],[406, 517, 518, 999],[465, 511, 544, 987],[521, 503, 580, 985],[565, 501, 637, 941],[795, 521, 891, 657],[861, 523, 942, 712],[903, 503, 953, 638],[938, 478, 999, 999];clothing:[565, 495, 966, 1000],[140, 554, 673, 999],[831, 565, 884, 642],[881, 579, 951, 708],[930, 559, 967, 646],[953, 559, 999, 984];mammal:[131, 563, 236, 993],[239, 478, 999, 987];human face:[643, 335, 774, 595];microphone:[394, 639, 538, 792],[458, 604, 615, 763].') 2023-11-03 05:04:02.364 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:04:02.365 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[233, 179, 434, 496] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a push-bikeFrame1:[233, 179, 434, 496];Frame2:[202, 140, 406, 417];Frame3:[225, 129, 436, 410];Frame4:[248, 114, 470, 395];Frame5:[249, 118, 471, 399].') 2023-11-03 05:04:23.673 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:04:23.673 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Gothic t-shirt [164, 348, 775, 925] with a magic ball [298, 501, 666, 819] "You\'re gonna die soon"') 2023-11-03 05:04:40.414 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:04:40.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[060, 060, 526, 1000],[664, 241, 1000, 1000].') 2023-11-03 05:05:06.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:05:06.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,footwear and wheel in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[000, 000, 114, 875],[000, 000, 143, 472],[417, 000, 548, 326],[535, 000, 717, 350];footwear:[020, 766, 097, 865],[068, 379, 109, 453];wheel:[130, 476, 286, 762],[655, 182, 790, 373],[671, 349, 858, 565].') 2023-11-03 05:05:31.192 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:05:31.193 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'This file photo taken on February 18, 2017 shows demonstrators dressed as custom officials [420, 006, 810, 995] at a mock customs checkpoint [003, 159, 454, 941] at the northern Ireland - Ireland border crossing') 2023-11-03 05:05:47.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:05:47.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bracelet,Glasses and Hat in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Bracelet:[376, 515, 431, 537],[092, 391, 130, 421];Glasses:[397, 145, 447, 164];Hat:[385, 088, 527, 153],[104, 069, 219, 114].') 2023-11-03 05:06:08.044 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:06:08.045 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Young girl [330, 406, 517, 713] practicing her figure skating') 2023-11-03 05:06:12.438 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:06:12.438 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Thanksgiving day table [006, 237, 989, 998] with turkey, pie, green beans [031, 329, 217, 437] all in Handi-Foil Pans [321, 305, 671, 556] [508, 750, 752, 960] [616, 475, 874, 720] [007, 321, 241, 501] [125, 438, 421, 650]') 2023-11-03 05:06:13.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:06:13.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[866, 469, 1000, 698],carFrame1:[346, 472, 380, 543],carFrame1:[481, 475, 526, 553],carFrame1:[680, 490, 801, 591],carFrame1:[086, 457, 181, 552],carFrame1:[601, 478, 659, 551],carFrame1:[351, 452, 500, 664] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1[866, 469, 1000, 698];Frame2:No object,carFrame1[346, 472, 380, 543];Frame2:No object,carFrame1:[481, 475, 526, 553];Frame2:[487, 465, 530, 543],carFrame1:[680, 490, 801, 591];Frame2:[750, 473, 951, 618],carFrame1:[086, 457, 181, 552];Frame2:[000, 448, 114, 562],carFrame1:[601, 478, 659, 551];Frame2:[631, 462, 702, 552],carFrame1:[351, 452, 500, 664];Frame2:[323, 443, 518, 705].') 2023-11-03 05:06:26.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:06:26.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Soccer:[272, 661, 326, 741];Person:[837, 103, 972, 628],[525, 199, 712, 879],[312, 134, 422, 682],[322, 145, 604, 753];Sneakers:[397, 653, 423, 685],[461, 631, 500, 732],[567, 704, 607, 759],[678, 839, 713, 883],[855, 549, 889, 619],[909, 595, 936, 630].') 2023-11-03 05:07:24.767 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:07:24.767 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of [358, 441, 510, 742] [453, 418, 654, 786] [172, 538, 337, 868] [639, 334, 874, 738] female desert bighorn sheep bask in the early morning light on [002, 002, 995, 997] sandstone cliffs; Colorado National Monument') 2023-11-03 05:07:47.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:07:47.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human hair:[036, 166, 291, 415],[000, 108, 117, 286],[365, 354, 446, 438],[627, 466, 709, 585];human head:[031, 170, 304, 414],[000, 108, 125, 353],[328, 353, 438, 442],[531, 429, 596, 490],[630, 471, 707, 582],[863, 451, 910, 516];man:[708, 459, 915, 658],[531, 426, 648, 541];human arm:[250, 430, 696, 782],[000, 350, 408, 677],[684, 536, 744, 651],[880, 388, 998, 767];human body:[001, 106, 410, 999],[309, 358, 488, 755],[536, 443, 645, 524],[615, 463, 742, 822],[703, 451, 891, 589],[891, 380, 998, 853];mammal:[000, 098, 386, 999],[333, 350, 500, 739],[536, 421, 645, 546],[627, 466, 750, 853],[721, 453, 894, 621],[899, 376, 998, 872];clothing:[000, 361, 367, 997],[074, 389, 656, 894],[686, 484, 895, 618],[884, 377, 998, 864].') 2023-11-03 05:08:11.983 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:08:11.983 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[032, 043, 799, 998] Lyon president Aulas laments failure to sell Depay: It wasn't the best outcome for us") 2023-11-03 05:08:16.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:08:16.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[183, 001, 969, 1000],microphoneFrame1:[202, 627, 367, 1000] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[183, 001, 969, 1000];Frame2:[314, 001, 874, 1000],microphoneFrame1:[202, 627, 367, 1000];Frame2:[499, 819, 636, 998].') 2023-11-03 05:08:26.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:08:26.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[083, 057, 668, 998] Bride with [488, 675, 671, 983] cowboy boots dangling her feet near [461, 006, 850, 994] a brook') 2023-11-03 05:08:29.212 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:08:29.213 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[017, 634, 102, 999],[178, 652, 268, 1000],[246, 664, 352, 1000].') 2023-11-03 05:08:52.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:08:52.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[698, 298, 913, 638],personFrame1:[632, 180, 760, 637],personFrame1:[271, 315, 500, 659],personFrame1:[578, 269, 664, 654],personFrame1:[560, 153, 705, 657] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[698, 298, 913, 638];Frame2:[719, 301, 890, 639],personFrame1:[632, 180, 760, 637];Frame2:[630, 194, 764, 633],personFrame1:[271, 315, 500, 659];Frame2:[297, 318, 484, 660],personFrame1:[578, 269, 664, 654];Frame2:[579, 262, 677, 654],personFrame1:[560, 153, 705, 657];Frame2:[565, 142, 705, 662].') 2023-11-03 05:09:07.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:09:07.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[001, 010, 442, 999],[665, 172, 999, 999];Glasses:[242, 062, 274, 131];Hat:[126, 014, 266, 221],[738, 175, 885, 392].') 2023-11-03 05:09:31.677 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:09:31.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Other Shoes in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Other Shoes:[951, 597, 986, 645].') 2023-11-03 05:09:59.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:09:59.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shelf,Person,Trash bin Can and Zebra in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'shelf:[397, 001, 473, 155];Person:[114, 088, 197, 226];Trash bin Can:[001, 353, 036, 442].') 2023-11-03 05:10:16.033 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:10:16.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fan,Satchel and Gloves in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Fan:[100, 064, 151, 166],[029, 057, 082, 231],[000, 061, 019, 139];Satchel:[717, 233, 743, 276];Gloves:[715, 219, 732, 242].') 2023-11-03 05:10:31.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:10:31.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Trolley:[341, 761, 474, 904];Street Lights:[129, 572, 175, 748];Car:[114, 711, 135, 756];Person:[418, 673, 541, 880],[375, 670, 438, 772],[734, 678, 770, 763];Satchel:[464, 703, 539, 777],[721, 725, 742, 753];Sneakers:[461, 864, 489, 879];Hat:[453, 675, 493, 696];Lamp:[819, 572, 840, 590],[787, 568, 809, 584],[608, 549, 641, 590].') 2023-11-03 05:10:55.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:10:55.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[403, 477, 482, 604],[837, 526, 903, 593].') 2023-11-03 05:11:10.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:11:10.775 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[698, 297, 732, 448],carFrame1:[495, 298, 532, 356],truckFrame1:[484, 287, 516, 343],carFrame1:[337, 283, 377, 362],carFrame1:[000, 239, 061, 612],carFrame1:[030, 232, 267, 522],bicycleFrame1:[881, 369, 962, 499] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[698, 297, 732, 448];Frame2:[795, 296, 845, 496],carFrame1:[495, 298, 532, 356];Frame2:[498, 290, 537, 352],truckFrame1:[484, 287, 516, 343];Frame2:[483, 274, 518, 331],carFrame1:[337, 283, 377, 362];Frame2:[313, 263, 362, 361],carFrame1[000, 239, 061, 612];Frame2:No object,carFrame1:[030, 232, 267, 522];Frame2:[000, 152, 120, 587],bicycleFrame1[881, 369, 962, 499];Frame2:No object.') 2023-11-03 05:11:11.639 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:11:11.639 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a modern industrial bathroom with concrete walls [000, 127, 826, 920] and a floor [003, 906, 862, 998], an exposed copper pipe [196, 094, 534, 578] and potted plants [450, 681, 896, 999] is a chic and functional space') 2023-11-03 05:11:52.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:11:52.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Frame:[042, 003, 570, 716],[133, 339, 384, 767];Flower:[332, 120, 561, 498];Desk:[039, 522, 956, 1000];Bottle:[717, 421, 900, 712].') 2023-11-03 05:12:02.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:12:02.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[052, 463, 243, 990],[163, 429, 202, 519],[176, 451, 226, 579],[193, 463, 283, 680],[212, 491, 447, 844],[213, 429, 311, 529],[268, 710, 531, 999],[288, 460, 358, 615],[295, 440, 400, 710],[441, 440, 534, 620],[450, 450, 584, 694],[534, 425, 581, 503],[552, 441, 592, 527],[585, 461, 633, 529],[601, 608, 750, 999],[611, 424, 999, 770];human face:[104, 511, 156, 615],[353, 756, 410, 906],[387, 505, 427, 582],[410, 927, 552, 999],[510, 611, 558, 689],[563, 520, 591, 584],[592, 646, 649, 761],[885, 610, 935, 687],[971, 530, 999, 601].') 2023-11-03 05:12:43.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:12:43.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Frame:[090, 391, 172, 477];Towel:[133, 516, 168, 625];shelf:[013, 510, 064, 750],[178, 491, 334, 715],[182, 582, 312, 998],[927, 495, 989, 548];Carpet:[547, 599, 961, 817],[750, 546, 981, 673];Flower:[722, 413, 761, 467];Lamp:[655, 084, 807, 374],[795, 396, 832, 453];Chair:[858, 480, 949, 614],[518, 474, 604, 544];Couch:[817, 450, 940, 536];Pillow:[842, 451, 893, 511];Oven:[340, 334, 444, 513];Microwave:[343, 200, 446, 333].') 2023-11-03 05:12:44.536 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:12:44.536 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[369, 330, 613, 942],[000, 105, 227, 999],[166, 501, 261, 999],[457, 905, 599, 999],[620, 359, 830, 999];clothing:[630, 420, 811, 898],[000, 426, 226, 999],[152, 567, 241, 766],[295, 470, 691, 999],[896, 462, 999, 808].') 2023-11-03 05:13:06.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:13:06.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the a lady is talking to the man and then takes up a bag from ground and then walks in the house.As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is no a lady is talking to the man and then takes up a bag from ground and then walks in the house.') 2023-11-03 05:13:35.312 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:13:35.313 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Traditional Makeup Mirrors Find Lighted Makeup Mirror') 2023-11-03 05:14:08.272 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:14:08.273 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Jeny [316, 060, 699, 966] wearing dark-blue pantyhose [320, 460, 608, 864] with no skirt on the road waits a car') 2023-11-03 05:14:30.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:14:30.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[364, 426, 400, 540],personFrame1:[259, 375, 294, 456],personFrame1:[353, 380, 387, 483],personFrame1:[127, 491, 167, 616],personFrame1:[346, 666, 379, 805],personFrame1:[810, 544, 855, 675],personFrame1:[891, 627, 941, 781] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1[364, 426, 400, 540];Frame2:No object,personFrame1:[259, 375, 294, 456];Frame2:[270, 370, 314, 450],personFrame1[353, 380, 387, 483];Frame2:No object,personFrame1:[127, 491, 167, 616];Frame2:[132, 480, 164, 613],personFrame1:[346, 666, 379, 805];Frame2:[373, 670, 418, 809],personFrame1[810, 544, 855, 675];Frame2:No object,personFrame1[891, 627, 941, 781];Frame2:No object.') 2023-11-03 05:14:37.792 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:14:37.793 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track truckFrame1:[324, 537, 387, 674],personFrame1:[749, 582, 790, 700],personFrame1:[171, 611, 210, 742],truckFrame1:[143, 543, 228, 674],personFrame1:[680, 534, 752, 858],personFrame1:[580, 516, 692, 862],personFrame1:[858, 452, 989, 867],personFrame1:[434, 604, 467, 713],personFrame1:[384, 595, 415, 716],personFrame1:[544, 523, 612, 842] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'truckFrame1:[324, 537, 387, 674];Frame2:[323, 535, 388, 667],personFrame1[749, 582, 790, 700];Frame2:No object,personFrame1[171, 611, 210, 742];Frame2:No object,truckFrame1:[143, 543, 228, 674];Frame2:[147, 546, 228, 658],personFrame1:[680, 534, 752, 858];Frame2:[590, 536, 668, 851],personFrame1:[580, 516, 692, 862];Frame2:[417, 518, 505, 840],personFrame1:[858, 452, 989, 867];Frame2:[730, 448, 854, 871],personFrame1[434, 604, 467, 713];Frame2:No object,personFrame1[384, 595, 415, 716];Frame2:No object,personFrame1:[544, 523, 612, 842];Frame2:[438, 564, 523, 837].') 2023-11-03 05:14:52.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:14:52.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'suit:[832, 587, 947, 969],[220, 553, 366, 999],[444, 599, 543, 999],[640, 570, 750, 984],[753, 610, 867, 976];woman:[327, 520, 418, 658],[336, 551, 433, 999],[425, 526, 485, 646],[446, 520, 557, 966],[543, 526, 640, 993],[586, 506, 655, 658],[720, 512, 742, 590],[737, 530, 885, 973];man:[000, 437, 117, 884],[211, 473, 376, 999],[389, 460, 445, 641],[643, 485, 751, 999],[728, 497, 774, 640],[828, 506, 949, 964];building:[000, 000, 999, 588];human face:[282, 492, 314, 576],[340, 558, 376, 633],[446, 527, 475, 594],[488, 545, 516, 617],[573, 532, 611, 609],[685, 494, 716, 576],[733, 513, 762, 578],[785, 556, 820, 621],[863, 523, 900, 590].') 2023-11-03 05:15:33.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:15:33.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Gloves in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[908, 250, 1000, 813],[599, 215, 727, 894],[145, 306, 340, 998];Gloves:[234, 576, 292, 640],[144, 679, 173, 748],[954, 473, 986, 550],[595, 474, 630, 540].') 2023-11-03 05:16:31.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:16:31.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Fan:[333, 002, 667, 153];Lamp:[443, 104, 561, 225],[507, 411, 591, 579],[875, 427, 977, 589];Chair:[188, 474, 436, 874],[691, 554, 999, 1000];Carpet:[195, 790, 695, 997];Couch:[574, 458, 868, 794];Picture:[528, 578, 585, 681],[587, 601, 641, 683],[040, 581, 091, 688],[094, 564, 137, 659];Clock:[421, 222, 508, 720];Flower:[845, 815, 1000, 999];Pillow:[674, 533, 770, 639];Person:[700, 318, 738, 392],[866, 128, 903, 199],[944, 098, 998, 175];Side Table:[028, 645, 192, 926].') 2023-11-03 05:16:34.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:16:34.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[157, 294, 633, 824],[609, 545, 694, 822],[672, 585, 810, 791];Bicycle:[151, 469, 419, 886];Helmet:[249, 293, 364, 412];Sneakers:[445, 773, 556, 823].') 2023-11-03 05:16:42.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:16:42.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[130, 586, 263, 999],[395, 512, 503, 882],[535, 395, 689, 828],[448, 232, 598, 626],[793, 209, 941, 629],[274, 164, 370, 233];Car:[002, 002, 802, 642];Scooter:[900, 410, 999, 699],[266, 870, 381, 1000];Chair:[267, 194, 398, 398],[277, 166, 390, 250];Helmet:[350, 834, 440, 961];Handbag:[538, 577, 638, 708],[751, 451, 806, 553],[595, 473, 680, 580].') 2023-11-03 05:17:17.133 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:17:17.134 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[407, 508, 466, 605] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a brown airplane flying in the airFrame1:[407, 508, 466, 605];Frame2:[412, 506, 470, 605];Frame3:[413, 487, 471, 591];Frame4:[409, 473, 467, 584];Frame5:[406, 469, 464, 580].') 2023-11-03 05:17:22.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:17:22.635 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Street Lights:[751, 003, 851, 800],[600, 408, 645, 763],[548, 555, 579, 791];Hat:[704, 639, 761, 734],[478, 758, 545, 858],[619, 771, 670, 837],[562, 837, 610, 918],[280, 795, 337, 873],[438, 799, 477, 859],[899, 734, 983, 834];Tent:[753, 571, 907, 797],[629, 650, 709, 789],[561, 719, 618, 807].') 2023-11-03 05:17:26.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:17:26.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[076, 410, 183, 481],carFrame1:[449, 373, 487, 462],carFrame1:[545, 341, 688, 564],carFrame1:[509, 363, 580, 521],carFrame1:[619, 289, 995, 698] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[076, 410, 183, 481];Frame2:[030, 410, 160, 491],carFrame1:[449, 373, 487, 462];Frame2:[460, 376, 509, 472],carFrame1:[545, 341, 688, 564];Frame2:[635, 278, 1000, 690],carFrame1:[509, 363, 580, 521];Frame2:[551, 343, 700, 553],carFrame1:[619, 289, 995, 698];Frame2:[898, 435, 999, 826].') 2023-11-03 05:17:31.281 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:17:31.281 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Signed up to the CBD e-scooter craze [415, 287, 715, 767]? You need to be aware of the physical and financial risks.') 2023-11-03 05:17:50.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:17:50.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[883, 830, 999, 1000],[354, 002, 1000, 1000],[418, 594, 623, 997],[941, 180, 1000, 311],[523, 234, 636, 315],[260, 142, 667, 933],[210, 153, 365, 863],[002, 002, 288, 998];Hat:[527, 002, 978, 234],[942, 179, 999, 242],[322, 149, 484, 253],[212, 153, 355, 221],[002, 002, 239, 129];Boots:[430, 824, 553, 1000],[284, 786, 367, 859];Handbag:[290, 858, 435, 939];Other Shoes:[125, 961, 199, 999].') 2023-11-03 05:18:03.489 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:18:03.490 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[339, 448, 392, 537],carFrame1:[390, 458, 422, 531],carFrame1:[844, 428, 997, 582] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[339, 448, 392, 537];Frame2:[340, 449, 395, 539],carFrame1:[390, 458, 422, 531];Frame2:[390, 456, 423, 531],carFrame1:[844, 428, 997, 582];Frame2:[687, 439, 950, 584].') 2023-11-03 05:18:24.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:18:24.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Picture:[947, 001, 999, 221],[482, 001, 948, 176],[000, 001, 240, 174];Desk:[307, 440, 603, 653];Chair:[592, 634, 958, 1000];Person:[000, 137, 128, 911],[072, 255, 311, 922],[034, 001, 266, 372],[214, 401, 526, 1000],[445, 301, 777, 999],[588, 229, 983, 694],[452, 185, 606, 421],[536, 199, 698, 454];Necklace:[424, 184, 551, 392];Hat:[250, 193, 349, 278];Tie:[507, 282, 536, 357];Glasses:[706, 293, 779, 326];Sneakers:[027, 771, 054, 819],[034, 829, 078, 907];Cup:[530, 450, 574, 548],[549, 412, 584, 457],[537, 362, 567, 413],[469, 348, 497, 415],[467, 313, 493, 352],[372, 317, 396, 374].') 2023-11-03 05:19:00.027 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:19:00.028 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[006, 302, 339, 643],[038, 366, 380, 896],[406, 368, 671, 762],[550, 349, 625, 414],[545, 355, 748, 678],[617, 322, 738, 421],[712, 343, 866, 612];Sneakers:[197, 846, 265, 897],[160, 782, 210, 840],[490, 656, 540, 710],[539, 724, 594, 762],[596, 650, 645, 675],[739, 586, 777, 612],[786, 580, 823, 608],[775, 556, 799, 577],[819, 536, 849, 566],[837, 540, 856, 556],[877, 515, 903, 530],[864, 489, 885, 509],[957, 497, 982, 517];Hat:[099, 304, 190, 346],[866, 346, 906, 367];Gloves:[035, 479, 070, 513],[432, 535, 484, 563],[541, 497, 605, 522],[899, 416, 915, 428].') 2023-11-03 05:19:05.709 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:19:05.709 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'building [176, 156, 970, 857] a simple hydroponic system') 2023-11-03 05:19:31.577 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:19:31.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'footwear:[613, 826, 672, 938],[072, 904, 110, 988],[357, 869, 387, 942],[393, 861, 432, 942],[431, 791, 477, 865],[453, 779, 492, 845],[531, 843, 581, 932],[733, 697, 765, 768];miniskirt:[592, 475, 695, 625];dress:[338, 384, 462, 679],[483, 345, 592, 692],[707, 337, 795, 526],[846, 220, 951, 529];woman:[000, 143, 106, 986],[000, 268, 107, 998],[000, 125, 138, 819],[341, 190, 467, 951],[423, 123, 507, 861],[486, 156, 618, 945],[585, 158, 753, 912],[675, 143, 731, 662],[696, 149, 810, 774],[832, 123, 946, 770];girl:[000, 268, 110, 992],[341, 186, 467, 942],[346, 130, 947, 951],[431, 121, 510, 860],[485, 151, 622, 930],[585, 154, 758, 902],[706, 136, 805, 753],[832, 111, 946, 757].') 2023-11-03 05:19:49.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:19:49.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the person sitting on a sailboat turning then moving to the right.Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is no person sitting on a sailboat turning then moving to the right.') 2023-11-03 05:20:06.153 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:20:06.153 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'bottle:[011, 586, 105, 969],[290, 700, 355, 987];man:[281, 275, 709, 884],[000, 301, 305, 818],[236, 200, 433, 535],[571, 167, 708, 453],[618, 028, 999, 999];wine:[017, 579, 109, 955],[000, 758, 066, 999],[135, 761, 246, 887],[290, 906, 373, 999],[580, 971, 675, 999],[636, 855, 736, 973];clothing:[291, 390, 717, 880],[018, 461, 352, 710],[238, 261, 396, 505],[579, 229, 701, 450],[662, 406, 999, 999];beer:[012, 584, 150, 982],[130, 718, 258, 935],[250, 700, 370, 999],[480, 687, 632, 994],[558, 897, 701, 997],[624, 795, 752, 999];wine glass:[132, 715, 246, 892],[274, 837, 372, 999],[482, 701, 616, 999],[562, 895, 690, 994],[635, 803, 735, 999].') 2023-11-03 05:20:35.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:20:35.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[134, 315, 563, 945],[000, 058, 999, 999];man:[000, 282, 094, 786],[000, 293, 211, 836],[000, 222, 351, 998],[006, 129, 196, 469],[006, 161, 550, 999],[171, 309, 227, 495],[178, 130, 815, 999],[436, 123, 503, 267],[440, 214, 962, 999],[465, 135, 595, 487],[468, 130, 552, 266],[586, 000, 918, 463],[722, 000, 999, 999];human face:[000, 365, 045, 477],[033, 364, 096, 498],[059, 175, 104, 261],[206, 301, 299, 463],[316, 261, 446, 430],[565, 228, 692, 422],[621, 053, 745, 228],[764, 314, 912, 516],[916, 079, 999, 337].') 2023-11-03 05:20:51.828 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:20:51.828 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "I know it shouldn't bother me, but sometimes I get jealous that my younger brother [477, 067, 994, 997] is more popular than me. I hate that it bothers me.") 2023-11-03 05:20:55.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:20:55.203 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect land vehicle and man in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'land vehicle:[000, 000, 999, 999];man:[144, 559, 376, 722],[028, 556, 209, 651],[037, 580, 159, 684].') 2023-11-03 05:20:57.334 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:20:57.334 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cup:[688, 420, 704, 446];Bottle:[893, 387, 912, 468];Chair:[234, 368, 454, 873],[753, 541, 821, 704],[735, 440, 780, 628];Desk:[032, 591, 285, 981],[858, 454, 987, 626];shelf:[658, 440, 739, 583],[630, 525, 712, 632];Storage box:[607, 367, 647, 402],[043, 505, 277, 647];Potted Plant:[982, 256, 999, 325];Flag:[012, 001, 096, 238];Person:[301, 252, 529, 863],[089, 779, 262, 999],[754, 336, 873, 587];Book:[756, 469, 837, 539];Slippers:[387, 827, 463, 864],[423, 805, 461, 840];Bakset:[909, 415, 977, 483],[917, 328, 975, 400],[680, 538, 732, 570],[699, 558, 737, 591],[671, 466, 739, 523];Other Shoes:[808, 566, 828, 599];Blackboard:[001, 001, 671, 494].') 2023-11-03 05:21:12.778 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:21:12.779 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Sensory Issues and Public Restrooms [000, 005, 996, 995] just don't mix well! Here is one of those moments for our family and what we did to make it a little less stressful! #autism #spd #asd") 2023-11-03 05:21:33.460 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:21:33.460 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'gifts for the cyclist [451, 185, 800, 871]') 2023-11-03 05:21:56.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:21:56.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Dog:[339, 596, 493, 829],[594, 326, 622, 366];SUV:[000, 189, 086, 394],[097, 181, 262, 392],[311, 177, 467, 366],[503, 156, 688, 372];Car:[686, 216, 824, 358];Van:[757, 161, 999, 368];Wheelchair:[097, 498, 476, 999];Person:[770, 098, 914, 885],[549, 181, 673, 470],[347, 151, 504, 631],[147, 328, 494, 954];Leather Shoes:[411, 820, 495, 882],[378, 841, 444, 926];High Heels:[793, 833, 874, 879];Satchel:[778, 532, 826, 693];Desk:[343, 455, 994, 811];Picture:[720, 399, 767, 484].') 2023-11-03 05:22:12.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:22:12.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[144, 357, 214, 693],[244, 281, 271, 386],[301, 271, 328, 368],[315, 248, 344, 364],[351, 266, 376, 364],[594, 740, 766, 997],[701, 382, 803, 661],[759, 300, 780, 386],[787, 310, 810, 386],[809, 386, 863, 594],[821, 298, 849, 443],[851, 426, 910, 653],[855, 335, 893, 432],[868, 561, 999, 921];wheel:[420, 455, 580, 723],[282, 615, 395, 731],[640, 447, 691, 580];tire:[417, 453, 585, 736],[288, 616, 396, 721],[641, 452, 689, 576];car:[254, 209, 701, 727],[138, 302, 211, 351],[226, 270, 305, 340],[335, 268, 418, 329],[375, 250, 451, 290],[490, 231, 568, 265].') 2023-11-03 05:22:16.885 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:22:16.886 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An image that shows a wooden playset [005, 007, 939, 993] with a steel slide [597, 386, 950, 996] attached to it.') 2023-11-03 05:22:20.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:22:20.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Tie:[819, 507, 894, 948],[518, 488, 585, 845];Person:[002, 003, 325, 1000],[635, 240, 1000, 999],[377, 290, 732, 999],[300, 321, 507, 1000];Chair:[259, 483, 376, 872],[477, 532, 729, 1000].') 2023-11-03 05:22:47.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:22:47.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[787, 307, 954, 650],[000, 355, 111, 642],[000, 749, 267, 999],[078, 375, 255, 786],[234, 828, 331, 893],[435, 234, 532, 430],[519, 240, 597, 437],[553, 599, 791, 999],[645, 250, 704, 373],[676, 206, 744, 306],[698, 279, 811, 611],[708, 245, 790, 397],[721, 275, 807, 437],[776, 258, 835, 399],[821, 554, 999, 999],[853, 255, 939, 327],[956, 227, 999, 334];table:[127, 426, 835, 983],[000, 508, 133, 884],[470, 238, 584, 410],[684, 251, 934, 403],[828, 325, 999, 617];man:[191, 243, 498, 999],[429, 218, 789, 999];clothing:[486, 256, 796, 941],[145, 362, 484, 999];human face:[375, 277, 459, 426],[582, 245, 659, 381].') 2023-11-03 05:22:52.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:22:52.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Potted Plant:[197, 384, 356, 533],[002, 590, 248, 875];Vase:[761, 216, 841, 295],[765, 142, 806, 184],[589, 429, 624, 472];Flower:[588, 310, 846, 530];bucket:[396, 492, 499, 614],[636, 668, 865, 983];Faucet:[568, 346, 747, 447].') 2023-11-03 05:23:28.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:23:28.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[000, 328, 082, 926],[150, 259, 302, 979],[436, 142, 618, 999],[593, 268, 783, 1000];Flag:[764, 001, 994, 769];Hat:[037, 533, 071, 612];Boots:[030, 822, 080, 892],[013, 854, 058, 927],[118, 793, 156, 883],[162, 774, 200, 849],[212, 863, 249, 978],[239, 837, 299, 917],[308, 929, 351, 1000],[403, 933, 434, 971];Speaker:[411, 254, 458, 346];Flower:[690, 482, 803, 801];Chair:[416, 546, 443, 719].') 2023-11-03 05:24:04.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:24:04.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Stuffed Toy:[318, 443, 387, 535];Person:[546, 521, 621, 644],[436, 498, 524, 643];Book:[024, 132, 226, 342];Telephone:[678, 179, 1000, 406];Laptop:[002, 495, 192, 988];Moniter:[140, 248, 1000, 1000].') 2023-11-03 05:24:16.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:24:16.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'sunglasses:[566, 292, 783, 453].') 2023-11-03 05:24:16.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:24:16.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[000, 625, 079, 837],[063, 275, 112, 390],[115, 645, 285, 997],[128, 319, 215, 495],[172, 789, 457, 999],[212, 335, 313, 604],[331, 197, 661, 999],[375, 249, 433, 329],[391, 695, 564, 999],[517, 385, 620, 510],[628, 361, 680, 521],[645, 534, 765, 839],[680, 275, 720, 368],[681, 778, 845, 999],[710, 336, 761, 463],[738, 657, 856, 961],[772, 304, 841, 500],[804, 416, 870, 537],[828, 725, 965, 999],[830, 452, 925, 657],[854, 630, 981, 762],[933, 518, 999, 815];clothing:[000, 250, 999, 999].') 2023-11-03 05:24:55.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:24:55.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[744, 424, 823, 500],carFrame1:[642, 481, 718, 573],carFrame1:[309, 463, 457, 693],carFrame1:[499, 474, 577, 589],carFrame1:[311, 394, 392, 512],carFrame1:[706, 474, 779, 575],carFrame1:[897, 535, 999, 956] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[744, 424, 823, 500];Frame2:[805, 406, 875, 489],carFrame1:[642, 481, 718, 573];Frame2:[680, 470, 763, 557],carFrame1:[309, 463, 457, 693];Frame2:[309, 461, 451, 686],carFrame1:[499, 474, 577, 589];Frame2:[513, 463, 605, 594],carFrame1:[311, 394, 392, 512];Frame2:[306, 388, 390, 596],carFrame1:[706, 474, 779, 575];Frame2:[751, 464, 830, 581],carFrame1[897, 535, 999, 956];Frame2:No object.') 2023-11-03 05:25:11.252 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:25:11.252 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[323, 779, 346, 867],[301, 777, 317, 859],[286, 796, 303, 845];Paddle:[440, 880, 537, 922],[416, 843, 464, 869].') 2023-11-03 05:25:11.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:25:11.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Car:[892, 279, 998, 377],[030, 103, 945, 963],[002, 319, 080, 478].') 2023-11-03 05:25:26.034 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:25:26.034 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[211, 099, 769, 652] Young woman doing yoga exercises in the autumn city park lake. Fall with mirror in water') 2023-11-03 05:25:32.689 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:25:32.689 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'photo of Georgena Moran capturing video as she moves along [127, 499, 697, 998] a trail in [381, 559, 536, 831] her wheelchair') 2023-11-03 05:25:46.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:25:46.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bench:[246, 784, 411, 994],[125, 638, 441, 699],[113, 584, 345, 636],[774, 933, 836, 999];Chair:[791, 513, 902, 767];Desk:[335, 648, 882, 1000],[311, 525, 495, 690],[832, 432, 901, 572],[488, 448, 592, 516];Guitar:[150, 541, 327, 620];Person:[137, 416, 327, 789],[188, 556, 308, 904],[357, 427, 442, 802],[495, 428, 645, 726],[564, 366, 607, 519],[613, 343, 649, 419],[690, 295, 735, 376],[605, 470, 690, 763],[657, 473, 811, 826],[881, 253, 994, 804];Book:[472, 725, 540, 784],[364, 655, 400, 703];Hat:[216, 555, 264, 630],[430, 609, 468, 656];Sneakers:[358, 753, 382, 801];Handbag:[829, 834, 943, 998];Cup:[422, 675, 448, 725],[459, 658, 484, 709];Watermelon:[663, 760, 736, 874].') 2023-11-03 05:26:18.594 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:26:18.594 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[638, 575, 882, 888] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a reconnaissance vehicleFrame1:[638, 575, 882, 888];Frame2:[615, 556, 871, 883];Frame3:[580, 529, 853, 881];Frame4:[567, 522, 848, 884].') 2023-11-03 05:26:24.840 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:26:24.840 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[482, 400, 518, 511],personFrame1:[430, 301, 467, 393],personFrame1:[007, 368, 053, 465],personFrame1:[571, 341, 606, 441],personFrame1:[753, 352, 800, 451],personFrame1:[163, 297, 200, 411],personFrame1:[295, 495, 331, 623] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[482, 400, 518, 511];Frame2:[404, 388, 439, 501],personFrame1:[430, 301, 467, 393];Frame2:[351, 291, 396, 377],personFrame1[007, 368, 053, 465];Frame2:No object,personFrame1[571, 341, 606, 441];Frame2:No object,personFrame1:[753, 352, 800, 451];Frame2:[678, 341, 717, 440],personFrame1[163, 297, 200, 411];Frame2:No object,personFrame1[295, 495, 331, 623];Frame2:No object.') 2023-11-03 05:26:30.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:26:30.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[759, 259, 820, 462],personFrame1:[558, 191, 610, 386],personFrame1:[317, 219, 355, 443],personFrame1:[563, 133, 602, 316],personFrame1:[704, 298, 758, 494],personFrame1:[808, 462, 890, 683],personFrame1:[696, 443, 818, 620] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[759, 259, 820, 462];Frame2:[753, 251, 809, 441],personFrame1:[558, 191, 610, 386];Frame2:[549, 188, 602, 373],personFrame1:[317, 219, 355, 443];Frame2:[296, 209, 340, 431],personFrame1:[563, 133, 602, 316];Frame2:[556, 122, 595, 305],personFrame1:[704, 298, 758, 494];Frame2:[699, 300, 735, 479],personFrame1:[808, 462, 890, 683];Frame2:[802, 448, 877, 670],personFrame1:[696, 443, 818, 620];Frame2:[731, 458, 820, 622].') 2023-11-03 05:26:44.220 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:26:44.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track carFrame1:[375, 451, 418, 507],carFrame1:[704, 459, 777, 558],carFrame1:[757, 424, 930, 594] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'carFrame1:[375, 451, 418, 507];Frame2:[376, 451, 418, 507],carFrame1:[704, 459, 777, 558];Frame2:[704, 459, 778, 559],carFrame1:[757, 424, 930, 594];Frame2:[757, 424, 930, 593].') 2023-11-03 05:26:47.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:26:47.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[178, 239, 898, 789] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a armadilloFrame1:[178, 239, 898, 789];Frame2:[282, 235, 997, 761];Frame3:[347, 231, 998, 754];Frame4:[252, 208, 798, 746].') 2023-11-03 05:27:14.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:27:14.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[352, 310, 421, 401],personFrame1:[801, 262, 847, 446],carFrame1:[150, 323, 233, 415],personFrame1:[714, 268, 750, 421],personFrame1:[844, 262, 875, 397] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[352, 310, 421, 401];Frame2:[370, 319, 425, 393],personFrame1:[801, 262, 847, 446];Frame2:[808, 281, 851, 463],carFrame1:[150, 323, 233, 415];Frame2:[142, 337, 227, 433],personFrame1:[714, 268, 750, 421];Frame2:[703, 280, 741, 440],personFrame1[844, 262, 875, 397];Frame2:No object.') 2023-11-03 05:27:57.901 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:27:57.902 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Crane [004, 071, 901, 722] in Amsterdam Noord. This crane houses a cool hotel for those looking for an adventurous experience in the Netherlands as you can bungee off the top! #amsterdam #netherlands') 2023-11-03 05:28:12.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:28:12.129 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[159, 309, 356, 623],[352, 435, 631, 571],[460, 498, 610, 530];human body:[147, 330, 374, 598],[352, 493, 643, 604],[453, 510, 553, 535];human head:[201, 325, 318, 399],[508, 435, 595, 515];mammal:[134, 326, 356, 593],[361, 429, 626, 572],[433, 502, 608, 536];human hair:[201, 317, 305, 390],[442, 428, 601, 512];human arm:[231, 493, 399, 578],[309, 495, 411, 528],[361, 499, 526, 540],[424, 504, 636, 567];human hand:[473, 501, 547, 527].') 2023-11-03 05:28:41.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:28:41.002 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[001, 540, 086, 999],[165, 000, 999, 999];Necklace:[671, 255, 748, 313];Bracelet:[001, 873, 056, 990];Hat:[308, 000, 740, 150];Belt:[407, 851, 896, 907].') 2023-11-03 05:28:55.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:28:55.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[628, 597, 986, 1000],[001, 000, 999, 998];hat:[000, 186, 371, 651],[687, 576, 774, 728];human face:[160, 274, 347, 655],[605, 568, 695, 758];microphone:[394, 418, 633, 549].') 2023-11-03 05:29:02.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:29:02.910 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photograph of a longhorn beetle [175, 164, 848, 869] on flowers [000, 003, 998, 998]') 2023-11-03 05:30:07.862 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:30:07.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[758, 138, 991, 1000],[088, 198, 196, 607],[211, 195, 276, 330],[477, 182, 521, 297],[702, 221, 790, 379];footwear:[218, 796, 300, 914],[505, 908, 563, 999],[568, 879, 607, 978];jeans:[783, 624, 981, 993],[210, 421, 253, 690],[460, 799, 606, 950];woman:[182, 239, 220, 533],[208, 223, 361, 403],[210, 367, 398, 714],[281, 234, 313, 312],[312, 245, 370, 350],[336, 230, 432, 498],[371, 441, 636, 999],[419, 218, 510, 540],[448, 221, 598, 829],[575, 276, 616, 470],[619, 245, 700, 610],[666, 237, 711, 620],[957, 257, 999, 451];girl:[221, 350, 371, 900],[321, 521, 405, 758],[641, 560, 770, 999],[720, 410, 783, 676].') 2023-11-03 05:31:25.841 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:31:25.842 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[258, 352, 464, 644] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a dog colored in white and black playing in front of a mirrorFrame1:[258, 352, 464, 644];Frame2:[254, 347, 464, 650];Frame3:[256, 358, 464, 638];Frame4:[256, 363, 468, 644].') 2023-11-03 05:31:52.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:31:52.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[201, 514, 871, 604];frame:2:[196, 514, 866, 602];frame:3:[170, 514, 835, 590] doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an ferry stationary on water') 2023-11-03 05:31:53.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:31:53.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,sunglasses,suit and dress in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[851, 199, 990, 670],[057, 079, 243, 678],[245, 118, 431, 839],[425, 076, 618, 804],[501, 393, 611, 804],[748, 203, 896, 753];sunglasses:[071, 573, 175, 619],[410, 711, 491, 773],[580, 723, 646, 781],[866, 719, 933, 773],[911, 601, 956, 687];suit:[424, 065, 628, 783],[000, 285, 071, 555],[065, 184, 248, 681],[216, 314, 279, 738],[247, 224, 438, 875],[608, 353, 641, 640],[634, 296, 756, 690],[746, 288, 902, 771],[874, 292, 975, 609];dress:[000, 651, 271, 999],[250, 769, 536, 997],[628, 293, 752, 688].') 2023-11-03 05:32:13.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:32:13.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Wild Bird:[339, 903, 392, 957],[053, 716, 161, 817],[069, 456, 117, 496],[649, 518, 687, 559],[561, 304, 626, 388],[411, 101, 527, 240],[157, 177, 209, 240].') 2023-11-03 05:32:23.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:32:23.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Everything you need to know about the third collaboration between Uniqlo and JW Anderson [431, 032, 598, 845]') 2023-11-03 05:32:37.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:32:37.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[916, 834, 999, 999];Potted Plant:[001, 206, 034, 378];Umbrella:[709, 413, 783, 497].') 2023-11-03 05:32:45.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:32:45.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[639, 152, 939, 924],[378, 074, 677, 910],[058, 068, 393, 944];Necklace:[695, 251, 777, 293];Glasses:[247, 108, 366, 134];Tie:[291, 207, 342, 335],[513, 201, 559, 325];Leather Shoes:[084, 893, 172, 947],[199, 866, 336, 917],[395, 858, 476, 911],[592, 861, 675, 910];High Heels:[675, 833, 797, 903],[743, 851, 901, 924].') 2023-11-03 05:32:52.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:32:52.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[398, 497, 432, 723],personFrame1:[593, 497, 640, 746],personFrame1:[468, 510, 534, 800],personFrame1:[323, 489, 391, 758],personFrame1:[355, 494, 409, 741],personFrame1:[521, 498, 604, 776],personFrame1:[499, 508, 539, 727] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[398, 497, 432, 723];Frame2:[387, 512, 452, 722],personFrame1:[593, 497, 640, 746];Frame2:[594, 495, 634, 735],personFrame1:[468, 510, 534, 800];Frame2:[456, 557, 528, 800],personFrame1:[323, 489, 391, 758];Frame2:[354, 487, 408, 757],personFrame1:[355, 494, 409, 741];Frame2:[347, 477, 405, 737],personFrame1:[521, 498, 604, 776];Frame2:[521, 529, 593, 777],personFrame1:[499, 508, 539, 727];Frame2:[493, 513, 537, 744].') 2023-11-03 05:33:11.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:33:11.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cup,Chopsticks and Bowl/Basin in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Cup:[606, 125, 682, 312];Chopsticks:[304, 685, 434, 999],[464, 053, 515, 257];Bowl:[002, 770, 257, 999],[351, 199, 463, 266],[324, 247, 430, 305],[914, 340, 1000, 428],[895, 417, 998, 536],[564, 467, 999, 929],[889, 880, 1000, 997].') 2023-11-03 05:33:38.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:33:38.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Desk:[128, 801, 759, 1000],[785, 666, 999, 999],[000, 605, 330, 999],[010, 480, 253, 603],[516, 507, 765, 820],[370, 438, 544, 656],[270, 402, 406, 561],[000, 432, 163, 502],[000, 393, 118, 440];Napkin:[920, 647, 980, 709],[546, 793, 633, 903],[309, 777, 388, 891],[194, 594, 253, 661],[024, 609, 092, 665],[689, 507, 725, 552],[600, 488, 634, 526],[553, 509, 584, 552],[295, 398, 313, 418],[092, 434, 120, 461];shelf:[690, 389, 792, 665];Lamp:[144, 064, 212, 170],[289, 000, 409, 097],[436, 093, 461, 312],[485, 101, 508, 311],[533, 108, 551, 311];Cup:[966, 718, 989, 749],[937, 713, 966, 784],[904, 665, 930, 726],[457, 810, 489, 896],[309, 845, 340, 940],[390, 973, 443, 999],[136, 471, 148, 505],[591, 883, 625, 987],[592, 499, 604, 524];Fork:[965, 790, 990, 845],[982, 790, 1000, 830];Knife:[503, 829, 556, 877],[296, 846, 333, 905],[043, 713, 113, 755],[203, 679, 254, 698],[030, 667, 059, 689];Spoon:[509, 825, 543, 849],[492, 816, 527, 844],[047, 722, 098, 761].') 2023-11-03 05:33:47.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:33:47.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'plant:[000, 035, 029, 110],[000, 159, 050, 293],[000, 001, 082, 064],[000, 384, 101, 443],[034, 157, 082, 226],[057, 593, 185, 665],[142, 095, 237, 187],[203, 612, 401, 779],[421, 265, 484, 285],[462, 407, 556, 518],[475, 726, 582, 862],[509, 007, 681, 153],[571, 862, 787, 953],[673, 809, 765, 889],[695, 170, 756, 242],[826, 490, 907, 575],[857, 150, 942, 196],[857, 657, 998, 934],[964, 103, 998, 168].') 2023-11-03 05:33:50.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:33:50.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The path [074, 070, 922, 777] traced across the sky by 324 Bamberga around the time of opposition') 2023-11-03 05:33:59.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:33:59.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[285, 078, 988, 1000],[002, 530, 311, 1000];Backpack:[653, 308, 953, 649].') 2023-11-03 05:34:01.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:34:01.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[092, 316, 164, 550],personFrame1:[510, 437, 592, 672],personFrame1:[375, 300, 431, 506],personFrame1:[317, 362, 361, 604],personFrame1:[403, 426, 484, 675],personFrame1:[614, 302, 671, 529],personFrame1:[292, 245, 338, 452] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[092, 316, 164, 550];Frame2:[056, 318, 107, 568],personFrame1:[510, 437, 592, 672];Frame2:[497, 423, 571, 677],personFrame1:[375, 300, 431, 506];Frame2:[342, 298, 393, 530],personFrame1:[317, 362, 361, 604];Frame2:[250, 370, 303, 625],personFrame1:[403, 426, 484, 675];Frame2:[435, 431, 503, 641],personFrame1:[614, 302, 671, 529];Frame2:[579, 316, 634, 543],personFrame1:[292, 245, 338, 452];Frame2:[265, 245, 300, 450].') 2023-11-03 05:34:13.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:34:13.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[001, 002, 160, 1000],personFrame1:[414, 002, 835, 1000],personFrame1:[304, 314, 403, 645] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1[001, 002, 160, 1000];Frame2:No object,personFrame1:[414, 002, 835, 1000];Frame2:[353, 002, 754, 1000],personFrame1:[304, 314, 403, 645];Frame2:[190, 287, 301, 685].') 2023-11-03 05:34:15.444 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:34:15.445 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Clock:[087, 176, 116, 244];Person:[501, 060, 928, 1000],[446, 180, 619, 775],[317, 267, 510, 697],[284, 260, 439, 682];Glasses:[107, 861, 251, 950],[217, 655, 299, 699];Hat:[283, 259, 369, 357];Cup:[354, 871, 492, 998],[375, 692, 423, 838],[268, 600, 299, 677],[287, 336, 360, 431],[249, 405, 294, 517],[449, 248, 558, 439],[401, 300, 479, 445];Chopsticks:[519, 839, 635, 916];Bottle:[411, 841, 485, 998],[157, 588, 217, 831],[204, 568, 242, 692];Basin:[318, 823, 411, 889],[296, 682, 361, 737];Plate:[287, 739, 361, 782];Chair:[601, 504, 997, 997];Purse:[185, 818, 300, 900];Bakset:[002, 611, 114, 725];Desk:[100, 510, 857, 998].') 2023-11-03 05:34:29.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:34:29.996 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'flower:[722, 581, 835, 761],[368, 708, 455, 781],[453, 613, 567, 735],[485, 840, 563, 936],[643, 585, 716, 668],[655, 613, 747, 726],[666, 715, 733, 785],[727, 425, 783, 500],[760, 166, 812, 278],[801, 165, 865, 221],[870, 165, 912, 228].') 2023-11-03 05:34:59.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:34:59.908 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[533, 258, 837, 999],[361, 236, 591, 999],[077, 223, 430, 999];Hat:[222, 224, 367, 383],[366, 235, 494, 376],[581, 259, 699, 390];Tie:[287, 517, 379, 894];Desk:[546, 738, 957, 999];Chair:[002, 866, 105, 1000],[006, 732, 149, 999].') 2023-11-03 05:35:11.899 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:35:11.900 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Set of [057, 117, 937, 872] vertical banner templates with a coin of crypto currency, bitcoin, ethereum, ripple, lightcoin and eos.') 2023-11-03 05:35:27.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:35:27.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,please tell me the trajectory of the man wearing black jacket and blue jeans.When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one man wearing black jacket and blue jeans.Track1frame1:[338, 662, 733, 1000];frame2:[199, 692, 459, 1000].') 2023-11-03 05:36:30.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:36:30.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[713, 012, 895, 366];frame:2:[526, 001, 759, 287];frame:3:[457, 001, 595, 303] doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an lamb moving forward') 2023-11-03 05:37:11.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:37:11.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Couch:[011, 602, 429, 999];Lamp:[001, 655, 095, 1000],[296, 467, 367, 647],[418, 253, 453, 404],[329, 256, 350, 402];Mirror:[312, 183, 412, 632];Desk:[374, 525, 493, 672],[332, 523, 392, 630];Chair:[338, 551, 365, 631],[408, 532, 463, 679],[420, 506, 464, 636];Carpet:[373, 820, 491, 998];Cabinet:[535, 307, 575, 414],[573, 305, 634, 414],[638, 254, 703, 653],[617, 491, 641, 621],[536, 491, 620, 583],[720, 674, 1000, 998],[767, 018, 960, 803];Microwave:[596, 452, 627, 486];Refrigerator:[638, 413, 656, 651];TV:[873, 527, 1000, 941].') 2023-11-03 05:37:53.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:37:53.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[122, 547, 194, 712],personFrame1:[553, 516, 632, 719],personFrame1:[158, 525, 210, 654],personFrame1:[830, 554, 893, 793],personFrame1:[406, 500, 470, 622],personFrame1:[488, 551, 558, 789],personFrame1:[368, 466, 419, 585],personFrame1:[236, 526, 298, 654],personFrame1:[318, 490, 378, 620],personFrame1:[610, 393, 675, 579] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[122, 547, 194, 712];Frame2:[122, 575, 205, 712],personFrame1:[553, 516, 632, 719];Frame2:[552, 549, 634, 719],personFrame1:[158, 525, 210, 654];Frame2:[160, 527, 211, 654],personFrame1:[830, 554, 893, 793];Frame2:[830, 591, 898, 793],personFrame1:[406, 500, 470, 622];Frame2:[407, 500, 470, 622],personFrame1:[488, 551, 558, 789];Frame2:[478, 594, 557, 790],personFrame1:[368, 466, 419, 585];Frame2:[368, 467, 418, 585],personFrame1:[236, 526, 298, 654];Frame2:[234, 529, 294, 651],personFrame1:[318, 490, 378, 620];Frame2:[318, 491, 378, 620],personFrame1:[610, 393, 675, 579];Frame2:[613, 425, 670, 581].') 2023-11-03 05:38:46.697 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:38:46.697 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "The first piece of official promotional artwork for 'Shadow [026, 011, 741, 998] of the Tomb Raider'") 2023-11-03 05:38:51.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:38:51.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'person:[372, 666, 455, 924],[021, 577, 073, 784],[138, 649, 195, 818],[184, 612, 905, 891],[216, 641, 291, 850],[245, 662, 283, 801],[465, 683, 590, 931],[477, 583, 508, 652],[650, 685, 800, 931],[715, 714, 761, 787],[817, 674, 858, 775],[849, 625, 878, 712],[899, 628, 950, 735],[923, 621, 977, 731],[941, 621, 993, 739];footwear:[540, 848, 576, 930],[545, 818, 588, 894].') 2023-11-03 05:39:00.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:39:00.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect office building,person,office supplies,furniture and starfish in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'office building:[000, 042, 800, 1000];person:[513, 164, 841, 1000],[130, 356, 564, 899];office supplies:[035, 579, 071, 645],[050, 560, 114, 634],[165, 579, 348, 716],[181, 541, 328, 603],[217, 899, 408, 999],[258, 747, 401, 830],[285, 805, 349, 868],[376, 702, 584, 981];furniture:[000, 700, 633, 999].') 2023-11-03 05:39:02.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:39:02.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[407, 026, 625, 426] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a gooseFrame1:[407, 026, 625, 426];Frame2:[146, 194, 448, 643];Frame3:[392, 377, 597, 616];Frame4:[427, 338, 529, 541].') 2023-11-03 05:39:10.832 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:39:10.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'fixed-wing aircraft:[440, 335, 538, 454].') 2023-11-03 05:39:13.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:39:13.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[838, 565, 995, 967],[080, 574, 230, 996],[131, 328, 180, 499],[159, 315, 201, 407],[310, 288, 342, 488],[342, 313, 391, 507],[345, 330, 461, 789],[483, 338, 539, 663],[578, 477, 849, 999],[629, 407, 666, 646],[670, 385, 773, 593],[676, 336, 733, 448],[797, 324, 901, 752];man:[527, 423, 659, 958],[000, 265, 999, 999];fashion accessory:[589, 425, 865, 890];girl:[791, 340, 874, 608];human face:[619, 468, 645, 559],[866, 625, 923, 739],[884, 503, 910, 582].') 2023-11-03 05:39:14.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:39:14.293 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Jake Smith [506, 086, 997, 850], associate consultant at BARR Advisory, shares his experience breaking into the cybersecurity workforce.') 2023-11-03 05:39:32.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:39:32.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bottle:[026, 386, 059, 452],[675, 379, 709, 453],[903, 384, 938, 454];Volleyball:[320, 048, 416, 123];Person:[000, 107, 218, 469],[238, 130, 422, 460],[659, 096, 820, 471],[820, 127, 999, 459],[273, 296, 748, 790],[324, 245, 596, 952],[268, 520, 928, 1000];Pen:[163, 224, 180, 238];Sneakers:[826, 944, 925, 1000],[475, 850, 565, 923],[400, 905, 443, 949],[431, 984, 519, 1000],[667, 708, 749, 787],[962, 402, 999, 458],[827, 406, 868, 457],[729, 415, 814, 471],[275, 421, 318, 460];High Heels:[118, 428, 156, 467],[000, 355, 052, 403].') 2023-11-03 05:40:31.291 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:40:31.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A chisel [203, 639, 913, 847] is used to pry off old baseboard.') 2023-11-03 05:40:34.565 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:40:34.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[198, 801, 293, 963],[230, 748, 304, 852],[276, 812, 346, 898],[280, 728, 335, 795],[314, 756, 378, 869],[369, 740, 405, 860],[388, 747, 447, 833],[388, 814, 451, 886],[451, 807, 517, 885],[456, 747, 515, 841],[525, 767, 576, 874],[551, 742, 597, 815],[551, 806, 614, 872],[593, 761, 644, 874],[633, 806, 720, 967],[650, 761, 703, 857],[707, 748, 768, 908],[779, 757, 853, 936];mammal:[182, 820, 299, 955],[244, 749, 299, 850],[280, 816, 350, 903],[286, 730, 333, 828],[318, 757, 373, 861],[366, 739, 411, 866],[392, 745, 449, 830],[398, 813, 445, 880],[455, 749, 514, 832],[455, 806, 521, 885],[531, 766, 580, 863],[552, 804, 630, 870],[556, 742, 594, 830],[597, 761, 643, 866],[630, 807, 719, 966],[652, 761, 698, 852],[715, 753, 780, 907],[785, 759, 850, 929];clothing:[191, 840, 278, 951],[235, 773, 303, 829],[273, 837, 343, 885],[307, 778, 379, 869],[461, 766, 518, 829],[531, 786, 563, 871],[599, 779, 647, 876],[637, 832, 717, 960],[708, 773, 776, 905],[779, 780, 852, 841],[799, 837, 841, 919].') 2023-11-03 05:40:42.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:40:42.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Helmet:[716, 471, 772, 573],[476, 327, 531, 429],[258, 346, 294, 424];Skating and Skiing shoes:[510, 696, 552, 751],[442, 699, 467, 749],[241, 632, 289, 680],[182, 606, 220, 678];Gloves:[423, 416, 458, 487],[422, 506, 473, 568],[274, 521, 298, 561];Hockey Stick:[278, 491, 401, 696],[587, 604, 842, 714],[008, 639, 078, 734];Person:[421, 327, 553, 752],[635, 473, 830, 754],[000, 260, 045, 741],[051, 269, 116, 576],[137, 320, 188, 419].') 2023-11-03 05:41:16.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:41:16.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Dog in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Dog:[170, 394, 473, 714].') 2023-11-03 05:41:33.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:41:33.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Glasses:[846, 690, 951, 752];Person:[002, 686, 254, 1000],[162, 644, 479, 1000],[211, 633, 963, 1000],[416, 652, 720, 1000],[707, 696, 980, 1000],[830, 651, 1000, 1000].') 2023-11-03 05:42:02.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:42:02.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[680, 500, 766, 612];Hat:[745, 765, 846, 882],[225, 704, 338, 841].') 2023-11-03 05:42:55.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:42:55.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect palm tree,street light,girl and woman in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'palm tree:[076, 161, 203, 751],[300, 367, 359, 704],[380, 490, 468, 728],[725, 491, 761, 614],[783, 362, 833, 624],[861, 080, 992, 608];street light:[190, 480, 229, 729],[260, 594, 280, 728],[440, 608, 457, 695],[491, 633, 506, 709],[653, 633, 667, 707],[666, 606, 678, 707],[711, 462, 749, 753],[905, 519, 930, 641];girl:[591, 714, 670, 999],[747, 744, 808, 838],[868, 766, 985, 956];woman:[885, 776, 983, 968],[583, 709, 671, 999],[738, 740, 807, 842].') 2023-11-03 05:43:20.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:43:20.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[078, 521, 207, 718],[002, 537, 083, 792],[204, 513, 328, 754],[746, 682, 836, 1000],[795, 511, 986, 998],[721, 428, 854, 880],[686, 534, 759, 909],[584, 488, 702, 846],[481, 409, 685, 754],[567, 378, 638, 784],[613, 400, 698, 763];Hat:[465, 729, 579, 897].') 2023-11-03 05:44:09.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:44:09.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[306, 255, 474, 518] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a guard shipFrame1:[306, 255, 474, 518];Frame2:[307, 272, 497, 537];Frame3:[342, 250, 541, 525].') 2023-11-03 05:44:22.830 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:44:22.830 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Protestors [204, 516, 575, 662] [316, 418, 663, 554] [002, 746, 486, 998] [253, 554, 777, 701] [001, 476, 282, 631] [223, 663, 998, 909] lay on the ground outside of Zilber Hall, part of the 'Die-in' demonstration in August.") 2023-11-03 05:44:43.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:44:43.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Storage box,Sandals and Desk in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Chair:[018, 600, 224, 986];Storage box:[137, 459, 256, 638],[245, 480, 333, 690],[550, 491, 674, 719],[641, 453, 803, 738];Sandals:[141, 277, 819, 441].') 2023-11-03 05:44:44.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:44:44.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[747, 001, 819, 186],[114, 119, 165, 294];Pillow:[702, 743, 833, 923],[770, 749, 937, 964],[261, 772, 458, 999],[112, 647, 228, 807],[055, 663, 220, 822],[151, 809, 214, 853],[028, 750, 121, 822],[001, 792, 095, 881],[068, 821, 334, 999];Chair:[206, 647, 361, 815],[002, 659, 311, 870],[587, 776, 998, 1000];Couch:[001, 813, 514, 999];Umbrella:[748, 383, 992, 608],[638, 431, 737, 516];Book:[461, 821, 607, 894];Plate:[419, 809, 517, 841];Basin:[439, 950, 616, 998];Desk:[001, 922, 151, 1000].') 2023-11-03 05:44:58.347 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:44:58.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[653, 244, 742, 797] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a giraffeFrame1:[653, 244, 742, 797];Frame2:[594, 251, 698, 762];Frame3:[522, 205, 657, 691];Frame4:[528, 243, 724, 712];Frame5:[425, 279, 664, 761].') 2023-11-03 05:45:07.408 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:45:07.408 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Glad young couple [457, 328, 673, 998] [207, 372, 518, 998] sitting on the couch [044, 482, 888, 926] surrounded by boxes [021, 591, 260, 735] [658, 790, 860, 980]') 2023-11-03 05:45:29.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:45:29.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[835, 549, 999, 822],carFrame1:[573, 489, 774, 813],carFrame1:[477, 463, 570, 614] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[835, 549, 999, 822];Frame2:[799, 543, 989, 781],carFrame1:[573, 489, 774, 813];Frame2:[585, 496, 745, 749],carFrame1:[477, 463, 570, 614];Frame2:[490, 472, 570, 599].') 2023-11-03 05:45:48.855 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:45:48.856 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Perfectly peeled hard boiled eggs sit on [394, 599, 810, 781] a plate next to Negg, [208, 362, 414, 794] the easy hard boiled egg peeler') 2023-11-03 05:46:10.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:46:10.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Desk:[001, 672, 678, 1000],[253, 380, 343, 474],[914, 320, 971, 386];Chair:[907, 675, 998, 947],[522, 571, 571, 769];Potted Plant:[802, 076, 869, 225],[537, 113, 593, 220];Person:[001, 203, 088, 717],[094, 233, 280, 703],[340, 105, 656, 892],[548, 059, 911, 998];Belt:[651, 595, 891, 741];Cup:[260, 729, 310, 816].') 2023-11-03 05:46:45.253 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:46:45.253 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[002, 000, 317, 984];clothing:[025, 095, 311, 934],[371, 000, 656, 782];girl:[009, 000, 371, 998];kitchen appliance:[256, 265, 291, 378],[292, 895, 360, 998],[314, 806, 387, 922],[372, 854, 423, 928],[396, 899, 442, 998],[654, 482, 783, 902],[760, 556, 823, 824].') 2023-11-03 05:47:11.452 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:47:11.452 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Illustration of the rounded icons [664, 344, 981, 658] [340, 344, 655, 658] [013, 017, 333, 335] [339, 018, 658, 335] [662, 017, 983, 335] with pants [747, 731, 884, 905] [747, 409, 885, 583] [421, 731, 559, 906] [100, 731, 236, 905] [422, 086, 558, 259] [099, 409, 237, 582] [747, 085, 884, 259] [100, 086, 236, 259] [422, 410, 559, 582] on a white background') 2023-11-03 05:47:14.999 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:47:15.000 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[469, 295, 760, 803],[093, 287, 370, 813],[694, 292, 893, 834];human hair:[751, 291, 870, 382],[229, 298, 314, 337],[339, 221, 439, 277],[391, 274, 485, 328],[542, 296, 644, 354];footwear:[741, 760, 860, 831],[219, 781, 288, 805],[343, 774, 420, 805],[545, 774, 632, 821],[698, 776, 788, 816];suit:[281, 287, 493, 685],[323, 346, 548, 781];human body:[085, 368, 380, 821],[270, 288, 496, 715],[324, 349, 546, 805],[469, 308, 700, 720],[691, 354, 901, 831];human leg:[135, 553, 232, 818],[224, 555, 319, 806],[288, 574, 345, 714],[342, 559, 422, 805],[435, 562, 512, 808],[514, 536, 601, 820],[586, 531, 691, 822],[704, 520, 801, 816],[753, 518, 886, 827];human face:[221, 318, 293, 366],[393, 286, 460, 349],[742, 306, 806, 371].') 2023-11-03 05:47:17.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:47:17.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[220, 415, 292, 695],personFrame1:[501, 579, 557, 848],personFrame1:[362, 430, 453, 715] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[220, 415, 292, 695];Frame2:[200, 418, 289, 695],personFrame1:[501, 579, 557, 848];Frame2:[492, 572, 548, 847],personFrame1:[362, 430, 453, 715];Frame2:[359, 430, 412, 708].') 2023-11-03 05:47:39.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:47:39.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[515, 376, 669, 999],[371, 425, 548, 1000].') 2023-11-03 05:47:52.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:47:52.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'house:[039, 000, 999, 999];window:[076, 524, 096, 643],[086, 330, 109, 456],[191, 220, 230, 377],[194, 472, 225, 647],[372, 603, 432, 869],[372, 264, 437, 520],[523, 141, 632, 457],[728, 000, 895, 432],[732, 560, 909, 930].') 2023-11-03 05:48:22.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:48:22.608 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,what is the track of the a man walking and carrying a bag?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one a man walking and carrying a bag.Track1frame1:[442, 150, 914, 472];frame2:[441, 151, 909, 472].') 2023-11-03 05:48:52.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:48:52.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[258, 235, 428, 644].') 2023-11-03 05:49:04.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:49:04.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[513, 240, 581, 527],personFrame1:[465, 348, 532, 644],personFrame1:[319, 454, 401, 692],personFrame1:[791, 412, 882, 624],personFrame1:[802, 341, 878, 533],personFrame1:[683, 410, 752, 591],personFrame1:[524, 191, 601, 461],personFrame1:[693, 320, 768, 484],personFrame1:[352, 396, 426, 585],personFrame1:[397, 325, 474, 496] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[513, 240, 581, 527];Frame2:[504, 227, 564, 522],personFrame1:[465, 348, 532, 644];Frame2:[426, 354, 523, 655],personFrame1:[319, 454, 401, 692];Frame2:[290, 482, 390, 694],personFrame1:[791, 412, 882, 624];Frame2:[796, 428, 896, 632],personFrame1:[802, 341, 878, 533];Frame2:[802, 351, 890, 533],personFrame1:[683, 410, 752, 591];Frame2:[684, 400, 763, 592],personFrame1:[524, 191, 601, 461];Frame2:[525, 177, 599, 461],personFrame1:[693, 320, 768, 484];Frame2:[698, 319, 778, 483],personFrame1:[352, 396, 426, 585];Frame2:[328, 408, 420, 593],personFrame1:[397, 325, 474, 496];Frame2:[378, 333, 472, 503].') 2023-11-03 05:49:42.764 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:49:42.764 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[221, 392, 788, 808] no bake pumpkin chocolate chip cookie dough in spoon') 2023-11-03 05:49:48.775 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:49:48.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[107, 315, 243, 903],[143, 275, 652, 904],[673, 283, 828, 899],[953, 373, 970, 440];balloon:[167, 482, 229, 568],[188, 568, 244, 655],[205, 380, 257, 465],[244, 675, 289, 785],[260, 453, 280, 519],[284, 678, 312, 782],[307, 462, 360, 547],[361, 478, 390, 550],[410, 377, 467, 514],[626, 227, 673, 308],[626, 290, 683, 358],[671, 218, 727, 322],[693, 428, 766, 519],[760, 288, 813, 395],[798, 306, 836, 395];tree:[000, 000, 136, 565],[130, 000, 246, 411],[193, 000, 435, 378],[383, 000, 553, 363],[511, 000, 793, 244],[685, 000, 999, 749],[930, 003, 999, 402];clothing:[000, 380, 031, 544],[106, 407, 243, 853],[146, 356, 248, 801],[238, 415, 286, 700],[270, 419, 305, 569],[297, 365, 408, 839],[385, 509, 481, 864],[546, 370, 636, 870],[686, 360, 815, 867].') 2023-11-03 05:50:00.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:50:00.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'tree:[503, 038, 965, 559],[000, 000, 501, 613],[423, 368, 651, 476].') 2023-11-03 05:50:19.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:50:19.499 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Picture:[002, 003, 194, 466],[328, 003, 658, 718];Tie:[685, 450, 732, 930],[100, 413, 158, 818];Person:[426, 208, 930, 998],[097, 274, 471, 999],[002, 178, 329, 999].') 2023-11-03 05:50:22.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:50:22.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'carnivore:[387, 032, 612, 999].') 2023-11-03 05:50:29.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 05:50:29.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Necklace:[391, 469, 592, 1000];Bracelet:[050, 897, 082, 990];Glasses:[412, 261, 601, 363],[101, 236, 117, 272].') 2023-11-03 05:50:43.966 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:50:43.966 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Figure 1 Representation of a K+ channel showing activation and inactivation 'gates' and possible binding sites for [751, 347, 833, 616] [666, 351, 749, 619] small molecules") 2023-11-03 05:51:14.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:51:14.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet/shelf in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'shelf:[266, 471, 368, 625].') 2023-11-03 05:51:30.724 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:51:30.724 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'the Vivoactive 3 screen [352, 468, 664, 632] on the music library') 2023-11-03 05:52:24.806 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:52:24.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the person walk around?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one person walk around.Track1frame:1:[750, 001, 962, 349];frame:2:[750, 001, 962, 349];frame:3:[628, 001, 888, 385].') 2023-11-03 05:54:20.009 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:54:20.009 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Baseball Bat:[871, 473, 999, 722];Person:[836, 658, 922, 891],[850, 399, 999, 894],[581, 191, 935, 931],[279, 197, 628, 950],[254, 410, 389, 862],[064, 346, 223, 857],[001, 556, 164, 851],[857, 087, 994, 223],[726, 078, 853, 219],[603, 075, 726, 218],[510, 058, 636, 214],[353, 056, 498, 213],[237, 046, 377, 211],[060, 001, 225, 205],[001, 001, 085, 201],[250, 001, 396, 143],[386, 010, 496, 170],[479, 001, 621, 213],[612, 027, 717, 157],[659, 001, 785, 168],[829, 001, 980, 211],[466, 001, 543, 166],[372, 001, 476, 099];Hat:[113, 347, 180, 410];Helmet:[375, 198, 442, 268],[792, 189, 854, 253];Gloves:[581, 445, 635, 512].') 2023-11-03 05:54:57.925 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:54:57.926 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[145, 006, 848, 946] A diamond necklace, [289, 056, 408, 385] [594, 047, 715, 374] earring, bracelet and ring suite (4) (partially illustrated') 2023-11-03 05:55:36.227 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:55:36.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Gun,Person and Projector in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[050, 338, 307, 999],[001, 340, 084, 547],[040, 317, 100, 444],[180, 156, 281, 511],[271, 294, 588, 972],[433, 335, 539, 556],[550, 305, 734, 609],[689, 328, 812, 628],[753, 294, 892, 622],[579, 319, 616, 377];Projector:[480, 623, 629, 713].') 2023-11-03 05:55:52.936 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:55:52.937 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[812, 125, 1000, 999],[855, 042, 1000, 520],[221, 022, 834, 999].') 2023-11-03 05:56:07.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:56:07.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[400, 412, 488, 518] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a medusaFrame1:[400, 412, 488, 518];Frame2:[390, 432, 480, 541];Frame3:[398, 471, 495, 585].') 2023-11-03 05:57:01.690 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:57:01.691 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'plant:[093, 000, 999, 886].') 2023-11-03 05:57:14.260 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:57:14.260 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'BRSP distributed masks [462, 360, 518, 458] [338, 360, 418, 521] [731, 315, 827, 495], sanitizers, and gloves [459, 632, 548, 760] among the deployed forces [411, 282, 571, 997] [541, 327, 656, 933] [221, 219, 545, 994] [514, 187, 999, 995] to stop spreading the coronavirus') 2023-11-03 05:57:33.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:57:33.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'ant:[330, 347, 676, 665].') 2023-11-03 05:57:47.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:57:47.783 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[399, 101, 558, 937] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a personFrame1:[399, 101, 558, 937];Frame2:[397, 118, 556, 956];Frame3:[394, 112, 547, 901];Frame4:[406, 150, 549, 859];Frame5:[413, 183, 540, 890].') 2023-11-03 05:58:44.222 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:58:44.223 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "American track and field athlete Jordan Gray [148, 032, 949, 997] has launched a campaign for a women's decathlon event to be included at the Paris 2024 Olympic Games") 2023-11-03 05:58:56.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:58:56.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[090, 245, 650, 1000],[491, 089, 962, 1000];Stool:[326, 502, 798, 901];Backpack:[002, 685, 102, 999].') 2023-11-03 05:59:12.089 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:59:12.089 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[775, 351, 918, 990] About 20 armers with [179, 407, 251, 789] [471, 483, 573, 875] [902, 362, 989, 743] [347, 433, 431, 734] [782, 446, 888, 877] [088, 464, 181, 806] [648, 494, 724, 863] [249, 463, 327, 808] a green apron standing, posing as [548, 644, 680, 997] a group for the photo inside the supermarket') 2023-11-03 05:59:19.409 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:59:19.409 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[018, 182, 979, 856] President Trump has been promising to bring Christmas back to the White House for years and has now become the first POTUS in eight years to use [350, 799, 681, 997] the phrase "Merry Christmas!"') 2023-11-03 05:59:34.373 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:59:34.374 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[209, 246, 392, 970] [487, 196, 695, 979] Policemen wearing [271, 308, 321, 368] [560, 264, 613, 325] face masks patrol on a street in Moscow, Russia, on Jan. 15, 2021. (Xinhua/Evgeny Sinitsyn') 2023-11-03 05:59:48.044 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:59:48.044 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of [520, 208, 672, 530] [556, 249, 842, 996] [733, 171, 913, 997] [156, 088, 332, 998] [415, 209, 587, 998] [001, 005, 263, 994] [835, 144, 996, 993] [207, 221, 484, 998] diverse youth pose for in [759, 177, 920, 304] [532, 208, 673, 352] [015, 003, 266, 115] [415, 209, 560, 351] [189, 089, 323, 251] [877, 143, 997, 270] [643, 243, 797, 379] [298, 222, 457, 391] their cap and [208, 524, 478, 996] [561, 440, 855, 997] [835, 359, 996, 998] [001, 256, 190, 996] gown for a graduation photo') 2023-11-03 05:59:56.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 05:59:56.508 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Book:[686, 434, 890, 735],[000, 260, 157, 644];Person:[220, 120, 737, 997],[368, 634, 405, 743],[000, 251, 162, 999];Frame:[218, 395, 687, 973],[630, 001, 827, 251];Cabinet:[483, 574, 834, 1000].') 2023-11-03 05:59:57.398 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 05:59:57.399 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[124, 223, 580, 750],truckFrame1:[806, 190, 999, 488],carFrame1:[769, 386, 819, 465] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[124, 223, 580, 750];Frame2:[127, 223, 580, 750],truckFrame1:[806, 190, 999, 488];Frame2:[806, 192, 999, 488],carFrame1:[769, 386, 819, 465];Frame2:[764, 386, 814, 468].') 2023-11-03 06:00:05.488 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:00:05.488 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[868, 240, 1000, 384];Bottle:[532, 253, 562, 319],[561, 264, 599, 319];Potted Plant:[370, 445, 486, 604],[210, 331, 261, 377],[108, 303, 165, 380];Side Table:[001, 543, 623, 916];Flower:[162, 312, 216, 360],[050, 308, 114, 361],[672, 270, 698, 318];Lamp:[004, 000, 064, 252],[151, 000, 202, 265],[263, 000, 309, 273],[941, 000, 1000, 241],[625, 273, 658, 323];Desk:[001, 373, 353, 484],[837, 335, 1000, 389],[521, 318, 725, 372];Stool:[112, 499, 201, 549];Couch:[199, 362, 999, 790];Pillow:[793, 390, 893, 463],[699, 384, 805, 470],[761, 380, 837, 405];Chair:[861, 338, 1000, 391];Vase:[561, 264, 600, 320],[532, 253, 561, 319],[162, 359, 216, 378],[052, 360, 105, 381].') 2023-11-03 06:00:48.821 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:00:48.821 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[446, 219, 635, 720] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a personFrame1:[446, 219, 635, 720];Frame2:[478, 193, 762, 705];Frame3:[537, 176, 660, 668];Frame4:[589, 143, 754, 677].') 2023-11-03 06:01:02.225 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:01:02.225 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Chair:[001, 537, 060, 607],[063, 524, 132, 599],[143, 515, 215, 595],[253, 520, 304, 588].') 2023-11-03 06:01:12.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:01:12.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[170, 212, 208, 308],[216, 208, 240, 283],[242, 258, 303, 430],[281, 233, 328, 338],[288, 407, 373, 949],[316, 219, 336, 296],[578, 283, 631, 523],[660, 326, 726, 709],[694, 258, 741, 342],[721, 263, 786, 424],[738, 600, 994, 997],[745, 342, 823, 725],[781, 269, 833, 362],[858, 288, 912, 500],[886, 281, 969, 602],[891, 274, 944, 378];tree:[000, 000, 176, 141],[176, 000, 338, 182];clothing:[000, 207, 018, 292],[011, 511, 125, 996],[019, 324, 096, 540],[032, 254, 095, 357],[059, 615, 243, 999],[078, 261, 124, 377],[085, 344, 163, 439],[114, 355, 224, 516],[115, 240, 182, 369],[164, 256, 197, 336],[182, 223, 208, 303],[182, 406, 291, 756],[208, 284, 250, 390],[245, 278, 295, 399],[252, 377, 316, 784],[279, 245, 300, 309],[285, 257, 315, 361],[290, 484, 373, 919],[306, 372, 393, 559],[314, 242, 333, 305],[329, 265, 385, 367],[361, 254, 406, 446],[398, 261, 425, 378],[428, 305, 469, 463],[464, 252, 493, 331],[468, 313, 526, 563],[505, 294, 547, 547],[541, 274, 602, 507],[583, 329, 612, 415],[585, 388, 671, 721],[625, 298, 693, 503],[651, 378, 719, 697],[670, 411, 789, 694],[679, 247, 716, 324],[703, 274, 750, 390],[716, 311, 770, 452],[729, 252, 766, 348],[748, 693, 999, 999],[760, 265, 785, 393],[763, 395, 833, 653],[767, 288, 828, 397],[776, 435, 888, 800],[813, 305, 883, 428],[848, 236, 897, 327],[873, 336, 900, 509],[884, 327, 970, 602],[902, 307, 926, 382],[921, 348, 999, 682].') 2023-11-03 06:01:26.171 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:01:26.172 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[606, 116, 656, 228] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a faceFrame1:[606, 116, 656, 228];Frame2:[607, 127, 661, 241];Frame3:[582, 125, 635, 241];Frame4:[583, 075, 640, 195];Frame5:[556, 054, 612, 175].') 2023-11-03 06:01:34.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:01:34.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[012, 515, 113, 998],[099, 407, 277, 962],[253, 355, 475, 999],[465, 351, 637, 999],[626, 388, 824, 1000],[931, 704, 1000, 998],[783, 369, 946, 1000];Hat:[146, 407, 224, 482];Glasses:[705, 441, 759, 476];Satchel:[700, 682, 754, 754];Backpack:[298, 818, 438, 1000],[120, 867, 262, 999];Ship:[389, 152, 537, 217].') 2023-11-03 06:01:53.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:01:53.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[250, 195, 458, 903];woman:[415, 290, 634, 956],[000, 308, 050, 896],[011, 243, 115, 848],[018, 245, 260, 968];man:[722, 190, 969, 948],[235, 213, 494, 921],[239, 212, 358, 675],[386, 212, 492, 823],[545, 213, 600, 304],[552, 198, 747, 886],[673, 183, 822, 779],[906, 251, 993, 408];footwear:[018, 812, 093, 841],[480, 884, 527, 966],[542, 819, 580, 911].') 2023-11-03 06:01:55.293 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:01:55.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[150, 441, 195, 531],[354, 428, 401, 520],[480, 498, 556, 565],[502, 430, 559, 511],[693, 583, 999, 999],[707, 407, 758, 505],[796, 478, 877, 665],[913, 492, 999, 719];man:[026, 434, 152, 626],[200, 441, 256, 544],[295, 409, 341, 482],[411, 425, 476, 526],[580, 462, 748, 671],[799, 375, 925, 546],[891, 441, 966, 576];girl:[390, 776, 558, 999],[509, 429, 558, 504],[527, 482, 629, 656],[693, 580, 999, 999],[798, 477, 882, 662],[821, 488, 999, 725];tree:[332, 000, 999, 492];clothing:[025, 471, 168, 602],[095, 434, 140, 506],[197, 451, 255, 532],[354, 427, 398, 520],[418, 444, 467, 523],[485, 513, 548, 571],[536, 523, 611, 650],[538, 498, 601, 554],[603, 520, 705, 645],[710, 425, 766, 491],[778, 709, 984, 984],[791, 532, 886, 652],[800, 425, 885, 540],[880, 488, 950, 564],[915, 567, 999, 709].') 2023-11-03 06:02:09.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:02:09.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'boy:[481, 622, 721, 983];human hair:[189, 147, 287, 293],[083, 144, 152, 249],[463, 121, 546, 236],[523, 554, 615, 703],[623, 090, 711, 224],[656, 576, 838, 962],[753, 222, 816, 279],[871, 544, 996, 760];man:[011, 139, 159, 930],[170, 140, 359, 998],[378, 125, 562, 998],[497, 565, 711, 998],[564, 096, 776, 765],[720, 166, 889, 824],[853, 548, 999, 998];clothing:[488, 693, 705, 989],[005, 271, 145, 998],[176, 284, 357, 998],[378, 240, 563, 998],[567, 232, 725, 758],[711, 791, 856, 998],[717, 300, 888, 810],[850, 776, 999, 998];human arm:[018, 372, 081, 516],[112, 393, 162, 508],[160, 314, 272, 531],[298, 305, 370, 533],[373, 284, 522, 489],[527, 426, 568, 510],[543, 948, 638, 998],[562, 266, 685, 641],[622, 827, 843, 998],[701, 376, 783, 503],[715, 413, 793, 560],[750, 319, 887, 549];beard:[227, 245, 263, 314],[755, 259, 813, 339];woman:[611, 570, 855, 998].') 2023-11-03 06:02:17.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:02:17.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[475, 084, 669, 575];Book:[578, 521, 636, 617];bucket:[031, 788, 184, 995],[003, 723, 135, 893];Vase:[028, 789, 182, 998],[002, 721, 135, 892];Desk:[436, 432, 634, 605],[361, 565, 523, 967],[529, 567, 681, 878],[000, 462, 289, 808];shelf:[926, 174, 1000, 893].') 2023-11-03 06:02:19.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:02:19.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[384, 221, 606, 747],[166, 344, 296, 633],[236, 549, 331, 752],[274, 480, 364, 588],[315, 771, 518, 999],[326, 584, 390, 716],[328, 331, 369, 415],[411, 610, 560, 874],[603, 350, 674, 505],[765, 679, 995, 999],[866, 306, 910, 411];human face:[015, 746, 185, 988],[024, 069, 999, 999],[128, 543, 175, 625],[198, 720, 283, 835],[268, 567, 331, 690],[327, 612, 366, 707],[332, 512, 363, 594],[388, 813, 461, 968],[448, 231, 523, 350],[478, 661, 537, 751],[531, 292, 583, 401],[618, 565, 700, 685],[703, 705, 795, 870],[760, 288, 821, 384],[806, 774, 942, 971];land vehicle:[000, 078, 329, 421],[366, 078, 850, 517],[521, 000, 999, 267];man:[000, 668, 190, 999],[000, 175, 999, 999],[123, 505, 188, 677],[131, 317, 210, 507],[163, 617, 284, 957],[259, 332, 346, 499],[284, 690, 427, 917],[507, 652, 644, 978],[509, 271, 590, 426],[594, 529, 695, 693],[608, 462, 644, 531],[619, 295, 699, 472],[627, 619, 794, 937],[653, 281, 711, 407],[681, 271, 766, 503],[708, 267, 857, 584],[719, 175, 890, 448],[866, 201, 999, 570],[896, 574, 999, 887],[935, 705, 999, 999];girl:[431, 228, 613, 742];clothing:[000, 647, 027, 850],[000, 375, 138, 737],[124, 615, 187, 714],[142, 393, 221, 506],[148, 838, 318, 999],[170, 427, 301, 563],[270, 402, 335, 484],[314, 270, 405, 331],[353, 447, 393, 516],[378, 608, 413, 697],[411, 362, 600, 781],[429, 763, 540, 854],[440, 955, 533, 999],[521, 794, 676, 982],[579, 371, 603, 457],[600, 417, 674, 526],[611, 873, 775, 999],[615, 371, 698, 469],[666, 333, 701, 451],[689, 312, 746, 488],[715, 391, 878, 546],[756, 861, 993, 998],[801, 394, 910, 515],[886, 312, 999, 628],[974, 814, 999, 938].') 2023-11-03 06:02:21.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:02:21.104 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'tree:[270, 045, 854, 291];building:[856, 410, 999, 583].') 2023-11-03 06:02:26.293 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:02:26.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'flower:[261, 460, 561, 993],[335, 061, 829, 773],[650, 158, 999, 895].') 2023-11-03 06:02:59.275 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:02:59.275 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Our Rome with kids itinerary included some physical activities to keep the kids [387, 480, 448, 570] [423, 450, 673, 805] happy, such as the Janiculum Walk') 2023-11-03 06:03:03.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:03:03.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Flower:[091, 363, 193, 724],[318, 615, 515, 1000],[120, 658, 260, 945];Person:[001, 427, 063, 808],[029, 435, 197, 923],[197, 555, 274, 737],[265, 294, 357, 477],[118, 420, 449, 998],[373, 295, 427, 584],[412, 011, 691, 617],[364, 279, 807, 999],[597, 359, 735, 611],[954, 396, 999, 999],[190, 523, 267, 722];Necklace:[493, 486, 624, 895];Hat:[391, 295, 420, 360];Glasses:[453, 095, 524, 142];Satchel:[723, 477, 756, 556];Tie:[534, 604, 558, 648];Truck:[780, 549, 955, 691].') 2023-11-03 06:03:08.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:03:08.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower and Person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Flower:[807, 424, 986, 533];Person:[288, 463, 382, 756],[384, 457, 450, 750],[596, 513, 681, 760],[697, 446, 790, 776],[332, 365, 384, 434].') 2023-11-03 06:03:12.876 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:03:12.876 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[146, 186, 816, 944];footwear:[794, 586, 922, 676];human face:[397, 304, 562, 494].') 2023-11-03 06:03:18.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:03:18.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[957, 156, 998, 333];frame:2:[951, 139, 998, 322];frame:3:[948, 139, 998, 318] doing?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an parked red car without shifting position') 2023-11-03 06:03:23.167 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:03:23.167 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'One colorful and one grey Star Wars bedding set, two throw-back pillows [319, 124, 551, 512] [737, 080, 947, 310] [047, 083, 261, 304] and an "only at Target" icon') 2023-11-03 06:03:29.898 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:03:29.899 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Serena Williams [169, 036, 762, 997] vowed to relish every moment of her quest for an elusive and record-equalling 24th Grand Slam singles title after edging her way into the second round of the US Open.') 2023-11-03 06:03:50.262 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:03:50.262 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[631, 353, 723, 655],[078, 586, 223, 896],[226, 536, 323, 855],[363, 526, 468, 733],[422, 495, 490, 670],[758, 498, 908, 670];mammal:[626, 343, 711, 648],[078, 583, 223, 876],[220, 536, 332, 856],[365, 528, 477, 735],[422, 500, 518, 658],[752, 491, 908, 668];snowboard:[021, 671, 233, 801],[348, 581, 542, 643],[515, 643, 748, 698],[545, 598, 735, 636],[648, 748, 952, 865];footwear:[241, 758, 282, 826],[246, 798, 265, 863].') 2023-11-03 06:04:09.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:09.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Desk:[001, 077, 180, 275];Bench:[001, 111, 548, 214];Person:[001, 658, 106, 931],[476, 635, 694, 1000],[001, 409, 051, 576],[031, 392, 173, 513],[182, 439, 244, 565],[170, 336, 283, 503],[739, 419, 900, 534];Glasses:[596, 478, 643, 513];Slippers:[733, 234, 757, 264];Handbag:[272, 005, 326, 095];Bakset:[406, 081, 453, 122].') 2023-11-03 06:04:19.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:19.989 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Whiteboard,Bicycle,Chair and Radish in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Whiteboard:[528, 316, 755, 603];Bicycle:[268, 316, 379, 469];Chair:[917, 576, 999, 700].') 2023-11-03 06:04:22.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:22.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'bicycle:[661, 656, 763, 992],[483, 659, 557, 938];boy:[000, 415, 156, 963],[218, 420, 276, 874],[285, 436, 338, 832],[310, 462, 386, 846],[406, 476, 465, 886],[427, 452, 518, 862],[505, 532, 544, 751],[535, 463, 641, 999],[633, 520, 674, 783],[648, 494, 791, 986],[761, 490, 799, 730],[783, 456, 925, 999];clothing:[000, 518, 083, 842],[000, 487, 016, 637],[032, 489, 137, 971],[103, 508, 170, 817],[158, 518, 192, 705],[205, 469, 295, 856],[285, 493, 326, 813],[308, 520, 398, 879],[378, 495, 423, 854],[419, 516, 475, 893],[445, 506, 519, 875],[490, 551, 510, 682],[495, 561, 542, 703],[521, 514, 655, 985],[623, 536, 635, 627],[638, 565, 674, 764],[676, 549, 802, 987],[757, 559, 811, 678],[781, 536, 903, 999],[905, 582, 981, 848],[958, 664, 999, 987].') 2023-11-03 06:04:28.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:28.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect High Heels,Keyboard,Bus,Trash bin Can,Car,SUV and Van in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bus:[001, 805, 104, 926];Trash bin Can:[663, 874, 688, 903];Car:[186, 846, 334, 915],[787, 877, 964, 927],[889, 912, 999, 999],[793, 854, 832, 870];SUV:[574, 847, 663, 887],[642, 841, 710, 874];Van:[181, 844, 239, 877],[711, 849, 750, 872].') 2023-11-03 06:04:33.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:33.237 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'desk:[646, 528, 902, 877];table:[188, 636, 668, 1000],[000, 679, 216, 999],[642, 572, 905, 855],[796, 476, 999, 730];person:[085, 231, 262, 690].') 2023-11-03 06:04:35.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:35.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'I enjoyed seeing [351, 483, 458, 737] [188, 407, 277, 541] [002, 493, 099, 747] [729, 323, 914, 772] [154, 512, 349, 993] the kids watching a slide show of the images I had taken the previous night.') 2023-11-03 06:04:46.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:46.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Chair:[001, 686, 058, 825],[115, 675, 177, 803],[159, 672, 215, 800],[283, 666, 342, 783],[328, 658, 381, 767];Desk:[035, 680, 148, 819],[183, 665, 340, 789];Trash bin Can:[503, 666, 567, 798].') 2023-11-03 06:04:54.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:04:54.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Flower:[526, 350, 668, 553];Desk:[459, 538, 754, 750];Picture:[001, 395, 063, 461];Chair:[001, 566, 136, 760],[101, 619, 293, 837],[179, 537, 311, 701],[304, 656, 590, 997],[656, 811, 997, 998],[791, 620, 998, 838],[946, 571, 998, 640],[495, 503, 577, 610];Carpet:[001, 583, 919, 998];Person:[728, 691, 834, 910],[635, 757, 975, 917],[193, 535, 294, 635],[008, 560, 123, 646],[089, 405, 142, 614],[196, 485, 254, 552],[664, 402, 734, 559],[849, 386, 902, 622],[914, 522, 956, 624],[951, 412, 1000, 577];Handbag:[854, 611, 902, 658];Stroller:[893, 496, 989, 650].') 2023-11-03 06:05:13.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:05:13.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bed in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bed:[002, 002, 1000, 1000].') 2023-11-03 06:05:26.394 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:05:26.395 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the bird flapping its wings flying forward?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There are 4 bird flapping its wings flying forward.Track1frame:1:[332, 570, 421, 680];frame:2:[344, 568, 421, 620];frame:3:[350, 604, 439, 694],Track2frame:1:[338, 629, 570, 877];frame:2:[340, 641, 571, 990];frame:3:[343, 730, 563, 993],Track3frame:1:[472, 414, 630, 601];frame:2:[471, 482, 632, 612];frame:3:[478, 603, 632, 660],Track4frame:1:[439, 712, 484, 760];frame:2:[441, 734, 463, 765];frame:3:[386, 697, 496, 758].') 2023-11-03 06:05:27.125 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:05:27.125 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Row of [769, 603, 893, 998] IDF soldiers in [412, 605, 468, 791] [306, 579, 352, 733] [864, 657, 958, 944] [489, 632, 554, 845] [780, 656, 880, 956] [687, 628, 767, 915] [608, 623, 691, 880] [546, 629, 613, 868] green uniforms throwing [074, 170, 109, 217] [630, 003, 720, 104] [592, 319, 629, 369] [540, 269, 574, 324] [652, 310, 685, 360] [764, 287, 803, 345] [424, 057, 471, 130] their green berets into the air on top of Masada') 2023-11-03 06:05:44.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:05:44.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[003, 456, 194, 1000],[071, 019, 548, 984],[467, 405, 731, 895],[652, 150, 954, 680],[876, 190, 999, 601].') 2023-11-03 06:05:50.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:05:50.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Glasses:[237, 088, 413, 123];Sneakers:[550, 806, 714, 927],[158, 799, 386, 918];Stool:[210, 570, 780, 999];Person:[003, 079, 212, 736],[042, 020, 991, 930],[077, 036, 192, 197].') 2023-11-03 06:06:03.912 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:06:03.912 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the black car going stright?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one black car going stright.Track1frame:1:[614, 215, 686, 301];frame:2:[675, 218, 716, 293].') 2023-11-03 06:06:38.272 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:06:38.272 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'If you desire information on some point of law, you are not likely to ponder over [086, 235, 921, 638] the Felix Adler Picture Quote') 2023-11-03 06:06:48.938 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:06:48.938 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Baby baboon from the Plateau Road troop sits in its mother's arms while playing with a leaf [094, 539, 550, 988], Cape Peninsula, South Africa") 2023-11-03 06:06:56.758 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:06:56.759 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Close up of [361, 364, 953, 813] a sander power tool for DIY on [000, 358, 999, 935] wooden parquet floor poster') 2023-11-03 06:07:53.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:07:53.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'insect:[283, 395, 729, 647].') 2023-11-03 06:08:09.105 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:08:09.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the walk forward to the right?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one walk forward to the right.Track1frame:1:[306, 437, 488, 493];frame:2:[306, 437, 488, 493];frame:3:[342, 442, 510, 573].') 2023-11-03 06:08:32.577 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:08:32.578 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[054, 008, 995, 991] A black cast iron skillet holds sauteed onions and raw mushrooms. A green and white striped napkin is next to the skillet.') 2023-11-03 06:08:44.563 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:08:44.564 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'These Creamsicle-flavored Orange Halloween Cupcakes [008, 166, 990, 859] are the perfect pumpkin alternative! A tantalizing combination [320, 425, 858, 864] of smooth vanilla and juicy orange fruit.') 2023-11-03 06:09:24.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:09:24.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[100, 011, 580, 999],[328, 176, 872, 978],[612, 828, 712, 938];Gloves:[352, 158, 454, 312],[400, 265, 482, 352],[701, 445, 798, 538];Boots:[815, 808, 876, 955],[657, 865, 763, 980],[450, 828, 581, 980],[098, 825, 220, 1000].') 2023-11-03 06:09:51.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:09:51.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the shifting motorcycle?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one shifting motorcycle.Track1frame:1:[269, 516, 425, 730];frame:2:[269, 516, 425, 730];frame:3:[221, 576, 400, 812].') 2023-11-03 06:10:25.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:10:25.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[376, 388, 738, 754] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a molochFrame1:[376, 388, 738, 754];Frame2:[375, 488, 733, 802];Frame3:[357, 440, 664, 845].') 2023-11-03 06:10:28.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:10:28.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Desk:[000, 001, 684, 999];Paint Brush:[305, 019, 340, 103],[036, 139, 093, 241];Cup:[203, 247, 273, 373],[323, 079, 390, 208],[436, 118, 497, 220],[451, 073, 503, 166],[012, 135, 074, 238],[000, 214, 056, 374];Bottle:[366, 294, 417, 397],[414, 275, 451, 392];Person:[355, 125, 894, 999],[580, 001, 851, 494].') 2023-11-03 06:10:39.257 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:10:39.258 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[414, 259, 598, 558] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a cockroachFrame1:[414, 259, 598, 558];Frame2:[402, 312, 612, 547];Frame3:[411, 320, 615, 564];Frame4:[392, 338, 573, 567];Frame5:[359, 355, 569, 598].') 2023-11-03 06:10:44.974 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:10:44.974 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect human nose,man,human eye,human hair and human mouth in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human nose:[580, 487, 662, 604];man:[000, 103, 848, 999];human eye:[620, 395, 667, 488],[650, 560, 698, 681];human hair:[518, 286, 851, 717];human mouth:[515, 484, 572, 655].') 2023-11-03 06:11:08.509 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:11:08.510 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'sisters looking at each other in [435, 325, 723, 817] lace blue dresses. wedding guests best seller in may') 2023-11-03 06:11:39.707 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:11:39.708 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'tree:[032, 784, 118, 927],[000, 463, 068, 562],[000, 285, 090, 419],[065, 609, 170, 673],[109, 284, 158, 352],[138, 526, 376, 673],[170, 780, 258, 929],[203, 306, 266, 377],[239, 765, 320, 888],[265, 271, 355, 382],[271, 768, 376, 916],[307, 308, 499, 437],[346, 275, 427, 349],[396, 780, 483, 921],[437, 456, 992, 877],[441, 290, 545, 396],[494, 795, 545, 900],[539, 801, 598, 913],[567, 831, 633, 926],[586, 313, 657, 388],[640, 828, 715, 962],[651, 297, 687, 387],[691, 838, 768, 967],[727, 876, 803, 999],[796, 903, 853, 993],[860, 752, 999, 999],[884, 361, 999, 450];building:[000, 632, 420, 913],[057, 250, 321, 368],[728, 310, 975, 406].') 2023-11-03 06:11:56.159 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:11:56.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[001, 356, 206, 641],[049, 519, 319, 980],[168, 761, 407, 999],[028, 002, 279, 238],[502, 002, 686, 356],[640, 140, 858, 517],[821, 324, 999, 899];Towel:[168, 356, 321, 479],[142, 158, 253, 267],[302, 569, 466, 729],[427, 805, 607, 999],[708, 668, 873, 874],[655, 434, 758, 533],[341, 083, 456, 182];bucket:[339, 184, 437, 371],[494, 426, 616, 642];Hat:[072, 634, 233, 998],[001, 376, 132, 787],[606, 002, 740, 290],[753, 158, 922, 484],[948, 489, 999, 699];Bottle:[564, 578, 623, 765],[640, 532, 703, 726],[611, 431, 670, 617],[414, 368, 468, 546],[284, 190, 341, 355],[343, 078, 390, 196],[267, 119, 313, 271];Spoon:[335, 612, 453, 655],[741, 671, 819, 817],[476, 864, 602, 925],[193, 379, 300, 428];Fork:[346, 570, 433, 698],[719, 694, 846, 806],[471, 827, 588, 950],[195, 348, 278, 465];Plate:[583, 710, 814, 970],[597, 732, 792, 968],[690, 487, 880, 682],[697, 509, 853, 665],[377, 648, 590, 881],[387, 685, 554, 866],[512, 297, 689, 454],[512, 344, 676, 461],[216, 410, 412, 601],[220, 429, 408, 610],[199, 113, 344, 234],[393, 127, 508, 263],[212, 127, 346, 237];Cup:[458, 389, 499, 495].') 2023-11-03 06:11:58.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:11:58.987 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[375, 201, 545, 541] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a dhowFrame1:[375, 201, 545, 541];Frame2:[367, 181, 556, 559];Frame3:[367, 181, 556, 565];Frame4:[367, 181, 556, 570];Frame5:[367, 181, 557, 576].') 2023-11-03 06:12:11.255 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:12:11.255 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A cartoon drawing of a crazed-looking wolf [300, 300, 828, 730] suspended in midair by a helium balloon [496, 022, 627, 205], with the phrase [187, 711, 860, 989] "Airwolf" written below in a 1980s heavy-metal font') 2023-11-03 06:12:47.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 06:12:47.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[608, 367, 866, 1000];Bed:[002, 796, 684, 1000];Pillow:[101, 729, 281, 861],[150, 753, 324, 900].') 2023-11-03 06:12:53.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:12:53.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[322, 314, 413, 742],[402, 326, 474, 746].') 2023-11-03 06:13:23.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:13:23.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A shower transfer bench [173, 268, 553, 993] can be useful to help bathe an elderly person [477, 013, 972, 951]') 2023-11-03 06:13:35.846 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:13:35.847 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[016, 024, 994, 982] Breadth-First Search Visits all vertices adjacent to vertex before going forward. See Figure 20-10b.') 2023-11-03 06:13:41.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:13:41.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[192, 511, 236, 574],[584, 525, 614, 573],[301, 520, 323, 561];Flag:[313, 435, 362, 459],[246, 271, 300, 293],[552, 330, 588, 349],[782, 431, 823, 452],[876, 369, 905, 389],[865, 436, 910, 465],[535, 291, 644, 324];Boat:[049, 069, 894, 562],[317, 517, 487, 550];Rickshaw:[183, 533, 218, 573].') 2023-11-03 06:14:02.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 06:14:02.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[128, 417, 209, 693],[804, 337, 920, 668],[863, 422, 943, 648];Car:[820, 615, 860, 632],[106, 626, 186, 650];Awning:[431, 533, 533, 578],[595, 538, 689, 579],[738, 543, 829, 584];Trash bin Can:[519, 637, 556, 679];Chair:[323, 623, 365, 645];Flower:[223, 646, 310, 688].') 2023-11-03 06:14:07.190 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:14:07.190 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'This Creamy Cauliflower Soup recipe [105, 086, 998, 468] is savory, cheesy, and filled with tender chunks of cauliflower plus all of the comforts of a home-cooked meal!') 2023-11-03 06:14:28.952 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:14:28.954 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'cooked bacon [412, 270, 549, 584] [219, 254, 419, 521] [393, 525, 481, 882] [631, 494, 819, 783] [663, 222, 764, 559] [526, 185, 630, 516] wrapped avocado fries in the basket of an air fryer [001, 004, 994, 996]') 2023-11-03 06:14:46.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:14:46.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[047, 715, 081, 867],[090, 719, 122, 805],[232, 738, 275, 912],[548, 745, 583, 905],[671, 752, 698, 911];Trash bin Can:[077, 800, 114, 922];Street Lights:[400, 204, 436, 891],[785, 138, 830, 923],[597, 521, 649, 803];Stroller:[494, 797, 548, 906].') 2023-11-03 06:14:52.169 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:14:52.170 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Whole Chicken [056, 150, 634, 802] cut into sections on a white plate [001, 105, 992, 877] with lemons [024, 303, 118, 447] [751, 275, 921, 381] [085, 240, 221, 394] [691, 218, 911, 316] [326, 384, 466, 560] [403, 366, 586, 518] [223, 691, 364, 799] and pomegranate seeds') 2023-11-03 06:14:58.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:14:58.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[101, 433, 235, 941],[343, 376, 385, 523],[459, 505, 570, 829],[772, 364, 835, 535];man:[106, 421, 235, 934],[380, 338, 454, 837];tree:[908, 021, 990, 295],[000, 000, 043, 203],[021, 000, 348, 437],[513, 256, 588, 304],[835, 224, 896, 290];clothing:[366, 354, 449, 767],[120, 493, 246, 933],[350, 404, 391, 521],[460, 577, 566, 833],[769, 398, 835, 537].') 2023-11-03 06:15:06.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:15:06.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Parishioners [729, 107, 995, 996] prepare for the 180th celebrations with the launch of a new book [153, 473, 241, 617].') 2023-11-03 06:15:35.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:15:35.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'house:[068, 086, 999, 825];tree:[766, 135, 999, 635];window:[271, 543, 288, 613],[323, 401, 365, 525],[326, 543, 345, 670],[330, 262, 367, 370],[340, 169, 376, 236],[350, 703, 390, 787],[350, 390, 400, 525],[356, 522, 390, 672],[362, 243, 406, 351],[386, 140, 415, 233],[440, 527, 499, 665],[445, 141, 481, 215],[445, 369, 493, 520],[446, 236, 492, 353],[455, 715, 500, 794],[509, 259, 558, 369],[511, 162, 545, 235],[516, 380, 562, 527],[518, 532, 570, 675],[562, 183, 601, 256],[565, 262, 620, 382],[571, 398, 621, 520],[572, 710, 629, 773],[572, 546, 642, 679],[643, 136, 673, 217],[645, 519, 688, 653],[685, 164, 713, 246],[696, 525, 744, 650],[735, 177, 765, 260],[745, 513, 785, 646],[778, 532, 828, 651],[816, 543, 875, 653].') 2023-11-03 06:15:52.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:15:52.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Street Lights in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[537, 642, 603, 709].') 2023-11-03 06:15:54.360 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:15:54.361 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Acting silly and having a fun time with [520, 414, 964, 997] [002, 199, 213, 997] co[617, 257, 733, 502] [766, 266, 959, 997] [452, 358, 691, 998] [357, 341, 529, 996] -workers at [000, 008, 994, 626] HOMESTAR's Annual Ball") 2023-11-03 06:16:25.677 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:16:25.677 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'shorts:[313, 553, 706, 831],[000, 518, 178, 697];man:[196, 000, 846, 935],[000, 191, 178, 858];sports uniform:[190, 009, 852, 899],[000, 281, 154, 681];footwear:[000, 803, 076, 860].') 2023-11-03 06:16:54.665 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:16:54.665 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[264, 326, 379, 914].') 2023-11-03 06:16:55.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:16:55.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Laptop:[370, 552, 578, 785],[069, 501, 230, 621];TV:[750, 177, 1000, 405];Cup:[586, 441, 643, 548];Person:[500, 237, 999, 999],[170, 338, 352, 842];Hat:[588, 237, 720, 367];Handbag:[345, 495, 460, 636].') 2023-11-03 06:17:04.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:04.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Plate:[003, 002, 995, 998].') 2023-11-03 06:17:15.094 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:15.094 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[746, 287, 784, 383] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a camel headFrame1:[746, 287, 784, 383];Frame2:[761, 286, 798, 379];Frame3:[770, 300, 806, 386];Frame4:[803, 308, 842, 405];Frame5:[843, 304, 881, 421].') 2023-11-03 06:17:19.355 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:19.356 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[240, 698, 401, 883] [042, 710, 218, 898] [615, 277, 789, 462] cherry gummy bears sitting on [292, 209, 996, 840] a cupcake stand and on [005, 010, 996, 994] a table') 2023-11-03 06:17:33.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:33.040 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the fishes moving around?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There are 13 fishes moving around.Track1frame:1:[000, 000, 148, 221];frame:2:[000, 000, 156, 215];frame:3:[000, 000, 169, 207],Track2frame:1:[660, 000, 754, 139];frame:2:[669, 000, 757, 129];frame:3:[684, 000, 766, 114],Track3frame:1:[476, 162, 513, 230];frame:2:[472, 164, 517, 224];frame:3:[465, 166, 518, 194],Track4frame:1:[176, 261, 662, 743];frame:2:[216, 249, 708, 736];frame:3:[277, 209, 782, 709],Track5frame:1:[340, 675, 925, 998];frame:2:[346, 673, 956, 999];frame:3:[381, 654, 999, 998],Track6frame:1:[360, 231, 671, 562];frame:2:[335, 200, 648, 441];frame:3:[254, 142, 554, 350],Track7frame:1:[567, 133, 818, 318];frame:2:[562, 123, 807, 308];frame:3:[552, 106, 792, 296],Track8frame:1:[409, 712, 453, 737];frame:2:[425, 700, 458, 725];frame:3:[363, 713, 396, 768],Track9frame:1:[682, 454, 781, 631];frame:2:[680, 450, 777, 627],Track10frame:1:[182, 950, 256, 999];frame:2:[173, 942, 270, 999];frame:3:[161, 925, 284, 999],Track11frame:1:[963, 000, 999, 050];frame:2:[933, 000, 999, 053];frame:3:[888, 000, 988, 072],Track12frame:3:[383, 637, 505, 731],Track13frame:3:[836, 000, 905, 056].') 2023-11-03 06:17:42.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:17:42.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[760, 312, 817, 576],[685, 251, 740, 588],[443, 027, 572, 714],[062, 162, 337, 821];Necklace:[485, 140, 522, 188];Bracelet:[226, 512, 248, 563];Barrel:[542, 585, 621, 735],[366, 781, 514, 953],[189, 770, 329, 998],[706, 562, 815, 793],[361, 653, 448, 797],[371, 584, 443, 688],[329, 612, 396, 796].') 2023-11-03 06:17:45.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:45.009 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you tell me the trajectory of the woman wearing a blue jean skirt with tan tshirt in this video clip?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one woman wearing a blue jean skirt with tan tshirt.Track1frame1:[597, 385, 1000, 1000];frame2:[614, 390, 1000, 1000].') 2023-11-03 06:17:47.824 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:47.825 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Crew members [255, 278, 333, 423] [309, 220, 410, 419] [154, 274, 263, 425] [545, 292, 649, 421] [449, 290, 529, 420] [647, 267, 760, 421] wearing face masks [465, 335, 501, 380] [688, 304, 725, 352] [339, 315, 378, 364] [583, 342, 619, 387] [202, 318, 241, 366] as a precaution against the spread of corona virus, after the ship docked. Cruise ship Regal Princess [005, 006, 993, 996] has docked in...') 2023-11-03 06:17:57.737 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:17:57.738 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the carrying the cat?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one carrying the cat.Track1frame:1:[000, 001, 999, 998];frame:2:[000, 001, 999, 998];frame:3:[302, 001, 999, 998].') 2023-11-03 06:18:08.711 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:18:08.711 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Surfer with [002, 544, 999, 711] surf board in the evening light') 2023-11-03 06:18:17.136 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:18:17.137 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[001, 216, 245, 924] [074, 022, 833, 823] Two young girls taking afternoon tea on a China with kids itinerary') 2023-11-03 06:18:44.367 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:18:44.367 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'portrait of a young man with blue anorak [249, 360, 765, 990] in a studio, standing against blue background - man blue background stock pictures, royalty-free photos [597, 622, 801, 730] & images') 2023-11-03 06:19:25.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:19:25.527 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Scale:[645, 280, 796, 491],[205, 468, 244, 505];Lamp:[228, 234, 348, 320],[265, 335, 309, 365],[354, 321, 420, 364],[460, 315, 520, 361],[354, 360, 402, 390],[454, 365, 495, 396];Fan:[170, 357, 247, 395];Crab:[738, 154, 836, 224],[603, 016, 735, 093],[260, 743, 347, 795];Gloves:[480, 579, 574, 650],[628, 539, 704, 578];Handbag:[557, 547, 713, 668],[898, 446, 976, 553],[796, 523, 909, 621];Person:[787, 397, 872, 489],[445, 404, 718, 656],[610, 402, 684, 501],[589, 396, 635, 448],[361, 408, 440, 472],[001, 420, 174, 998],[127, 432, 181, 521];Basin:[150, 635, 226, 658];Plate:[202, 749, 344, 795].') 2023-11-03 06:19:43.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:19:43.414 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[339, 360, 601, 849] bride laying on bed, surrounding by [545, 112, 754, 744] [143, 002, 408, 734] her bridesmaids') 2023-11-03 06:20:06.678 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:20:06.678 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[513, 268, 682, 996] [821, 511, 972, 993] [197, 415, 472, 996] [002, 366, 156, 653] [002, 564, 193, 996] [460, 645, 646, 997] Shiite rebels known as Houthis hold up [441, 094, 481, 407] [521, 119, 562, 499] [142, 010, 387, 818] their weapons as they chant slogans during a protest to denounce the Saudi aggression in Sanaa, Yemen, Wednesday, April 22, 2015') 2023-11-03 06:20:14.020 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:20:14.020 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bread,Person and Helmet in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[014, 429, 056, 534],[334, 287, 704, 680],[497, 554, 944, 712],[554, 406, 598, 546],[632, 454, 664, 546],[660, 453, 700, 548],[699, 450, 733, 549],[744, 390, 775, 551],[864, 458, 899, 553],[904, 461, 932, 554];Helmet:[587, 556, 647, 625].') 2023-11-03 06:20:34.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:20:34.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[202, 570, 618, 995];frame:2:[189, 579, 592, 995];frame:3:[166, 583, 571, 995] is doing?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an cake in front of boy') 2023-11-03 06:20:35.607 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:20:35.607 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Explore section from Feedly with [683, 457, 838, 785] trending tech articles shown.') 2023-11-03 06:20:55.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:20:55.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human head:[341, 101, 550, 491],[000, 099, 130, 460],[031, 000, 180, 253],[250, 066, 384, 383],[612, 120, 823, 485];human hand:[791, 453, 977, 681],[000, 498, 064, 646],[025, 746, 136, 921],[099, 761, 227, 999],[131, 481, 218, 607],[159, 821, 306, 999],[163, 650, 196, 748],[300, 900, 350, 963],[440, 763, 560, 915],[556, 823, 692, 994],[625, 802, 788, 950];man:[000, 757, 138, 999],[000, 104, 209, 999],[036, 000, 248, 603],[097, 761, 226, 999],[160, 822, 309, 999],[171, 126, 583, 999],[188, 069, 390, 617],[552, 824, 813, 999],[629, 576, 999, 999],[795, 474, 999, 681];human face:[000, 253, 125, 452],[051, 073, 155, 240],[261, 176, 353, 382],[401, 209, 529, 477],[624, 205, 776, 476];human arm:[000, 502, 066, 641],[000, 760, 143, 927],[100, 764, 226, 999],[108, 266, 215, 608],[167, 828, 310, 999],[167, 446, 584, 905],[572, 828, 825, 999],[625, 581, 999, 987],[795, 478, 999, 672];human nose:[491, 322, 522, 389],[641, 318, 694, 401].') 2023-11-03 06:21:16.629 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:21:16.633 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Jersey Shore's Snooki and [543, 240, 761, 998] [358, 264, 611, 994] [721, 200, 900, 997] her bridesmaids enjoying [744, 590, 804, 750] [639, 586, 699, 715] [484, 556, 544, 681] [161, 372, 233, 544] pre-wedding cocktails in [542, 384, 762, 998] [355, 408, 609, 994] Gatsby themed sweatsuits") 2023-11-03 06:21:23.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:21:23.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'chair:[006, 398, 234, 700];girl:[147, 346, 999, 999];clothing:[418, 605, 999, 999];human face:[651, 453, 916, 655].') 2023-11-03 06:21:38.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:21:38.420 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[269, 595, 362, 888];frame:2:[249, 571, 364, 864];frame:3:[260, 547, 383, 809] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an people riding motorized-bicycle') 2023-11-03 06:23:23.095 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:23:23.095 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Nepalese girl [313, 177, 610, 795] in traditional dress [323, 265, 583, 676] carries a large basket with a heavy sheaf of grass [212, 006, 994, 376] for her livestock. Nepalese child helps parents. Pokhara [005, 003, 989, 956], Nepal') 2023-11-03 06:23:33.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:23:33.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[426, 174, 550, 972],[105, 154, 947, 998];woman:[250, 272, 370, 973],[418, 198, 558, 938];human hair:[257, 240, 352, 432],[202, 177, 280, 260],[475, 202, 527, 294],[531, 164, 595, 251],[650, 150, 716, 226],[710, 153, 797, 244];mammal:[420, 164, 554, 950],[000, 230, 098, 998],[103, 177, 281, 998],[245, 254, 390, 998],[348, 235, 453, 957],[525, 162, 695, 960],[640, 148, 772, 998],[703, 140, 950, 997];man:[528, 165, 708, 940],[000, 232, 095, 998],[110, 183, 287, 998],[341, 232, 450, 911],[649, 158, 774, 998],[709, 155, 937, 998];footwear:[553, 867, 649, 956],[300, 940, 366, 998],[344, 807, 390, 875],[388, 833, 422, 908],[445, 725, 491, 906],[463, 731, 529, 931],[545, 840, 584, 908],[685, 958, 784, 998];jeans:[560, 482, 698, 913],[128, 705, 253, 998],[700, 586, 788, 996],[745, 674, 903, 998];dress:[267, 381, 383, 792];human face:[213, 213, 273, 323],[298, 277, 346, 366],[381, 248, 432, 342],[475, 223, 519, 303],[546, 187, 601, 285],[653, 170, 710, 280],[715, 193, 790, 334].') 2023-11-03 06:23:45.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:23:45.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'bicycle:[441, 488, 571, 823];bicycle wheel:[465, 636, 525, 810],[533, 572, 576, 721];bookcase:[020, 220, 266, 856];wheel:[465, 634, 525, 814],[534, 563, 578, 720];shelf:[015, 190, 254, 836];book:[016, 637, 173, 828],[021, 534, 154, 671],[141, 667, 168, 745],[155, 485, 260, 573],[165, 599, 252, 665],[165, 635, 259, 736].') 2023-11-03 06:24:08.698 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:24:08.699 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Family:\xa0Matt also seems to be getting on wonderfully with [174, 401, 640, 998] [423, 430, 880, 997] Tammy's children, Wolf and Saskia, (pictured) who she shares with her ex-partner [000, 183, 526, 989] Reece Hawkins") 2023-11-03 06:24:20.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:24:20.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[143, 093, 712, 999],[025, 180, 112, 444];Necklace:[363, 311, 488, 436];Car:[554, 178, 999, 414];Cell Phone:[347, 230, 395, 322].') 2023-11-03 06:24:28.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:24:28.465 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Wild Bird in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Wild Bird:[837, 688, 971, 894],[576, 433, 803, 868],[091, 201, 185, 549],[162, 311, 242, 485].') 2023-11-03 06:24:59.265 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:24:59.266 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Boat:[537, 518, 825, 679];Person:[106, 212, 171, 452].') 2023-11-03 06:25:24.193 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:25:24.194 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '2021 Jeep Gladiator [069, 160, 626, 897] driving down the road [014, 050, 990, 991]') 2023-11-03 06:25:28.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:25:28.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[175, 167, 618, 976] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a personFrame1:[175, 167, 618, 976];Frame2:[000, 044, 505, 833];Frame3:[156, 055, 608, 803];Frame4:[054, 161, 552, 896];Frame5:[001, 172, 569, 951].') 2023-11-03 06:25:34.426 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:25:34.426 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'If you have seen the movie only once in the 1980s, you might be forgiven for thinking that it may not stand up as a classic today. Back to the Future The Musical | Manchester Opera House [036, 075, 962, 961] | Casting') 2023-11-03 06:25:36.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:25:36.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[491, 102, 785, 522] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a hat worn by a man with facial hairFrame1:[491, 102, 785, 522];Frame2:[489, 094, 795, 527];Frame3:[491, 077, 800, 519];Frame4:[487, 066, 802, 519];Frame5:[487, 063, 802, 513].') 2023-11-03 06:25:55.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:25:55.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Chair:[120, 619, 340, 997],[288, 666, 535, 997],[536, 818, 750, 999];Desk:[001, 489, 999, 999];Person:[141, 449, 315, 803],[364, 362, 604, 895],[591, 417, 881, 999];TV:[081, 379, 187, 482];Keyboard:[125, 559, 240, 606],[004, 567, 084, 612].') 2023-11-03 06:25:56.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:25:56.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[396, 262, 662, 604] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a elkFrame1:[396, 262, 662, 604];Frame2:[362, 238, 565, 587];Frame3:[331, 250, 618, 575];Frame4:[314, 281, 573, 600].') 2023-11-03 06:26:01.339 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:26:01.339 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Teagasc and Musgrave [001, 037, 339, 997] join forces [689, 052, 997, 996] to fund a 4-year PhD to ensure safer food supply chains') 2023-11-03 06:26:45.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:26:45.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Truck:[761, 770, 830, 858],[198, 727, 340, 947],[810, 746, 879, 828];Street Lights:[655, 592, 690, 780];Bus:[172, 765, 215, 830];Luggage:[923, 894, 958, 950].') 2023-11-03 06:26:57.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:26:57.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[035, 028, 995, 904] A house in Ironwood has [068, 550, 710, 875] an old Hollywood video sign attached to an out building... Hollywood in Michigan!!') 2023-11-03 06:27:07.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:27:07.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Other Shoes,Hat,Person and Glasses in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Other Shoes:[720, 931, 781, 1000],[966, 728, 989, 805];Hat:[530, 195, 657, 405],[984, 153, 1000, 219];Person:[497, 196, 748, 1000],[654, 284, 791, 619],[699, 202, 968, 999],[945, 144, 1000, 805],[133, 122, 405, 999],[039, 266, 186, 931].') 2023-11-03 06:27:43.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:27:43.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[015, 095, 176, 317],[259, 063, 349, 334],[312, 057, 402, 351];footwear:[604, 794, 684, 899],[283, 845, 368, 916];plant:[000, 397, 999, 625],[415, 000, 814, 109];sports equipment:[410, 503, 607, 601],[466, 252, 552, 347],[637, 783, 693, 893],[890, 549, 959, 657].') 2023-11-03 06:27:49.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:27:49.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Lamp:[108, 278, 178, 436],[383, 193, 445, 235];Flower:[119, 436, 187, 482],[420, 574, 479, 629],[441, 553, 482, 621],[291, 562, 319, 623];Vase:[127, 476, 174, 555];Picture:[652, 464, 765, 508];Potted Plant:[137, 620, 346, 861];shelf:[125, 824, 336, 999],[111, 527, 185, 629];Chair:[611, 856, 695, 998],[758, 754, 844, 848],[328, 641, 412, 839],[446, 644, 503, 839],[413, 619, 444, 797];Couch:[518, 681, 883, 1000];Pillow:[525, 797, 601, 896];Desk:[354, 655, 503, 839];Bottle:[696, 782, 722, 884],[816, 755, 851, 878];Wine Glass:[735, 817, 760, 886],[777, 797, 807, 873];Plate:[782, 871, 865, 912],[776, 883, 870, 923],[719, 818, 773, 863],[854, 795, 883, 828];TV:[124, 723, 268, 936];Bakset:[673, 873, 787, 940].') 2023-11-03 06:27:55.217 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:27:55.217 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[322, 296, 670, 655] Colorful Reggae Mammoth vector illustration isolated on a white background.') 2023-11-03 06:28:01.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:28:01.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Fire Truck:[219, 119, 895, 876];Street Lights:[174, 133, 219, 442];Helmet:[221, 401, 283, 459],[273, 399, 314, 443];Hat:[340, 426, 386, 476];Other Shoes:[399, 882, 442, 927];Gloves:[366, 319, 385, 352];Person:[310, 425, 444, 936],[365, 242, 403, 431],[587, 289, 652, 393].') 2023-11-03 06:28:04.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:28:04.025 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Bottle and Desk in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[264, 679, 339, 807],[388, 699, 458, 882],[436, 716, 486, 853],[495, 705, 551, 845],[657, 714, 895, 1000],[781, 761, 831, 863],[688, 693, 842, 858],[874, 721, 914, 852],[868, 709, 974, 911],[938, 778, 1000, 998];Bottle:[894, 835, 924, 939];Desk:[207, 841, 421, 999],[264, 767, 601, 855],[583, 891, 691, 999].') 2023-11-03 06:28:24.342 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:28:24.342 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Charm bracelet brown, green, white from the Charm Club collection [149, 135, 856, 869] in the THOMAS SABO online store') 2023-11-03 06:28:51.599 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:28:51.599 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[404, 046, 551, 660];Flower:[065, 181, 292, 411],[710, 163, 969, 426],[000, 706, 144, 926],[883, 797, 999, 904];Lamp:[284, 745, 383, 914],[645, 758, 750, 929],[758, 806, 931, 962],[129, 784, 279, 974];Vase:[000, 866, 132, 956],[924, 890, 999, 947].') 2023-11-03 06:28:54.192 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:28:54.192 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[560, 269, 607, 458],personFrame1:[286, 202, 328, 413],personFrame1:[504, 313, 567, 519],personFrame1:[422, 497, 564, 773],personFrame1:[555, 551, 628, 872],personFrame1:[613, 308, 657, 516],personFrame1:[318, 288, 375, 519] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[560, 269, 607, 458];Frame2:[474, 279, 542, 473],personFrame1:[286, 202, 328, 413];Frame2:[288, 202, 330, 413],personFrame1:[504, 313, 567, 519];Frame2:[543, 311, 582, 513],personFrame1:[422, 497, 564, 773];Frame2:[432, 540, 574, 791],personFrame1:[555, 551, 628, 872];Frame2:[555, 551, 622, 872],personFrame1:[613, 308, 657, 516];Frame2:[614, 318, 662, 515],personFrame1:[318, 288, 375, 519];Frame2:[318, 288, 375, 518].') 2023-11-03 06:29:09.299 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:29:09.299 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[340, 039, 663, 994] Troy Tulowitzki, who hit two homers vs. the Rangers on May 5, leads the majors with a .369 batting average.') 2023-11-03 06:29:11.265 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:29:11.265 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'silver bracelet with [556, 581, 678, 699] [707, 710, 804, 813] heart-shaped pendants and hanging chain lowered on [004, 094, 987, 995] a book page') 2023-11-03 06:29:28.027 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:29:28.027 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Gloves,Belt and Glasses in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Gloves:[731, 343, 777, 446];Belt:[572, 619, 725, 690];Glasses:[512, 120, 585, 172].') 2023-11-03 06:29:39.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:29:39.991 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[098, 238, 302, 852],[633, 088, 999, 997],[944, 148, 996, 985];fireplace:[398, 508, 546, 728];man:[642, 090, 995, 992].') 2023-11-03 06:30:08.488 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:30:08.488 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[019, 280, 087, 805] [257, 250, 325, 815] [629, 276, 701, 836] [901, 243, 986, 838] [714, 263, 791, 841] The Australian Socceroos line up to sing the national anthem prior to the second leg of the 2006 FIFA World Cup qualifying match between [391, 393, 457, 844] Australia...') 2023-11-03 06:30:16.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:30:16.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect chair,woman,clothing and man in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[008, 578, 085, 602],[021, 620, 105, 675],[028, 759, 221, 999],[207, 609, 280, 762],[241, 954, 388, 999],[301, 710, 433, 791],[371, 516, 467, 663],[424, 568, 529, 749],[432, 891, 747, 999],[605, 615, 678, 733],[683, 595, 774, 810],[825, 726, 999, 999];woman:[753, 456, 972, 978],[146, 429, 220, 580],[179, 479, 273, 648],[340, 345, 390, 510],[396, 319, 450, 490],[485, 348, 531, 501],[596, 434, 636, 529],[639, 430, 690, 565],[656, 443, 766, 650];clothing:[435, 587, 717, 992],[000, 480, 100, 600],[009, 637, 218, 861],[139, 482, 225, 575],[171, 537, 258, 643],[223, 332, 336, 532],[239, 532, 388, 801],[340, 382, 395, 511],[382, 472, 430, 640],[392, 371, 450, 492],[427, 510, 541, 696],[483, 371, 538, 503],[513, 514, 663, 677],[572, 363, 616, 451],[635, 453, 692, 568],[655, 517, 756, 656],[735, 463, 794, 547],[749, 589, 970, 908],[756, 477, 866, 643],[965, 646, 999, 737];man:[430, 525, 726, 988],[002, 424, 099, 601],[006, 540, 216, 863],[215, 451, 395, 810],[222, 286, 353, 527],[431, 453, 545, 720],[535, 429, 598, 522],[553, 464, 661, 650],[571, 327, 620, 469],[737, 420, 798, 543],[758, 432, 871, 627].') 2023-11-03 06:30:33.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:30:33.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect nightstand,table and drawer in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'nightstand:[805, 547, 948, 841];table:[000, 436, 171, 607],[800, 557, 966, 835];drawer:[820, 515, 948, 822].') 2023-11-03 06:30:38.813 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:30:38.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'mobile phone:[436, 237, 798, 1000].') 2023-11-03 06:30:39.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 06:30:39.689 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lifesaver:[934, 003, 999, 102];Person:[487, 217, 600, 426],[624, 003, 783, 112],[944, 029, 999, 081].') 2023-11-03 06:30:49.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:30:49.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[814, 464, 989, 786],[639, 409, 815, 780],[477, 381, 700, 844],[256, 358, 421, 854],[032, 455, 346, 927],[002, 314, 184, 928];shelf:[002, 184, 127, 520],[129, 219, 383, 548],[376, 283, 557, 826],[551, 331, 680, 610],[713, 385, 839, 634];Desk:[002, 749, 1000, 1000];Tie:[670, 542, 735, 780].') 2023-11-03 06:31:05.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:05.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'harbor seal:[075, 259, 735, 615],[419, 000, 673, 093];sea lion:[078, 272, 731, 606],[422, 000, 671, 087].') 2023-11-03 06:31:17.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:17.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[046, 186, 110, 352],[400, 187, 455, 310],[457, 168, 495, 270],[489, 180, 549, 376],[711, 207, 759, 339],[745, 174, 820, 370];clothing:[032, 329, 286, 965],[048, 296, 123, 450],[057, 230, 110, 351],[165, 426, 528, 999],[403, 436, 695, 999],[404, 232, 451, 307],[458, 188, 493, 267],[491, 197, 554, 370],[578, 378, 973, 999],[710, 238, 756, 338],[745, 211, 808, 349],[903, 201, 954, 395];woman:[000, 192, 296, 999],[086, 283, 823, 999],[379, 204, 969, 999],[404, 261, 705, 993];girl:[000, 195, 229, 998],[087, 286, 824, 996],[384, 203, 968, 996],[411, 264, 700, 988];human face:[133, 188, 200, 281],[181, 290, 278, 431],[341, 354, 433, 499],[447, 310, 540, 462],[632, 281, 729, 452].') 2023-11-03 06:31:19.579 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:19.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[781, 427, 856, 914],[672, 394, 904, 865],[439, 365, 548, 641],[456, 353, 550, 852].') 2023-11-03 06:31:25.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:25.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human body:[693, 499, 987, 998],[000, 397, 999, 999],[125, 035, 329, 295];woman:[008, 355, 151, 621],[330, 332, 393, 459],[370, 318, 431, 474],[467, 348, 730, 809],[544, 340, 725, 639],[580, 330, 723, 580],[696, 524, 999, 999],[791, 382, 999, 655],[853, 377, 938, 536];human hair:[026, 358, 131, 565],[035, 576, 234, 888],[068, 501, 230, 630],[181, 485, 320, 653],[184, 353, 238, 423],[256, 309, 317, 403],[313, 439, 421, 590],[426, 363, 506, 474],[494, 351, 568, 465],[556, 336, 604, 443],[708, 328, 941, 811];clothing:[000, 873, 233, 999],[120, 430, 283, 519],[216, 899, 318, 999],[246, 395, 329, 576],[280, 584, 620, 999],[372, 390, 425, 476],[399, 476, 677, 932],[477, 461, 718, 805],[548, 428, 706, 611],[618, 411, 732, 576],[681, 765, 971, 904],[699, 681, 951, 996].') 2023-11-03 06:31:28.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:28.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect chest of drawers,woman,boy and man in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[327, 632, 544, 839],[527, 565, 586, 658];boy:[065, 688, 388, 999],[273, 856, 468, 999],[287, 625, 385, 808],[359, 637, 530, 817],[538, 697, 634, 789];man:[659, 649, 829, 825],[000, 664, 117, 999],[019, 610, 117, 722],[070, 673, 377, 999],[240, 551, 293, 640],[257, 619, 414, 828],[278, 548, 309, 628],[347, 541, 388, 618],[371, 536, 409, 616],[388, 108, 570, 358],[445, 534, 484, 618],[489, 539, 522, 616],[516, 744, 765, 999],[555, 451, 802, 483],[563, 572, 625, 694],[596, 531, 625, 594].') 2023-11-03 06:31:34.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:34.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,footwear,girl and plant in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'woman:[432, 129, 601, 881],[113, 133, 245, 695],[310, 143, 456, 704],[930, 253, 999, 663];footwear:[478, 786, 555, 878];girl:[434, 091, 613, 883];plant:[000, 388, 988, 411],[000, 423, 999, 998].') 2023-11-03 06:31:36.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:36.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[534, 475, 722, 565] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a skateboardFrame1:[534, 475, 722, 565];Frame2:[504, 540, 677, 706];Frame3:[497, 594, 640, 780];Frame4:[498, 626, 645, 754];Frame5:[628, 858, 684, 994].') 2023-11-03 06:31:44.728 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:44.729 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Moses Odubajo of Hull City is surrounded by team-mates [092, 167, 399, 996] [372, 136, 899, 997] as he celebrates his goal') 2023-11-03 06:31:52.129 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:31:52.129 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[012, 286, 982, 973],[224, 818, 265, 969],[556, 562, 580, 648],[582, 564, 615, 653],[630, 559, 666, 663],[705, 439, 718, 506],[860, 826, 985, 978];clothing:[008, 559, 201, 639],[024, 757, 036, 820],[031, 649, 055, 718],[046, 772, 060, 841],[050, 516, 071, 585],[058, 777, 069, 870],[074, 777, 097, 891],[080, 663, 101, 726],[098, 793, 113, 915],[111, 500, 132, 570],[115, 801, 148, 904],[142, 820, 188, 937],[143, 819, 159, 904],[154, 621, 171, 684],[185, 829, 206, 938],[201, 844, 223, 946],[202, 484, 215, 553],[208, 431, 438, 618],[228, 839, 271, 955],[253, 806, 301, 914],[259, 602, 269, 668],[303, 580, 315, 644],[318, 637, 333, 706],[341, 644, 355, 711],[364, 648, 379, 725],[374, 680, 406, 772],[461, 670, 478, 739],[480, 668, 500, 741],[500, 677, 518, 766],[520, 680, 534, 748],[541, 692, 575, 761],[558, 579, 581, 644],[564, 657, 581, 753],[584, 399, 705, 439],[728, 327, 986, 453],[897, 851, 940, 932].') 2023-11-03 06:32:02.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:32:02.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the woman sitting on the floor and sorting stockings on her legs?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one woman sitting on the floor and sorting stockings on her legs.Track1frame:1:[064, 243, 223, 585];frame:2:[083, 227, 251, 575];frame:3:[100, 231, 269, 568].') 2023-11-03 06:32:05.905 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:32:05.905 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[203, 141, 892, 996] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a tankFrame1:[203, 141, 892, 996];Frame2:[189, 131, 888, 995];Frame3:[195, 129, 846, 910];Frame4:[191, 143, 801, 875];Frame5:[183, 173, 797, 854].') 2023-11-03 06:32:18.973 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:32:18.973 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photo of The Americans. [145, 004, 485, 993] Matthew Rhys (left) and Keri Russell (right) are standing in their kitchen very concerned.') 2023-11-03 06:32:25.682 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:32:25.682 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,clothing,table and sports equipment in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human face:[000, 000, 128, 229],[232, 254, 278, 345],[671, 360, 706, 439],[778, 406, 826, 506],[840, 568, 915, 695];clothing:[065, 187, 317, 958],[000, 375, 034, 481],[000, 166, 173, 466],[575, 187, 729, 258],[623, 406, 754, 627],[689, 466, 840, 720],[735, 583, 998, 931],[845, 172, 867, 235];table:[001, 443, 098, 487],[339, 750, 918, 997],[573, 450, 656, 554];sports equipment:[385, 506, 546, 587],[431, 597, 659, 712],[465, 762, 792, 968],[585, 327, 687, 462],[885, 254, 998, 493].') 2023-11-03 06:32:39.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:32:39.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,wheel and land vehicle in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'person:[009, 329, 098, 721],[394, 429, 512, 505],[514, 424, 581, 485],[542, 036, 579, 139],[555, 372, 581, 471],[611, 363, 675, 520],[650, 369, 685, 516];wheel:[152, 510, 186, 618],[290, 584, 354, 730],[501, 658, 611, 864],[709, 678, 805, 809];land vehicle:[306, 351, 821, 864],[000, 338, 061, 445],[065, 324, 278, 617],[235, 384, 304, 430],[257, 391, 304, 467],[289, 322, 566, 467],[557, 328, 863, 580].') 2023-11-03 06:32:41.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:32:41.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the a baby seated in a baby chair?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one a baby seated in a baby chair.Track1frame:1:[358, 268, 997, 998];frame:2:[337, 263, 997, 998];frame:3:[266, 278, 997, 998].') 2023-11-03 06:32:58.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:32:58.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'house:[065, 018, 738, 752],[704, 342, 999, 623];window:[022, 217, 155, 438],[280, 211, 347, 386],[398, 211, 469, 391],[519, 217, 600, 395];door:[230, 466, 293, 698],[454, 476, 544, 729].') 2023-11-03 06:33:05.690 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:33:05.690 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'chair:[543, 676, 721, 918],[000, 799, 066, 962],[431, 581, 538, 881],[627, 560, 753, 872],[759, 729, 999, 958];office building:[078, 089, 958, 984];man:[404, 263, 515, 575],[000, 636, 056, 834],[244, 414, 327, 577],[421, 412, 526, 799],[632, 403, 751, 616],[858, 484, 986, 729],[875, 450, 944, 591];clothing:[641, 446, 776, 659],[000, 699, 031, 961],[216, 600, 380, 909],[245, 496, 269, 586],[418, 303, 505, 534],[428, 498, 518, 771],[901, 611, 979, 730].') 2023-11-03 06:33:08.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:33:08.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[026, 029, 138, 196],[277, 044, 320, 088],[300, 001, 429, 209],[102, 040, 284, 584],[210, 027, 346, 303],[330, 001, 602, 355],[328, 062, 596, 1000],[436, 259, 836, 999];Desk:[000, 191, 146, 295],[000, 285, 378, 538],[000, 578, 493, 997],[876, 216, 1000, 436];Chair:[003, 153, 029, 217];Storage box:[904, 194, 988, 240];Lamp:[926, 013, 955, 057];Bowl:[000, 534, 071, 667],[314, 285, 358, 316].') 2023-11-03 06:36:41.092 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:36:41.095 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A diverse group of [085, 342, 194, 749] [716, 312, 880, 888] [386, 321, 489, 801] [291, 309, 404, 787] [180, 278, 318, 768] [469, 315, 582, 822] [835, 308, 987, 900] [569, 263, 713, 830] heritage stakeholders participated in the consultative meeting in Cape Town.') 2023-11-03 06:36:52.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:36:52.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[754, 001, 836, 110].') 2023-11-03 06:37:17.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:37:17.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bench in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Bench:[838, 604, 999, 795],[329, 604, 678, 800],[002, 620, 095, 793].') 2023-11-03 06:37:32.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:37:32.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Samit Patel [236, 032, 748, 996] reacts after losing his wicket') 2023-11-03 06:37:37.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:37:37.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bed:[081, 549, 832, 995].') 2023-11-03 06:38:01.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:38:01.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cell Phone,Other Fish,Boat and Paddle in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Other Fish:[306, 597, 342, 703];Boat:[110, 529, 894, 789],[117, 624, 903, 922];Paddle:[661, 554, 850, 655].') 2023-11-03 06:38:04.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:38:04.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Handbag:[382, 271, 420, 311];Chair:[608, 568, 658, 740];Bus:[040, 001, 999, 1000];Person:[627, 507, 791, 727],[960, 146, 1000, 293],[835, 160, 913, 290],[776, 186, 834, 282],[635, 111, 691, 274],[588, 218, 622, 283],[350, 178, 441, 320],[146, 588, 215, 723].') 2023-11-03 06:38:05.431 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:38:05.432 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Flutter By - arrangement of yellow lilies [289, 522, 614, 704], purple gerbera, and green Bells of Ireland [342, 091, 451, 232], accented with a butterfly [328, 306, 552, 408], and presented in a birch basket [332, 381, 767, 874].') 2023-11-03 06:38:37.025 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:38:37.026 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[373, 039, 969, 964] A Treatise concerning Heaven and Hell, containing a relation of many wonderful things therein, as...') 2023-11-03 06:38:44.678 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:38:44.679 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Antonio Tempesta (c.1655-1737) - Jupiter in the form [040, 030, 939, 867] of an eagle, carrying Ganymede - The Metamorphoses') 2023-11-03 06:38:59.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:38:59.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[447, 155, 923, 918],[000, 844, 039, 938],[026, 774, 183, 999],[030, 855, 057, 932],[057, 816, 085, 927],[143, 764, 263, 999],[223, 866, 284, 936],[794, 390, 956, 999],[940, 584, 999, 999];microphone:[461, 305, 580, 397];woman:[270, 229, 574, 694];clothing:[033, 918, 147, 999],[050, 846, 080, 935],[150, 885, 270, 999],[495, 421, 571, 595],[506, 367, 959, 999],[858, 553, 960, 999],[936, 593, 999, 999],[938, 857, 994, 999];human face:[003, 842, 038, 938],[028, 866, 048, 940],[144, 816, 173, 932],[176, 774, 214, 890],[231, 870, 293, 944],[421, 274, 509, 458],[581, 173, 698, 411].') 2023-11-03 06:39:59.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:39:59.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Desk:[310, 541, 496, 662],[609, 576, 679, 645];Couch:[555, 542, 724, 629];Bench:[711, 548, 789, 660],[691, 806, 997, 998],[001, 711, 459, 1000];Pillow:[001, 705, 118, 878];Chair:[002, 551, 092, 713];Potted Plant:[135, 429, 269, 701],[366, 415, 450, 558],[905, 545, 994, 703].') 2023-11-03 06:40:42.517 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:40:42.518 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Georgia linebacker Roquan Smith [400, 009, 663, 963] runs the 40-yard dash at the NFL scouting combine in Indianapolis on March 4, 2018.') 2023-11-03 06:41:32.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:41:32.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Train:[038, 463, 653, 690].') 2023-11-03 06:41:49.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:41:49.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human body:[000, 736, 185, 1000],[050, 636, 105, 746],[110, 642, 163, 779],[165, 679, 353, 999],[255, 688, 328, 837],[325, 662, 436, 897],[399, 810, 568, 997],[411, 654, 541, 901],[520, 671, 669, 883],[542, 681, 715, 999],[555, 509, 646, 666],[614, 644, 695, 862],[725, 725, 864, 999],[818, 835, 998, 999];human hair:[400, 816, 578, 1000],[000, 634, 325, 999],[468, 658, 525, 737],[541, 698, 554, 779],[553, 677, 611, 774],[583, 686, 668, 827],[608, 640, 672, 735],[786, 729, 864, 847],[813, 844, 999, 999];human head:[015, 773, 178, 993],[000, 646, 056, 748],[056, 638, 108, 729],[113, 642, 163, 735],[155, 679, 271, 847],[188, 621, 253, 688],[250, 642, 287, 717],[259, 686, 326, 835],[365, 662, 415, 746],[400, 794, 574, 999],[469, 656, 528, 750],[484, 642, 533, 721],[538, 702, 555, 785],[546, 677, 610, 776],[584, 690, 671, 847],[611, 646, 671, 731],[784, 729, 868, 870],[812, 837, 999, 999];mammal:[382, 795, 586, 1000],[002, 626, 334, 999],[005, 755, 189, 999],[327, 661, 435, 885],[410, 661, 538, 890],[494, 682, 671, 869],[539, 692, 715, 999],[549, 512, 644, 663],[606, 643, 701, 863],[727, 725, 871, 999],[811, 836, 999, 999];tree:[030, 000, 250, 653],[251, 524, 717, 619],[307, 345, 406, 633],[362, 585, 408, 692],[420, 588, 471, 700],[484, 561, 533, 704],[543, 594, 562, 710],[661, 619, 762, 771],[743, 484, 828, 715],[958, 438, 999, 603];clothing:[565, 565, 655, 687],[000, 729, 080, 868],[086, 706, 173, 806],[163, 829, 343, 999],[321, 737, 431, 893],[405, 727, 550, 883],[514, 760, 668, 866],[537, 804, 730, 999],[723, 841, 871, 999];woman:[167, 682, 349, 954],[405, 813, 582, 999],[608, 643, 696, 864];man:[000, 631, 077, 982],[014, 746, 181, 999],[042, 631, 118, 767],[103, 642, 171, 789],[192, 619, 256, 727],[251, 650, 283, 731],[271, 693, 348, 999],[321, 662, 437, 883],[415, 658, 550, 889],[510, 682, 669, 877],[558, 509, 639, 666],[611, 638, 700, 868],[726, 721, 874, 999],[812, 837, 999, 999].') 2023-11-03 06:41:51.061 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:41:51.062 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'vehicle:[731, 466, 795, 500].') 2023-11-03 06:41:55.926 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:41:55.926 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'girl:[469, 092, 959, 901];footwear:[480, 554, 570, 645],[641, 915, 716, 999];woman:[467, 084, 931, 999];plant:[000, 791, 999, 999];jeans:[530, 385, 834, 966].') 2023-11-03 06:42:07.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:42:07.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Cool Plant Stand Design Ideas for Indoor Houseplant 1') 2023-11-03 06:42:14.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:42:14.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Car:[378, 942, 520, 999].') 2023-11-03 06:42:50.272 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:42:50.272 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[040, 479, 225, 923],[008, 410, 060, 554],[133, 425, 248, 772],[215, 397, 266, 632],[265, 419, 293, 584],[280, 423, 333, 673],[324, 412, 443, 807],[331, 408, 705, 998],[540, 367, 666, 971],[645, 509, 687, 589],[672, 444, 712, 549],[756, 514, 833, 611],[758, 403, 811, 514],[803, 429, 855, 606],[830, 403, 865, 483],[857, 393, 968, 707];man:[013, 391, 059, 544],[135, 372, 255, 778],[229, 375, 268, 632],[281, 391, 332, 680],[320, 226, 709, 998],[321, 359, 457, 834],[527, 296, 670, 996],[671, 438, 708, 615],[750, 378, 818, 606],[750, 485, 833, 727],[831, 389, 861, 498],[873, 364, 967, 716].') 2023-11-03 06:42:55.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:42:55.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'window:[029, 055, 094, 247],[000, 092, 035, 244],[000, 348, 041, 542],[054, 356, 099, 538],[200, 104, 224, 227],[265, 184, 295, 312],[271, 474, 459, 689],[272, 058, 467, 392],[298, 232, 322, 365],[340, 290, 357, 392],[348, 310, 367, 404],[365, 339, 380, 428],[380, 353, 392, 443],[721, 005, 811, 321],[762, 382, 775, 489],[785, 527, 804, 666],[791, 346, 805, 465],[880, 228, 900, 348],[937, 139, 971, 296],[952, 424, 976, 656];bus:[594, 465, 752, 833];building:[000, 000, 999, 870].') 2023-11-03 06:43:04.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:43:04.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[000, 000, 999, 999].') 2023-11-03 06:43:08.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:43:08.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Sneakers:[897, 789, 937, 912],[862, 756, 902, 884],[691, 680, 762, 824],[607, 832, 661, 975],[495, 702, 575, 816],[431, 718, 500, 814],[333, 720, 371, 805],[148, 723, 200, 900],[028, 776, 116, 928];Gloves:[539, 455, 626, 588],[711, 482, 779, 625],[789, 392, 840, 508],[221, 479, 281, 623],[045, 467, 107, 611];Helmet:[860, 078, 966, 243],[605, 062, 702, 241],[472, 193, 571, 332],[096, 035, 194, 209];Hockey Stick:[000, 244, 181, 857],[239, 622, 343, 840],[438, 462, 712, 769],[832, 465, 999, 531],[448, 677, 533, 837];Person:[787, 078, 1000, 914].') 2023-11-03 06:43:13.175 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:43:13.176 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[008, 047, 995, 996] Peter Sattmann as the devil performs during the dress rehearsal for the play "Jedermann" by author Hugo von Hoffmannsthal at the Berlin Cathedral on...') 2023-11-03 06:43:46.576 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:43:46.576 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Horse:[841, 297, 1000, 597];Helmet:[540, 289, 595, 328];Motorcycle:[344, 210, 887, 788];Bus:[671, 115, 911, 311];Person:[478, 287, 618, 657].') 2023-11-03 06:44:33.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:44:33.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Sneakers:[002, 886, 047, 968],[003, 674, 088, 794],[186, 794, 269, 942],[204, 711, 309, 853],[032, 397, 099, 494],[134, 385, 211, 546],[562, 807, 683, 910];Gloves:[843, 612, 915, 693],[709, 390, 811, 504],[529, 307, 587, 373],[389, 353, 477, 395];Helmet:[802, 325, 934, 536],[661, 264, 772, 402],[476, 116, 620, 318],[282, 007, 423, 144],[236, 002, 314, 090],[486, 804, 590, 960];Other Shoes:[380, 788, 489, 898];Person:[002, 010, 698, 960],[037, 002, 312, 493];Bracelet:[660, 378, 696, 440];American Football:[785, 536, 837, 595].') 2023-11-03 06:45:26.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:45:26.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[656, 407, 786, 975];man:[651, 440, 786, 972];tree:[443, 000, 998, 597].') 2023-11-03 06:45:44.511 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:45:44.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[612, 502, 703, 916] [459, 451, 630, 942] [093, 008, 476, 994] Commercial washing machines in a hospitality setting') 2023-11-03 06:46:28.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:46:28.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'food:[153, 039, 848, 906].') 2023-11-03 06:46:39.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:46:39.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[462, 518, 693, 919],[197, 478, 237, 599],[424, 535, 538, 788],[890, 537, 999, 847];woman:[029, 410, 213, 816],[631, 527, 875, 999];clothing:[000, 637, 106, 913],[428, 605, 525, 718],[541, 619, 680, 935],[654, 734, 884, 999],[900, 627, 999, 850];human face:[013, 508, 120, 663],[108, 516, 181, 628],[444, 563, 478, 637],[520, 545, 585, 687],[633, 565, 709, 733].') 2023-11-03 06:46:43.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:46:43.485 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'bicycle:[209, 298, 236, 362],[302, 345, 351, 446],[762, 472, 980, 632];bicycle wheel:[319, 384, 338, 447],[763, 501, 845, 605],[880, 514, 976, 632];man:[208, 266, 237, 336],[304, 299, 348, 409],[808, 365, 945, 601].') 2023-11-03 06:46:53.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:46:53.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[649, 863, 667, 919];Bench:[790, 895, 840, 959];Street Lights:[647, 004, 892, 1000],[618, 313, 738, 963];Traffic Light:[868, 490, 919, 696],[293, 260, 342, 403].') 2023-11-03 06:48:00.539 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:00.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Mirror:[001, 050, 118, 420],[567, 149, 647, 380];Chair:[001, 670, 152, 996],[215, 539, 525, 996],[530, 477, 755, 1000],[733, 435, 926, 887],[804, 425, 950, 789],[925, 405, 1000, 708];Stool:[947, 455, 1000, 642];Lamp:[156, 001, 197, 118];Surfboard:[202, 001, 525, 131];Person:[795, 352, 923, 730],[671, 287, 927, 765],[058, 203, 248, 515],[001, 209, 086, 420];Hat:[851, 280, 928, 361];Coffee Machine:[652, 306, 711, 397];Refrigerator:[718, 222, 789, 397];Moniter:[574, 008, 665, 151];Tissue:[176, 457, 246, 532].') 2023-11-03 06:48:21.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:21.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[609, 291, 888, 705];tree:[000, 434, 622, 580],[875, 503, 999, 619];flower:[000, 567, 999, 999],[563, 780, 594, 853].') 2023-11-03 06:48:25.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:25.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Handbag:[851, 337, 905, 384],[886, 336, 917, 391];Lamp:[724, 002, 822, 091];shelf:[001, 774, 338, 999];Storage box:[562, 274, 608, 303],[562, 299, 605, 331];Person:[769, 338, 832, 470],[782, 280, 820, 336],[816, 236, 900, 421],[925, 239, 1000, 421],[063, 199, 998, 1000].') 2023-11-03 06:48:33.125 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:33.126 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [889, 394, 997, 996] [740, 346, 891, 995] [484, 372, 588, 995] [359, 382, 496, 995] [559, 350, 774, 994] [206, 341, 383, 995] wait outside of a currency exchange house [435, 014, 768, 940] in Buenos Aires, Argentina, on Thursday, Aug. 30, 2018.') 2023-11-03 06:48:36.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:36.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[407, 200, 661, 829] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a personFrame1:[407, 200, 661, 829];Frame2:[473, 192, 657, 872];Frame3:[476, 191, 664, 841];Frame4:[411, 180, 626, 878];Frame5:[354, 075, 686, 853].') 2023-11-03 06:48:40.904 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:40.904 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Illustration of scientists in [615, 193, 703, 332] [349, 494, 426, 662] [390, 292, 483, 423] [570, 553, 659, 741] lab coats building [189, 042, 812, 914] a heart shape out of [596, 402, 654, 489] [497, 294, 554, 374] bricks') 2023-11-03 06:48:48.431 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:48.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[001, 220, 096, 398],[276, 267, 482, 783],[437, 137, 788, 956];Skating and Skiing shoes:[574, 755, 764, 862],[614, 775, 740, 957];Sneakers:[334, 716, 461, 784],[363, 587, 438, 656],[043, 378, 065, 398];Car:[809, 232, 880, 267];Street Lights:[743, 110, 849, 250].') 2023-11-03 06:48:51.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:51.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman and clothing in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'woman:[015, 772, 091, 997],[198, 802, 333, 997],[530, 750, 591, 997],[713, 800, 900, 997];clothing:[005, 805, 046, 997],[021, 800, 088, 997],[068, 790, 115, 997],[196, 760, 221, 827],[225, 935, 335, 997],[266, 745, 295, 815],[516, 782, 555, 997],[535, 797, 596, 997],[606, 780, 658, 997],[786, 920, 898, 997],[861, 880, 951, 997].') 2023-11-03 06:48:57.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:48:57.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[025, 002, 999, 999],[738, 259, 1000, 957],[816, 066, 1000, 307],[852, 002, 998, 168],[714, 039, 833, 259],[206, 148, 329, 311];Boots:[648, 762, 812, 1000],[818, 778, 988, 917];Satchel:[805, 062, 879, 152],[485, 517, 573, 596];Other Shoes:[528, 512, 627, 555];Chair:[003, 069, 091, 239],[249, 087, 394, 213],[083, 205, 238, 312];Desk:[071, 083, 267, 209],[736, 023, 876, 114];Bench:[003, 237, 848, 420].') 2023-11-03 06:49:02.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:02.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Necklace:[431, 677, 527, 999];Person:[686, 142, 1000, 998],[002, 003, 263, 1000],[222, 166, 708, 1000],[157, 408, 239, 698].') 2023-11-03 06:49:06.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:06.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[470, 475, 965, 929],[000, 842, 443, 999].') 2023-11-03 06:49:08.183 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:08.183 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Screen capture from the PDF link above with the itinerary for Dr. Diana Lewis's visit, [770, 296, 971, 580] a headshot of Dr. Lewis, and a University of Guelph building photo") 2023-11-03 06:49:13.335 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:13.335 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[820, 153, 933, 313],[564, 469, 650, 762],[547, 457, 598, 759],[327, 453, 441, 809],[341, 437, 403, 753],[181, 472, 254, 695],[428, 247, 494, 348],[129, 278, 176, 350];Helmet:[357, 439, 395, 493],[830, 162, 869, 219];Airplane:[913, 343, 999, 526],[487, 142, 1000, 692],[076, 189, 771, 685],[001, 259, 539, 651],[001, 275, 119, 621].') 2023-11-03 06:49:15.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:15.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,clothing and man in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human arm:[484, 707, 646, 840];clothing:[346, 167, 391, 372],[403, 143, 442, 265],[418, 076, 804, 458],[450, 660, 631, 999],[470, 419, 518, 651],[803, 443, 884, 685];man:[471, 522, 673, 974],[353, 148, 406, 340],[410, 113, 445, 287],[443, 408, 517, 630],[445, 069, 477, 148],[471, 040, 792, 476],[809, 410, 893, 697],[936, 285, 959, 370].') 2023-11-03 06:49:43.022 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:43.022 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Faith itself is a horrible mechanism that stunts the growth of ideas. Joe Rogan Picture Quote [066, 247, 932, 698]') 2023-11-03 06:49:51.780 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:49:51.780 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'I stumbled upon these gold macarons [373, 415, 738, 697] and had to photograph them. Glam food at its finest! Also playing with monochromatic colours.') 2023-11-03 06:50:07.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:50:07.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 380, 313, 951],carFrame1:[293, 443, 376, 636],carFrame1:[873, 479, 999, 587],carFrame1:[394, 499, 434, 557],carFrame1:[657, 465, 695, 579],personFrame1:[632, 450, 669, 677],carFrame1:[671, 462, 883, 708] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[000, 380, 313, 951];Frame2:[000, 387, 314, 952],carFrame1:[293, 443, 376, 636];Frame2:[292, 443, 376, 636],carFrame1:[873, 479, 999, 587];Frame2:[874, 479, 999, 586],carFrame1:[394, 499, 434, 557];Frame2:[394, 499, 434, 557],carFrame1:[657, 465, 695, 579];Frame2:[657, 465, 695, 579],personFrame1:[632, 450, 669, 677];Frame2:[632, 450, 669, 675],carFrame1:[671, 462, 883, 708];Frame2:[671, 462, 883, 708].') 2023-11-03 06:50:14.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 06:50:14.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bicycle wheel,car and bicycle in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'bicycle wheel:[039, 205, 556, 999],[659, 526, 821, 999];car:[000, 000, 999, 999];bicycle:[039, 000, 818, 999].') 2023-11-03 06:50:38.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:50:38.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the mounted police officer on patrol, engaging in conversation with a pedestrian while lowering their head.?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one the mounted police officer on patrol, engaging in conversation with a pedestrian while lowering their head..Track1frame:1:[516, 306, 564, 488];frame:2:[516, 308, 564, 490];frame:3:[516, 311, 563, 491].') 2023-11-03 06:50:50.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:50:50.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[001, 266, 549, 995] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a blindworm blindwormFrame1:[001, 266, 549, 995];Frame2:[139, 235, 675, 997];Frame3:[063, 295, 661, 997];Frame4:[001, 306, 633, 972].') 2023-11-03 06:50:55.414 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:50:55.414 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Google co-[003, 031, 998, 998] founder Sergey Brin takes questions from the media following presentations at a media preview of Google's prototype autonomous vehicles in Moutain View, California") 2023-11-03 06:51:15.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:51:15.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'tower:[078, 257, 197, 520],[021, 415, 069, 551],[209, 362, 252, 491],[287, 368, 313, 435],[335, 362, 388, 476],[499, 293, 556, 478],[520, 331, 573, 483],[631, 337, 705, 657];skyscraper:[000, 344, 061, 562],[007, 410, 075, 567],[029, 543, 101, 666],[057, 313, 126, 604],[073, 662, 126, 758],[080, 566, 131, 670],[101, 289, 200, 538],[109, 608, 159, 699],[135, 530, 175, 626],[150, 551, 197, 681],[174, 580, 246, 698],[174, 381, 251, 433],[211, 635, 288, 767],[265, 507, 537, 729],[282, 366, 311, 432],[336, 366, 383, 480],[402, 621, 534, 714],[491, 275, 567, 502],[512, 337, 575, 528],[516, 612, 608, 725],[573, 571, 641, 672],[611, 396, 631, 472],[627, 353, 705, 646],[701, 478, 844, 602],[733, 349, 798, 457],[840, 509, 896, 602],[865, 333, 943, 418],[922, 496, 994, 571].') 2023-11-03 06:52:02.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:52:02.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'person:[000, 339, 087, 697],[004, 677, 035, 812],[007, 621, 090, 890],[069, 369, 153, 743],[176, 044, 285, 243],[179, 342, 262, 731],[237, 696, 335, 895],[270, 302, 375, 662],[271, 510, 356, 769],[314, 660, 407, 893],[364, 385, 451, 748],[416, 742, 505, 890],[424, 380, 537, 768],[440, 200, 497, 304],[508, 622, 580, 813],[511, 802, 612, 899],[543, 473, 593, 582],[577, 528, 650, 736],[678, 500, 833, 858],[737, 115, 775, 210],[801, 329, 881, 444];sculpture:[000, 589, 110, 877],[001, 365, 086, 690],[073, 324, 168, 746],[088, 804, 128, 890],[125, 800, 167, 882],[166, 813, 217, 877],[177, 345, 259, 708],[178, 032, 289, 287],[240, 704, 333, 880],[276, 308, 375, 662],[306, 662, 413, 890],[375, 394, 433, 674],[413, 750, 498, 897],[441, 386, 536, 744],[451, 212, 486, 292],[504, 628, 589, 825],[515, 807, 616, 877],[691, 480, 846, 815],[747, 115, 775, 188],[799, 308, 904, 496],[803, 764, 999, 891],[859, 442, 999, 778];human face:[024, 391, 054, 455],[049, 627, 093, 691],[204, 359, 232, 428],[243, 731, 310, 837],[285, 541, 344, 640],[310, 338, 344, 403],[390, 428, 428, 498],[416, 752, 500, 856],[432, 423, 515, 563],[739, 553, 789, 654],[808, 351, 876, 442].') 2023-11-03 06:52:27.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:52:27.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[776, 316, 961, 862],[246, 428, 478, 999],[523, 535, 628, 971],[602, 423, 722, 961];human leg:[000, 774, 067, 951],[150, 925, 176, 999],[315, 822, 359, 943],[357, 712, 397, 999],[389, 710, 448, 999],[585, 887, 601, 957],[588, 843, 628, 957],[623, 667, 661, 944],[659, 650, 718, 932],[778, 581, 848, 803],[825, 551, 920, 794],[949, 747, 999, 870];human head:[087, 616, 130, 693],[373, 430, 431, 526],[548, 545, 612, 628],[608, 436, 659, 501],[871, 341, 926, 407];man:[082, 606, 123, 682],[238, 626, 295, 997];girl:[000, 580, 193, 981],[245, 426, 472, 999],[531, 543, 633, 960],[578, 409, 733, 943],[780, 307, 985, 804];clothing:[293, 522, 478, 834],[530, 630, 611, 709],[611, 494, 735, 807],[707, 582, 786, 889],[872, 403, 999, 569];human face:[872, 352, 917, 420];human arm:[013, 678, 070, 743],[133, 379, 439, 639],[250, 592, 365, 709],[432, 542, 475, 819],[575, 606, 620, 690],[682, 507, 718, 687],[744, 587, 785, 660],[918, 417, 978, 580].') 2023-11-03 06:52:27.789 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:52:27.789 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'tree:[051, 019, 428, 432],[000, 000, 999, 552];fashion accessory:[494, 820, 604, 900],[530, 933, 622, 999],[618, 277, 747, 405],[752, 919, 817, 999];girl:[486, 273, 818, 999];footwear:[520, 939, 619, 999],[756, 921, 816, 999];flower:[153, 353, 227, 467],[404, 469, 458, 551],[408, 382, 483, 463],[436, 448, 501, 543],[709, 000, 999, 320];human face:[643, 362, 745, 490].') 2023-11-03 06:52:32.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:52:32.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Glasses:[443, 112, 477, 177],[609, 002, 645, 157],[822, 002, 864, 137];Lamp:[446, 592, 526, 833];Cup:[869, 543, 918, 726];Cabinet:[035, 468, 088, 685],[822, 445, 933, 515];Car:[211, 676, 254, 751];Tie:[121, 220, 216, 333];Person:[791, 523, 1000, 999],[682, 464, 825, 999],[670, 454, 774, 999],[613, 475, 679, 898],[484, 460, 655, 1000],[423, 690, 666, 1000],[211, 549, 472, 999],[124, 507, 247, 1000],[124, 412, 238, 902],[349, 503, 465, 893],[464, 471, 539, 837],[431, 462, 492, 573],[371, 477, 429, 581],[250, 467, 324, 715],[221, 445, 262, 541],[144, 415, 248, 655].') 2023-11-03 06:52:54.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:52:54.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'goat:[340, 694, 416, 865],[000, 661, 440, 999],[458, 635, 495, 751],[476, 571, 615, 695].') 2023-11-03 06:54:05.135 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:54:05.135 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[546, 450, 601, 633],personFrame1:[617, 436, 672, 687],personFrame1:[322, 461, 362, 660],personFrame1:[554, 376, 597, 568],personFrame1:[651, 519, 686, 695],personFrame1:[253, 504, 292, 712],personFrame1:[434, 424, 490, 607],personFrame1:[577, 451, 633, 670],personFrame1:[377, 390, 423, 531],personFrame1:[407, 514, 469, 717] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[546, 450, 601, 633];Frame2:[552, 454, 607, 636],personFrame1:[617, 436, 672, 687];Frame2:[647, 451, 713, 675],personFrame1:[322, 461, 362, 660];Frame2:[282, 470, 334, 664],personFrame1:[554, 376, 597, 568];Frame2:[559, 376, 601, 554],personFrame1:[651, 519, 686, 695];Frame2:[658, 519, 707, 668],personFrame1:[253, 504, 292, 712];Frame2:[205, 523, 261, 712],personFrame1:[434, 424, 490, 607];Frame2:[435, 426, 490, 607],personFrame1:[577, 451, 633, 670];Frame2:[600, 473, 656, 674],personFrame1:[377, 390, 423, 531];Frame2:[344, 393, 397, 537],personFrame1:[407, 514, 469, 717];Frame2:[421, 518, 471, 716].') 2023-11-03 06:54:11.389 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:54:11.389 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Weebly is a top membership website builder [129, 321, 333, 954] because of its eCommerce capabilities.') 2023-11-03 06:54:22.216 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:54:22.217 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Prince [610, 076, 918, 988] of Wales buries two sweaters [672, 151, 916, 660], one made from wool and another from a synthetic fibre, to test their durability') 2023-11-03 06:55:07.861 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:55:07.862 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'View all your Alerts [046, 314, 953, 670] on a site') 2023-11-03 06:56:02.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:56:02.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[027, 178, 351, 489].') 2023-11-03 06:57:14.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:57:14.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree,woman,girl and dress in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'tree:[000, 000, 999, 998];woman:[047, 592, 264, 1000];girl:[000, 615, 278, 998];dress:[010, 722, 228, 979].') 2023-11-03 06:57:54.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:57:54.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[142, 156, 748, 998] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a antelopeFrame1:[142, 156, 748, 998];Frame2:[166, 102, 810, 999];Frame3:[267, 001, 882, 854];Frame4:[338, 283, 943, 999].') 2023-11-03 06:58:00.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:58:00.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Backpack,Moniter,Printer,Person and Computer Box in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Backpack:[165, 775, 232, 940];TV:[007, 554, 112, 769],[696, 455, 767, 580],[850, 562, 998, 833];Printer:[623, 488, 698, 591];Person:[633, 193, 880, 1000],[460, 416, 643, 914],[440, 254, 517, 666],[284, 415, 360, 610],[155, 121, 364, 940];Computer Box:[731, 901, 868, 999].') 2023-11-03 06:58:22.603 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:58:22.603 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[903, 559, 946, 744],personFrame1:[150, 531, 193, 713],personFrame1:[795, 611, 850, 815],personFrame1:[475, 495, 513, 661],personFrame1:[467, 579, 512, 779],personFrame1:[800, 487, 848, 662],personFrame1:[639, 415, 695, 583] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[903, 559, 946, 744];Frame2:[914, 556, 962, 743],personFrame1:[150, 531, 193, 713];Frame2:[184, 537, 236, 709],personFrame1:[795, 611, 850, 815];Frame2:[749, 590, 849, 777],personFrame1:[475, 495, 513, 661];Frame2:[508, 488, 546, 658],personFrame1[467, 579, 512, 779];Frame2:No object,personFrame1:[800, 487, 848, 662];Frame2:[793, 480, 862, 659],personFrame1:[639, 415, 695, 583];Frame2:[667, 404, 700, 576].') 2023-11-03 06:58:47.585 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:58:47.585 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[321, 440, 368, 498],carFrame1:[756, 428, 861, 485],carFrame1:[000, 439, 033, 540],personFrame1:[033, 399, 064, 524],carFrame1:[515, 432, 608, 496] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'carFrame1[321, 440, 368, 498];Frame2:No object,carFrame1:[756, 428, 861, 485];Frame2:[778, 431, 900, 499],carFrame1[000, 439, 033, 540];Frame2:No object,personFrame1[033, 399, 064, 524];Frame2:No object,carFrame1:[515, 432, 608, 496];Frame2:[494, 434, 600, 505].') 2023-11-03 06:58:59.105 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:58:59.105 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the a black goose is looking for food in the lake in this video clip?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one a black goose is looking for food in the lake.Track1frame:1:[300, 697, 689, 981];frame:2:[300, 697, 689, 981];frame:3:[412, 533, 588, 877].') 2023-11-03 06:59:22.599 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:59:22.599 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'January 8, 2012 [109, 059, 481, 836] - Panda with [562, 271, 955, 994] candle (which smells great') 2023-11-03 06:59:45.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:59:45.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[488, 397, 585, 623] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a personFrame1:[488, 397, 585, 623];Frame2:[484, 387, 584, 620];Frame3:[530, 494, 667, 631];Frame4:[559, 458, 639, 622];Frame5:[576, 370, 649, 600].') 2023-11-03 06:59:51.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 06:59:51.473 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Satchel,Ice cream and Bowl/Basin in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Satchel:[808, 273, 940, 473];Ice cream:[702, 449, 783, 512];Bowl:[701, 474, 784, 513].') 2023-11-03 07:00:01.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:00:01.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[175, 242, 882, 679] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a airplaneFrame1:[175, 242, 882, 679];Frame2:[185, 260, 908, 704];Frame3:[173, 235, 921, 705];Frame4:[153, 209, 910, 690].') 2023-11-03 07:00:21.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:00:21.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Moniter:[119, 328, 560, 881],[460, 876, 729, 999];Head Phone:[219, 408, 270, 480],[422, 411, 471, 479],[179, 569, 205, 602],[402, 584, 419, 607],[643, 319, 723, 430],[924, 277, 999, 390],[017, 294, 055, 430];Remote:[635, 798, 741, 866];modem:[555, 798, 687, 875];Converter:[000, 823, 044, 926];Person:[583, 312, 770, 591],[852, 278, 1000, 622],[841, 584, 1000, 702],[408, 294, 479, 336],[220, 169, 284, 333],[000, 288, 116, 618];Watch:[920, 638, 948, 686];Necklace:[929, 441, 973, 579];Cup:[806, 598, 844, 656],[746, 565, 780, 623],[572, 550, 603, 598],[572, 599, 600, 643];Desk:[009, 578, 998, 997].') 2023-11-03 07:00:29.308 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:00:29.308 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect boat,human face,fashion accessory and dress in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human face:[834, 143, 988, 545],[577, 008, 726, 380];fashion accessory:[044, 487, 122, 902],[383, 396, 458, 788],[436, 375, 498, 495],[623, 379, 683, 540];dress:[496, 253, 795, 859],[723, 587, 999, 998].') 2023-11-03 07:00:48.310 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:00:48.310 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Desk:[503, 698, 719, 787];Person:[692, 472, 772, 700],[729, 409, 786, 539],[409, 435, 482, 760],[462, 401, 515, 736],[214, 444, 286, 769];Guitar:[477, 490, 573, 602],[695, 610, 789, 715];Piano:[149, 572, 316, 730];Drum:[627, 572, 700, 702],[623, 556, 675, 635];Speaker:[849, 554, 944, 699],[777, 562, 846, 645],[532, 549, 593, 702],[471, 549, 531, 738],[099, 622, 175, 726],[177, 637, 220, 726].') 2023-11-03 07:00:52.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:00:52.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Trolley:[475, 741, 622, 1000],[713, 730, 796, 1000];Satchel:[191, 752, 248, 866];Storage box:[684, 624, 840, 704],[685, 686, 839, 779],[703, 751, 766, 796],[679, 772, 762, 836],[704, 817, 765, 861],[677, 828, 763, 902],[688, 866, 833, 1000],[766, 718, 886, 953],[814, 715, 933, 948],[935, 835, 988, 955],[898, 945, 984, 997],[760, 897, 920, 999],[875, 047, 970, 184];Lamp:[031, 068, 089, 103],[133, 040, 196, 072];Chair:[894, 310, 974, 440];Cabinet:[629, 449, 713, 575],[517, 488, 568, 601],[248, 263, 396, 563],[000, 121, 266, 684];Person:[215, 560, 314, 895],[050, 607, 078, 645].') 2023-11-03 07:00:53.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:00:53.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[178, 117, 316, 306];Stroller:[732, 367, 780, 413];Person:[192, 132, 650, 999];Hat:[350, 138, 491, 235];Sneakers:[295, 929, 381, 993].') 2023-11-03 07:01:30.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:01:30.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'man:[053, 625, 212, 980],[000, 067, 049, 250],[000, 311, 051, 446],[000, 789, 060, 932],[000, 389, 055, 514],[033, 473, 105, 625],[063, 393, 128, 516],[097, 458, 173, 651],[127, 382, 190, 507],[159, 066, 227, 240],[163, 007, 222, 106],[187, 383, 231, 528],[197, 000, 246, 063],[289, 000, 340, 091],[331, 066, 403, 235],[364, 466, 450, 637],[398, 020, 451, 197],[412, 318, 476, 514],[478, 305, 532, 501],[509, 384, 576, 554],[559, 480, 595, 578],[580, 066, 643, 200],[646, 004, 706, 117],[703, 452, 761, 640],[766, 008, 825, 151],[834, 302, 903, 429],[917, 589, 999, 746],[966, 020, 999, 185];shorts:[221, 703, 299, 827],[278, 703, 374, 859],[575, 643, 671, 774],[788, 966, 914, 998];volleyball:[453, 081, 516, 174];woman:[036, 641, 115, 825],[052, 067, 120, 247],[115, 073, 155, 216],[171, 321, 229, 432],[198, 235, 394, 998],[230, 019, 283, 168],[274, 203, 378, 998],[392, 587, 690, 998],[484, 138, 786, 811],[643, 076, 702, 196],[704, 550, 972, 998],[776, 401, 834, 532],[835, 389, 882, 545],[886, 389, 949, 547];girl:[195, 224, 391, 998],[262, 200, 375, 998],[391, 595, 689, 998],[482, 141, 788, 817],[698, 550, 969, 998],[775, 393, 838, 569],[833, 386, 890, 570].') 2023-11-03 07:01:47.295 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:01:47.296 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Man [369, 142, 693, 823] and his cute baby [232, 549, 453, 758] in stroller [151, 434, 604, 893] outdoors') 2023-11-03 07:01:53.097 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:01:53.097 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[080, 097, 656, 995] A man climbing up [391, 105, 691, 985] the ladder on the back of [048, 012, 990, 999] an Antero Adventure Van') 2023-11-03 07:02:07.608 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:02:07.608 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Hat:[436, 304, 527, 432];Boat:[149, 373, 192, 424],[424, 379, 468, 428];Person:[396, 305, 576, 1000].') 2023-11-03 07:02:15.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:02:15.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[838, 352, 990, 999],[067, 216, 514, 999];Hat:[214, 217, 421, 406];Bus:[434, 213, 881, 774].') 2023-11-03 07:02:28.755 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:02:28.756 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Get [105, 201, 580, 983] [547, 234, 890, 998] the kids out with this spider tree swing. What a great gift idea.') 2023-11-03 07:02:46.199 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:02:46.199 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[361, 282, 400, 374],[308, 213, 359, 268],[424, 209, 442, 271],[557, 213, 582, 287],[591, 204, 668, 375],[466, 368, 660, 999],[652, 611, 744, 813],[826, 380, 1000, 1000],[829, 425, 894, 807],[933, 683, 1000, 1000];Helmet:[612, 203, 659, 244];Traffic cone:[436, 089, 541, 339];Car:[021, 511, 073, 563].') 2023-11-03 07:03:38.312 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:03:38.313 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'man [512, 214, 689, 624] standing in front of a white garage [004, 140, 994, 577] holding a cane [578, 433, 786, 751] out in front of him with a red arrow around the tip of the cane') 2023-11-03 07:03:57.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:03:57.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'plant:[638, 273, 999, 882];building:[000, 000, 999, 999].') 2023-11-03 07:04:09.960 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:04:09.960 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Group of dance students [745, 205, 996, 993] [587, 185, 779, 997] [116, 207, 381, 994] [359, 072, 651, 996] wearing masks [879, 192, 941, 272] [738, 238, 791, 310] [050, 324, 115, 389] [649, 252, 712, 345] [446, 165, 545, 278] [226, 277, 314, 381] as preventive measures against Covid19 clap their hands [316, 304, 369, 439] [724, 312, 786, 418] [125, 261, 202, 415] [574, 221, 656, 404] [367, 199, 448, 388] during a protest in Plaza de Sant. Teachers,...') 2023-11-03 07:04:13.242 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:04:13.243 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Why You Should Take The First Step (Even if You Don't Know What it is)") 2023-11-03 07:04:16.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:04:16.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Storage box:[001, 829, 075, 911];Person:[894, 824, 999, 999],[239, 440, 436, 999],[406, 586, 476, 884];bucket:[728, 883, 773, 980],[839, 802, 882, 860].') 2023-11-03 07:05:02.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:05:02.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[500, 199, 588, 923],[575, 203, 698, 862],[647, 228, 729, 800],[717, 222, 810, 797].') 2023-11-03 07:05:03.237 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:05:03.238 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Portrait drawing of a young girl [060, 006, 977, 991] in straw hat [068, 001, 852, 617] in tinted charcoal - detail') 2023-11-03 07:05:56.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:05:56.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[416, 258, 997, 1000],[350, 297, 629, 892],[001, 333, 408, 1000],[251, 620, 350, 745];Glasses:[749, 371, 848, 422];Handbag:[596, 456, 696, 698];Bottle:[686, 522, 720, 669];Basin:[297, 628, 380, 698],[388, 602, 461, 663],[425, 878, 493, 947],[183, 936, 324, 998],[325, 952, 459, 998];Cup:[348, 636, 416, 762],[409, 669, 453, 773],[293, 801, 340, 882];Chopsticks:[395, 889, 438, 966].') 2023-11-03 07:06:19.738 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:06:19.739 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Recording artists Diplo [004, 034, 547, 998] and Justin Beiber [470, 035, 998, 995] winners of the award [144, 417, 426, 799] for 'Dance Song of the Year' for 'Where are u now' backstage at the iHeartRadio Music...") 2023-11-03 07:06:39.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:06:39.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lamp:[323, 043, 379, 088],[963, 267, 999, 402];Cabinet:[231, 374, 327, 473];Chair:[001, 411, 146, 577],[022, 402, 146, 511],[315, 608, 432, 1000],[475, 487, 644, 762],[965, 503, 999, 721];Bench:[001, 543, 062, 760];Desk:[452, 671, 997, 997];Piano:[896, 402, 982, 671];Moniter:[243, 304, 310, 384].') 2023-11-03 07:06:50.024 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:06:50.024 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[362, 428, 451, 772] [486, 454, 554, 676] [595, 357, 771, 994] [116, 457, 181, 676] [213, 458, 333, 958] Six blurred construction workers on the go in busy construction site') 2023-11-03 07:06:54.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:06:54.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,man,woman and building in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'clothing:[000, 350, 079, 456],[004, 558, 123, 750],[070, 412, 278, 520],[071, 370, 167, 460],[218, 385, 317, 564],[257, 200, 403, 516],[260, 491, 479, 741],[340, 568, 756, 925],[481, 329, 582, 552],[671, 493, 914, 737],[673, 360, 707, 452],[835, 289, 876, 362],[842, 381, 982, 483],[865, 270, 920, 343],[965, 520, 998, 643];man:[379, 356, 765, 912],[273, 154, 410, 572],[562, 245, 621, 370];woman:[693, 314, 931, 806],[000, 325, 082, 464],[089, 304, 176, 468],[203, 339, 492, 827],[487, 279, 584, 493],[618, 287, 710, 452],[685, 289, 748, 439],[848, 243, 879, 366],[895, 285, 981, 481];building:[000, 000, 998, 997].') 2023-11-03 07:07:26.508 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:07:26.509 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Students [513, 272, 664, 585] [611, 178, 772, 587] [716, 176, 867, 960] [343, 298, 549, 752] filming an evening war scene with key light') 2023-11-03 07:07:36.126 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:07:36.127 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '4 south sea pearl strands [005, 079, 948, 986] on a checkerboard backdrop.') 2023-11-03 07:07:47.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:07:47.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[032, 169, 409, 1000],[463, 157, 718, 1000],[382, 130, 716, 999],[387, 153, 534, 711];Desk:[733, 264, 821, 379];Flower:[002, 003, 302, 498].') 2023-11-03 07:08:20.641 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:08:20.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[082, 002, 421, 998] A woman in [097, 320, 323, 996] athletic wear and [221, 149, 335, 325] a medical mask runs on a treadmill.') 2023-11-03 07:09:27.409 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:09:27.410 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'High angle shot of a cup of [215, 095, 612, 654] hot chocolate with [360, 113, 421, 195] [509, 164, 579, 241] [391, 160, 446, 252] [442, 168, 507, 243] marshmallows on [196, 264, 991, 998] a wooden surface Free Photo') 2023-11-03 07:09:28.867 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:09:28.867 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The sanctuary [416, 004, 995, 995] of B’nai Israel in Sacramento, before (left) and after (right) (Photo/HCLA') 2023-11-03 07:09:29.854 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:09:29.854 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A half length oil portrait [144, 078, 764, 995] of a man, naked, with a pigeon [378, 608, 706, 997] perched on his outstretched finger. Behind him, deep green foliage.') 2023-11-03 07:10:11.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:10:11.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Chair:[764, 468, 954, 669];Sneakers:[643, 823, 779, 892];Handbag:[596, 707, 882, 810];Person:[125, 179, 423, 879],[458, 192, 795, 894];Whiteboard:[246, 152, 849, 457].') 2023-11-03 07:10:16.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:10:16.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[726, 118, 953, 972],[053, 346, 237, 999],[092, 156, 251, 478],[182, 239, 418, 999],[303, 342, 569, 999],[306, 135, 426, 429],[437, 128, 595, 998],[478, 397, 733, 999],[600, 084, 785, 999],[900, 235, 925, 316];boy:[051, 135, 593, 999];tree:[000, 000, 261, 584],[389, 080, 460, 341],[443, 062, 513, 283],[586, 016, 695, 174],[713, 074, 793, 293],[787, 000, 849, 107],[850, 070, 959, 285],[916, 058, 999, 268];trousers:[760, 562, 883, 999].') 2023-11-03 07:10:30.537 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:10:30.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Storage box:[849, 716, 977, 890],[769, 685, 853, 855];Cabinet:[000, 234, 066, 405];Desk:[000, 483, 237, 895];Stool:[427, 514, 488, 647],[227, 627, 338, 808];Bowl:[000, 575, 083, 627],[066, 576, 158, 627],[101, 525, 183, 566],[035, 524, 117, 566],[000, 524, 049, 568];Bakset:[224, 592, 343, 666],[545, 768, 700, 869];Satchel:[076, 709, 206, 909];Person:[885, 267, 999, 409],[285, 276, 406, 770],[028, 312, 107, 406].') 2023-11-03 07:10:41.033 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:10:41.033 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[602, 426, 723, 568] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[602, 426, 723, 568];Frame2:[601, 424, 726, 573].') 2023-11-03 07:10:51.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:10:51.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'girl:[322, 000, 982, 999];clothing:[322, 029, 641, 576].') 2023-11-03 07:10:55.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:10:55.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[147, 502, 335, 749],[228, 307, 353, 653],[336, 323, 451, 658],[356, 216, 409, 379],[368, 012, 661, 804],[488, 283, 579, 435],[600, 262, 701, 449],[766, 281, 860, 464],[869, 218, 950, 367],[899, 259, 965, 414];man:[039, 239, 255, 644],[246, 000, 383, 264],[293, 201, 372, 486],[443, 137, 571, 349],[555, 053, 603, 201],[558, 142, 666, 402],[783, 239, 858, 293];girl:[248, 304, 360, 699],[340, 316, 485, 621],[365, 011, 688, 963],[491, 288, 563, 418],[765, 266, 856, 439];clothing:[659, 604, 851, 999];human face:[146, 318, 181, 434],[187, 536, 295, 685],[245, 227, 275, 318],[269, 330, 331, 458],[280, 115, 325, 184],[311, 216, 373, 345],[369, 247, 418, 340],[385, 356, 447, 472],[452, 184, 516, 271],[474, 401, 565, 551],[498, 296, 558, 410],[569, 200, 603, 283],[600, 174, 650, 285],[626, 268, 679, 378],[775, 294, 831, 406],[789, 230, 851, 306],[893, 227, 955, 302].') 2023-11-03 07:11:29.251 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:11:29.252 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect mammal,human hair and human hand in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'mammal:[000, 697, 150, 945],[012, 127, 093, 258],[096, 560, 169, 727],[110, 575, 192, 738],[181, 586, 261, 726],[206, 584, 365, 969],[211, 553, 251, 645],[301, 570, 320, 637],[303, 499, 348, 570],[315, 564, 387, 709],[353, 557, 397, 630],[376, 663, 607, 999],[386, 503, 430, 566],[413, 545, 471, 634],[443, 564, 520, 696],[453, 572, 586, 804],[463, 710, 786, 999],[477, 198, 512, 371],[581, 545, 623, 636],[591, 564, 669, 722],[671, 577, 719, 665],[705, 544, 739, 632],[733, 545, 766, 608],[750, 555, 800, 622],[766, 582, 836, 760],[766, 699, 999, 993],[817, 588, 960, 758],[820, 577, 913, 724],[823, 549, 861, 641],[863, 562, 893, 626],[945, 586, 999, 707];human hair:[516, 678, 594, 825],[145, 579, 183, 656],[252, 583, 313, 671],[320, 568, 360, 631],[600, 712, 721, 922],[783, 577, 823, 656],[826, 550, 861, 621],[898, 587, 955, 671],[923, 695, 999, 856];human hand:[513, 794, 608, 945],[573, 772, 622, 897].') 2023-11-03 07:11:43.372 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:11:43.372 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Uganda elections: [370, 071, 837, 996] Bobi Wine ends interview with Kenyan radio abruptly as military raids his home') 2023-11-03 07:12:12.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:12:12.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Chair:[335, 546, 405, 593];Potted Plant:[471, 906, 524, 999],[643, 834, 689, 910],[513, 870, 555, 924],[527, 730, 630, 915];Desk:[776, 780, 819, 830];Couch:[672, 794, 744, 844].') 2023-11-03 07:12:25.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:12:25.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree,human face,man and clothing in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'tree:[000, 000, 999, 999];human face:[395, 128, 473, 258];man:[325, 052, 636, 999];clothing:[321, 230, 637, 999].') 2023-11-03 07:12:46.120 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:12:46.120 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track personFrame1:[731, 471, 781, 710],carFrame1:[037, 429, 104, 501],carFrame1:[000, 420, 066, 510] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'personFrame1:[731, 471, 781, 710];Frame2:[827, 437, 908, 790],carFrame1:[037, 429, 104, 501];Frame2:[000, 427, 041, 496],carFrame1[000, 420, 066, 510];Frame2:No object.') 2023-11-03 07:13:00.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:13:00.665 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human hair:[052, 313, 214, 655],[052, 280, 084, 350],[291, 051, 471, 319],[441, 271, 693, 696],[783, 308, 841, 421],[810, 315, 931, 533],[978, 345, 999, 435];man:[146, 108, 505, 923],[001, 332, 271, 998],[396, 282, 848, 998],[673, 310, 753, 544],[721, 310, 835, 668],[725, 317, 999, 998],[950, 322, 999, 611];clothing:[739, 432, 976, 973],[000, 426, 056, 708],[000, 586, 230, 998],[001, 365, 060, 515],[145, 191, 455, 961],[386, 657, 846, 998],[676, 359, 754, 507],[722, 416, 814, 636],[920, 782, 999, 998],[958, 457, 999, 614];woman:[000, 279, 080, 583];human face:[019, 327, 054, 406],[168, 436, 229, 676],[328, 176, 445, 390],[793, 373, 879, 545];human arm:[000, 406, 044, 495],[000, 423, 049, 618],[033, 382, 060, 460],[061, 642, 276, 998],[155, 257, 410, 659],[393, 809, 503, 998],[395, 297, 460, 738],[725, 506, 841, 810],[790, 628, 971, 956],[924, 795, 999, 998],[953, 493, 997, 616].') 2023-11-03 07:13:01.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:13:01.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect human face,person and toilet in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human face:[638, 415, 698, 484];person:[322, 000, 688, 987].') 2023-11-03 07:13:11.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:13:11.548 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[148, 110, 549, 999],[492, 113, 886, 954];Boots:[289, 800, 396, 920],[184, 878, 340, 1000];High Heels:[516, 831, 635, 934],[582, 847, 717, 956].') 2023-11-03 07:13:20.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:13:20.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Storage box:[000, 513, 471, 917],[197, 437, 612, 724],[331, 369, 679, 542],[432, 332, 703, 507],[508, 296, 701, 375];Desk:[009, 568, 612, 1000],[505, 435, 743, 921],[926, 141, 999, 216],[014, 574, 606, 999];Other Fish:[527, 382, 637, 422];Other Shoes:[677, 821, 716, 916];Hat:[621, 049, 665, 106];Satchel:[614, 237, 680, 289];Truck:[000, 001, 609, 592];Trolley:[954, 457, 1000, 721];Person:[607, 049, 724, 285],[678, 001, 979, 1000],[883, 048, 931, 155].') 2023-11-03 07:13:26.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:13:26.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the black fish moving around?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is no black fish moving around.') 2023-11-03 07:14:00.159 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:14:00.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[857, 172, 998, 839];frame:2:[860, 177, 998, 843];frame:3:[773, 241, 998, 912] doing?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an man staying and shifting by stepping backs') 2023-11-03 07:14:10.588 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:14:10.588 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of [001, 587, 121, 867] [192, 498, 288, 689] [319, 495, 389, 661] people outdoors receiving instruction from [845, 536, 992, 867] a personal trainer.') 2023-11-03 07:14:11.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:14:11.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'cat:[000, 136, 996, 820].') 2023-11-03 07:14:38.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:14:38.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the man sweeping leaves and talking to lady?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one man sweeping leaves and talking to lady.Track1frame:1:[053, 217, 209, 999];frame:2:[050, 217, 209, 999];frame:3:[058, 226, 197, 999].') 2023-11-03 07:15:11.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:15:11.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[585, 318, 1000, 1000],[462, 452, 566, 646];human hair:[582, 207, 955, 833];human head:[562, 224, 921, 605];human arm:[672, 688, 964, 987];human face:[595, 302, 742, 551].') 2023-11-03 07:15:12.585 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:15:12.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'fish:[061, 105, 947, 868].') 2023-11-03 07:15:44.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:15:44.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Helmet in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[044, 540, 195, 1000],[001, 852, 030, 1000],[354, 235, 524, 914],[459, 163, 622, 809],[615, 001, 802, 601];Helmet:[096, 541, 162, 607],[388, 234, 453, 299],[496, 165, 557, 240].') 2023-11-03 07:16:32.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:16:32.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[897, 475, 964, 682],[841, 482, 909, 675],[735, 459, 772, 607],[683, 453, 728, 601],[473, 471, 518, 682],[407, 478, 466, 683],[384, 429, 424, 545],[146, 446, 196, 593],[092, 442, 141, 596].') 2023-11-03 07:16:39.373 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:16:39.373 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "SAPU Mr and Mrs Couples Mug Set,Valentine's Day gift,Unique Wedding Gift for Bride and Groom, His and Hers Anniversary Pre...") 2023-11-03 07:17:02.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:17:02.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cabinet:[615, 407, 961, 865];Storage box:[342, 361, 430, 531],[426, 270, 503, 423],[051, 419, 134, 660],[456, 846, 620, 999];Desk:[000, 564, 703, 1000];Person:[134, 208, 313, 505],[747, 196, 931, 919];Umbrella:[203, 012, 367, 120];Bakset:[454, 416, 526, 472],[451, 520, 588, 623];Scale:[124, 491, 252, 613];Shovel:[298, 461, 346, 522],[408, 852, 470, 999].') 2023-11-03 07:17:08.527 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:17:08.527 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human body:[743, 135, 940, 808],[000, 186, 123, 701],[111, 055, 145, 118],[135, 156, 223, 680],[151, 036, 177, 120],[191, 195, 438, 816],[217, 466, 445, 920],[260, 071, 276, 200],[262, 048, 295, 223],[306, 033, 385, 205],[360, 176, 547, 660],[417, 056, 446, 183],[443, 021, 478, 183],[541, 176, 721, 675];human hair:[201, 216, 278, 328],[366, 525, 446, 651],[823, 150, 892, 240];mammal:[737, 170, 927, 776],[000, 178, 133, 713],[113, 041, 137, 121],[135, 153, 236, 688],[143, 041, 177, 116],[191, 195, 442, 805],[215, 453, 451, 931],[258, 070, 273, 158],[261, 048, 300, 190],[307, 043, 332, 178],[321, 053, 340, 200],[341, 058, 361, 198],[355, 076, 385, 201],[356, 178, 553, 703],[413, 048, 446, 170],[446, 025, 480, 183],[511, 170, 723, 676];man:[527, 160, 721, 696],[000, 178, 133, 705],[136, 151, 238, 701],[182, 190, 436, 821],[208, 460, 448, 905],[250, 021, 482, 228],[353, 170, 555, 713],[746, 153, 933, 831];footwear:[373, 840, 436, 896].') 2023-11-03 07:17:21.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:17:21.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[054, 441, 387, 901],[364, 297, 596, 565],[414, 174, 845, 968];Sneakers:[233, 862, 307, 901],[063, 838, 131, 890],[430, 468, 520, 502],[424, 523, 534, 563];Hat:[894, 315, 964, 353],[884, 244, 948, 280];Handbag:[862, 409, 948, 451];Backpack:[774, 273, 851, 319];Car:[322, 272, 674, 369];Street Lights:[192, 004, 330, 447],[348, 019, 472, 343].') 2023-11-03 07:18:09.766 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:18:09.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[209, 231, 426, 1000],[503, 583, 756, 999];man:[207, 239, 445, 1000];clothing:[201, 300, 429, 931],[500, 756, 748, 999];human face:[293, 222, 445, 415],[379, 295, 437, 432],[648, 642, 728, 785];human hand:[278, 812, 382, 975];human leg:[207, 786, 405, 999];woman:[491, 581, 750, 999];human hair:[316, 216, 470, 334],[571, 577, 725, 755];human head:[321, 217, 456, 434],[578, 584, 728, 779];flower:[777, 380, 818, 474],[779, 280, 818, 375];human arm:[220, 385, 368, 958],[496, 805, 566, 999].') 2023-11-03 07:18:14.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:18:14.222 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'toy:[308, 124, 999, 969];calculator:[000, 410, 595, 999].') 2023-11-03 07:18:15.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:18:15.957 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[006, 034, 207, 790],[673, 619, 711, 743],[732, 632, 765, 751];skateboard:[000, 708, 100, 794];tree:[000, 000, 999, 756];footwear:[018, 716, 087, 764].') 2023-11-03 07:18:18.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:18:18.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Traffic cone in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Traffic cone:[964, 806, 999, 884],[822, 794, 852, 864],[712, 785, 740, 846],[623, 775, 649, 831],[513, 766, 535, 815],[433, 759, 453, 803].') 2023-11-03 07:18:26.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:18:26.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Dinning Table:[002, 413, 1000, 1000],[262, 269, 597, 467],[587, 208, 680, 308];Napkin:[244, 819, 430, 943],[359, 442, 438, 487],[905, 396, 965, 440];Green beans:[754, 662, 815, 725];Glasses:[117, 224, 207, 286];Plate:[451, 544, 721, 699],[411, 946, 593, 1000],[224, 932, 353, 999],[388, 465, 588, 545],[538, 391, 718, 462],[757, 402, 914, 454],[901, 489, 999, 588];Bowl:[724, 656, 844, 744],[878, 611, 995, 693],[164, 589, 244, 664],[358, 489, 401, 533];Cup:[816, 805, 974, 1000];Spoon:[275, 854, 331, 999],[636, 909, 703, 999];Chopsticks:[095, 493, 241, 557];Wine Glass:[220, 428, 313, 674],[674, 294, 719, 405];Person:[002, 106, 255, 754],[684, 076, 962, 414].') 2023-11-03 07:19:01.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:19:01.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Scooter:[900, 526, 999, 677],[002, 508, 115, 656];Tricycle:[584, 374, 931, 651];Person:[049, 370, 160, 655].') 2023-11-03 07:19:21.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:19:21.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Frisbee:[398, 284, 443, 326];Person:[392, 318, 627, 817],[171, 398, 196, 459],[196, 402, 231, 460],[235, 324, 269, 452],[366, 366, 394, 427],[517, 326, 548, 383],[559, 240, 591, 433],[596, 241, 633, 417],[624, 251, 657, 413],[648, 237, 685, 359],[626, 337, 681, 435],[799, 314, 911, 410],[809, 225, 849, 356],[860, 217, 905, 441],[843, 189, 872, 353],[890, 203, 923, 387],[919, 210, 948, 326],[911, 294, 967, 385],[946, 297, 999, 402],[972, 164, 999, 322],[272, 392, 298, 443].') 2023-11-03 07:19:35.052 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:19:35.053 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '"Common Kestrel [531, 341, 718, 675] (Falco tinnunculus) in soft winter light, perched on a fence..."') 2023-11-03 07:19:42.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:19:42.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Desk and Picture/Frame in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[037, 607, 478, 997];Frame:[451, 768, 511, 940].') 2023-11-03 07:19:43.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:19:43.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[216, 123, 415, 560],[000, 000, 029, 196],[000, 119, 326, 866],[057, 000, 184, 463],[238, 000, 365, 284],[344, 000, 415, 245],[446, 000, 561, 373],[465, 000, 505, 209],[514, 214, 670, 413],[670, 109, 891, 470],[761, 000, 841, 190],[838, 000, 909, 388];human body:[241, 125, 418, 486],[000, 047, 026, 205],[000, 330, 332, 868],[020, 000, 181, 501],[125, 038, 175, 249],[239, 034, 365, 291],[339, 001, 418, 240],[445, 000, 558, 370],[461, 057, 481, 163],[663, 207, 890, 469],[768, 029, 842, 201],[810, 003, 906, 234];human hair:[515, 193, 677, 395],[002, 121, 153, 298],[229, 121, 446, 421],[265, 000, 329, 058],[749, 106, 823, 192],[775, 000, 825, 076];table:[237, 375, 998, 998];clothing:[690, 174, 892, 449],[000, 051, 038, 163],[000, 336, 309, 886],[238, 038, 354, 271],[238, 265, 435, 490],[336, 000, 401, 238],[473, 000, 560, 373],[763, 019, 845, 173],[832, 000, 904, 245];human arm:[454, 041, 535, 151],[000, 429, 298, 714],[113, 447, 330, 577],[165, 083, 181, 258],[239, 087, 263, 291],[246, 284, 425, 474],[270, 066, 362, 241],[344, 014, 421, 183],[394, 276, 441, 438],[672, 240, 726, 371],[758, 230, 889, 470],[765, 039, 783, 120],[815, 026, 885, 123],[822, 033, 846, 212].') 2023-11-03 07:19:54.766 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:19:54.766 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[219, 433, 255, 559] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[219, 433, 255, 559];Frame2:[299, 441, 335, 550];Frame3:[361, 447, 404, 558];Frame4:[364, 452, 397, 566].') 2023-11-03 07:19:57.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:19:57.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[647, 318, 1000, 999];Desk:[464, 864, 644, 1000];Laptop:[584, 830, 694, 1000].') 2023-11-03 07:20:06.606 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:20:06.606 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[084, 443, 247, 911] [471, 432, 652, 936] [731, 440, 833, 916] [251, 456, 393, 916] [348, 425, 497, 931] [827, 443, 955, 907] ballet dancers in [826, 580, 936, 831] [086, 586, 217, 848] [628, 548, 745, 836] [354, 581, 460, 839] long white tutu on a winter forest background with snow falling') 2023-11-03 07:20:44.923 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:20:44.923 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human leg:[729, 740, 837, 993],[000, 716, 091, 999],[154, 806, 335, 995],[378, 808, 433, 995],[430, 785, 565, 945],[597, 755, 643, 999],[890, 737, 930, 949];man:[717, 260, 971, 665],[000, 210, 373, 999],[286, 279, 649, 995],[410, 177, 833, 995];sunglasses:[111, 271, 189, 330],[358, 345, 436, 370],[814, 340, 881, 375];human body:[000, 204, 380, 999],[281, 403, 652, 999],[411, 169, 835, 999],[457, 343, 528, 482],[525, 336, 575, 412],[728, 362, 953, 992];human hair:[111, 210, 201, 273],[564, 345, 631, 376],[803, 275, 885, 361];table:[031, 593, 999, 999];human arm:[000, 405, 246, 685],[188, 440, 380, 657],[290, 441, 553, 662],[410, 171, 541, 475],[446, 438, 651, 590],[644, 176, 833, 467],[741, 370, 898, 543],[765, 421, 946, 615];human hand:[156, 620, 250, 695],[295, 445, 377, 570],[411, 170, 483, 259],[476, 591, 554, 666],[578, 438, 652, 525],[762, 522, 838, 615],[785, 180, 835, 269],[818, 364, 896, 469].') 2023-11-03 07:21:11.046 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:21:11.046 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[104, 116, 294, 849] [304, 116, 485, 851] [699, 117, 886, 850] [504, 116, 685, 850] 4 pillars of a solid sales process') 2023-11-03 07:21:21.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:21:21.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Trash bin Can:[790, 739, 855, 785];Stool:[225, 608, 428, 791];Lamp:[715, 527, 764, 582].') 2023-11-03 07:21:43.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:21:43.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat,Person and Microphone in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Hat:[316, 034, 706, 209],[002, 189, 178, 297];Person:[002, 034, 893, 1000],[002, 191, 226, 1000];Microphone:[263, 231, 497, 346].') 2023-11-03 07:22:42.436 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:22:42.436 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[323, 080, 451, 359] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a eyeFrame1:[323, 080, 451, 359];Frame2:[357, 082, 484, 355];Frame3:[405, 084, 533, 357];Frame4:[412, 084, 540, 357];Frame5:[424, 086, 552, 368].') 2023-11-03 07:23:00.169 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:23:00.170 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[701, 106, 924, 910] Top 7 mobile pokies to play this spring') 2023-11-03 07:23:01.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:23:01.156 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[498, 194, 795, 631] [201, 180, 498, 635] Lever-back earrings with [225, 368, 385, 617] a tropical pink flamingo theme in pinks and greens') 2023-11-03 07:23:33.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:23:33.516 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hamburger,Speaker,Bench and Plate in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Speaker:[594, 125, 637, 413],[632, 245, 692, 412],[243, 139, 291, 449];Bench:[543, 645, 827, 965];Plate:[640, 660, 700, 694].') 2023-11-03 07:23:53.293 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:23:53.293 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'pouring [120, 048, 826, 736] batter from [027, 024, 916, 743] the blender into [022, 395, 989, 977] the muffin pan') 2023-11-03 07:24:09.800 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:24:09.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[469, 241, 602, 662];frame:2:[444, 236, 605, 651];frame:3:[437, 234, 605, 641] doing?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an the tiger that is walking away from us and facing in the opposite direction, going into the distance.') 2023-11-03 07:24:18.414 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:24:18.415 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[049, 299, 153, 756] Street vendor who set himself on fire after [381, 302, 465, 667] municipal police seized [185, 445, 455, 910] his stall dies in Turkey') 2023-11-03 07:24:19.039 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:24:19.040 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'collage [010, 002, 991, 992] of simple illustrations of the liberty bell') 2023-11-03 07:24:35.531 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:24:35.531 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Sneakers:[966, 659, 1000, 710],[002, 559, 058, 668],[024, 548, 072, 630];Backpack:[862, 098, 924, 237];Desk:[256, 475, 768, 999];Chair:[335, 898, 714, 999],[863, 518, 963, 890],[066, 521, 339, 999];Person:[687, 223, 958, 1000],[102, 231, 516, 909],[410, 136, 606, 480];Flask:[608, 450, 644, 549];Cup:[331, 592, 385, 661],[590, 444, 631, 490];Camera:[508, 425, 548, 476].') 2023-11-03 07:24:54.135 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:24:54.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Guitar,Chair,Other Shoes and Person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[003, 053, 097, 623];Chair:[823, 237, 1000, 654];Other Shoes:[553, 760, 590, 810];Person:[417, 088, 703, 820],[058, 771, 300, 1000],[392, 821, 546, 1000],[641, 734, 845, 999],[884, 722, 1000, 1000].') 2023-11-03 07:24:54.487 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:24:54.488 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[678, 427, 825, 798],[006, 360, 986, 872],[406, 633, 765, 993];bicycle:[006, 622, 026, 769],[025, 595, 118, 949],[048, 607, 073, 683],[125, 566, 200, 813],[195, 553, 225, 683],[251, 576, 286, 846],[293, 566, 335, 712],[330, 545, 378, 758],[361, 561, 413, 817],[433, 559, 493, 781],[441, 561, 453, 683],[563, 551, 615, 677],[635, 542, 681, 742],[666, 566, 718, 719],[673, 631, 801, 941],[796, 612, 935, 911],[898, 593, 961, 761],[938, 551, 978, 723];footwear:[016, 566, 986, 901],[088, 729, 113, 815].') 2023-11-03 07:25:01.332 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:25:01.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[426, 417, 663, 877],[000, 175, 249, 607],[158, 000, 999, 817],[235, 276, 318, 375].') 2023-11-03 07:25:04.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:25:04.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the pigeon moving around on sand?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one pigeon moving around on sand.Track1frame:1:[500, 408, 575, 577];frame:2:[466, 426, 560, 570];frame:3:[455, 419, 553, 570].') 2023-11-03 07:25:22.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:25:22.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'giraffe:[223, 363, 452, 885],[036, 265, 220, 931],[520, 271, 666, 876];person:[335, 391, 953, 998];clothing:[492, 470, 947, 998];human face:[368, 451, 513, 613].') 2023-11-03 07:26:10.064 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:26:10.064 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'cartoon of Jeff Bezos [206, 394, 492, 794] standing atop a cloud [726, 561, 997, 806] with the label [566, 060, 969, 574]') 2023-11-03 07:26:12.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:26:12.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'helicopter:[018, 295, 927, 591].') 2023-11-03 07:27:00.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:27:00.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A tank moves into position as Turkish people [461, 553, 568, 809] [250, 553, 346, 944] [359, 507, 476, 845] [004, 480, 119, 950] [606, 473, 739, 743] [750, 483, 878, 773] [117, 474, 261, 828] attempt to stop it early Saturday in Ankara, Turkey.') 2023-11-03 07:27:07.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:27:07.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,woman and human face in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[072, 555, 586, 1000],[417, 000, 601, 122],[440, 546, 975, 999],[741, 104, 999, 866];woman:[392, 048, 921, 966];human face:[293, 399, 458, 739],[436, 032, 669, 313],[578, 335, 721, 621].') 2023-11-03 07:27:23.848 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:27:23.849 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "How to make mini s'mores cups with graham crackers [370, 791, 610, 996] [692, 796, 926, 996], press graham cracker crust down with a shot glass [367, 006, 562, 626]") 2023-11-03 07:27:51.997 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:27:51.997 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[814, 793, 900, 881] Custom DNS server settings on a TP-link router admin page') 2023-11-03 07:28:31.964 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:28:31.964 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[395, 059, 508, 319],[000, 046, 078, 305],[055, 048, 173, 688],[142, 000, 321, 415],[265, 000, 427, 883],[551, 092, 675, 323],[600, 283, 698, 644],[649, 101, 738, 578],[721, 094, 807, 574];snack:[608, 876, 651, 957],[713, 824, 771, 919];table:[463, 837, 999, 998],[552, 520, 999, 915];man:[344, 078, 399, 186],[380, 260, 634, 816],[753, 048, 921, 535];girl:[000, 053, 053, 268],[060, 036, 160, 250],[146, 000, 304, 390],[275, 013, 422, 812],[391, 049, 496, 324],[547, 094, 675, 315],[595, 275, 698, 650];human face:[004, 062, 052, 152],[094, 060, 135, 156],[213, 050, 265, 166],[249, 595, 311, 772],[434, 071, 477, 147],[491, 287, 548, 421],[541, 321, 626, 499],[606, 118, 644, 196],[629, 315, 684, 413],[837, 078, 873, 143],[885, 283, 947, 355].') 2023-11-03 07:28:47.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:28:47.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[431, 098, 793, 287].') 2023-11-03 07:29:07.293 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:29:07.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[222, 350, 333, 440];Backpack:[384, 578, 428, 629];Satchel:[267, 619, 289, 649];Person:[366, 551, 449, 733],[201, 554, 279, 715],[256, 541, 314, 706].') 2023-11-03 07:29:09.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:29:09.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Storage box:[002, 004, 1000, 890];Canned:[790, 369, 849, 530],[880, 259, 952, 337],[845, 353, 900, 458],[908, 420, 972, 543],[863, 581, 949, 659].') 2023-11-03 07:29:18.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:29:18.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[274, 093, 433, 850] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[274, 093, 433, 850];Frame2:[328, 087, 478, 879];Frame3:[307, 044, 527, 951];Frame4:[271, 012, 445, 986];Frame5:[171, 018, 393, 973].') 2023-11-03 07:29:26.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:29:26.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the rabbit running around.As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There are 6 rabbit running around.Track1frame:1:[135, 702, 240, 833];frame:2:[159, 748, 284, 867];frame:3:[198, 742, 261, 859],Track2frame:1:[239, 295, 309, 491];frame:2:[240, 300, 310, 492];frame:3:[239, 300, 310, 495],Track3frame:1:[347, 247, 482, 393];frame:2:[328, 239, 482, 391];frame:3:[313, 188, 469, 382],Track4frame:1:[152, 420, 238, 612];frame:2:[127, 458, 202, 657];frame:3:[116, 513, 208, 720],Track5frame:1:[709, 390, 873, 622];frame:2:[773, 375, 872, 636];frame:3:[736, 322, 879, 575],Track6frame:1:[786, 490, 888, 681];frame:2:[679, 318, 801, 585];frame:3:[685, 391, 834, 554].') 2023-11-03 07:29:34.272 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:29:34.273 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[000, 244, 428, 505];frame:2:[000, 254, 528, 520];frame:3:[097, 275, 622, 458] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an black horses shifting to left-side') 2023-11-03 07:29:52.084 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:29:52.085 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Plate:[000, 123, 999, 999];Cup:[918, 001, 999, 378],[687, 044, 802, 209],[267, 001, 390, 132];Wine Glass:[785, 001, 933, 154];Bread:[450, 420, 942, 919],[004, 324, 461, 819].') 2023-11-03 07:30:13.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:30:13.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[072, 109, 376, 632].') 2023-11-03 07:30:31.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:30:31.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp and Mirror in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[772, 222, 832, 266];Mirror:[003, 292, 167, 866],[223, 321, 370, 775],[764, 304, 972, 559].') 2023-11-03 07:30:56.366 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:30:56.367 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Symbol of house: stack of books with a black laptop [327, 146, 962, 512] on top') 2023-11-03 07:31:17.247 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:17.247 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[005, 038, 988, 955] A realm is provisioned with a primary instance group and a secondary instance group.') 2023-11-03 07:31:33.601 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:33.602 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Wild Rice Salad with Roasted Butternut Squash photographed from a three quarter angle on a grey background with loads of [123, 165, 834, 736] wild rice, roasted butternut squash and [554, 921, 612, 967] [509, 001, 579, 049] [181, 188, 270, 284] cilantro strewn around') 2023-11-03 07:31:35.212 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:31:35.212 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Armen Living LCAMBEWHBCH Amanda Bench in White and Chrome Finish') 2023-11-03 07:31:42.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:42.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[158, 367, 853, 973];human hair:[312, 083, 631, 345];human head:[275, 084, 615, 379];man:[101, 055, 833, 966];human mouth:[319, 305, 394, 359];human ear:[524, 211, 580, 291];human face:[296, 139, 505, 416];human arm:[109, 683, 243, 999],[370, 472, 814, 999];glasses:[271, 189, 525, 303];human nose:[296, 236, 358, 311].') 2023-11-03 07:31:47.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:47.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Power outlet,Traffic Sign,Banana,Person,Cup,Lamp,Gloves and Handbag/Satchel in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[045, 066, 465, 968],[102, 197, 485, 999],[397, 215, 871, 999],[592, 116, 825, 550],[793, 264, 999, 889];Cup:[057, 335, 132, 457];Lamp:[002, 076, 092, 258],[783, 100, 869, 246],[163, 042, 207, 113];Gloves:[108, 561, 231, 950];Handbag:[709, 646, 799, 796].') 2023-11-03 07:31:52.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:52.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Bottle:[510, 246, 580, 509],[403, 251, 474, 513],[314, 254, 382, 512],[202, 262, 269, 515];Street Lights:[605, 190, 696, 507],[639, 394, 679, 513],[326, 212, 415, 293];Traffic Light:[586, 385, 596, 422],[583, 428, 596, 446],[165, 013, 222, 190],[043, 138, 078, 235],[008, 148, 037, 225].') 2023-11-03 07:31:52.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:52.778 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Pear:[822, 918, 874, 978],[870, 909, 909, 972];Orange:[868, 881, 909, 926];Apple:[812, 889, 857, 930];Storage box:[554, 174, 710, 473],[693, 169, 831, 474],[403, 145, 528, 293];shelf:[782, 488, 995, 999];Desk:[092, 727, 847, 1000];Cup:[157, 461, 256, 594],[379, 604, 423, 694];Coffee Machine:[090, 284, 378, 795];Bottle:[814, 307, 922, 547].') 2023-11-03 07:31:57.209 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:57.210 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[242, 032, 689, 996] Halle Berry, the Hollywood actress, had a baby at 47') 2023-11-03 07:31:59.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:31:59.771 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[627, 297, 648, 339] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a handFrame1:[627, 297, 648, 339];Frame2:[580, 460, 601, 487];Frame3:[488, 522, 508, 556];Frame4:[493, 518, 511, 552];Frame5:[498, 516, 516, 550].') 2023-11-03 07:33:26.273 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:33:26.274 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[856, 529, 996, 995] [596, 571, 960, 998] [002, 551, 256, 994] [797, 353, 996, 692] [595, 236, 827, 751] [055, 269, 294, 721] Bridesmaids surrounding [193, 279, 650, 939] the bride as she dresses') 2023-11-03 07:33:39.489 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:33:39.489 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing and human hand in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'clothing:[312, 264, 831, 977];human hand:[850, 501, 988, 791],[313, 749, 518, 913],[863, 511, 935, 654].') 2023-11-03 07:33:44.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:33:44.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'air fried candied bacon in a container [040, 428, 772, 997]') 2023-11-03 07:33:51.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:33:51.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Car:[048, 366, 953, 925].') 2023-11-03 07:34:10.252 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:34:10.252 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Half Dome in the background with one of [004, 047, 696, 995] Yosemite/s iconic weathered tree roots standing proudly in the forefront.') 2023-11-03 07:34:16.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:34:16.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[255, 289, 613, 544] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a gliderFrame1:[255, 289, 613, 544];Frame2:[272, 367, 633, 564];Frame3:[280, 392, 654, 520];Frame4:[353, 389, 718, 546];Frame5:[270, 385, 702, 626].') 2023-11-03 07:34:39.905 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:34:39.905 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[016, 005, 995, 995] The cover of Kindred: The ground-breaking masterpiece') 2023-11-03 07:34:47.082 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:34:47.082 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Inside the Spar at Coleham where [494, 292, 541, 388] [181, 252, 250, 410] staff have become used to clearing up floodwater') 2023-11-03 07:35:37.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:35:37.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'toy:[302, 484, 417, 573],[035, 782, 096, 857],[045, 635, 102, 709],[181, 637, 244, 701],[183, 787, 240, 862],[185, 531, 248, 596],[311, 948, 377, 998],[338, 737, 396, 801],[473, 946, 538, 998],[670, 942, 736, 998],[820, 939, 893, 998],[858, 528, 920, 596],[858, 642, 922, 698],[914, 775, 977, 845];person:[000, 234, 154, 546],[000, 526, 997, 998],[217, 198, 544, 423],[311, 475, 417, 582],[421, 484, 519, 590],[503, 484, 584, 581],[576, 268, 643, 387],[603, 435, 937, 637],[665, 281, 772, 393],[789, 275, 860, 376],[874, 284, 941, 396];clothing:[020, 296, 171, 465],[229, 251, 557, 421],[425, 704, 475, 784],[603, 568, 665, 595],[674, 979, 736, 998],[682, 620, 753, 648],[684, 532, 759, 579],[862, 565, 933, 592],[866, 665, 929, 687].') 2023-11-03 07:35:40.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:35:40.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the a girl doing somersault.Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one a girl doing somersault.Track1frame:1:[370, 406, 582, 741];frame:2:[370, 408, 584, 747];frame:3:[371, 408, 584, 739].') 2023-11-03 07:35:56.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:35:56.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[319, 522, 994, 1000],[435, 327, 857, 822].') 2023-11-03 07:36:03.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:36:03.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'bicycle wheel:[856, 649, 903, 810],[081, 726, 093, 917],[090, 703, 110, 840],[120, 735, 143, 836],[141, 756, 159, 846],[169, 748, 188, 838],[251, 686, 285, 836],[296, 679, 338, 821],[438, 699, 473, 810],[496, 703, 533, 814],[554, 681, 593, 803],[618, 677, 659, 799],[642, 709, 667, 789],[688, 711, 711, 787],[756, 726, 783, 801],[792, 658, 831, 793],[808, 727, 830, 801];man:[017, 469, 099, 932],[255, 425, 318, 818],[531, 429, 568, 787];wheel:[852, 649, 903, 812],[081, 726, 091, 906],[085, 752, 096, 883],[090, 697, 112, 851],[139, 752, 158, 851],[171, 752, 189, 829],[250, 682, 286, 836],[296, 682, 341, 814],[313, 641, 322, 733],[343, 636, 360, 756],[439, 709, 473, 806],[501, 709, 531, 806],[554, 682, 593, 799],[618, 682, 659, 799],[643, 694, 668, 774],[689, 718, 713, 786],[755, 718, 782, 810],[788, 667, 828, 782],[809, 733, 828, 806],[845, 709, 861, 789];bicycle:[068, 619, 102, 909],[091, 649, 118, 833],[139, 656, 190, 846],[250, 607, 338, 829],[438, 622, 532, 810],[551, 611, 659, 803],[639, 634, 710, 795],[753, 652, 836, 795],[790, 572, 901, 791];woman:[068, 497, 107, 801],[381, 465, 435, 795],[416, 493, 446, 729],[462, 469, 500, 795],[551, 472, 591, 761],[820, 461, 865, 787];tree:[000, 001, 144, 495],[066, 000, 257, 525],[208, 000, 343, 575],[343, 003, 466, 544],[368, 405, 418, 517],[412, 320, 485, 544],[435, 000, 555, 649],[472, 110, 554, 641],[521, 000, 612, 690],[631, 000, 715, 272],[635, 283, 693, 467],[660, 000, 793, 529],[749, 000, 838, 617],[794, 000, 946, 744],[850, 000, 999, 649];clothing:[016, 592, 043, 879],[036, 491, 114, 879],[069, 581, 105, 784],[121, 649, 145, 803],[258, 470, 300, 806],[386, 514, 424, 759],[415, 514, 446, 741],[452, 568, 470, 714],[463, 517, 493, 682],[467, 602, 493, 756],[536, 476, 563, 675],[555, 517, 579, 737],[580, 557, 611, 752],[651, 598, 684, 759],[690, 545, 786, 909],[771, 536, 790, 703],[781, 613, 799, 744],[816, 506, 854, 733],[957, 484, 965, 560].') 2023-11-03 07:36:04.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:36:04.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'pastry:[000, 790, 201, 980],[035, 710, 248, 822],[036, 386, 388, 700],[113, 920, 385, 995],[198, 807, 603, 936],[218, 617, 755, 877],[275, 265, 456, 420],[361, 925, 586, 998],[478, 281, 853, 393],[575, 788, 911, 998],[596, 361, 998, 726].') 2023-11-03 07:36:09.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:36:09.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[577, 491, 666, 548],[059, 657, 251, 702],[222, 649, 341, 695],[670, 664, 872, 749],[876, 654, 998, 717];TV:[531, 480, 679, 565],[438, 573, 514, 616].') 2023-11-03 07:37:03.134 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:03.134 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[581, 350, 838, 949],[045, 412, 170, 984];human arm:[602, 540, 681, 682],[000, 815, 088, 933],[042, 625, 073, 759],[052, 886, 104, 975],[063, 511, 185, 721],[597, 589, 832, 880];human body:[000, 803, 090, 982],[043, 409, 159, 971],[583, 377, 827, 995],[765, 927, 895, 997];human leg:[098, 675, 156, 984];human hair:[078, 410, 135, 492],[661, 384, 777, 554];human head:[078, 410, 136, 498],[655, 381, 771, 560],[803, 774, 999, 953];mammal:[000, 811, 098, 999],[039, 411, 176, 981],[592, 385, 827, 999],[799, 775, 998, 960],[874, 905, 999, 995];man:[590, 390, 814, 999],[806, 783, 999, 945];girl:[045, 410, 168, 982];clothing:[031, 495, 170, 978],[636, 530, 825, 995];human face:[082, 432, 127, 497],[658, 456, 698, 557];human hand:[065, 913, 100, 979],[592, 514, 632, 589],[596, 608, 813, 860],[871, 903, 999, 999].') 2023-11-03 07:37:06.831 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:06.832 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Dog in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Dog:[426, 381, 860, 1000],[065, 208, 875, 998],[002, 196, 411, 997].') 2023-11-03 07:37:18.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:18.675 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'building:[000, 000, 326, 999],[055, 000, 999, 999];window:[012, 004, 117, 165],[190, 568, 249, 633],[196, 513, 263, 563],[208, 281, 270, 388],[279, 568, 365, 640],[283, 508, 360, 560],[292, 276, 370, 392],[427, 791, 511, 913],[435, 334, 486, 426],[552, 343, 652, 436],[561, 549, 659, 599],[565, 804, 673, 915],[568, 602, 659, 684],[709, 363, 800, 458],[727, 612, 814, 703],[730, 563, 805, 607],[752, 807, 839, 916],[864, 376, 960, 465],[882, 625, 969, 705],[885, 575, 962, 616],[910, 802, 999, 913].') 2023-11-03 07:37:29.058 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:29.058 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[368, 054, 989, 991] yvette nicole brown at the 2020 film independent spirit awards on february 08, 2020 in santa monica, california. [006, 006, 994, 993] - film independent spirit awards stock videos & royalty-free footage') 2023-11-03 07:37:33.429 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:33.430 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Desk:[410, 899, 999, 1000];Couch:[198, 603, 1000, 943];Potted Plant:[146, 389, 243, 466];Person:[002, 680, 085, 914],[237, 271, 681, 1000],[840, 367, 999, 695];Cup:[909, 491, 961, 544],[644, 824, 724, 929].') 2023-11-03 07:37:43.709 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:43.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Flag:[635, 620, 725, 743],[646, 562, 772, 647],[431, 533, 550, 652],[514, 341, 675, 462],[507, 242, 679, 322],[257, 440, 497, 569],[410, 454, 539, 542],[321, 376, 526, 473],[205, 340, 380, 468],[312, 231, 449, 350],[347, 330, 470, 429],[413, 242, 492, 350],[114, 231, 311, 354],[050, 126, 178, 213];Bench:[509, 510, 906, 559],[175, 115, 341, 186],[001, 121, 084, 186],[784, 117, 846, 147];Storage box:[663, 538, 745, 598];Boat:[001, 481, 198, 737],[006, 098, 964, 993];Barrel:[774, 563, 848, 615],[740, 549, 774, 604].') 2023-11-03 07:37:51.106 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:37:51.107 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'sweet potato hash [023, 262, 917, 718] in a skillet [006, 245, 990, 795] with a spoon [024, 086, 602, 549]') 2023-11-03 07:38:00.424 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:38:00.425 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[325, 372, 392, 792],[517, 447, 569, 739],[650, 481, 695, 705],[753, 498, 791, 687],[824, 513, 857, 682],[886, 526, 911, 673];Hat:[011, 252, 056, 292],[341, 379, 376, 416];Sneakers:[011, 786, 036, 851],[526, 706, 557, 738].') 2023-11-03 07:38:15.870 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 07:38:15.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Potted Plant in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Potted Plant:[340, 699, 420, 859].') 2023-11-03 07:38:20.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:38:20.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'shelf:[113, 001, 892, 836],[001, 623, 116, 998];Picture:[917, 094, 1000, 191];bucket:[385, 091, 536, 217];Luggage:[243, 661, 565, 999];Trash bin Can:[100, 793, 293, 999];Storage box:[141, 201, 227, 265],[227, 222, 337, 275],[385, 244, 656, 347],[132, 587, 284, 690],[278, 588, 448, 688],[425, 587, 557, 680],[538, 552, 854, 995].') 2023-11-03 07:38:27.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:38:27.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Backpack:[216, 889, 311, 1000];Person:[251, 791, 391, 1000],[059, 789, 107, 959],[147, 786, 204, 956].') 2023-11-03 07:38:37.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:38:37.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Bicycle:[641, 589, 730, 701];Blackboard:[214, 644, 280, 788];Trash bin Can:[806, 700, 877, 827];Desk:[010, 784, 101, 1000],[326, 614, 361, 699];Lamp:[258, 041, 307, 130],[305, 140, 338, 203],[667, 035, 715, 130],[635, 140, 669, 204];Cabinet:[065, 487, 119, 565];Person:[401, 549, 447, 709];Flag:[532, 425, 569, 494],[547, 377, 596, 470],[582, 275, 668, 428],[388, 385, 436, 474],[289, 232, 387, 403].') 2023-11-03 07:38:47.909 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:38:47.909 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[061, 219, 357, 670] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a electric fan swinging on the groundFrame1:[061, 219, 357, 670];Frame2:[049, 219, 351, 662];Frame3:[042, 219, 345, 662];Frame4:[030, 219, 332, 662];Frame5:[035, 219, 320, 662].') 2023-11-03 07:38:50.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:38:50.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Handbag:[605, 621, 790, 827];Other Shoes:[363, 676, 415, 768],[256, 637, 319, 718];Bench:[149, 124, 612, 660];Desk:[098, 466, 328, 792];Cabinet:[002, 101, 194, 415],[112, 002, 287, 128];Person:[153, 080, 363, 645],[256, 091, 536, 769],[574, 338, 993, 1000].') 2023-11-03 07:39:01.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:39:01.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Carpet:[275, 677, 994, 998];Potted Plant:[001, 661, 137, 998],[065, 495, 140, 590];Cabinet:[001, 573, 171, 998];Vase:[024, 505, 057, 585];Bed:[475, 522, 931, 883];Lamp:[644, 524, 670, 586],[934, 536, 997, 662];Nightstand:[897, 645, 995, 820];Book:[001, 584, 089, 627];Moniter:[001, 182, 060, 429].') 2023-11-03 07:39:11.997 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:39:11.998 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'elements [227, 301, 756, 687] of gluten free teriyaki sauce in a measuring cup [128, 109, 882, 908]') 2023-11-03 07:39:31.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:39:31.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[253, 434, 298, 711];frame:2:[253, 434, 298, 711];frame:3:[272, 435, 314, 691] is doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an bike shifting forward') 2023-11-03 07:39:53.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:39:53.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[001, 596, 073, 644];frame:2:[001, 596, 073, 644];frame:3:[001, 590, 082, 635] is doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an ferry does not change position') 2023-11-03 07:40:15.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:40:15.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Assess the image and offer an answer to my question 'QUESTION: Why is [394, 295, 469, 595] on [398, 383, 467, 782] and [279, 299, 367, 595] on [293, 379, 368, 772] and [539, 263, 636, 645] on [363, 419, 396, 681] ? \nANSWER: [394, 295, 469, 595] and [279, 299, 367, 595] and [293, 379, 368, 772] and the others are playing polo . You should explain the reason for the above answer.'. Ensure to explain the reasoning process and highlight the spatial location of objects with square brackets, like [xmin,ymin,xmax,ymax]", 'Polo is played by a group of people on horseback wearing helmets and brandishing implements .') 2023-11-03 07:40:26.852 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:40:26.853 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[293, 009, 777, 999] a candy corn stand with sweets for kids and [416, 008, 642, 140] faux blackbirds is great to treat your neighbors' kids") 2023-11-03 07:41:52.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:41:52.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[433, 494, 495, 808] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[433, 494, 495, 808];Frame2:[566, 465, 611, 759];Frame3:[578, 427, 638, 756];Frame4:[555, 465, 643, 868];Frame5:[519, 376, 596, 784].') 2023-11-03 07:42:07.049 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:42:07.050 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Clog in the pipe vector icon in golden circle, cartoon style isolated on [001, 008, 989, 995] white background') 2023-11-03 07:42:19.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:42:19.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[024, 015, 981, 1000];clothing:[213, 819, 891, 998];human face:[258, 214, 599, 763].') 2023-11-03 07:42:34.122 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:42:34.123 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A foil packet recipe for the grill [005, 024, 989, 987], filled with seasoned pork chops [074, 267, 990, 756], baby potatoes [003, 393, 411, 601], and finished with a robust Chimichurri sauce [004, 002, 609, 236]! Perfect for weeknight and summer cooking!') 2023-11-03 07:42:42.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:42:42.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[442, 404, 503, 548] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a duckFrame1:[442, 404, 503, 548];Frame2:[438, 470, 506, 593];Frame3:[488, 522, 606, 656];Frame4:[614, 550, 668, 669];Frame5:[592, 558, 680, 672].') 2023-11-03 07:42:46.081 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:42:46.081 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[417, 218, 533, 420],[001, 787, 108, 999],[038, 690, 209, 931],[103, 948, 205, 997],[173, 751, 431, 999],[150, 771, 271, 963],[198, 685, 377, 881],[387, 792, 557, 999],[527, 777, 865, 999],[487, 754, 597, 965],[604, 761, 722, 872],[902, 704, 998, 999],[933, 842, 998, 1000],[701, 458, 827, 854];Book:[323, 913, 368, 947],[101, 933, 159, 966];Moniter:[450, 700, 586, 860].') 2023-11-03 07:42:47.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:42:47.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[197, 644, 457, 869] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a cheetahFrame1:[197, 644, 457, 869];Frame2:[219, 644, 471, 868];Frame3:[247, 648, 481, 850];Frame4:[270, 644, 503, 845].') 2023-11-03 07:43:03.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:43:03.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'skyscraper:[196, 540, 227, 613],[336, 496, 382, 616],[383, 499, 416, 613],[431, 381, 465, 607],[497, 512, 536, 576],[543, 471, 574, 603],[570, 486, 605, 594],[581, 507, 605, 586],[623, 501, 645, 605],[653, 479, 690, 567],[696, 477, 714, 554],[720, 486, 742, 592],[755, 479, 776, 592],[841, 490, 887, 590],[866, 501, 891, 598],[890, 490, 925, 592].') 2023-11-03 07:43:35.709 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 07:43:35.709 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Camera,Satchel,Couch and Moniter/TV in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Handbag:[284, 142, 371, 227];Couch:[001, 578, 303, 995];Moniter:[598, 069, 867, 351].') 2023-11-03 07:43:54.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:43:54.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pie,Napkin,Spoon,Bakset and Dinning Table in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Pie:[585, 389, 732, 508];Napkin:[002, 285, 381, 599],[921, 336, 999, 452];Spoon:[002, 423, 311, 498];Bakset:[441, 170, 888, 674];Dinning Table:[002, 060, 1000, 999].') 2023-11-03 07:44:02.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:02.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Desk:[515, 636, 999, 829],[626, 581, 948, 657];Person:[077, 403, 251, 1000].') 2023-11-03 07:44:13.514 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:13.514 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[563, 067, 956, 995] Paul Linwood signed for Grimsby Town in the summer of 2009, but left the club after just one season following relegation.') 2023-11-03 07:44:20.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:20.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[517, 330, 708, 729],[091, 305, 230, 553];Sneakers:[604, 699, 659, 729],[527, 599, 567, 662].') 2023-11-03 07:44:23.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:23.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'chicken:[505, 131, 728, 300],[035, 185, 168, 325],[043, 732, 185, 880],[068, 000, 208, 092],[073, 565, 215, 671],[116, 416, 218, 496],[128, 850, 203, 897],[218, 070, 355, 186],[218, 300, 381, 498],[226, 687, 361, 805],[285, 915, 393, 998],[411, 262, 495, 302],[420, 893, 498, 927],[431, 113, 518, 176],[440, 743, 523, 807],[478, 530, 548, 596],[506, 767, 726, 921],[586, 560, 770, 678],[658, 408, 775, 496],[755, 880, 891, 998],[768, 312, 891, 501],[773, 085, 878, 203],[778, 675, 885, 780],[941, 201, 998, 337],[953, 715, 998, 840].') 2023-11-03 07:44:44.012 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:44.012 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[431, 477, 576, 763] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a alaskan brown bearFrame1:[431, 477, 576, 763];Frame2:[435, 431, 575, 706];Frame3:[435, 451, 576, 713];Frame4:[430, 466, 576, 702].') 2023-11-03 07:44:45.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:45.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[508, 215, 775, 572];girl:[213, 222, 767, 985];footwear:[423, 909, 601, 999];human face:[036, 329, 103, 385],[383, 306, 532, 418],[590, 268, 748, 398].') 2023-11-03 07:44:52.255 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:44:52.255 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[305, 519, 415, 701];frame:2:[305, 519, 415, 701];frame:3:[304, 517, 416, 697] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an horse laying down') 2023-11-03 07:45:38.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:45:38.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Bus:[594, 168, 1000, 680],[000, 265, 608, 554];Person:[000, 295, 194, 1000],[179, 383, 252, 452],[276, 373, 418, 1000],[355, 318, 591, 999],[506, 371, 629, 982],[641, 312, 778, 993],[802, 231, 984, 1000],[981, 457, 1000, 712];Necklace:[882, 343, 908, 394],[562, 454, 598, 498];Flag:[911, 001, 989, 170];Glasses:[426, 334, 482, 367],[082, 345, 138, 384];Other Shoes:[319, 965, 355, 1000],[381, 963, 416, 999];Sandals:[506, 935, 526, 979],[648, 894, 694, 949],[745, 923, 777, 996];Sneakers:[827, 885, 875, 964],[910, 910, 969, 999];Satchel:[749, 594, 786, 689],[162, 796, 200, 941];Hat:[080, 294, 145, 359];Microphone:[450, 429, 483, 545],[846, 430, 881, 560];Lamp:[302, 242, 318, 266].') 2023-11-03 07:45:46.536 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:45:46.537 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[068, 799, 161, 880],carFrame1:[523, 749, 593, 854],carFrame1:[160, 795, 250, 860],carFrame1:[261, 788, 318, 862],carFrame1:[581, 746, 739, 962] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[068, 799, 161, 880];Frame2:[000, 810, 087, 908],carFrame1:[523, 749, 593, 854];Frame2:[528, 748, 621, 886],carFrame1:[160, 795, 250, 860];Frame2:[091, 806, 208, 886],carFrame1:[261, 788, 318, 862];Frame2:[210, 796, 282, 886],carFrame1:[581, 746, 739, 962];Frame2:[641, 715, 997, 995].') 2023-11-03 07:46:00.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:46:00.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[329, 840, 363, 954] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a person headFrame1:[329, 840, 363, 954];Frame2:[323, 873, 353, 952];Frame3:[471, 879, 506, 950];Frame4:[442, 876, 478, 950].') 2023-11-03 07:46:01.222 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:46:01.222 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'man:[776, 206, 996, 987],[000, 358, 366, 999],[030, 000, 136, 190],[266, 000, 411, 283],[275, 269, 589, 999];clothing:[027, 360, 360, 1000],[029, 000, 133, 115],[262, 326, 582, 888],[265, 000, 396, 222],[769, 341, 999, 966];human face:[269, 506, 318, 600],[402, 372, 479, 469],[825, 295, 894, 445].') 2023-11-03 07:46:33.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:46:33.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[000, 715, 253, 1000],[405, 637, 665, 999],[421, 447, 487, 580],[735, 639, 999, 999],[898, 455, 966, 646];man:[021, 258, 078, 797],[188, 546, 427, 999],[241, 272, 346, 655],[333, 456, 513, 793],[353, 437, 427, 594],[452, 519, 588, 815],[480, 435, 551, 606],[553, 362, 625, 575],[573, 468, 669, 718],[639, 492, 812, 911],[655, 440, 777, 703],[716, 371, 790, 533],[776, 399, 843, 584],[777, 646, 999, 995],[820, 521, 999, 834],[831, 333, 901, 545],[868, 348, 981, 573],[905, 450, 968, 541];clothing:[026, 332, 073, 737],[188, 555, 999, 999],[248, 352, 340, 645],[356, 487, 425, 593],[484, 491, 551, 603],[557, 414, 628, 566],[718, 422, 785, 537],[781, 443, 839, 545],[835, 407, 894, 526],[888, 407, 972, 545];human face:[817, 490, 840, 556],[898, 560, 920, 648].') 2023-11-03 07:46:44.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:46:44.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Ship:[136, 334, 414, 384];Boat:[374, 428, 419, 479],[353, 387, 392, 421],[437, 358, 519, 430],[769, 469, 833, 505],[797, 439, 847, 465];Person:[289, 711, 380, 816].') 2023-11-03 07:46:49.003 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:46:49.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[571, 393, 604, 566],[593, 433, 637, 543].') 2023-11-03 07:46:53.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:46:53.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An aerial map of the property: click here to view and/or download a printable high-definition PDF version [001, 012, 336, 990] of the file.') 2023-11-03 07:46:54.485 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:46:54.485 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Obstruction [056, 129, 986, 863] of the bile ducts and blood vessels connected to the liver can cause it to become enlarged.') 2023-11-03 07:47:11.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:11.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'window:[181, 361, 385, 645],[475, 402, 623, 676],[690, 446, 861, 701],[941, 507, 999, 629].') 2023-11-03 07:47:24.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:24.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat,Bench,Person,Baseball Bat and Bracelet in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Hat:[100, 194, 178, 242],[554, 162, 633, 205],[739, 175, 812, 214];Bench:[092, 389, 993, 584];Person:[127, 058, 771, 883],[001, 192, 198, 694],[231, 256, 384, 691],[496, 162, 665, 530],[675, 171, 905, 686],[791, 195, 999, 682];Baseball Bat:[646, 056, 723, 379];Bracelet:[630, 299, 665, 330].') 2023-11-03 07:47:26.620 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:26.620 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Digital painting of [000, 007, 996, 998] a street in downtown Reykjavik, with colorful houses, shops and bars and in the background the famous Hallgrímskirkja') 2023-11-03 07:47:29.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:29.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'dress:[515, 417, 661, 847],[888, 478, 982, 621];human body:[530, 317, 680, 909],[000, 449, 276, 777],[058, 900, 135, 999],[147, 912, 479, 999],[303, 495, 347, 623],[353, 279, 507, 912],[627, 400, 670, 471],[649, 491, 704, 708],[710, 366, 771, 689],[776, 431, 999, 704],[841, 756, 999, 999];woman:[523, 345, 691, 929],[080, 419, 143, 527],[192, 411, 223, 481],[213, 415, 259, 494],[298, 452, 356, 626],[308, 411, 348, 513],[333, 423, 363, 523],[623, 448, 705, 728],[762, 415, 797, 675],[861, 428, 988, 707];human hair:[553, 335, 616, 490],[126, 685, 308, 899],[378, 310, 425, 383],[926, 595, 999, 777];man:[712, 314, 793, 695],[000, 280, 999, 999];human leg:[350, 633, 412, 921],[411, 621, 491, 890],[721, 507, 751, 681],[745, 517, 771, 692],[860, 584, 920, 684];human arm:[026, 506, 081, 592],[101, 508, 151, 599],[151, 526, 184, 623],[302, 516, 341, 600],[351, 450, 476, 584],[435, 284, 495, 428],[494, 285, 545, 476],[621, 446, 667, 621],[627, 409, 649, 478],[649, 408, 668, 472],[665, 509, 703, 617],[710, 369, 740, 454],[741, 357, 776, 438].') 2023-11-03 07:47:34.340 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:34.340 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'On [004, 302, 994, 992] a tabletop surrounded by [068, 002, 674, 352] [758, 005, 995, 660] students, [263, 133, 570, 753] two framework structures made of [303, 523, 345, 720] toothpicks and [466, 136, 509, 189] mini marshmallows jiggle on moving platforms; one structure has collapsed.') 2023-11-03 07:47:43.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:43.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'football:[503, 769, 569, 861];person:[000, 000, 999, 920];plant:[000, 538, 999, 999].') 2023-11-03 07:47:45.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:47:45.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Chair:[060, 121, 109, 211];Person:[579, 142, 658, 250],[586, 150, 623, 203],[513, 119, 566, 220],[451, 125, 528, 212],[342, 105, 392, 157],[317, 066, 364, 148],[303, 076, 330, 115],[215, 008, 304, 155],[147, 002, 188, 087];SUV:[055, 160, 1000, 895];Street Lights:[758, 044, 811, 221];Pickup Truck:[656, 208, 792, 292];Car:[924, 359, 1000, 451].') 2023-11-03 07:48:17.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:48:17.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'boy:[516, 135, 772, 1000];woman:[250, 114, 585, 1000],[000, 126, 290, 999],[234, 246, 352, 594],[646, 273, 768, 999],[784, 176, 999, 999];girl:[258, 080, 570, 999],[000, 132, 106, 998],[016, 120, 276, 999],[235, 234, 340, 580],[657, 253, 772, 999],[778, 155, 999, 999];man:[526, 122, 729, 999];clothing:[040, 417, 282, 997],[266, 454, 562, 997],[544, 419, 725, 993],[790, 386, 998, 991];human face:[004, 237, 056, 353],[290, 311, 327, 388],[414, 191, 543, 457],[568, 184, 685, 415],[742, 268, 788, 384].') 2023-11-03 07:48:48.014 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:48:48.015 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[001, 102, 956, 867];Motorcycle:[001, 291, 1000, 989];Sneakers:[001, 739, 097, 865];Hat:[462, 101, 769, 206];Handbag:[871, 592, 998, 827].') 2023-11-03 07:49:06.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:49:06.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Couch:[001, 625, 027, 773],[018, 606, 097, 743],[108, 597, 153, 709],[265, 600, 328, 720],[321, 578, 368, 639],[571, 565, 606, 617],[600, 607, 649, 657],[668, 602, 736, 736],[820, 601, 870, 679],[879, 612, 972, 769],[970, 638, 1000, 798],[793, 678, 941, 879],[553, 653, 778, 864],[429, 634, 489, 837],[228, 673, 303, 738],[190, 727, 443, 954],[001, 730, 200, 950],[769, 551, 831, 696],[747, 539, 797, 636],[149, 542, 183, 610],[201, 539, 257, 618],[053, 635, 159, 748];Stool:[731, 597, 776, 648],[031, 375, 328, 758],[001, 617, 1000, 966];Pillow:[545, 599, 676, 678],[053, 733, 105, 771],[094, 738, 153, 775],[153, 736, 206, 793],[229, 728, 307, 788],[299, 734, 370, 774],[349, 729, 412, 779],[591, 655, 629, 687],[625, 688, 692, 736];Flower:[574, 488, 676, 588];Potted Plant:[039, 083, 189, 214],[061, 322, 196, 488],[378, 441, 570, 721],[675, 353, 749, 421],[675, 462, 739, 552],[576, 494, 653, 611],[835, 387, 1000, 928],[680, 205, 728, 299];Lamp:[563, 477, 621, 586];Desk:[149, 612, 185, 646],[190, 597, 235, 648],[312, 578, 467, 651].') 2023-11-03 07:49:25.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:49:25.150 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'giraffe:[403, 208, 650, 750];tree:[000, 132, 999, 560],[341, 361, 517, 625],[645, 285, 832, 554].') 2023-11-03 07:50:04.436 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:50:04.436 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a bold Christmas wedding bouquet of red and white blooms [133, 174, 930, 916], leaves and some rhinestones [486, 326, 557, 390] [678, 497, 744, 558] [432, 555, 503, 621] to sparkle') 2023-11-03 07:50:06.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:50:06.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[026, 425, 348, 795],personFrame1:[650, 465, 685, 603],personFrame1:[687, 469, 725, 596] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'carFrame1:[026, 425, 348, 795];Frame2:[228, 460, 398, 683],personFrame1:[650, 465, 685, 603];Frame2:[651, 461, 687, 615],personFrame1:[687, 469, 725, 596];Frame2:[696, 472, 729, 600].') 2023-11-03 07:50:37.337 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:50:37.337 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'an IKEA Vittsjo shelving unit turned into a chic home bar with copper spray paint, [001, 010, 888, 944] the piece features much storage space') 2023-11-03 07:51:07.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:51:07.659 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Clock:[478, 001, 619, 073];Person:[359, 098, 527, 280],[502, 089, 757, 292],[839, 081, 1000, 425],[767, 141, 1000, 416],[840, 326, 998, 669];Hat:[591, 088, 701, 142];Refrigerator:[002, 097, 191, 425];Knife:[770, 714, 914, 909];chopping Board:[002, 571, 286, 724],[595, 534, 986, 684],[704, 700, 1000, 916],[002, 491, 228, 578],[720, 197, 849, 306];Bowl:[002, 842, 219, 999],[002, 757, 093, 848],[298, 580, 451, 655],[453, 600, 591, 651],[543, 368, 690, 421],[494, 542, 619, 615];Plate:[109, 652, 733, 942],[193, 494, 344, 570],[339, 478, 605, 584];Pizza:[004, 603, 257, 705];Desk:[270, 278, 734, 367].') 2023-11-03 07:51:23.057 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:51:23.057 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Noriaki Iwashima [142, 165, 643, 996] gestures as he lies in a coffin [076, 073, 742, 998] to try it out during an end-of-life seminar held by Japan's largest retailer Aeon Co in Tokyo October 24, 2014. (Photo by Toru Hanai/Reuters") 2023-11-03 07:52:06.940 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:52:06.940 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'James Patrick Bulger Twitter profile is reported by members [061, 744, 150, 802] [060, 454, 150, 511] [061, 586, 149, 643] [060, 349, 150, 406] of the public [035, 940, 154, 998] who are concerned by the claims made by the owner of the account,.') 2023-11-03 07:52:24.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:52:24.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'plant:[000, 370, 998, 998].') 2023-11-03 07:52:31.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:52:31.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[610, 331, 718, 623],[000, 583, 051, 666],[000, 538, 063, 617],[003, 678, 108, 805],[033, 424, 066, 495],[051, 515, 101, 578],[058, 581, 116, 694],[061, 485, 101, 548],[070, 431, 099, 504],[113, 702, 250, 827],[116, 571, 205, 694],[157, 444, 191, 515],[169, 611, 313, 730],[191, 446, 210, 511],[210, 450, 260, 560],[229, 463, 304, 641],[277, 494, 403, 668],[289, 438, 362, 526],[292, 611, 393, 721],[315, 471, 366, 546],[331, 453, 438, 675],[400, 460, 458, 660],[441, 511, 557, 815],[499, 405, 562, 444],[644, 704, 769, 820],[716, 439, 851, 751],[767, 315, 830, 378],[776, 487, 863, 769],[790, 704, 999, 820],[844, 471, 923, 767],[870, 376, 940, 450],[893, 460, 989, 600],[899, 301, 926, 383],[905, 552, 999, 777],[928, 301, 970, 383];clothing:[243, 628, 427, 875],[000, 720, 090, 800],[051, 631, 113, 695],[114, 753, 229, 824],[116, 619, 178, 698],[176, 653, 290, 713],[216, 490, 285, 619],[285, 530, 353, 643],[364, 508, 408, 602],[415, 484, 476, 578],[441, 535, 550, 710],[507, 716, 536, 812],[623, 381, 688, 462],[630, 483, 691, 599],[637, 751, 789, 817],[731, 495, 837, 568],[746, 568, 779, 672],[779, 542, 863, 710],[793, 747, 999, 823],[866, 515, 914, 667],[870, 665, 922, 754],[871, 411, 946, 440],[902, 501, 985, 585],[921, 592, 999, 758];furniture:[000, 775, 279, 894],[000, 646, 440, 891],[065, 513, 500, 893],[486, 458, 646, 609],[588, 812, 999, 903].') 2023-11-03 07:52:55.503 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:52:55.503 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[290, 518, 427, 855] A cow sitting on [006, 512, 995, 993] the beach looking wistfully at [007, 431, 995, 655] the ocean next to [029, 333, 211, 619] a colored umbrella.') 2023-11-03 07:53:07.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:07.334 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'flower:[604, 512, 765, 605],[002, 539, 144, 623],[120, 506, 574, 669],[516, 609, 667, 693];tree:[000, 626, 130, 999],[000, 001, 998, 786].') 2023-11-03 07:53:19.263 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:19.264 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'person:[394, 214, 482, 536].') 2023-11-03 07:53:21.150 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:21.151 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me the trajectory of the cars in left in this video clip?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There are 2 cars in left.Track1frame1:[000, 610, 083, 1000],Track2frame3:[000, 584, 156, 1000].') 2023-11-03 07:53:23.960 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:23.960 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the fish in movement. in this video clip?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There are 6 the fish in movement..Track1frame:1:[464, 380, 634, 981];frame:2:[514, 343, 649, 998];frame:3:[541, 315, 659, 974],Track2frame:1:[180, 411, 501, 975];frame:2:[152, 368, 448, 903];frame:3:[127, 334, 451, 884],Track3frame:1:[260, 236, 438, 766];frame:2:[264, 244, 449, 792];frame:3:[255, 242, 468, 784],Track4frame:1:[287, 095, 382, 364];frame:2:[272, 106, 423, 317];frame:3:[232, 182, 391, 382],Track5frame:1:[230, 000, 521, 291];frame:2:[235, 000, 566, 228];frame:3:[266, 000, 631, 214],Track6frame:1:[354, 153, 537, 655];frame:2:[328, 133, 519, 658];frame:3:[360, 148, 480, 614].') 2023-11-03 07:53:27.410 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 07:53:27.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "It's not by power nor by might - Groupe Psaumes 150 | MFCI Church Culte du 02 Avril 2017") 2023-11-03 07:53:48.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:48.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Pillow:[132, 002, 363, 369];Bed:[083, 002, 998, 854];Bottle:[191, 367, 350, 490],[652, 578, 863, 766];Person:[002, 002, 999, 998],[317, 252, 629, 998],[633, 117, 999, 849].') 2023-11-03 07:53:55.971 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:55.972 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Matt Stairs [091, 029, 928, 998] celebrates 2-run, eighth-inning homer that gave the Phillies the lead en route to a 7-5 win over the Dodgers in Game 4 of the 2008 NLCS.') 2023-11-03 07:53:57.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:53:57.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,human face and man in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'clothing:[000, 343, 998, 999];human face:[001, 378, 026, 602],[063, 306, 111, 400],[090, 459, 127, 571],[218, 281, 270, 393],[305, 308, 343, 409],[380, 302, 417, 385],[454, 302, 489, 378],[638, 223, 706, 354],[775, 317, 883, 654],[881, 184, 932, 295],[956, 339, 999, 555];man:[277, 236, 328, 300],[480, 275, 595, 875],[537, 171, 710, 991],[542, 241, 999, 999],[809, 300, 999, 999],[831, 169, 930, 403].') 2023-11-03 07:54:12.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:54:12.384 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human hair:[565, 413, 927, 869],[000, 340, 352, 635],[661, 121, 938, 251];human head:[592, 402, 904, 719],[000, 117, 998, 619];mammal:[350, 360, 975, 977],[000, 204, 406, 587],[000, 337, 474, 999],[000, 120, 641, 754],[578, 271, 682, 414],[641, 309, 778, 425],[651, 125, 998, 766];man:[024, 181, 609, 1000],[000, 251, 042, 514],[036, 200, 187, 371],[192, 233, 294, 340],[590, 269, 682, 373],[636, 321, 748, 426],[647, 124, 998, 747],[925, 293, 998, 409];glasses:[698, 240, 953, 310];human nose:[454, 266, 556, 333],[674, 584, 748, 634],[769, 279, 850, 334];human eye:[403, 264, 482, 292],[641, 559, 714, 590],[750, 585, 819, 609];sunglasses:[117, 344, 347, 461],[692, 430, 913, 507],[692, 244, 938, 332];human mouth:[647, 633, 758, 682];human body:[000, 399, 683, 747],[047, 600, 425, 999],[425, 645, 980, 999];woman:[000, 330, 478, 999],[409, 430, 958, 999];human face:[000, 244, 998, 417],[172, 380, 441, 602],[193, 247, 251, 318],[378, 197, 606, 423],[611, 504, 859, 718],[646, 339, 709, 416],[695, 203, 935, 430],[940, 323, 998, 404].') 2023-11-03 07:54:28.395 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:54:28.395 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lamp:[172, 001, 395, 207],[431, 001, 629, 174],[547, 177, 638, 291],[672, 166, 748, 283];Mirror:[383, 153, 818, 490];Stool:[051, 773, 173, 999],[167, 790, 292, 999],[330, 830, 455, 1000],[488, 837, 614, 999];shelf:[815, 225, 894, 483];Blackboard:[018, 129, 210, 386],[620, 283, 820, 412];Cup:[554, 500, 605, 565],[662, 506, 684, 564];Bottle:[604, 500, 638, 564],[760, 615, 802, 688];Microwave:[170, 480, 240, 551];Coffee Machine:[402, 426, 480, 559],[708, 413, 793, 574].') 2023-11-03 07:54:45.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:54:45.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'canary:[513, 708, 873, 939],[164, 233, 319, 451],[226, 653, 540, 868],[348, 048, 639, 238],[534, 393, 712, 663],[715, 106, 966, 342],[789, 378, 998, 665];sparrow:[238, 656, 548, 872],[161, 241, 316, 446],[341, 055, 624, 235],[522, 715, 844, 928],[528, 395, 709, 658],[720, 112, 957, 343],[797, 381, 991, 665].') 2023-11-03 07:54:51.200 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:54:51.200 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'David Allan [231, 184, 675, 891] : William inglis, surgeon and captain of the honourable company of edinburgh golfers') 2023-11-03 07:55:04.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:55:04.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'dress:[243, 270, 510, 976];woman:[234, 118, 490, 999],[446, 124, 745, 999].') 2023-11-03 07:55:05.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:55:05.182 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[554, 854, 629, 968] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a eyeFrame1:[554, 854, 629, 968];Frame2:[492, 741, 598, 887];Frame3:[505, 843, 604, 961];Frame4:[497, 819, 596, 938];Frame5:[496, 804, 587, 941].') 2023-11-03 07:55:49.448 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:55:49.448 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Carl's Jr. Breakfast Burger with seared beef patty, crispy hash browns, scrambled eggs, cheese and glorious bacon. [007, 009, 996, 996] All the flavor, no drive through.") 2023-11-03 07:56:14.402 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:56:14.402 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of [374, 184, 527, 946] [491, 179, 626, 979] [088, 174, 257, 876] [217, 187, 386, 896] [590, 216, 755, 874] [709, 176, 866, 845] [001, 185, 153, 826] [834, 218, 998, 968] ladies post-eye surgery') 2023-11-03 07:56:52.376 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:56:52.376 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[115, 501, 318, 998] [588, 184, 924, 996] The band actually performed in the Digital Ferret record store in Philadelphia, 2009') 2023-11-03 07:57:31.107 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:57:31.107 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[356, 427, 521, 1000] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a man in black playing basketballFrame1:[356, 427, 521, 1000];Frame2:[467, 290, 600, 955];Frame3:[430, 266, 549, 873];Frame4:[403, 331, 529, 905].') 2023-11-03 07:57:36.125 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:57:36.126 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A relic, is places in front of the framed photo of Mother Teresa of Calcutta, before the start of a mass celebrating the canonization of the sister, at 825 N. Carpenter St., in Chicago, on Sunday Sep. 4, 2016.') 2023-11-03 07:58:03.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:58:03.706 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[318, 208, 948, 624];Couch:[155, 128, 1000, 990];Pillow:[300, 070, 633, 300];Chair:[123, 127, 269, 431];Frame:[153, 002, 238, 062],[253, 002, 325, 042],[083, 035, 134, 095];Flower:[205, 065, 290, 143];Dinning Table:[056, 205, 316, 496];Cabinet:[002, 002, 063, 448].') 2023-11-03 07:59:06.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:59:06.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[359, 450, 456, 600],carFrame1:[562, 477, 698, 569],truckFrame1:[741, 470, 795, 544] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[359, 450, 456, 600];Frame2:[353, 467, 453, 617],carFrame1:[562, 477, 698, 569];Frame2:[601, 493, 783, 610],truckFrame1:[741, 470, 795, 544];Frame2:[792, 484, 847, 570].') 2023-11-03 07:59:32.227 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 07:59:32.227 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[332, 313, 813, 1000];Book:[821, 516, 1000, 558],[893, 525, 999, 573];Telephone:[807, 447, 919, 504];Laptop:[550, 376, 704, 490];Bottle:[043, 754, 088, 799];shelf:[002, 214, 297, 441],[002, 526, 212, 781],[882, 658, 1000, 1000],[255, 123, 850, 292];Desk:[002, 430, 1000, 710];Chair:[287, 416, 701, 884];Backpack:[030, 656, 277, 873];Satchel:[207, 641, 379, 735];Sneakers:[672, 776, 733, 844],[565, 911, 738, 999].') 2023-11-03 07:59:50.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:59:50.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'nightstand:[844, 423, 978, 723],[393, 357, 470, 414];bed:[096, 242, 845, 961];drawer:[823, 426, 975, 705];table:[392, 345, 487, 423],[825, 443, 994, 709];chest of drawers:[390, 360, 468, 420],[844, 443, 989, 707];window:[000, 000, 182, 426].') 2023-11-03 07:59:59.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 07:59:59.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[001, 567, 116, 1000],[117, 614, 218, 780],[315, 643, 361, 827],[384, 656, 447, 810],[510, 624, 566, 776],[644, 588, 769, 1000],[781, 599, 832, 1000],[777, 594, 956, 1000],[972, 597, 999, 771];Bottle:[193, 810, 216, 931],[165, 816, 189, 937];Cup:[626, 404, 657, 461];Plate:[869, 245, 901, 295];Refrigerator:[237, 525, 336, 767];Pear:[052, 087, 093, 181],[086, 111, 116, 189];Onion:[139, 132, 206, 210].') 2023-11-03 08:00:27.375 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:00:27.375 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[190, 218, 871, 818] a strange white clouds traces on [001, 000, 997, 954] blue sky poster') 2023-11-03 08:00:28.223 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:00:28.224 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[230, 015, 790, 995] portrait of an indian man smiling in the school library - malaysia stock videos & royalty-free footage') 2023-11-03 08:00:59.245 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:00:59.245 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[319, 730, 345, 792] Landmannalaugar in Iceland has some of the most incredible views on [004, 113, 995, 996] the island.') 2023-11-03 08:01:05.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:01:05.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,chair,table and clothing in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'person:[020, 415, 998, 997],[030, 425, 116, 572],[071, 495, 148, 637],[091, 455, 265, 957],[211, 457, 285, 700],[285, 495, 351, 598],[315, 453, 348, 521],[350, 560, 431, 633],[450, 595, 595, 934],[498, 425, 998, 978],[531, 703, 688, 997];chair:[278, 799, 456, 997],[000, 565, 078, 644],[000, 635, 125, 836],[278, 623, 355, 719],[286, 570, 340, 640],[360, 628, 445, 710],[413, 567, 453, 644],[466, 747, 588, 990],[480, 570, 553, 644],[501, 614, 560, 696],[606, 551, 666, 616],[626, 588, 711, 651],[628, 721, 718, 939],[708, 588, 780, 630],[763, 829, 985, 997],[960, 658, 998, 775];table:[000, 724, 073, 990],[246, 635, 760, 962],[283, 492, 503, 572],[526, 485, 620, 551];clothing:[038, 434, 108, 570],[068, 492, 146, 764],[100, 497, 250, 925],[235, 469, 296, 691],[308, 551, 361, 637],[360, 584, 438, 672],[481, 647, 598, 862],[530, 794, 698, 997],[556, 572, 635, 689],[666, 502, 696, 581],[731, 600, 848, 936],[768, 469, 795, 542],[775, 778, 918, 992],[801, 464, 856, 563],[823, 595, 871, 719],[890, 507, 970, 859].') 2023-11-03 08:01:30.385 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:01:30.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[563, 544, 607, 637],carFrame1:[192, 531, 309, 676],carFrame1:[551, 538, 594, 616],carFrame1:[538, 531, 586, 599],carFrame1:[578, 545, 675, 676] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1:[563, 544, 607, 637];Frame2:[577, 480, 642, 586],carFrame1:[192, 531, 309, 676];Frame2:[102, 459, 265, 647],carFrame1:[551, 538, 594, 616];Frame2:[556, 469, 616, 554],carFrame1:[538, 531, 586, 599];Frame2:[547, 455, 601, 538],carFrame1:[578, 545, 675, 676];Frame2:[599, 480, 755, 659].') 2023-11-03 08:04:01.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:04:01.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[471, 277, 571, 900],personFrame1:[378, 397, 479, 850],personFrame1:[310, 283, 479, 894],personFrame1:[216, 340, 359, 925],personFrame1:[511, 333, 634, 936] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[471, 277, 571, 900];Frame2:[473, 261, 555, 897],personFrame1:[378, 397, 479, 850];Frame2:[326, 581, 487, 851],personFrame1:[310, 283, 479, 894];Frame2:[253, 152, 489, 893],personFrame1:[216, 340, 359, 925];Frame2:[113, 576, 278, 940],personFrame1:[511, 333, 634, 936];Frame2:[412, 530, 596, 940].') 2023-11-03 08:04:06.591 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:04:06.591 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[048, 250, 293, 889],[203, 338, 357, 999],[223, 261, 534, 999],[513, 283, 867, 999];drink:[000, 365, 105, 417],[000, 415, 105, 477],[455, 358, 565, 417],[464, 422, 535, 477],[518, 652, 555, 739];woman:[103, 173, 150, 286],[358, 165, 399, 286],[526, 453, 574, 602],[678, 308, 911, 999],[783, 416, 873, 550],[893, 396, 999, 999];clothing:[033, 404, 999, 999];human face:[154, 284, 244, 450],[280, 363, 350, 488],[371, 290, 432, 424],[560, 305, 625, 442],[711, 334, 793, 481],[960, 406, 999, 483].') 2023-11-03 08:04:40.931 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:04:40.931 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'broken appliances [276, 493, 430, 775] [591, 406, 759, 797] [420, 442, 573, 789] [104, 486, 257, 777] on the curb [001, 697, 866, 860] for bulk pickup') 2023-11-03 08:04:46.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:04:46.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Helmet:[358, 144, 503, 331],[489, 182, 583, 335];Boat:[016, 396, 999, 883].') 2023-11-03 08:04:55.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:04:55.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'duck:[181, 420, 613, 809];goose:[350, 421, 607, 821].') 2023-11-03 08:05:27.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:05:27.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[000, 316, 578, 788] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a tankFrame1:[000, 316, 578, 788];Frame2:[086, 320, 586, 689];Frame3:[111, 325, 594, 681];Frame4:[111, 341, 575, 675].') 2023-11-03 08:05:53.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:05:53.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[254, 470, 378, 734],[032, 450, 225, 1000],[049, 491, 114, 617].') 2023-11-03 08:06:28.021 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:06:28.021 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'members [689, 246, 922, 794] [534, 389, 733, 695] [054, 050, 234, 998] [231, 257, 388, 974] queue up to sign the declaration [365, 576, 719, 817]') 2023-11-03 08:06:42.222 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:06:42.223 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[060, 077, 933, 1000],[198, 082, 970, 940];Car:[001, 000, 1000, 1000].') 2023-11-03 08:06:57.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:06:57.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bus:[000, 084, 894, 752];Stop Sign:[579, 343, 614, 433];Car:[865, 424, 941, 517];Street Lights:[953, 131, 985, 473].') 2023-11-03 08:07:00.025 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:07:00.025 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human arm:[000, 036, 059, 200],[025, 089, 148, 198],[092, 308, 168, 417],[148, 061, 287, 219],[188, 131, 251, 321],[200, 328, 354, 643],[212, 126, 359, 316],[338, 110, 394, 212],[366, 118, 492, 208],[418, 028, 462, 130],[433, 000, 560, 048],[490, 253, 587, 301],[512, 303, 651, 615],[533, 000, 592, 089],[607, 073, 666, 164],[625, 000, 679, 075],[625, 147, 685, 313],[679, 156, 794, 315],[718, 157, 833, 333],[755, 056, 808, 157],[782, 000, 888, 092],[824, 149, 990, 348],[972, 200, 999, 306],[984, 058, 999, 140];sports uniform:[000, 279, 164, 633],[225, 280, 571, 953];tennis racket:[544, 540, 747, 686].') 2023-11-03 08:07:03.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:07:03.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'hat:[315, 264, 433, 385];man:[560, 049, 999, 999];tree:[000, 000, 999, 325];human face:[000, 455, 133, 720],[114, 498, 173, 653],[339, 346, 426, 469],[620, 120, 781, 408].') 2023-11-03 08:07:15.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:07:15.007 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect mobile phone and human hand in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'mobile phone:[676, 371, 843, 634],[082, 309, 201, 475];human hand:[076, 387, 251, 620],[229, 300, 285, 378],[376, 548, 429, 621],[693, 504, 853, 751].') 2023-11-03 08:07:38.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:07:38.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Handbag/Satchel in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Satchel:[711, 745, 745, 964].') 2023-11-03 08:07:40.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:07:40.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the grey cat drop from the sofa?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There is one grey cat drop from the sofa.Track1frame:1:[519, 000, 694, 712];frame:2:[501, 054, 704, 711];frame:3:[529, 056, 693, 775].') 2023-11-03 08:07:46.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 08:07:46.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'weapon:[006, 064, 914, 954];vehicle:[011, 029, 989, 937].') 2023-11-03 08:07:47.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:07:47.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[552, 575, 672, 881] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a mealwormFrame1:[552, 575, 672, 881];Frame2:[539, 585, 668, 895];Frame3:[522, 627, 665, 914];Frame4:[501, 637, 656, 897].') 2023-11-03 08:07:59.710 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:07:59.710 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[421, 285, 722, 998] [097, 250, 561, 998] A couple flash [485, 448, 559, 630] the victory sign in front of [196, 204, 840, 749] a Moroccan flag.') 2023-11-03 08:08:04.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:08:04.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Lemon:[807, 034, 947, 230];Cup:[172, 134, 851, 836];Plate:[106, 002, 545, 146].') 2023-11-03 08:08:23.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:08:23.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'tree:[008, 000, 112, 344],[120, 000, 170, 359],[196, 000, 245, 389],[246, 000, 307, 363],[320, 000, 415, 352],[399, 000, 483, 359],[508, 000, 635, 328],[625, 000, 725, 334],[698, 000, 788, 336],[781, 000, 944, 359],[900, 000, 999, 322].') 2023-11-03 08:08:31.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:08:31.199 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[453, 418, 496, 633] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a personFrame1:[453, 418, 496, 633];Frame2:[448, 421, 488, 636];Frame3:[450, 418, 487, 633];Frame4:[461, 412, 501, 628];Frame5:[473, 407, 511, 615].') 2023-11-03 08:08:47.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:08:47.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Fishing Rod:[579, 607, 671, 699],[694, 429, 764, 697];Person:[437, 617, 485, 799],[548, 603, 597, 801],[756, 607, 798, 805],[888, 695, 943, 809].') 2023-11-03 08:09:06.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:09:06.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[597, 461, 650, 622],personFrame1:[479, 526, 549, 716],personFrame1:[342, 404, 382, 573],personFrame1:[456, 480, 489, 662],personFrame1:[676, 527, 723, 729],personFrame1:[117, 515, 154, 693],personFrame1:[424, 437, 475, 598] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[597, 461, 650, 622];Frame2:[588, 487, 653, 647],personFrame1:[479, 526, 549, 716];Frame2:[542, 544, 583, 737],personFrame1[342, 404, 382, 573];Frame2:No object,personFrame1:[456, 480, 489, 662];Frame2:[474, 509, 518, 697],personFrame1:[676, 527, 723, 729];Frame2:[677, 551, 731, 755],personFrame1:[117, 515, 154, 693];Frame2:[134, 512, 170, 690],personFrame1:[424, 437, 475, 598];Frame2:[453, 445, 517, 612].') 2023-11-03 08:09:09.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:09:09.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the lizard that is fleeing rapidly.?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There is one the lizard that is fleeing rapidly..Track1frame:1:[001, 279, 591, 585];frame:2:[001, 313, 751, 623];frame:3:[001, 355, 580, 675].') 2023-11-03 08:09:48.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:09:48.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[539, 437, 767, 700] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a antelopeFrame1:[539, 437, 767, 700];Frame2:[490, 643, 723, 954];Frame3:[410, 664, 620, 947];Frame4:[309, 647, 500, 937].') 2023-11-03 08:10:23.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:10:23.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'desk:[177, 640, 928, 998];office building:[000, 000, 999, 998];corded phone:[196, 665, 323, 763],[772, 660, 934, 927].') 2023-11-03 08:10:23.894 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:10:23.894 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[356, 128, 522, 965] A veteran and [601, 420, 750, 840] [001, 007, 160, 991] [638, 428, 951, 995] [210, 096, 350, 994] his family received a Christmas surprise this evening.') 2023-11-03 08:10:35.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:10:35.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[061, 001, 167, 048],[147, 043, 252, 131],[213, 104, 302, 139];Frame:[211, 109, 346, 261];Chair:[185, 457, 323, 739],[285, 427, 355, 572];Clock:[755, 088, 830, 144];Person:[897, 220, 999, 374],[895, 362, 999, 469],[040, 087, 967, 1000];Necklace:[464, 314, 510, 355];Hat:[404, 082, 582, 196];Bottle:[777, 303, 815, 340].') 2023-11-03 08:10:56.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:10:56.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Trash bin Can:[285, 669, 307, 715],[346, 670, 367, 709];Van:[654, 629, 865, 775],[665, 617, 767, 686];Car:[766, 644, 906, 748],[884, 637, 1000, 722];Scooter:[384, 682, 452, 756],[319, 685, 395, 768],[042, 667, 086, 757];bucket:[285, 669, 307, 714],[346, 669, 369, 709];Chair:[747, 658, 762, 684],[761, 657, 776, 683];Storage box:[327, 672, 344, 698];Stool:[106, 702, 126, 726];Person:[290, 627, 332, 681],[170, 641, 190, 684],[193, 339, 222, 393],[436, 549, 453, 606].') 2023-11-03 08:11:07.987 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:11:07.988 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[310, 601, 410, 883] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a transparent cup being placed on the deskFrame1:[310, 601, 410, 883];Frame2:[310, 601, 410, 883];Frame3:[310, 601, 410, 883];Frame4:[310, 601, 410, 883].') 2023-11-03 08:11:26.013 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:11:26.014 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[028, 001, 067, 053];shelf:[001, 665, 059, 998];Person:[318, 421, 544, 745],[083, 435, 204, 925];Barrel:[270, 571, 313, 662],[194, 784, 217, 900],[287, 592, 331, 670],[027, 873, 067, 975];Sneakers:[121, 848, 171, 889],[088, 873, 114, 922];Pot:[587, 745, 656, 834],[502, 656, 547, 758];Cup:[035, 878, 068, 972].') 2023-11-03 08:11:59.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:11:59.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Chair:[556, 728, 599, 851],[397, 727, 437, 842],[708, 767, 775, 938],[110, 814, 209, 1000],[141, 807, 232, 999],[227, 766, 302, 932],[778, 828, 863, 998],[800, 816, 914, 1000];Desk:[450, 735, 543, 861],[726, 779, 865, 986],[140, 778, 276, 980],[001, 834, 172, 999],[844, 820, 998, 997];Person:[001, 365, 033, 627],[064, 421, 105, 632],[152, 463, 181, 626],[817, 465, 845, 627],[892, 425, 937, 623];Lantern:[172, 096, 330, 448],[682, 117, 830, 450],[462, 462, 534, 584];Picture:[001, 320, 056, 720],[044, 354, 122, 711],[141, 418, 195, 698],[188, 449, 224, 690],[237, 483, 266, 680],[731, 477, 762, 683],[771, 446, 810, 689],[805, 422, 858, 695],[876, 365, 961, 708],[946, 324, 1000, 710].') 2023-11-03 08:12:14.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:12:14.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[610, 362, 853, 823];frame:2:[350, 559, 648, 817];frame:3:[399, 595, 638, 938] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an the cat that initiated the attack.') 2023-11-03 08:13:14.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:13:14.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[861, 388, 987, 754],[000, 408, 038, 685],[228, 389, 546, 499],[590, 402, 605, 465],[752, 461, 801, 583],[860, 437, 906, 537];person:[000, 370, 045, 707],[221, 382, 679, 502],[753, 406, 813, 699],[856, 402, 908, 534],[866, 402, 990, 788];vehicle:[125, 423, 179, 485],[342, 431, 381, 495];building:[000, 000, 082, 476],[076, 017, 317, 416],[316, 235, 357, 400],[398, 010, 705, 413],[698, 000, 999, 515].') 2023-11-03 08:13:15.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:13:15.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Onion,Chair,shelf and Keyboard in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[309, 605, 443, 998],[285, 514, 377, 822],[256, 478, 363, 715],[521, 562, 749, 999],[482, 415, 574, 645],[847, 562, 1000, 993],[685, 492, 839, 833],[657, 467, 796, 774],[714, 433, 788, 595];shelf:[348, 471, 460, 688];Keyboard:[155, 723, 220, 827].') 2023-11-03 08:13:20.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:13:20.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[548, 571, 615, 819],[025, 783, 124, 970],[085, 570, 141, 769],[163, 592, 227, 920],[319, 733, 377, 898],[379, 738, 434, 881],[383, 554, 419, 703],[442, 848, 536, 998],[444, 576, 510, 921],[532, 817, 668, 998],[589, 768, 707, 998],[687, 739, 771, 976],[720, 541, 792, 812],[789, 561, 834, 794],[809, 726, 907, 929],[895, 566, 999, 998];girl:[338, 906, 414, 998],[418, 557, 440, 647];clothing:[000, 838, 030, 964],[000, 562, 049, 677],[011, 595, 044, 680],[037, 822, 116, 955],[052, 614, 082, 743],[087, 597, 135, 752],[130, 580, 158, 700],[131, 527, 222, 595],[163, 639, 222, 897],[291, 561, 318, 641],[302, 585, 322, 661],[321, 772, 384, 884],[328, 527, 345, 592],[386, 563, 419, 662],[389, 763, 426, 858],[404, 647, 453, 888],[408, 523, 538, 592],[420, 576, 446, 644],[430, 914, 534, 998],[438, 626, 503, 897],[491, 598, 551, 768],[500, 695, 568, 816],[550, 861, 666, 998],[563, 588, 597, 759],[581, 536, 718, 625],[595, 559, 616, 647],[611, 570, 627, 641],[613, 824, 706, 998],[640, 585, 663, 657],[656, 580, 679, 657],[675, 591, 690, 658],[683, 803, 761, 976],[688, 597, 719, 763],[726, 587, 788, 764],[770, 821, 822, 937],[787, 606, 827, 799],[791, 540, 842, 606],[821, 783, 895, 917],[865, 608, 913, 772],[891, 647, 999, 998];human face:[000, 490, 999, 997].') 2023-11-03 08:13:23.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:13:23.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[449, 270, 536, 731] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[449, 270, 536, 731];Frame2:[442, 300, 539, 747];Frame3:[441, 300, 540, 793];Frame4:[418, 310, 512, 772];Frame5:[421, 297, 509, 752].') 2023-11-03 08:13:53.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:13:53.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'squirrel:[373, 408, 861, 707].') 2023-11-03 08:14:32.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:14:32.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'duck:[128, 870, 208, 960],[000, 668, 048, 740],[000, 726, 052, 819],[069, 692, 133, 796],[097, 747, 166, 870],[112, 479, 185, 537],[126, 657, 229, 731],[131, 631, 211, 685],[199, 548, 291, 597],[270, 710, 343, 777],[272, 539, 336, 601],[281, 766, 354, 870],[291, 645, 381, 733],[298, 787, 380, 835],[324, 590, 434, 657],[326, 504, 355, 571],[355, 506, 439, 587],[394, 620, 440, 696],[453, 659, 513, 761],[456, 486, 517, 553],[512, 625, 552, 708],[546, 500, 631, 550];person:[791, 094, 998, 581];tree:[000, 000, 998, 212].') 2023-11-03 08:15:04.613 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:15:04.613 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The only use of knowledge of the past is to equip us for the present [170, 254, 827, 645] Picture Quote #1') 2023-11-03 08:15:27.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:15:27.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Pizza:[468, 703, 591, 785];Knife:[445, 789, 593, 885];Fork:[561, 718, 683, 773],[537, 682, 665, 719],[646, 583, 715, 632],[511, 599, 643, 646],[415, 456, 496, 519];Plate:[461, 699, 636, 814],[655, 544, 764, 601],[570, 567, 715, 658],[475, 598, 569, 653],[430, 515, 531, 562],[194, 617, 293, 700];Cup:[529, 559, 587, 699],[553, 449, 592, 544],[282, 610, 348, 774];Bottle:[538, 514, 561, 596],[530, 496, 548, 570];Basin:[316, 577, 473, 696],[318, 193, 342, 214];Dinning Table:[082, 485, 764, 1000];Napkin:[509, 657, 600, 707],[369, 555, 463, 589],[523, 487, 557, 509],[264, 724, 385, 792];Bench:[000, 384, 360, 1000];Lamp:[495, 017, 668, 174];Person:[523, 317, 1000, 1000],[631, 303, 917, 883],[000, 169, 406, 999],[289, 301, 464, 603],[104, 196, 183, 342],[000, 382, 048, 492],[123, 106, 196, 210],[264, 137, 287, 178];Glasses:[181, 248, 282, 299];Other Shoes:[420, 939, 482, 999].') 2023-11-03 08:15:46.647 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:15:46.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lamp:[717, 282, 781, 323];Person:[402, 369, 615, 939];Gloves:[510, 455, 571, 545],[526, 654, 600, 704];Blender:[498, 685, 614, 942],[156, 632, 287, 960],[001, 557, 125, 972];TV:[001, 093, 152, 585].') 2023-11-03 08:16:02.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:16:02.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[389, 221, 580, 794],[002, 356, 156, 759].') 2023-11-03 08:16:10.253 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:16:10.253 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Cup:[571, 630, 769, 836];Plate:[506, 636, 829, 932],[000, 344, 191, 585];Storage box:[161, 483, 311, 631],[283, 390, 415, 529],[387, 477, 552, 647];Chair:[290, 552, 452, 770];Desk:[000, 223, 977, 999].') 2023-11-03 08:17:08.700 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:17:08.701 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '10 steps you need to know before starting a plant [010, 022, 984, 996] based') 2023-11-03 08:17:28.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:17:28.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect jeans and footwear in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'jeans:[417, 600, 589, 914],[895, 453, 999, 664];footwear:[162, 825, 256, 943],[346, 940, 459, 999],[410, 911, 497, 967],[415, 895, 551, 965],[602, 895, 735, 970],[641, 909, 763, 981].') 2023-11-03 08:17:50.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:17:50.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Truck,Glasses,Sneakers,Swing,Monkey and Couch in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Truck:[233, 224, 308, 382];Glasses:[135, 139, 168, 185];Sneakers:[011, 787, 073, 848],[051, 781, 128, 832],[194, 751, 233, 799].') 2023-11-03 08:18:02.913 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:18:02.913 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'dusk colored clouds [003, 002, 997, 540] in the background, highway overpass curved approach bridge - light natural phenomenon stock pictures [004, 006, 999, 996], royalty-free photos [596, 619, 798, 729] & images') 2023-11-03 08:18:37.259 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:18:37.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Penguin:[264, 306, 412, 519],[414, 250, 615, 597],[801, 273, 924, 465],[866, 174, 977, 442],[546, 810, 765, 966].') 2023-11-03 08:18:47.996 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:18:47.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[130, 570, 412, 993],[380, 591, 662, 966],[717, 616, 943, 939];shelf:[907, 650, 999, 932];Person:[002, 064, 186, 1000],[166, 458, 552, 993],[409, 477, 677, 974],[711, 532, 905, 846];Leather Shoes:[414, 933, 511, 994],[460, 778, 552, 863],[550, 926, 588, 975],[586, 914, 660, 971];High Heels:[126, 907, 192, 1000];Microphone:[119, 211, 149, 372];TV:[759, 822, 907, 1000],[891, 816, 999, 1000].') 2023-11-03 08:19:01.994 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:19:01.994 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '6 mannequins [701, 474, 866, 880] [409, 477, 512, 839] [229, 483, 340, 862] [093, 489, 224, 881] [589, 484, 674, 853] in gowns [700, 552, 871, 888] [413, 536, 512, 835] [379, 551, 616, 888] stand in a blue room with a portrait [388, 184, 589, 523] in the background on a wall [087, 172, 840, 674].') 2023-11-03 08:19:11.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:19:11.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Chair:[002, 050, 034, 151],[043, 043, 131, 159];Storage box:[803, 330, 870, 428];Person:[263, 385, 449, 780],[475, 526, 657, 767],[598, 427, 810, 697],[922, 663, 999, 921],[915, 604, 999, 762],[952, 498, 999, 613];Tent:[521, 002, 999, 494].') 2023-11-03 08:19:18.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:19:18.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'mammal:[407, 217, 775, 571];person:[403, 224, 742, 558].') 2023-11-03 08:19:34.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:19:34.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Backpack in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Backpack:[701, 618, 805, 659].') 2023-11-03 08:19:45.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:19:45.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Car,Street Lights,Person and Other Shoes in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[291, 363, 325, 387];Car:[001, 355, 365, 542],[124, 346, 405, 480],[286, 340, 390, 378],[606, 339, 691, 393];Street Lights:[744, 262, 774, 294],[809, 087, 880, 295];Person:[765, 306, 801, 435],[793, 291, 884, 450],[700, 293, 801, 451],[677, 308, 736, 420],[641, 311, 687, 420],[363, 201, 657, 820],[101, 246, 323, 711],[389, 348, 467, 565];Other Shoes:[399, 534, 442, 561].') 2023-11-03 08:19:50.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:19:50.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect plant,sun hat,man and jeans in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'plant:[298, 418, 384, 667];sun hat:[075, 111, 348, 286];man:[348, 096, 711, 977];jeans:[400, 746, 676, 977].') 2023-11-03 08:20:25.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:20:25.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[350, 476, 542, 638] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a bumper carFrame1:[350, 476, 542, 638];Frame2:[517, 447, 665, 593];Frame3:[622, 458, 732, 619];Frame4:[646, 452, 799, 654];Frame5:[603, 443, 773, 637].') 2023-11-03 08:20:45.489 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:20:45.490 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A rectangle white baking dish [009, 002, 990, 999] filled with roasted vegetables like broccoli [695, 789, 860, 909] [157, 693, 355, 833] [268, 696, 470, 808] [184, 001, 379, 100] [659, 655, 848, 794] [263, 219, 474, 345] [110, 448, 330, 605] [097, 365, 285, 485] [660, 244, 872, 379], mushrooms [384, 338, 529, 401] and bell peppers [135, 078, 287, 194]') 2023-11-03 08:20:53.715 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:20:53.715 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A lesson in cool from The Dean: [002, 003, 878, 996] Providence’s hippest boutique hotel isn’t just for visitors') 2023-11-03 08:21:02.173 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:21:02.173 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[390, 207, 488, 392],[209, 212, 313, 392],[818, 500, 890, 662],[585, 532, 651, 878],[400, 498, 486, 793],[336, 702, 408, 992],[227, 646, 290, 993],[162, 582, 237, 998],[872, 682, 949, 856];Backpack:[230, 738, 284, 867];Trash bin Can:[464, 823, 544, 999].') 2023-11-03 08:21:38.010 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:21:38.010 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[010, 010, 995, 983] Connectionless Service • Messages (or packets) are independent of each other – analogous to') 2023-11-03 08:22:21.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:22:21.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[040, 445, 060, 595],[040, 299, 120, 700],[124, 442, 162, 595],[162, 445, 202, 595],[206, 445, 242, 592],[246, 445, 284, 595],[286, 445, 326, 595],[330, 445, 366, 595],[370, 449, 408, 599],[576, 041, 624, 226],[624, 048, 692, 226],[686, 038, 722, 226],[778, 101, 818, 317],[818, 174, 832, 250],[836, 421, 902, 703],[840, 181, 854, 243],[864, 177, 878, 243],[888, 181, 900, 247],[908, 177, 924, 247],[932, 177, 948, 247],[956, 181, 970, 247];envelope:[002, 000, 998, 996].') 2023-11-03 08:23:12.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:23:12.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[240, 016, 624, 999];Hat:[394, 016, 556, 251];Tie:[422, 367, 495, 909];Microphone:[609, 313, 710, 427],[651, 417, 728, 545].') 2023-11-03 08:23:13.800 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:23:13.800 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track truckFrame1:[184, 490, 238, 613] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'truckFrame1:[184, 490, 238, 613];Frame2:[182, 460, 242, 602].') 2023-11-03 08:23:45.032 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:23:45.032 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Christina Boyce [456, 594, 576, 708] finds her "church" among the redwood forests [003, 005, 995, 996]') 2023-11-03 08:23:51.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:23:51.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Speaker:[001, 654, 055, 686].') 2023-11-03 08:25:28.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:25:28.281 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Street Lights:[658, 526, 693, 653],[735, 540, 761, 623];Car:[880, 880, 999, 999],[430, 671, 828, 943],[361, 677, 613, 824],[299, 656, 526, 788],[409, 637, 466, 672];Sports Car:[583, 706, 1000, 1000];Lamp:[000, 001, 085, 194],[053, 429, 091, 501],[064, 498, 090, 559];Satchel:[256, 699, 281, 738],[083, 813, 136, 910].') 2023-11-03 08:25:30.151 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:25:30.151 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The very first kiss as man [296, 460, 551, 997] and wife [386, 493, 619, 996], the ceremony was taking place at the monastery [005, 007, 997, 997] located close to Minthis Hills') 2023-11-03 08:25:56.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:25:56.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect hiking equipment and clothing in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'hiking equipment:[000, 153, 081, 529],[016, 399, 045, 908],[025, 309, 068, 374],[098, 793, 128, 866],[098, 437, 130, 884],[106, 236, 184, 446],[131, 449, 146, 905],[183, 393, 228, 918],[184, 819, 215, 893],[203, 393, 229, 467],[207, 203, 284, 389],[228, 813, 250, 884],[258, 399, 286, 843],[288, 324, 335, 899],[294, 796, 329, 875],[310, 162, 383, 411],[333, 801, 368, 881],[345, 303, 358, 890],[399, 849, 430, 921],[416, 371, 453, 476],[416, 128, 486, 256],[425, 473, 445, 921],[435, 312, 481, 393],[443, 470, 450, 931],[478, 790, 500, 858],[491, 383, 512, 872],[496, 389, 512, 846],[503, 194, 553, 295],[504, 784, 535, 858],[561, 312, 578, 383],[563, 352, 590, 872],[567, 298, 594, 374],[567, 212, 649, 473],[576, 790, 603, 866],[577, 345, 600, 893],[605, 796, 639, 881],[625, 414, 638, 928],[645, 247, 717, 547],[689, 858, 723, 955],[696, 427, 704, 952],[706, 165, 763, 449],[717, 440, 733, 831],[782, 106, 895, 461],[794, 884, 832, 959],[845, 890, 886, 955],[855, 934, 932, 998],[925, 182, 999, 312],[936, 411, 944, 998],[951, 423, 960, 998];clothing:[775, 056, 910, 895],[000, 138, 095, 819],[094, 209, 189, 810],[180, 168, 299, 816],[286, 125, 394, 819],[390, 094, 511, 872],[477, 179, 553, 796],[561, 203, 633, 819],[625, 221, 740, 893],[696, 144, 771, 793],[882, 116, 999, 996].') 2023-11-03 08:25:58.275 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:25:58.276 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[714, 246, 828, 414] [372, 343, 468, 498] [271, 668, 401, 876] [576, 589, 705, 799] [334, 533, 459, 689] [506, 489, 627, 646] [141, 641, 263, 854] bananas, [000, 603, 058, 847] [049, 512, 159, 795] [711, 517, 824, 775] [767, 602, 934, 779] figs, [185, 447, 317, 566] maple syrup and granola on [121, 460, 707, 945] a toasted baguette') 2023-11-03 08:26:14.723 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:26:14.723 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 021, 996, 995] vibrant orange flower and [179, 364, 662, 786] a bee against green background - symbiotic relationship stock pictures, [532, 624, 864, 717] royalty-free photos & images') 2023-11-03 08:26:31.283 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:26:31.283 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[578, 343, 710, 709] businesswoman speaking to [029, 456, 309, 997] [265, 460, 572, 997] [667, 460, 912, 996] the audience - public speaker stock pictures, [597, 626, 793, 722] royalty-free photos & images') 2023-11-03 08:26:54.867 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:26:54.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[605, 537, 728, 933],[021, 493, 068, 660],[093, 658, 266, 890],[161, 491, 212, 656],[231, 472, 343, 845],[233, 600, 414, 965],[355, 475, 418, 652],[370, 493, 471, 808],[409, 491, 533, 877],[541, 469, 590, 668],[588, 473, 625, 656],[709, 467, 765, 744],[745, 460, 790, 689],[838, 458, 903, 689],[910, 453, 985, 646];clothing:[030, 521, 078, 673],[091, 638, 245, 892],[160, 505, 200, 643],[234, 495, 348, 851],[242, 601, 408, 969],[370, 533, 436, 803],[376, 481, 411, 626],[402, 527, 506, 875],[553, 504, 602, 670],[587, 500, 647, 639],[594, 582, 729, 891],[710, 472, 762, 698],[746, 487, 790, 684],[840, 491, 901, 684],[916, 486, 973, 647],[973, 510, 999, 595].') 2023-11-03 08:27:20.323 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:27:20.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[175, 404, 238, 466],carFrame1:[348, 449, 410, 523],carFrame1:[039, 395, 146, 466] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[175, 404, 238, 466];Frame2:[059, 396, 152, 469],carFrame1:[348, 449, 410, 523];Frame2:[365, 446, 424, 521],carFrame1[039, 395, 146, 466];Frame2:No object.') 2023-11-03 08:27:36.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:27:36.608 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'suit:[463, 231, 718, 997],[096, 347, 464, 999],[671, 365, 711, 527],[703, 355, 743, 532];man:[096, 157, 474, 999],[431, 213, 698, 999],[466, 330, 530, 434],[675, 337, 708, 531],[698, 327, 741, 568],[781, 323, 833, 540],[838, 348, 865, 439],[875, 323, 970, 611],[882, 321, 962, 540],[938, 325, 999, 579];car:[000, 320, 067, 382],[000, 353, 401, 733],[089, 322, 210, 382],[423, 330, 508, 376].') 2023-11-03 08:28:04.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:28:04.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree,person,skull and poster in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'tree:[318, 139, 397, 172],[685, 065, 958, 331],[721, 179, 862, 345],[881, 148, 946, 239];person:[249, 274, 313, 339],[273, 201, 345, 276];skull:[402, 345, 664, 554];poster:[000, 000, 998, 998].') 2023-11-03 08:28:08.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:28:08.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'teddy bear:[486, 120, 648, 181],[605, 051, 691, 144],[648, 072, 921, 191].') 2023-11-03 08:28:33.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:28:33.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Bicycle:[117, 344, 214, 523];Hat:[817, 249, 908, 330];Other Shoes:[096, 760, 133, 821],[403, 694, 440, 744],[451, 720, 504, 776];Person:[941, 849, 999, 1000],[753, 249, 1000, 1000],[731, 222, 877, 822],[596, 320, 798, 1000],[174, 294, 441, 1000],[325, 283, 503, 779],[025, 324, 155, 826],[088, 256, 174, 537],[742, 230, 818, 622],[668, 241, 742, 473],[597, 236, 672, 602],[546, 247, 619, 594],[482, 241, 563, 607],[422, 250, 493, 591],[456, 220, 521, 577].') 2023-11-03 08:28:38.808 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:28:38.809 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[265, 145, 626, 995] Tanner Wells (51) celebrates with the team after coming across home plate during UNCW's matchup with Marshall on Feb. 22, 2020 at Brooks Field.") 2023-11-03 08:28:49.002 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:28:49.003 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect medical equipment,table,clothing and person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'medical equipment:[244, 000, 459, 292],[483, 440, 616, 536];table:[202, 461, 479, 803],[221, 379, 343, 495],[838, 459, 893, 635];clothing:[426, 303, 510, 508],[500, 300, 559, 374],[547, 257, 651, 383],[611, 309, 696, 440],[746, 283, 863, 579];person:[435, 262, 545, 510],[500, 225, 566, 369],[561, 210, 653, 383],[606, 264, 704, 424],[755, 229, 853, 593].') 2023-11-03 08:29:23.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:29:23.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bed,Car,Hat and Helmet in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Car:[083, 321, 284, 517],[287, 328, 351, 401];Hat:[467, 268, 532, 321];Helmet:[618, 175, 707, 267],[547, 219, 596, 271],[464, 232, 524, 283].') 2023-11-03 08:29:28.970 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:29:28.971 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "groom's mum putting a button hole on one of [232, 185, 470, 997] the groomsmen") 2023-11-03 08:29:37.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:29:37.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[471, 201, 999, 987],[000, 651, 181, 999],[232, 095, 595, 999],[698, 000, 999, 211];clothing:[000, 635, 193, 999],[159, 000, 628, 442],[196, 525, 593, 999],[426, 664, 999, 999],[688, 000, 999, 644];human face:[225, 198, 404, 548],[603, 318, 860, 753].') 2023-11-03 08:29:39.290 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:29:39.291 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Four judges [302, 704, 394, 827] sit within their enclosed booths at the edge [003, 807, 991, 923] of the pool with spectators [000, 286, 993, 826] behind as American driver Bruce Harlan [519, 156, 762, 407] makes his final dive of...') 2023-11-03 08:30:10.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:30:10.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[380, 354, 640, 627] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a roadsterFrame1:[380, 354, 640, 627];Frame2:[392, 368, 661, 654];Frame3:[371, 389, 665, 687];Frame4:[348, 368, 656, 687];Frame5:[327, 400, 656, 735].') 2023-11-03 08:30:11.025 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:30:11.025 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'sleepy man wearing [199, 107, 883, 893] pajamas holding a cup of [543, 302, 703, 356] tea - pajamas stock pictures, royalty-free photos & images') 2023-11-03 08:30:13.652 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:30:13.653 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Handbag,Other Shoes and Person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Satchel:[180, 843, 226, 914],[530, 679, 556, 768],[725, 761, 751, 842],[786, 759, 810, 792];Other Shoes:[176, 966, 209, 999],[455, 952, 481, 978],[527, 858, 544, 892],[591, 968, 610, 994];Person:[115, 627, 209, 999],[252, 638, 352, 1000],[423, 652, 513, 975],[505, 642, 559, 897],[547, 623, 655, 994],[661, 678, 716, 856],[694, 647, 716, 725],[730, 650, 792, 861],[780, 649, 801, 755],[812, 651, 844, 778].') 2023-11-03 08:30:24.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:30:24.645 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[767, 347, 991, 851],[513, 367, 785, 943],[002, 012, 375, 999],[261, 459, 689, 999],[313, 622, 585, 1000],[829, 704, 1000, 998];Mirror:[556, 003, 770, 596];Cabinet:[563, 101, 756, 470];Desk:[559, 846, 866, 999];Bottle:[760, 721, 801, 950];Tie:[620, 603, 667, 868].') 2023-11-03 08:30:26.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:30:26.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human head:[263, 123, 858, 785];human eye:[273, 321, 477, 434],[534, 280, 776, 404];poster:[000, 000, 997, 998];woman:[171, 034, 940, 998];mammal:[146, 037, 937, 998];human face:[271, 186, 798, 679].') 2023-11-03 08:30:41.310 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:30:41.310 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[580, 284, 680, 618] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a mountain zebraFrame1:[580, 284, 680, 618];Frame2:[345, 333, 497, 690];Frame3:[293, 312, 609, 650];Frame4:[198, 270, 568, 680].') 2023-11-03 08:31:41.991 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:31:41.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Belt:[429, 735, 538, 794];Tie:[600, 647, 656, 999],[821, 602, 849, 688];Person:[000, 001, 435, 1000],[359, 001, 706, 1000].') 2023-11-03 08:31:52.288 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:31:52.288 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Nehawu members [002, 508, 106, 997] [251, 480, 399, 996] [474, 470, 665, 995] [778, 392, 918, 995] [164, 498, 269, 996] [622, 434, 752, 998] gathered outside the Union Buildings in Pretoria on 3 September 2020 to hand over their list [267, 478, 373, 661] of demands on the union's national day of action. Picture: Kgomotso Modise/EWN") 2023-11-03 08:31:55.735 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:31:55.736 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[164, 498, 177, 513];frame:2:[196, 488, 212, 501] is doing?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an the bike going rightward and subsequently making a turn.') 2023-11-03 08:31:59.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:31:59.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[250, 187, 799, 885] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a tankFrame1:[250, 187, 799, 885];Frame2:[256, 172, 795, 901];Frame3:[253, 157, 788, 900];Frame4:[245, 144, 765, 887];Frame5:[266, 157, 745, 912].') 2023-11-03 08:32:13.816 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:32:13.816 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Picture and Bakset in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Frame:[249, 001, 493, 050];Bakset:[055, 736, 150, 890].') 2023-11-03 08:32:19.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:32:19.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[568, 366, 602, 472],personFrame1:[364, 534, 403, 684],personFrame1:[398, 373, 431, 490],personFrame1:[303, 501, 337, 648],personFrame1:[762, 319, 812, 418] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1[568, 366, 602, 472];Frame2:No object,personFrame1:[364, 534, 403, 684];Frame2:[382, 536, 417, 688],personFrame1[398, 373, 431, 490];Frame2:No object,personFrame1:[303, 501, 337, 648];Frame2:[304, 505, 341, 647],personFrame1[762, 319, 812, 418];Frame2:No object.') 2023-11-03 08:32:25.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 08:32:25.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[382, 334, 576, 829],[202, 239, 312, 563];Hat:[406, 332, 452, 373],[240, 240, 264, 265];Belt:[464, 547, 526, 597];Sneakers:[521, 775, 576, 827],[386, 791, 454, 831],[251, 535, 270, 562],[208, 514, 228, 561];Baseball Glove:[285, 387, 314, 444],[382, 385, 434, 482].') 2023-11-03 08:32:35.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:32:35.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[012, 236, 627, 795],[356, 250, 901, 1000];Chair:[096, 594, 155, 802];Hat:[425, 250, 580, 341].') 2023-11-03 08:32:36.897 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:32:36.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'house:[060, 014, 961, 835];plant:[358, 691, 515, 819],[001, 728, 061, 819],[010, 765, 928, 952],[056, 644, 145, 748],[076, 483, 189, 665],[186, 636, 328, 764],[210, 718, 255, 784],[325, 455, 461, 724],[513, 483, 616, 800],[528, 738, 588, 803],[591, 744, 685, 817],[611, 677, 780, 782],[721, 720, 785, 817],[774, 733, 815, 800],[788, 764, 919, 820],[845, 613, 976, 772],[911, 718, 993, 828];window:[178, 529, 313, 667],[215, 297, 298, 420],[465, 301, 504, 406],[600, 537, 759, 679],[673, 308, 750, 422].') 2023-11-03 08:32:44.886 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:32:44.886 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[366, 359, 554, 995] Diana Vega, a registered respiratory therapist, peeks through a curtain during setup at a temporary coronavirus testing site Monday, Aug. 3, 2020, in Las Vegas. (AP Photo/John Locher') 2023-11-03 08:32:53.937 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:32:53.937 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[020, 005, 137, 149];human face:[058, 023, 112, 111].') 2023-11-03 08:33:13.078 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:33:13.078 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Create professional videos with this amazing editor [000, 684, 697, 997]') 2023-11-03 08:33:42.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:33:42.162 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[225, 191, 879, 858] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a plodderFrame1:[225, 191, 879, 858];Frame2:[227, 184, 899, 851];Frame3:[225, 180, 917, 854];Frame4:[226, 165, 933, 841].') 2023-11-03 08:34:05.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:34:05.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Gun:[874, 593, 969, 698],[785, 578, 866, 702],[837, 473, 877, 569],[723, 541, 768, 677],[514, 547, 565, 683],[424, 542, 502, 651],[362, 536, 411, 647],[283, 538, 342, 649],[780, 420, 826, 538],[705, 429, 742, 557],[585, 429, 642, 513],[521, 408, 568, 510],[389, 406, 447, 515],[327, 419, 372, 514];Truck:[900, 197, 1000, 364];Heavy Truck:[356, 166, 604, 225];Person:[864, 539, 952, 755],[790, 529, 877, 744],[695, 485, 798, 715],[615, 478, 709, 701],[495, 506, 600, 723],[567, 468, 632, 677],[425, 499, 507, 688],[495, 488, 529, 655],[349, 479, 426, 642],[438, 482, 502, 557],[274, 474, 347, 650],[201, 486, 274, 653],[106, 483, 199, 664],[068, 367, 137, 651],[044, 346, 095, 603],[137, 371, 213, 624],[773, 357, 849, 642],[832, 393, 913, 590],[700, 364, 775, 669].') 2023-11-03 08:34:39.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:34:39.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[128, 475, 268, 673],carFrame1:[392, 488, 426, 560],carFrame1:[000, 374, 138, 742],carFrame1:[265, 500, 341, 621],carFrame1:[628, 467, 691, 596] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'carFrame1:[128, 475, 268, 673];Frame2:[000, 464, 062, 784],carFrame1:[392, 488, 426, 560];Frame2:[354, 490, 405, 579],carFrame1[000, 374, 138, 742];Frame2:No object,carFrame1:[265, 500, 341, 621];Frame2:[066, 516, 248, 698],carFrame1:[628, 467, 691, 596];Frame2:[674, 454, 808, 650].') 2023-11-03 08:34:46.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:34:46.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Lamp:[590, 001, 716, 357],[001, 396, 133, 647];Chair:[553, 527, 753, 901],[629, 509, 838, 842],[749, 502, 873, 792],[778, 465, 856, 544],[427, 485, 566, 828],[533, 472, 608, 563],[615, 467, 655, 546];Flower:[627, 463, 710, 527];Couch:[001, 492, 125, 791];Side Table:[001, 620, 150, 856].') 2023-11-03 08:34:47.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:34:47.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[424, 529, 500, 802],personFrame1:[275, 250, 353, 487],personFrame1:[235, 165, 271, 368],personFrame1:[288, 518, 385, 702],personFrame1:[545, 365, 640, 631],personFrame1:[914, 204, 961, 412],personFrame1:[487, 368, 574, 622] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[424, 529, 500, 802];Frame2:[434, 531, 499, 786],personFrame1:[275, 250, 353, 487];Frame2:[292, 247, 382, 484],personFrame1:[235, 165, 271, 368];Frame2:[262, 168, 301, 362],personFrame1:[288, 518, 385, 702];Frame2:[310, 518, 403, 698],personFrame1:[545, 365, 640, 631];Frame2:[551, 369, 613, 631],personFrame1:[914, 204, 961, 412];Frame2:[885, 204, 971, 405],personFrame1:[487, 368, 574, 622];Frame2:[511, 383, 577, 630].') 2023-11-03 08:34:53.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:34:53.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[407, 146, 467, 245],[575, 079, 647, 257];Car:[032, 076, 980, 939],[691, 246, 999, 723],[755, 190, 999, 315];Fire Truck:[648, 056, 984, 239],[002, 042, 344, 267].') 2023-11-03 08:35:16.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:35:16.548 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Cup:[511, 681, 559, 802],[326, 568, 360, 638],[184, 527, 214, 579];Jug:[376, 570, 503, 828];Microphone:[349, 417, 492, 774],[218, 390, 328, 660],[144, 383, 237, 593],[123, 377, 167, 472],[060, 379, 123, 526],[062, 372, 093, 429];Flower:[089, 457, 206, 556],[001, 414, 074, 482];Vase:[118, 532, 174, 583];Desk:[000, 462, 781, 1000];Tie:[663, 459, 737, 683],[424, 405, 491, 557],[228, 384, 251, 441];Person:[462, 233, 879, 962],[501, 268, 706, 642],[672, 353, 720, 435],[535, 254, 607, 403],[364, 233, 561, 631],[456, 319, 483, 391],[413, 321, 465, 470],[294, 290, 429, 582],[175, 268, 327, 545],[063, 310, 146, 466];Necklace:[592, 433, 644, 494];Watch:[650, 696, 682, 758].') 2023-11-03 08:35:25.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:35:25.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[002, 002, 002, 002] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a gray monkey playing with a catFrame1:[002, 002, 002, 002];Frame2:[002, 002, 002, 002];Frame3:[002, 002, 002, 002];Frame4:[002, 002, 002, 002].') 2023-11-03 08:35:29.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:35:29.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[040, 540, 104, 670],[911, 507, 951, 635];Head Phone:[002, 136, 173, 444];Moniter:[215, 394, 388, 599],[373, 217, 702, 611],[686, 287, 987, 618];Keyboard:[444, 609, 729, 660];Mouse:[354, 616, 410, 669];Cup:[134, 519, 229, 725];Lamp:[101, 156, 378, 256],[002, 098, 075, 181];Frame:[002, 531, 061, 682].') 2023-11-03 08:35:34.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:35:34.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk,Person and Bracelet in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[001, 520, 1000, 999];Person:[001, 193, 725, 802],[219, 260, 750, 688],[544, 248, 939, 561];Bracelet:[396, 493, 468, 527],[539, 469, 595, 493].') 2023-11-03 08:35:42.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:35:42.119 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Handbag:[742, 651, 836, 699];Leather Shoes:[498, 655, 548, 694];shelf:[012, 620, 161, 759],[026, 748, 246, 999];Couch:[189, 574, 373, 807],[383, 584, 524, 687];Chair:[504, 527, 604, 696];Lamp:[566, 454, 622, 542],[745, 415, 796, 453],[631, 456, 660, 525];Mirror:[620, 308, 723, 538];Desk:[541, 532, 730, 685];Picture:[907, 267, 999, 435];Bed:[551, 486, 1000, 998];Pillow:[838, 579, 999, 660];Carpet:[237, 723, 790, 996];TV:[022, 486, 155, 635];Person:[371, 494, 552, 688],[683, 455, 872, 685],[685, 469, 712, 508];Book:[419, 547, 496, 611].') 2023-11-03 08:36:02.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:36:02.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[382, 198, 681, 752] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a gliderFrame1:[382, 198, 681, 752];Frame2:[368, 287, 680, 747];Frame3:[318, 301, 642, 715];Frame4:[313, 404, 621, 556];Frame5:[260, 331, 545, 675].') 2023-11-03 08:36:15.330 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:36:15.330 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[681, 251, 754, 529] [528, 324, 649, 867] [650, 516, 788, 996] [436, 477, 580, 996] [336, 521, 469, 995] [250, 518, 373, 997] [755, 521, 919, 996] [127, 557, 278, 997] [554, 567, 698, 996] participants and staff from the Summer Sisters Exchange program from the US and Pakistan sit on [017, 155, 972, 991] the stairs') 2023-11-03 08:36:52.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:36:52.285 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Car:[035, 316, 921, 872],[489, 232, 818, 424],[010, 244, 235, 415],[041, 758, 923, 999].') 2023-11-03 08:36:59.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:36:59.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[479, 179, 585, 425],[940, 214, 992, 344];wheelchair:[408, 418, 622, 902];suit:[126, 197, 268, 834],[248, 209, 351, 772],[408, 439, 626, 668],[471, 242, 569, 395],[561, 230, 630, 380],[658, 212, 731, 368],[728, 221, 926, 824],[951, 255, 999, 376];man:[120, 115, 275, 873],[245, 121, 353, 821],[401, 334, 618, 772],[556, 174, 638, 395],[654, 170, 735, 417],[735, 106, 957, 877].') 2023-11-03 08:37:08.804 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:37:08.805 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'famous filmmaker mrinal sen [102, 004, 975, 998] died at the age of 95') 2023-11-03 08:38:04.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:38:04.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[830, 707, 915, 774],carFrame1:[000, 699, 283, 1000],carFrame1:[234, 700, 366, 836],carFrame1:[914, 657, 999, 803],trainFrame1:[312, 442, 450, 625] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[830, 707, 915, 774];Frame2:[831, 709, 916, 776],carFrame1:[000, 699, 283, 1000];Frame2:[000, 700, 284, 999],carFrame1:[234, 700, 366, 836];Frame2:[235, 704, 369, 839],carFrame1:[914, 657, 999, 803];Frame2:[913, 660, 999, 804],trainFrame1:[312, 442, 450, 625];Frame2:[321, 457, 449, 623].') 2023-11-03 08:38:13.692 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:38:13.692 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[550, 185, 794, 710] [432, 287, 584, 726] [319, 286, 452, 743] [170, 264, 335, 775] [660, 279, 803, 697] Five Harker athletes to continue their sports in college') 2023-11-03 08:38:40.590 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:38:40.590 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[021, 229, 766, 882] The American kestrel is a small raptor only slightly larger than a robin. Image credit: Chris Swarth') 2023-11-03 08:38:59.557 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:38:59.557 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[132, 031, 600, 996] Reince Priebus, chairman of the Republican National Committee, delivers a speech on Thursday. "We are the party of new ideas in a changing and faster world than ever before," he said.') 2023-11-03 08:39:01.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:39:01.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Couch:[002, 266, 958, 861];Pillow:[675, 489, 927, 825],[034, 716, 689, 998];Person:[313, 098, 999, 1000],[840, 384, 998, 893].') 2023-11-03 08:39:46.059 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:39:46.060 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Backcountry camp on Red Face Mountain, Mount Challenger and Whatcom Peak seen in the distance. North Cascades National Park Washington') 2023-11-03 08:39:54.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:39:54.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Car:[046, 146, 937, 791],[336, 152, 482, 223],[064, 147, 178, 196];SUV:[800, 140, 867, 188];Bench:[590, 235, 731, 282].') 2023-11-03 08:40:06.996 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:40:06.997 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "A school bus drop [250, 009, 998, 997] off students [002, 498, 270, 648] in Nashville, Tenn., where the city's school district has experienced one of the worst shortages of drivers in the country.") 2023-11-03 08:40:39.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:40:39.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Storage box:[299, 246, 542, 343];Desk:[001, 283, 1000, 950];Person:[001, 085, 403, 999],[618, 077, 997, 678];Necklace:[100, 185, 190, 408];Book:[171, 608, 598, 822],[663, 318, 784, 385];Sneakers:[142, 933, 348, 997];Glasses:[182, 209, 315, 266];Pliers:[463, 716, 710, 817],[427, 407, 481, 439];Pencil:[245, 479, 317, 507];Bottle:[444, 289, 498, 407].') 2023-11-03 08:40:48.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:40:48.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Whiteboard and Cabinet/shelf in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Blackboard:[003, 059, 1000, 718];Cabinet:[724, 825, 1000, 1000].') 2023-11-03 08:41:14.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:41:14.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect drum and tree in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'drum:[000, 701, 070, 905],[038, 752, 240, 960],[089, 672, 167, 775],[111, 622, 203, 743];tree:[000, 000, 999, 691].') 2023-11-03 08:41:16.486 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:41:16.487 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[433, 404, 599, 594] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a white boat floating on the riverFrame1:[433, 404, 599, 594];Frame2:[423, 412, 589, 619];Frame3:[411, 415, 577, 622];Frame4:[404, 419, 570, 626];Frame5:[399, 427, 564, 634].') 2023-11-03 08:41:17.326 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:41:17.327 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers,Side Table and Basketball in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Sneakers:[316, 681, 352, 709],[390, 757, 429, 784];Side Table:[023, 540, 178, 638];Basketball:[166, 531, 228, 577],[420, 502, 459, 533].') 2023-11-03 08:41:35.832 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:41:35.832 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Rolls-Royce Wraith facelift prototypes [041, 325, 911, 915] caught out in the open') 2023-11-03 08:41:39.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:41:39.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shelf,Chair and Handbag/Satchel in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'shelf:[811, 381, 999, 854],[002, 001, 123, 540],[066, 570, 218, 761];Chair:[348, 502, 821, 941];Satchel:[223, 739, 372, 793],[272, 812, 576, 961],[203, 785, 444, 883],[137, 838, 272, 951].') 2023-11-03 08:41:51.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 08:41:51.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Towel in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[720, 284, 835, 657],[470, 654, 597, 1000],[495, 706, 748, 998],[556, 554, 649, 801],[615, 525, 721, 726],[654, 524, 743, 686],[737, 513, 896, 764],[825, 670, 976, 855],[889, 573, 999, 862],[962, 549, 999, 670],[519, 307, 677, 557],[581, 321, 641, 431],[643, 326, 715, 548];Towel:[886, 840, 964, 893].') 2023-11-03 08:42:48.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:42:48.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'dress:[689, 344, 945, 980],[441, 287, 698, 999];human hair:[181, 644, 541, 1000],[043, 057, 160, 253],[451, 065, 603, 349],[648, 093, 838, 368];mammal:[006, 057, 287, 905],[000, 636, 105, 999],[000, 000, 136, 219],[203, 653, 550, 999],[413, 065, 741, 999],[642, 088, 999, 999];man:[570, 100, 960, 983],[000, 051, 335, 999];human arm:[818, 476, 968, 931],[000, 392, 262, 813],[223, 480, 345, 676],[403, 298, 494, 759],[582, 291, 749, 772];human hand:[238, 550, 336, 662],[153, 696, 251, 811],[441, 591, 487, 753],[580, 606, 653, 772],[775, 915, 841, 999];human mouth:[487, 256, 535, 320];human ear:[057, 196, 095, 263],[395, 926, 432, 999],[746, 234, 771, 309];woman:[210, 661, 546, 999],[411, 073, 745, 992],[645, 098, 999, 999];human head:[042, 047, 190, 338],[208, 670, 493, 999],[456, 061, 593, 342],[642, 091, 812, 408];human face:[095, 119, 190, 327],[214, 820, 395, 999],[476, 121, 558, 333],[648, 173, 745, 399].') 2023-11-03 08:43:19.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:43:19.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect SUV in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'SUV:[463, 620, 546, 703],[402, 613, 473, 682].') 2023-11-03 08:43:25.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:43:25.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[341, 337, 449, 850],[786, 374, 854, 554],[825, 313, 999, 725],[597, 752, 796, 999],[156, 382, 254, 746],[100, 522, 181, 646];Speaker:[726, 719, 942, 881],[480, 724, 692, 885],[139, 743, 369, 878],[428, 758, 495, 860],[002, 754, 135, 889],[248, 689, 344, 744];Tripod:[741, 506, 820, 701];Drum:[630, 563, 781, 707],[699, 463, 780, 562],[760, 527, 824, 593];Guitar:[826, 545, 999, 665].') 2023-11-03 08:43:28.073 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:43:28.073 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'elementary teacher [531, 013, 970, 508] explains how to read a clock [558, 098, 829, 505] as her young students [044, 004, 447, 502] [385, 532, 502, 995] [674, 496, 902, 995] [089, 497, 400, 995] take notes via video call. (audio) - learning stock videos & royalty-free footage') 2023-11-03 08:43:33.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:43:33.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'flower:[236, 072, 901, 936].') 2023-11-03 08:43:40.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:43:40.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'auto part:[000, 430, 053, 525],[000, 186, 129, 307],[000, 307, 158, 458],[000, 390, 662, 865],[035, 222, 183, 317],[117, 214, 258, 319],[166, 272, 347, 434],[227, 231, 288, 303],[230, 086, 426, 279],[288, 164, 690, 425],[452, 152, 537, 201],[523, 131, 612, 201],[545, 201, 869, 805],[648, 617, 795, 861],[657, 159, 748, 185],[743, 200, 826, 395],[783, 190, 903, 348],[815, 303, 940, 572],[899, 293, 999, 369],[931, 379, 997, 546];car:[000, 086, 999, 892].') 2023-11-03 08:43:44.272 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:43:44.272 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Frame:[365, 259, 458, 362],[464, 242, 538, 360],[546, 261, 639, 362],[549, 134, 639, 217],[467, 135, 545, 221],[363, 137, 456, 215];Chair:[714, 647, 958, 992],[104, 655, 359, 996],[469, 534, 597, 811],[599, 504, 760, 770],[883, 555, 998, 857],[255, 464, 374, 717],[309, 502, 464, 773],[658, 464, 783, 701],[811, 405, 868, 568],[883, 450, 998, 602],[024, 446, 136, 629],[163, 418, 253, 597],[135, 404, 198, 596];Desk:[350, 458, 695, 735],[843, 662, 997, 962],[798, 434, 956, 599],[019, 438, 215, 604];Flower:[078, 501, 130, 633].') 2023-11-03 08:43:58.079 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:43:58.079 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[852, 401, 969, 898] [002, 315, 115, 928] [411, 354, 491, 828] [479, 359, 565, 830] [332, 409, 412, 814] [612, 366, 703, 836] [556, 365, 639, 828] [258, 351, 343, 834] [096, 336, 200, 853] 12 officials exercise [406, 568, 454, 794] [622, 478, 657, 832] [766, 460, 796, 819] [269, 508, 315, 839] [689, 492, 718, 838] [835, 550, 963, 890] [334, 615, 398, 802] [474, 458, 503, 835] [190, 511, 251, 843] [026, 455, 137, 866] [107, 481, 169, 857] their shovel arms in this posed photo for the groundbreaking for the Downtown Redmond Link Extension. (Sound Transit') 2023-11-03 08:44:35.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:44:35.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Chair:[592, 527, 635, 587],[500, 523, 538, 568],[388, 514, 418, 561],[213, 521, 249, 572],[123, 521, 176, 587],[032, 520, 067, 571],[019, 564, 110, 737],[091, 600, 190, 747],[166, 564, 256, 754],[257, 612, 348, 788],[386, 566, 461, 730],[436, 600, 514, 749],[498, 567, 568, 749],[615, 565, 695, 753],[644, 601, 736, 785],[771, 563, 869, 743],[823, 594, 929, 787],[772, 729, 922, 786],[376, 729, 521, 787],[057, 729, 206, 787],[001, 676, 026, 784],[693, 525, 743, 590];Power outlet:[183, 139, 432, 232];Lamp:[642, 131, 901, 231];Person:[257, 493, 379, 788],[383, 455, 455, 580],[664, 459, 723, 584];Backpack:[375, 692, 430, 781];Trash bin Can:[001, 562, 038, 672];Desk:[001, 584, 257, 787],[294, 591, 593, 787],[587, 582, 978, 789],[657, 781, 999, 999],[001, 789, 732, 999].') 2023-11-03 08:44:42.682 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:44:42.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[000, 750, 121, 999],[285, 665, 355, 858],[310, 743, 400, 999],[426, 665, 465, 800],[478, 672, 525, 785],[541, 710, 637, 916],[570, 665, 623, 755],[702, 669, 740, 745],[906, 651, 921, 715];man:[029, 683, 140, 893],[141, 671, 282, 999],[363, 633, 450, 783],[378, 710, 472, 936],[578, 650, 774, 999],[761, 666, 856, 952];girl:[000, 748, 114, 995],[207, 825, 351, 999],[286, 667, 356, 839],[314, 746, 401, 998];clothing:[000, 814, 121, 999],[050, 744, 146, 900],[149, 757, 281, 999],[285, 700, 350, 865],[325, 820, 411, 998],[355, 665, 387, 740],[366, 694, 446, 760],[395, 789, 471, 997],[430, 687, 461, 805],[455, 840, 542, 999],[473, 700, 520, 770],[474, 682, 493, 765],[520, 689, 554, 779],[542, 773, 636, 999],[551, 662, 573, 778],[571, 694, 618, 765],[620, 738, 775, 999],[766, 710, 843, 886],[793, 840, 943, 996],[906, 781, 999, 999];building:[000, 000, 495, 832],[451, 515, 646, 665],[477, 446, 534, 519],[663, 359, 999, 722];human face:[616, 700, 670, 800].') 2023-11-03 08:44:46.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:44:46.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'insect:[485, 453, 536, 525].') 2023-11-03 08:45:09.760 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:45:09.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[000, 287, 046, 839],carFrame1:[050, 488, 108, 602],personFrame1:[015, 363, 121, 821],busFrame1:[345, 374, 434, 524],personFrame1:[190, 326, 293, 775] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1[000, 287, 046, 839];Frame2:No object,carFrame1:[050, 488, 108, 602];Frame2:[000, 431, 221, 614],personFrame1[015, 363, 121, 821];Frame2:No object,busFrame1:[345, 374, 434, 524];Frame2:[345, 374, 433, 523],personFrame1:[190, 326, 293, 775];Frame2:[081, 317, 219, 814].') 2023-11-03 08:45:12.659 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:45:12.660 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Tuk-tuk driver [278, 055, 810, 996] wears a face mask [412, 170, 537, 341] as a precaution against the spread of Coronavirus in Kochi. With 298 confirmed coronavirus cases, India has already...') 2023-11-03 08:45:20.153 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:45:20.154 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[372, 252, 499, 607],[222, 284, 374, 712],[550, 289, 657, 710];footwear:[277, 653, 301, 719].') 2023-11-03 08:45:28.883 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:45:28.883 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A Libyan family [694, 419, 994, 932] [057, 239, 497, 862] shows support for the rebel forces as they drive in a safe district in Tripoli.') 2023-11-03 08:45:32.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:45:32.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me the trajectory of the woman wearing white shirt and black jacket.When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one woman wearing white shirt and black jacket.Track1frame1:[756, 621, 1000, 1000];frame2:[668, 625, 1000, 1000];frame3:[583, 629, 1000, 1000];frame4:[494, 638, 1000, 1000].') 2023-11-03 08:47:03.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:47:03.887 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'cabinetry:[032, 506, 485, 960];door:[480, 390, 556, 621].') 2023-11-03 08:47:22.643 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:47:22.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[258, 219, 910, 926] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a mountain goatFrame1:[258, 219, 910, 926];Frame2:[242, 205, 821, 898];Frame3:[216, 191, 775, 844];Frame4:[160, 198, 763, 811].') 2023-11-03 08:47:30.991 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:47:30.992 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[126, 003, 995, 998] a bowl of oven roasted corn with bacon') 2023-11-03 08:48:02.888 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:48:02.889 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An ad [003, 010, 994, 987] in Popular Mechanics from 1932 seeking out "hustlers" to sell grape bricks.') 2023-11-03 08:48:34.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:48:34.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[278, 497, 914, 1000];Storage box:[483, 483, 602, 540];Cell Phone:[761, 747, 836, 783];Person:[669, 194, 999, 1000],[545, 248, 808, 558],[001, 186, 340, 972],[581, 649, 664, 674],[653, 731, 734, 755],[646, 695, 711, 719];Necklace:[151, 284, 221, 407];Ballon:[573, 077, 651, 164],[651, 067, 729, 154],[629, 076, 674, 168];Bowl:[443, 593, 561, 655];Hat:[154, 188, 278, 282];Backpack:[001, 565, 345, 959];Sneakers:[014, 880, 097, 969].') 2023-11-03 08:48:46.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:48:46.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[044, 459, 107, 531],carFrame1:[941, 309, 1000, 383],carFrame1:[469, 397, 610, 559],carFrame1:[655, 388, 999, 926],carFrame1:[443, 313, 514, 381],carFrame1:[017, 324, 101, 385],carFrame1:[897, 314, 953, 377] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[044, 459, 107, 531];Frame2:[000, 475, 032, 555],carFrame1[941, 309, 1000, 383];Frame2:No object,carFrame1:[469, 397, 610, 559];Frame2:[529, 392, 821, 681],carFrame1[655, 388, 999, 926];Frame2:No object,carFrame1:[443, 313, 514, 381];Frame2:[453, 301, 544, 378],carFrame1[017, 324, 101, 385];Frame2:No object,carFrame1[897, 314, 953, 377];Frame2:No object.') 2023-11-03 08:49:13.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:49:13.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Necklace,Ring and Person in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Necklace:[474, 480, 510, 533];Ring:[598, 593, 618, 632],[781, 636, 803, 674];Person:[000, 000, 578, 999],[253, 073, 832, 1000],[965, 455, 1000, 800],[890, 260, 1000, 1000].') 2023-11-03 08:50:20.511 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:50:20.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A unique Summer hostess gift idea [156, 131, 647, 742] comprised of a galvanized tub [155, 374, 420, 730], decorative throw blanket [224, 399, 634, 971], sparkling frech soda, marble cutting board, fresh grapes [415, 393, 534, 524], specialty cheese and water crackers') 2023-11-03 08:50:41.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:50:41.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,table and woman in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'clothing:[200, 664, 545, 999],[461, 307, 730, 591],[665, 230, 711, 342],[800, 253, 987, 589];table:[112, 572, 999, 999],[167, 297, 315, 628],[714, 409, 888, 575];woman:[185, 409, 631, 990],[501, 148, 719, 999],[651, 125, 770, 581],[800, 148, 999, 589],[827, 186, 931, 359].') 2023-11-03 08:50:42.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:50:42.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Glasses:[896, 013, 975, 090];Hat:[796, 201, 893, 284];Gloves:[741, 255, 768, 317];Handbag:[705, 253, 744, 326],[544, 259, 593, 358],[582, 257, 605, 359];Sneakers:[396, 729, 429, 763],[095, 703, 128, 740],[062, 692, 112, 731],[739, 871, 795, 984];Chair:[302, 458, 370, 600];Cell Phone:[692, 423, 732, 482];Dog:[718, 693, 745, 775];Person:[026, 275, 086, 562],[062, 330, 131, 742],[056, 345, 084, 563],[107, 317, 149, 552],[027, 305, 448, 1000],[258, 468, 290, 538],[303, 330, 365, 552],[383, 334, 431, 673],[359, 359, 449, 762],[429, 386, 462, 653],[440, 335, 537, 718],[736, 208, 926, 1000],[690, 200, 938, 983],[760, 001, 1000, 1000];Umbrella:[068, 138, 279, 262],[189, 071, 427, 255],[289, 035, 631, 269],[688, 077, 981, 274].') 2023-11-03 08:50:51.461 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:50:51.462 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[448, 140, 631, 683];Horse:[305, 225, 801, 927];Bench:[919, 692, 999, 788];Stool:[916, 673, 939, 698],[868, 688, 894, 762],[839, 681, 867, 765],[791, 686, 817, 771],[770, 696, 790, 774],[726, 697, 750, 777];Desk:[886, 661, 933, 759],[814, 667, 861, 770],[557, 716, 598, 791];Lamp:[901, 531, 919, 566],[927, 524, 947, 560],[956, 517, 977, 555];Chair:[331, 728, 372, 825].') 2023-11-03 08:50:54.983 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:50:54.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[468, 560, 520, 801],personFrame1:[214, 465, 258, 709],personFrame1:[730, 568, 777, 705] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[468, 560, 520, 801];Frame2:[424, 586, 523, 801],personFrame1:[214, 465, 258, 709];Frame2:[204, 544, 322, 707],personFrame1:[730, 568, 777, 705];Frame2:[721, 451, 760, 704].') 2023-11-03 08:51:00.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:51:00.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Guitar:[226, 583, 1000, 793];Speaker:[002, 549, 199, 927];Person:[196, 145, 898, 1000].') 2023-11-03 08:51:08.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:51:08.949 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[267, 031, 776, 969];Other Shoes:[399, 853, 584, 923],[332, 908, 537, 964].') 2023-11-03 08:51:13.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:51:13.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'flower:[276, 101, 603, 632],[292, 469, 613, 662],[643, 120, 854, 630];tree:[000, 000, 999, 998].') 2023-11-03 08:51:34.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:51:34.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[402, 465, 435, 674],personFrame1:[628, 458, 690, 672],personFrame1:[309, 432, 368, 637],personFrame1:[347, 578, 393, 797],personFrame1:[773, 368, 811, 564],personFrame1:[635, 197, 667, 347],personFrame1:[561, 505, 617, 711],personFrame1:[665, 542, 731, 750],personFrame1:[450, 585, 494, 797],personFrame1:[000, 415, 047, 640] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[402, 465, 435, 674];Frame2:[393, 462, 426, 670],personFrame1:[628, 458, 690, 672];Frame2:[621, 455, 678, 668],personFrame1:[309, 432, 368, 637];Frame2:[296, 423, 357, 629],personFrame1:[347, 578, 393, 797];Frame2:[335, 574, 386, 789],personFrame1:[773, 368, 811, 564];Frame2:[760, 355, 799, 558],personFrame1[635, 197, 667, 347];Frame2:No object,personFrame1:[561, 505, 617, 711];Frame2:[570, 503, 616, 707],personFrame1:[665, 542, 731, 750];Frame2:[650, 537, 729, 750],personFrame1:[450, 585, 494, 797];Frame2:[444, 582, 481, 789],personFrame1:[000, 415, 047, 640];Frame2:[000, 408, 039, 635].') 2023-11-03 08:51:53.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:51:53.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human body:[383, 698, 732, 984],[184, 660, 248, 723],[207, 708, 402, 999],[420, 758, 528, 944],[705, 684, 822, 868],[758, 775, 999, 999],[766, 746, 936, 973],[859, 696, 930, 770];woman:[377, 675, 727, 1000],[416, 709, 565, 841],[765, 690, 950, 979],[845, 660, 945, 738];van:[000, 643, 133, 807];human hand:[786, 782, 987, 991],[546, 778, 629, 833],[555, 817, 654, 887];human leg:[751, 753, 812, 870];human hair:[275, 635, 375, 694],[503, 679, 672, 788],[772, 694, 882, 755];mammal:[187, 647, 245, 722],[220, 636, 394, 999],[363, 678, 751, 999],[423, 707, 541, 941],[509, 650, 550, 715],[710, 653, 825, 863],[758, 770, 999, 999],[863, 665, 924, 771];man:[185, 643, 249, 721],[203, 636, 394, 999],[360, 679, 757, 999],[429, 700, 552, 952],[508, 651, 555, 718],[709, 648, 999, 999];girl:[363, 681, 754, 999];building:[000, 000, 999, 759];human arm:[245, 738, 409, 948],[356, 780, 646, 932],[535, 776, 766, 945],[710, 686, 766, 773],[751, 763, 999, 998],[754, 783, 881, 981].') 2023-11-03 08:52:06.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:52:06.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human hair:[435, 229, 629, 634],[021, 458, 085, 563],[230, 338, 283, 445],[435, 372, 470, 541],[651, 295, 688, 365],[701, 183, 999, 387];clothing:[184, 370, 353, 641],[032, 544, 145, 705],[146, 134, 206, 325],[373, 550, 755, 998],[378, 516, 487, 740],[625, 355, 689, 425],[702, 366, 910, 697],[728, 331, 780, 438],[868, 333, 952, 500],[963, 277, 999, 363];human arm:[379, 465, 434, 650],[151, 161, 198, 245],[201, 166, 221, 234],[635, 840, 999, 998],[681, 514, 758, 586],[685, 418, 830, 691],[701, 346, 727, 453];person:[000, 187, 999, 998],[148, 096, 217, 331];human hand:[396, 473, 430, 588],[631, 837, 777, 998],[678, 511, 704, 603],[685, 513, 719, 611],[982, 462, 999, 529].') 2023-11-03 08:52:10.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:52:10.993 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Cell Phone:[371, 200, 494, 286];Person:[347, 104, 561, 292];Wine Glass:[185, 000, 746, 999].') 2023-11-03 08:52:11.811 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:52:11.811 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photo of A south indian bride [078, 557, 372, 998] in a red saree [084, 676, 368, 997] poses for her bridal portrait.') 2023-11-03 08:52:53.490 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:52:53.490 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[224, 481, 335, 738],personFrame1:[592, 530, 703, 780],personFrame1:[351, 511, 485, 786],personFrame1:[541, 576, 665, 847],personFrame1:[228, 443, 317, 694] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[224, 481, 335, 738];Frame2:[224, 481, 335, 738],personFrame1:[592, 530, 703, 780];Frame2:[592, 530, 703, 779],personFrame1:[351, 511, 485, 786];Frame2:[352, 509, 485, 783],personFrame1:[541, 576, 665, 847];Frame2:[542, 573, 664, 845],personFrame1:[228, 443, 317, 694];Frame2:[228, 443, 317, 694].') 2023-11-03 08:53:25.199 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:53:25.199 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[682, 334, 858, 762],[585, 406, 693, 786],[506, 471, 626, 767],[445, 425, 620, 828],[389, 410, 527, 828],[279, 349, 516, 906],[082, 345, 298, 825],[423, 132, 555, 507];Chair:[781, 420, 925, 731].') 2023-11-03 08:53:38.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:53:38.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[042, 004, 917, 878] nursery boy eating [285, 649, 421, 801] corn, chunks of [372, 773, 515, 980] mango and [275, 830, 374, 997] strawberry and salad, [042, 625, 563, 997] a healthy food at a Preschool & Daycare Serving Hampton Roads, VA') 2023-11-03 08:53:42.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:53:42.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Other Shoes and Cabinet/shelf in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[030, 256, 075, 376],[037, 375, 214, 681];Other Shoes:[472, 542, 537, 594],[855, 578, 899, 625];shelf:[022, 194, 287, 413].') 2023-11-03 08:53:46.478 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:53:46.478 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[499, 035, 606, 223] A Mexican flag is seen next to a section of [289, 618, 954, 998] the wall separating Mexico and the United States, in Tijuana, Mexico, October 6, 2016 (photo credit: REUTERS') 2023-11-03 08:53:58.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:53:58.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[012, 046, 084, 118],[367, 844, 495, 989],[105, 830, 274, 988],[011, 803, 130, 988],[011, 652, 170, 824],[171, 697, 416, 881],[101, 587, 236, 826].') 2023-11-03 08:54:15.197 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:54:15.197 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A tray [049, 062, 958, 930] holds four fluffy yellow layers of cheese [157, 087, 813, 697] in concentric containers.') 2023-11-03 08:55:56.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:55:56.840 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[652, 000, 857, 887] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a european hareFrame1:[652, 000, 857, 887];Frame2:[612, 237, 882, 911];Frame3:[634, 240, 781, 909];Frame4:[439, 515, 589, 939].') 2023-11-03 08:56:13.517 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:56:13.517 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An illustration of a man [139, 015, 888, 978] in a top hat [370, 013, 621, 296] holding a beer and surrounded by bags of money [160, 568, 300, 879] [817, 537, 961, 880] [759, 117, 886, 300] [030, 573, 204, 915] [678, 563, 850, 894].') 2023-11-03 08:56:18.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:56:18.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[125, 388, 241, 611];frame:2:[123, 401, 239, 610];frame:3:[150, 420, 242, 602] is?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an four little white bunnies in motion.') 2023-11-03 08:57:19.041 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:57:19.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the elephant that lifted the person onto its back using its trunk..As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one the elephant that lifted the person onto its back using its trunk..Track1frame:1:[122, 592, 302, 819];frame:2:[119, 591, 302, 820];frame:3:[111, 594, 303, 822].') 2023-11-03 08:57:59.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:57:59.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[355, 145, 732, 825];Bicycle:[359, 413, 689, 897];Sneakers:[610, 625, 682, 695],[467, 754, 517, 821];Handbag:[491, 341, 654, 417];Glasses:[480, 203, 579, 224];Helmet:[465, 142, 605, 206].') 2023-11-03 08:58:14.508 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:58:14.509 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[321, 475, 723, 996] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a cheetahFrame1:[321, 475, 723, 996];Frame2:[250, 421, 710, 987];Frame3:[195, 348, 618, 926];Frame4:[171, 426, 596, 996].') 2023-11-03 08:58:41.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 08:58:41.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bus:[968, 629, 1000, 689];SUV:[089, 752, 166, 807],[077, 617, 101, 651];Flag:[065, 204, 184, 429],[232, 584, 296, 747],[375, 580, 404, 731],[489, 560, 551, 683].') 2023-11-03 08:59:42.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:59:42.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Take this water filtration system [218, 116, 772, 741] anywhere') 2023-11-03 08:59:50.847 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 08:59:50.847 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[034, 007, 936, 996] A salad anyone could get behind, packed with [531, 598, 717, 881] [304, 343, 419, 546] [183, 286, 365, 529] [734, 302, 878, 492] [594, 120, 716, 311] [112, 145, 256, 306] prawns, avocado, bacon and [260, 024, 688, 994] [356, 003, 850, 843] asparagus') 2023-11-03 09:00:05.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:00:05.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[316, 412, 417, 656],personFrame1:[383, 445, 437, 755],personFrame1:[388, 443, 485, 724] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[316, 412, 417, 656];Frame2:[268, 583, 421, 677],personFrame1:[383, 445, 437, 755];Frame2:[382, 460, 448, 758],personFrame1:[388, 443, 485, 724];Frame2:[337, 473, 458, 775].') 2023-11-03 09:00:20.066 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:00:20.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[156, 440, 783, 902] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a boat moving on the riverFrame1:[156, 440, 783, 902];Frame2:[145, 409, 767, 870];Frame3:[164, 433, 780, 888].') 2023-11-03 09:01:01.951 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:01:01.952 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The patent [005, 001, 992, 990] shows the system used with a variety of different lens types.') 2023-11-03 09:01:29.499 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:01:29.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pen,Watch,Keyboard and Bracelet in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Pen:[608, 513, 629, 621];Watch:[766, 611, 790, 657];Keyboard:[433, 591, 667, 736];Bracelet:[291, 492, 325, 539].') 2023-11-03 09:01:37.639 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:01:37.639 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Latisha Thompson [004, 100, 895, 989], 31, of Stamford, was charged by police with failing to store two guns she was carrying in her car in Stamford on Tuesday night.') 2023-11-03 09:02:01.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:02:01.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human head,human mouth,human hair and human face in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human head:[416, 329, 594, 670],[187, 349, 404, 796];human mouth:[447, 576, 511, 610];human hair:[385, 295, 603, 734];human face:[195, 547, 289, 796],[417, 407, 559, 670].') 2023-11-03 09:02:11.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:02:11.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'elephant:[225, 381, 765, 965].') 2023-11-03 09:02:51.401 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:02:51.401 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Cabinet:[000, 000, 224, 590];Desk:[225, 153, 999, 868];Spoon:[920, 267, 964, 309],[845, 188, 870, 210];Knife:[900, 270, 937, 306];Wine Glass:[412, 085, 450, 195],[473, 117, 516, 236],[605, 097, 645, 190],[677, 128, 729, 259],[819, 105, 861, 213],[916, 139, 962, 273];Fork:[556, 170, 603, 189],[773, 185, 809, 206].') 2023-11-03 09:03:05.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:03:05.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,human hair and human leg in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human arm:[529, 211, 674, 440],[237, 255, 366, 411];human hair:[264, 107, 516, 312];human leg:[327, 533, 479, 832],[431, 545, 587, 803].') 2023-11-03 09:03:35.735 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:03:35.736 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Traffic Light:[884, 089, 933, 239],[886, 291, 954, 372];Street Lights:[882, 398, 914, 586],[668, 341, 722, 555];Car:[640, 554, 798, 645],[099, 515, 362, 674];Crane:[002, 108, 220, 313].') 2023-11-03 09:03:40.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:03:40.461 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Handbag:[162, 508, 282, 630];Leather Shoes:[082, 787, 121, 837],[069, 845, 131, 897],[131, 913, 183, 970];Other Shoes:[026, 888, 050, 943];Hat:[255, 244, 333, 353];Person:[471, 336, 929, 999],[172, 303, 646, 999],[160, 255, 484, 904],[061, 245, 354, 972],[000, 271, 050, 946];Bottle:[000, 267, 026, 319].') 2023-11-03 09:04:18.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:04:18.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'person:[188, 376, 308, 673],[000, 292, 045, 623],[188, 316, 998, 895];tree:[000, 121, 690, 373],[625, 261, 863, 310].') 2023-11-03 09:04:23.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:04:23.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Satchel:[386, 513, 474, 603];Person:[080, 368, 299, 766],[399, 303, 531, 830].') 2023-11-03 09:04:33.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:04:33.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[086, 120, 991, 946];human hair:[409, 053, 829, 382];human head:[273, 067, 717, 449];clothing:[269, 330, 965, 869];human arm:[140, 579, 465, 816],[000, 000, 467, 531],[120, 457, 943, 999];human hand:[095, 551, 234, 690],[132, 836, 367, 998],[355, 440, 466, 516];boy:[089, 028, 999, 999];human ear:[591, 256, 703, 383];mammal:[000, 000, 497, 999];human face:[322, 130, 614, 461];human nose:[318, 239, 392, 325].') 2023-11-03 09:04:44.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:04:44.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[176, 081, 670, 1000],[443, 004, 756, 687];Bicycle:[004, 237, 227, 410];Traffic cone:[809, 500, 994, 684];Hat:[287, 083, 486, 155];Sneakers:[308, 787, 435, 926].') 2023-11-03 09:04:48.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:04:48.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'clothing:[697, 504, 763, 747],[588, 525, 644, 684];person:[581, 486, 631, 721],[700, 500, 772, 732];building:[000, 000, 999, 999].') 2023-11-03 09:04:50.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:04:50.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[000, 306, 780, 718];cat:[160, 218, 999, 993];clothing:[466, 309, 706, 424];furniture:[000, 000, 999, 470],[000, 345, 999, 999].') 2023-11-03 09:05:03.385 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:05:03.385 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'In only one month you can do it – perfect body. [004, 004, 993, 990] Photoshop courses – best tool to change your body') 2023-11-03 09:05:13.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:05:13.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'suit:[025, 121, 387, 992],[000, 508, 045, 999],[000, 308, 115, 634],[311, 335, 404, 901],[398, 332, 567, 999],[408, 324, 541, 370];man:[000, 191, 060, 999],[000, 199, 106, 999],[035, 110, 472, 999],[070, 257, 096, 327],[103, 257, 161, 341],[136, 169, 204, 333],[265, 299, 377, 999],[306, 251, 420, 920],[393, 232, 475, 869],[397, 227, 563, 999],[409, 221, 544, 894],[451, 252, 728, 999],[498, 241, 556, 382],[721, 281, 956, 968],[758, 257, 948, 999],[778, 374, 999, 999],[925, 207, 999, 656].') 2023-11-03 09:06:46.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:06:46.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Train:[166, 263, 904, 823];Car:[000, 764, 176, 997];Broom:[763, 673, 813, 783];Trash bin Can:[786, 701, 824, 779];Ladder:[502, 263, 615, 840].') 2023-11-03 09:07:08.928 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:07:08.928 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tree:[465, 105, 959, 754];building:[000, 074, 999, 798];car:[000, 763, 160, 916],[096, 768, 192, 841],[183, 746, 552, 963],[677, 761, 780, 819].') 2023-11-03 09:07:42.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:07:42.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face and clothing in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[000, 628, 051, 999],[000, 603, 065, 919],[069, 642, 194, 947],[266, 138, 494, 999],[652, 261, 834, 999],[795, 654, 882, 982],[823, 662, 985, 999],[932, 616, 999, 925];human face:[000, 635, 029, 778],[006, 616, 045, 707],[333, 149, 400, 282],[515, 270, 572, 367],[704, 278, 759, 386],[869, 670, 914, 764];clothing:[064, 685, 194, 948],[000, 785, 039, 999],[015, 717, 072, 999],[257, 279, 492, 999],[460, 351, 657, 999],[608, 657, 999, 999],[645, 368, 833, 999].') 2023-11-03 09:07:46.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:07:46.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Cabinet:[396, 199, 641, 486],[560, 340, 850, 492],[006, 115, 095, 281],[039, 304, 325, 504];Storage box:[419, 150, 600, 212];Hanger:[165, 004, 262, 052];Chair:[109, 515, 218, 665],[520, 848, 853, 997],[906, 601, 996, 791],[779, 434, 854, 529];Desk:[001, 478, 892, 998];Person:[002, 438, 182, 736],[187, 418, 405, 631],[614, 318, 859, 543],[621, 522, 999, 883];Cup:[524, 597, 610, 683],[248, 590, 328, 673],[137, 633, 230, 724];Gas stove:[554, 330, 852, 380];Oven:[553, 354, 777, 502];Bakset:[635, 290, 771, 351].') 2023-11-03 09:07:50.841 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:07:50.842 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Dinning Table:[069, 844, 1000, 1000];Napkin:[076, 910, 311, 968],[450, 897, 862, 952],[940, 909, 999, 992];Cup:[175, 651, 292, 830],[761, 624, 874, 742];Plate:[618, 854, 968, 905];Wine Glass:[740, 734, 898, 929];Bottle:[965, 819, 1000, 987];Person:[001, 075, 346, 811],[001, 781, 297, 999],[094, 112, 988, 864].') 2023-11-03 09:08:19.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:08:19.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Glasses:[421, 430, 629, 475];Person:[135, 421, 363, 683],[260, 283, 999, 1000],[002, 565, 493, 1000],[693, 454, 893, 579];Bracelet:[781, 777, 837, 839];Cell Phone:[055, 286, 427, 826];Car:[534, 424, 998, 588].') 2023-11-03 09:08:25.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:08:25.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me the trajectory of the left cars in light color.Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one left cars in light color.Track1frame2:[000, 560, 073, 674];frame3:[000, 560, 061, 677];frame4:[000, 562, 048, 680].') 2023-11-03 09:08:28.024 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:08:28.024 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'iPhone 7 Case Planets [253, 024, 747, 978] and stars [400, 666, 456, 722] [653, 249, 707, 305] [407, 305, 464, 364] in the sky') 2023-11-03 09:08:45.327 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:08:45.327 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[805, 250, 914, 422],[667, 249, 772, 404],[593, 245, 750, 434],[469, 267, 575, 435],[370, 270, 508, 461],[237, 277, 370, 486],[090, 328, 249, 579],[001, 435, 056, 601],[001, 555, 183, 756],[001, 666, 256, 957];Bench:[001, 125, 202, 340],[341, 105, 539, 296],[655, 108, 808, 276],[345, 001, 519, 087].') 2023-11-03 09:08:47.883 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:08:47.883 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[140, 447, 295, 997] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a deer walking on the ground among other deerFrame1:[140, 447, 295, 997];Frame2:[145, 491, 309, 997];Frame3:[153, 500, 270, 997];Frame4:[159, 522, 265, 997];Frame5:[143, 552, 270, 997].') 2023-11-03 09:08:57.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:08:57.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'person:[785, 625, 873, 897],[000, 432, 058, 859],[033, 622, 285, 921],[056, 529, 112, 745],[106, 549, 160, 728],[199, 519, 266, 673],[283, 632, 349, 780],[368, 611, 456, 829],[448, 635, 530, 855],[494, 611, 670, 992];clothing:[000, 505, 030, 849],[016, 560, 036, 628],[035, 560, 065, 745],[051, 580, 102, 734],[077, 710, 236, 898],[084, 590, 107, 692],[105, 590, 118, 701],[115, 580, 151, 719],[163, 614, 180, 698],[178, 612, 201, 721],[195, 565, 221, 644],[210, 561, 264, 691],[264, 575, 297, 679],[287, 679, 338, 798],[341, 672, 369, 751],[361, 617, 376, 681],[371, 659, 451, 817],[429, 665, 460, 734],[456, 679, 497, 811],[487, 754, 660, 994],[608, 643, 785, 714],[653, 620, 683, 733],[767, 693, 870, 832],[880, 683, 999, 848],[881, 632, 898, 717],[895, 630, 905, 699],[905, 635, 925, 699],[972, 621, 999, 689];building:[000, 000, 999, 716].') 2023-11-03 09:09:11.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:09:11.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[383, 399, 474, 524],truckFrame1:[878, 263, 999, 507],carFrame1:[238, 400, 313, 466] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[383, 399, 474, 524];Frame2:[389, 406, 474, 524],truckFrame1:[878, 263, 999, 507];Frame2:[946, 294, 999, 489],carFrame1:[238, 400, 313, 466];Frame2:[242, 402, 314, 462].') 2023-11-03 09:09:30.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:09:30.987 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[372, 438, 551, 686] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a pungFrame1:[372, 438, 551, 686];Frame2:[455, 423, 663, 693];Frame3:[475, 425, 686, 693];Frame4:[498, 418, 711, 693];Frame5:[520, 409, 742, 686].') 2023-11-03 09:09:52.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:09:52.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[141, 838, 263, 975] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a gametarget of a person kicking the enemiesFrame1:[141, 838, 263, 975];Frame2:[171, 802, 391, 972];Frame3:[231, 633, 386, 972];Frame4:[250, 619, 388, 938].') 2023-11-03 09:10:09.516 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:10:09.516 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'barge:[102, 101, 978, 787],[427, 657, 500, 794].') 2023-11-03 09:11:13.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:11:13.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track personFrame1:[939, 248, 987, 330],personFrame1:[680, 548, 720, 665],personFrame1:[007, 638, 046, 790],personFrame1:[104, 427, 142, 544],personFrame1:[936, 333, 972, 447] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'personFrame1[939, 248, 987, 330];Frame2:No object,personFrame1:[680, 548, 720, 665];Frame2:[706, 541, 750, 669],personFrame1:[007, 638, 046, 790];Frame2:[056, 636, 090, 786],personFrame1:[104, 427, 142, 544];Frame2:[122, 425, 155, 541],personFrame1[936, 333, 972, 447];Frame2:No object.') 2023-11-03 09:11:16.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:11:16.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bottle:[283, 655, 333, 823];Cup:[670, 730, 776, 928];Cell Phone:[426, 601, 482, 660];Laptop:[460, 689, 692, 807];Person:[002, 002, 394, 998],[336, 163, 666, 725],[659, 448, 999, 1000];Glasses:[238, 150, 303, 219],[472, 262, 591, 306];Handbag:[648, 611, 836, 807];Desk:[108, 646, 913, 1000];Lamp:[457, 002, 518, 066],[585, 002, 644, 089].') 2023-11-03 09:11:17.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:11:17.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[257, 483, 297, 617],carFrame1:[230, 566, 463, 897],carFrame1:[468, 425, 623, 670],personFrame1:[912, 478, 948, 567],busFrame1:[000, 001, 268, 1000] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[257, 483, 297, 617];Frame2:[213, 478, 262, 634],carFrame1:[230, 566, 463, 897];Frame2:[188, 570, 420, 897],carFrame1:[468, 425, 623, 670];Frame2:[457, 399, 648, 716],personFrame1[912, 478, 948, 567];Frame2:No object,busFrame1:[000, 001, 268, 1000];Frame2:[000, 001, 228, 1000].') 2023-11-03 09:11:29.593 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:11:29.593 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree,man,human face and beard in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'tree:[651, 117, 981, 596],[041, 521, 098, 592],[125, 307, 345, 579],[164, 000, 488, 431],[712, 523, 995, 994];man:[074, 070, 762, 984];human face:[415, 113, 658, 611];beard:[408, 344, 645, 637].') 2023-11-03 09:11:55.933 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:11:55.933 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[502, 258, 728, 998] Actor Lee Seung-gi, left, and [250, 268, 457, 997] Lee Hee-joon pose for pictures during the online press conference for tvN's new thriller series 'Mouse.' Courtesy of tvN") 2023-11-03 09:12:01.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:12:01.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[359, 374, 455, 911] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[359, 374, 455, 911];Frame2:[341, 374, 430, 933];Frame3:[290, 376, 386, 947];Frame4:[278, 382, 373, 969].') 2023-11-03 09:12:03.651 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:12:03.652 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 244, 993, 992] The new fleet of Rossignol skis for 2017.') 2023-11-03 09:12:05.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:12:05.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the rabbit that emerged from the underground. in this video clip?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is no the rabbit that emerged from the underground..') 2023-11-03 09:12:39.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:12:39.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[425, 479, 506, 663] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[425, 479, 506, 663];Frame2:[376, 452, 451, 676];Frame3:[421, 433, 484, 691];Frame4:[567, 400, 648, 643].') 2023-11-03 09:12:48.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:12:48.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Violin:[165, 206, 676, 322];Guitar:[367, 438, 523, 809];Speaker:[003, 872, 285, 1000];Person:[271, 104, 909, 928].') 2023-11-03 09:13:17.946 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:13:17.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'goose:[057, 140, 082, 214],[110, 140, 156, 206],[134, 330, 178, 401],[270, 436, 778, 765],[338, 362, 726, 685],[388, 117, 999, 496];swan:[261, 433, 778, 769],[333, 361, 738, 680].') 2023-11-03 09:13:27.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:13:27.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'land vehicle:[480, 582, 560, 750],[780, 460, 945, 815].') 2023-11-03 09:13:32.354 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:13:32.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[555, 490, 646, 743],personFrame1:[419, 294, 451, 443],personFrame1:[502, 377, 565, 588],personFrame1:[604, 355, 646, 530],personFrame1:[558, 606, 606, 852],personFrame1:[509, 468, 563, 705],personFrame1:[670, 687, 708, 947] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[555, 490, 646, 743];Frame2:[536, 500, 584, 748],personFrame1[419, 294, 451, 443];Frame2:No object,personFrame1:[502, 377, 565, 588];Frame2:[485, 376, 575, 588],personFrame1:[604, 355, 646, 530];Frame2:[612, 350, 652, 523],personFrame1:[558, 606, 606, 852];Frame2:[535, 612, 592, 855],personFrame1:[509, 468, 563, 705];Frame2:[471, 480, 560, 704],personFrame1:[670, 687, 708, 947];Frame2:[661, 688, 711, 950].') 2023-11-03 09:14:08.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:14:08.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect whisk,chair,man and clothing in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[490, 712, 688, 1000],[000, 531, 016, 698],[000, 779, 210, 999],[103, 431, 157, 506],[108, 535, 256, 797],[210, 782, 425, 999],[226, 431, 290, 621],[256, 545, 388, 700],[281, 376, 349, 420],[357, 427, 413, 685],[371, 379, 420, 484],[383, 535, 522, 755],[422, 441, 517, 515],[518, 385, 606, 574],[548, 425, 632, 628],[609, 381, 681, 516],[666, 431, 753, 591],[683, 722, 920, 999],[685, 531, 846, 779],[758, 433, 844, 506],[844, 433, 927, 564],[877, 751, 999, 999],[881, 528, 990, 735],[972, 436, 999, 623];man:[000, 498, 185, 965],[000, 372, 104, 555],[108, 318, 172, 503],[121, 363, 265, 724],[217, 306, 310, 533],[372, 306, 427, 472],[481, 264, 532, 353],[532, 424, 711, 678],[664, 326, 748, 551],[691, 388, 840, 774];clothing:[000, 473, 105, 627],[000, 644, 254, 999],[103, 386, 160, 519],[120, 423, 293, 730],[216, 377, 318, 534],[231, 694, 433, 880],[253, 492, 390, 669],[299, 329, 346, 408],[335, 406, 401, 519],[364, 327, 432, 467],[398, 461, 545, 717],[406, 246, 463, 360],[438, 333, 530, 415],[468, 622, 686, 961],[485, 394, 515, 479],[548, 380, 618, 461],[614, 371, 687, 433],[614, 488, 714, 673],[620, 307, 687, 371],[670, 394, 754, 544],[687, 696, 903, 973],[704, 471, 836, 734],[743, 344, 771, 452],[848, 404, 933, 575],[852, 369, 896, 483],[861, 479, 993, 697],[890, 686, 999, 981].') 2023-11-03 09:14:20.802 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:14:20.803 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Heart shaped cookies, [423, 463, 604, 729] cup of coffee on [153, 082, 884, 980] a purple placemat with flowers high angle view on a white wooden board background') 2023-11-03 09:15:01.282 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:15:01.282 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the bird flying around?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There are 3 bird flying around.Track1frame:1:[298, 255, 726, 998];frame:2:[600, 368, 756, 631];frame:3:[460, 352, 604, 618],Track2frame:1:[000, 376, 220, 768];frame:2:[000, 719, 142, 780];frame:3:[026, 431, 170, 798],Track3frame:1:[148, 188, 496, 620];frame:2:[000, 348, 025, 431];frame:3:[054, 231, 171, 611].') 2023-11-03 09:15:10.314 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:15:10.314 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[294, 275, 626, 630] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a stealth fighterFrame1:[294, 275, 626, 630];Frame2:[322, 309, 663, 516];Frame3:[340, 294, 684, 477];Frame4:[349, 294, 695, 477];Frame5:[369, 328, 712, 501].') 2023-11-03 09:15:11.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:15:11.781 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl and human face in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'girl:[444, 281, 674, 997];human face:[076, 233, 206, 458],[344, 297, 438, 471],[478, 343, 562, 477],[642, 249, 730, 399],[758, 364, 846, 506],[870, 139, 986, 332].') 2023-11-03 09:15:40.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:15:40.759 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'cat:[119, 000, 639, 671],[218, 445, 718, 948],[597, 000, 789, 452].') 2023-11-03 09:15:58.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:15:58.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[168, 398, 411, 904],[315, 382, 466, 721];Person:[196, 320, 481, 872],[360, 292, 508, 775];Car:[001, 302, 050, 484].') 2023-11-03 09:16:44.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:16:44.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[490, 291, 682, 942],[403, 318, 536, 756],[091, 375, 222, 855],[160, 336, 234, 722],[001, 295, 131, 908],[080, 294, 173, 776],[100, 301, 181, 728],[162, 306, 275, 698],[203, 285, 286, 624],[165, 277, 274, 660],[229, 288, 288, 587],[232, 261, 298, 566],[246, 247, 299, 527],[274, 250, 332, 516],[313, 241, 368, 506],[341, 247, 403, 489],[379, 220, 422, 459],[404, 231, 457, 507],[438, 243, 505, 510],[491, 223, 551, 527],[533, 220, 603, 549],[592, 207, 659, 578],[633, 286, 698, 630],[671, 237, 765, 666],[688, 270, 837, 741],[701, 334, 834, 794],[703, 310, 811, 684],[765, 345, 893, 846],[729, 459, 996, 1000],[777, 352, 953, 1000],[820, 412, 957, 1000],[809, 202, 923, 708],[734, 166, 862, 684],[057, 250, 128, 681],[120, 196, 189, 512],[015, 208, 087, 549],[221, 187, 282, 472],[189, 208, 246, 472],[152, 174, 213, 498],[274, 225, 325, 466],[324, 205, 386, 465],[381, 175, 440, 445],[427, 193, 482, 460],[467, 219, 510, 454],[460, 163, 509, 429],[536, 178, 593, 424],[587, 181, 654, 506],[635, 189, 706, 533],[675, 211, 756, 665],[656, 198, 723, 524],[053, 226, 149, 551],[310, 211, 363, 468],[403, 226, 442, 463],[431, 223, 474, 463],[920, 301, 1000, 911],[001, 427, 091, 1000],[752, 204, 822, 521],[187, 253, 255, 522],[001, 773, 848, 1000],[223, 163, 326, 228],[914, 385, 1000, 689].') 2023-11-03 09:17:05.681 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:17:05.682 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[228, 004, 627, 663] A grey-haired woman pouring salt in [250, 288, 683, 862] a spiral pattern on [002, 005, 995, 998] a dark wooded floor.') 2023-11-03 09:17:49.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:17:49.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[483, 310, 538, 399],truckFrame1:[408, 267, 460, 355],carFrame1:[225, 336, 326, 430],carFrame1:[944, 371, 999, 473],carFrame1:[898, 351, 972, 413],carFrame1:[314, 326, 399, 440],carFrame1:[408, 352, 525, 516] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'personFrame1[483, 310, 538, 399];Frame2:No object,truckFrame1:[408, 267, 460, 355];Frame2:[340, 267, 401, 357],carFrame1:[225, 336, 326, 430];Frame2:[122, 337, 242, 442],carFrame1:[944, 371, 999, 473];Frame2:[960, 360, 999, 421],carFrame1[898, 351, 972, 413];Frame2:No object,carFrame1:[314, 326, 399, 440];Frame2:[216, 323, 325, 459],carFrame1:[408, 352, 525, 516];Frame2:[336, 356, 465, 530].') 2023-11-03 09:18:10.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:18:10.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[468, 201, 614, 785],[053, 080, 260, 673],[270, 272, 420, 756],[397, 322, 471, 761],[590, 199, 786, 754],[875, 252, 962, 470],[889, 034, 999, 461];clothing:[046, 147, 275, 786],[252, 381, 284, 553],[285, 391, 425, 758],[401, 418, 465, 768],[453, 293, 604, 766],[588, 328, 793, 696],[881, 322, 976, 458],[940, 132, 999, 433].') 2023-11-03 09:18:21.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:18:21.587 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 002, 1000, 999],[002, 390, 999, 964];Glasses:[457, 097, 828, 182].') 2023-11-03 09:18:45.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:18:45.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'tree:[350, 792, 416, 830],[494, 773, 624, 845],[690, 765, 718, 876].') 2023-11-03 09:19:48.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:19:48.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[345, 484, 663, 663] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a pheasantFrame1:[345, 484, 663, 663];Frame2:[382, 528, 696, 706];Frame3:[348, 494, 648, 669];Frame4:[279, 495, 571, 679];Frame5:[249, 499, 519, 686].') 2023-11-03 09:20:04.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:20:04.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[047, 498, 128, 766],personFrame1:[386, 399, 518, 614],personFrame1:[255, 461, 384, 740],personFrame1:[289, 375, 342, 592],personFrame1:[139, 468, 268, 748],personFrame1:[358, 462, 422, 745],personFrame1:[232, 475, 290, 757] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[047, 498, 128, 766];Frame2:[048, 481, 118, 773],personFrame1:[386, 399, 518, 614];Frame2:[383, 434, 513, 616],personFrame1:[255, 461, 384, 740];Frame2:[254, 470, 387, 745],personFrame1[289, 375, 342, 592];Frame2:No object,personFrame1:[139, 468, 268, 748];Frame2:[126, 490, 268, 664],personFrame1:[358, 462, 422, 745];Frame2:[357, 430, 421, 753],personFrame1:[232, 475, 290, 757];Frame2:[230, 486, 284, 762].') 2023-11-03 09:20:18.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:20:18.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Microphone:[612, 252, 711, 300],[577, 448, 687, 506],[466, 341, 563, 402],[368, 447, 424, 511],[062, 442, 118, 531],[369, 654, 489, 711];Speaker:[242, 857, 574, 970],[020, 900, 243, 970];Glasses:[571, 290, 610, 312];Belt:[769, 575, 875, 627];Piano:[023, 741, 365, 841];Guitar:[379, 564, 472, 810],[722, 193, 787, 703];Violin:[116, 510, 185, 614];Person:[699, 128, 900, 968],[495, 239, 719, 968],[353, 347, 526, 869],[023, 395, 223, 914].') 2023-11-03 09:20:34.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:20:34.137 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Couch:[126, 687, 812, 1000],[253, 448, 552, 763];Chair:[001, 546, 330, 947],[718, 568, 914, 960],[776, 525, 928, 865],[648, 438, 734, 674],[211, 398, 293, 563],[116, 392, 209, 546];shelf:[755, 562, 856, 682];Desk:[142, 410, 377, 549];Lamp:[960, 022, 991, 101];Radiator:[899, 629, 955, 924];Backpack:[133, 538, 248, 718];Moniter:[775, 445, 885, 600].') 2023-11-03 09:21:22.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:21:22.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[498, 506, 553, 575],carFrame1:[567, 503, 711, 654],carFrame1:[123, 466, 244, 622],carFrame1:[223, 501, 280, 589],carFrame1:[000, 404, 160, 681] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[498, 506, 553, 575];Frame2:[509, 503, 573, 588],carFrame1:[567, 503, 711, 654];Frame2:[621, 496, 864, 728],carFrame1:[123, 466, 244, 622];Frame2:[000, 443, 184, 657],carFrame1:[223, 501, 280, 589];Frame2:[162, 494, 242, 609],carFrame1[000, 404, 160, 681];Frame2:No object.') 2023-11-03 09:21:30.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:21:30.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tree:[000, 000, 999, 750].') 2023-11-03 09:21:45.804 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:21:45.804 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A mother [334, 080, 417, 367] and her daughter [396, 443, 529, 996] take a group photo with the students [800, 275, 975, 997] [199, 387, 320, 996] [152, 292, 260, 993] [617, 035, 757, 408] [015, 352, 209, 996] [481, 319, 592, 996] [258, 362, 409, 996] [726, 369, 882, 995] [542, 326, 731, 997] they worked with in Fiji') 2023-11-03 09:22:03.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:22:03.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,wheelchair,clothing,person and lynx in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human arm:[046, 309, 111, 508],[111, 337, 156, 532],[138, 288, 165, 361],[147, 337, 166, 508],[170, 512, 202, 727],[242, 305, 315, 487],[269, 508, 343, 690],[292, 388, 319, 491],[293, 534, 318, 646],[329, 317, 403, 477],[355, 536, 393, 709],[403, 357, 428, 471],[428, 347, 465, 479],[445, 572, 493, 735],[461, 382, 500, 479],[496, 595, 556, 723],[505, 349, 547, 516],[545, 390, 583, 520],[553, 622, 620, 748],[588, 347, 607, 526],[601, 638, 646, 764],[635, 547, 657, 731],[663, 369, 686, 508],[719, 357, 738, 485],[723, 538, 747, 750],[736, 382, 761, 518],[778, 345, 800, 520],[801, 530, 903, 679],[841, 353, 912, 427],[874, 297, 958, 439];wheelchair:[728, 550, 943, 934];clothing:[043, 265, 146, 777],[139, 255, 958, 877];person:[048, 171, 149, 893],[141, 186, 956, 803],[168, 393, 348, 892],[291, 436, 376, 836],[371, 478, 461, 826],[430, 422, 543, 795],[538, 524, 653, 797],[635, 410, 761, 830],[735, 428, 907, 822].') 2023-11-03 09:22:39.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:22:39.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Flower:[028, 712, 072, 776],[580, 841, 628, 920];Chair:[190, 703, 244, 843],[150, 367, 213, 445],[177, 400, 223, 473],[748, 798, 798, 877];Person:[348, 839, 418, 972],[418, 839, 469, 960],[698, 896, 763, 975],[794, 760, 856, 907],[660, 794, 735, 896],[341, 731, 401, 880],[381, 697, 484, 805],[425, 604, 515, 797],[592, 726, 651, 819],[649, 640, 714, 719],[661, 711, 719, 796],[125, 784, 208, 877],[874, 688, 927, 792],[841, 569, 897, 657],[766, 517, 810, 582],[902, 655, 952, 777],[880, 527, 955, 651],[832, 454, 875, 560],[632, 480, 697, 551],[518, 435, 562, 505],[318, 538, 371, 623],[302, 590, 352, 658],[327, 625, 388, 714],[195, 535, 245, 621],[082, 595, 132, 668],[073, 501, 126, 577],[302, 468, 347, 534],[168, 697, 225, 803],[864, 384, 910, 447],[806, 279, 854, 355],[383, 396, 426, 475],[042, 353, 092, 419],[093, 102, 139, 259],[246, 465, 301, 576];Bed:[005, 660, 196, 924].') 2023-11-03 09:23:02.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:23:02.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[408, 722, 538, 997],[000, 678, 047, 905],[000, 650, 055, 833],[158, 650, 194, 768],[163, 718, 247, 999],[185, 655, 221, 725],[207, 678, 248, 786],[313, 674, 334, 800],[324, 669, 363, 855],[366, 660, 391, 772],[386, 656, 405, 748];man:[038, 640, 125, 850],[142, 617, 174, 689],[221, 620, 246, 715],[273, 644, 322, 792],[537, 676, 591, 835];clothing:[002, 709, 053, 881],[034, 775, 225, 999],[178, 770, 252, 995],[231, 730, 327, 995],[323, 704, 365, 855],[414, 788, 521, 999],[503, 712, 530, 830],[539, 699, 587, 849];building:[300, 037, 863, 673];car:[572, 674, 999, 999],[576, 665, 682, 734],[633, 695, 798, 851],[652, 680, 756, 711],[756, 734, 999, 999],[930, 923, 999, 992].') 2023-11-03 09:23:12.594 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:23:12.595 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[000, 305, 526, 999];man:[377, 233, 932, 999].') 2023-11-03 09:23:32.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:23:32.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[117, 274, 390, 990],[000, 746, 074, 999],[000, 576, 113, 983],[051, 265, 124, 653],[083, 639, 112, 873],[156, 157, 226, 507],[229, 187, 290, 432];human hair:[368, 293, 443, 401],[000, 742, 061, 999],[086, 319, 118, 386],[287, 302, 350, 411],[576, 288, 641, 380],[747, 395, 823, 515],[807, 461, 881, 580],[953, 725, 999, 815];human head:[577, 279, 663, 409],[000, 222, 008, 306],[000, 582, 021, 740],[000, 747, 066, 924],[267, 301, 365, 416],[373, 289, 441, 422],[732, 399, 825, 531],[785, 458, 881, 614],[937, 724, 999, 866];man:[753, 439, 951, 1000],[322, 284, 498, 999],[523, 289, 701, 999],[665, 399, 822, 999],[855, 720, 999, 999];human arm:[248, 416, 316, 609],[051, 261, 085, 388],[071, 892, 118, 981],[085, 699, 110, 765],[088, 399, 124, 553],[320, 446, 356, 774],[456, 461, 496, 792],[522, 459, 594, 617],[571, 450, 701, 598],[667, 576, 734, 750],[672, 604, 808, 750],[735, 626, 933, 831],[745, 602, 793, 774],[854, 906, 915, 999];dress:[121, 399, 375, 999];suit:[322, 409, 498, 999],[523, 397, 706, 999],[673, 531, 817, 999],[898, 866, 999, 999];human body:[000, 573, 120, 986],[002, 743, 078, 999],[116, 296, 380, 999],[317, 290, 500, 999],[523, 286, 705, 996],[665, 397, 825, 999],[858, 719, 999, 998];girl:[000, 750, 075, 999],[000, 565, 113, 983],[121, 303, 376, 999];flower:[000, 476, 111, 891],[019, 372, 110, 445],[316, 460, 408, 570];human face:[324, 322, 361, 406],[378, 317, 420, 421],[584, 322, 626, 411],[735, 426, 780, 521],[796, 486, 834, 596].') 2023-11-03 09:23:45.036 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:23:45.036 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the final one to enter among the gray ducks walking into the door.?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one the final one to enter among the gray ducks walking into the door..Track1frame:1:[318, 365, 447, 825];frame:2:[316, 352, 449, 826];frame:3:[335, 300, 454, 800].') 2023-11-03 09:24:10.265 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:24:10.266 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[619, 020, 999, 999],[218, 131, 665, 999];Necklace:[737, 866, 851, 999].') 2023-11-03 09:24:32.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:24:32.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'window:[365, 268, 642, 673].') 2023-11-03 09:26:13.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:26:13.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'SUV:[683, 397, 998, 725];Car:[508, 487, 712, 688].') 2023-11-03 09:27:16.960 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:27:16.960 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "San Marcos has installed [000, 049, 800, 999] 10 mermaid statues in a nod to the city's history and an attempt to promote tourism. This is Blooming Brightly.") 2023-11-03 09:27:20.280 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:27:20.280 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Still from Roger and Hammersteins 1997 "Cinderella". shows the two stars, Brandy and Whitney Houston, Brandy is wearing [266, 532, 525, 996] a pal blue ball gown and [330, 087, 443, 181] tiara and Whitney Houston is wearing [435, 382, 779, 998] a gold gown.') 2023-11-03 09:27:35.813 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:27:35.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lamp:[394, 196, 453, 303],[936, 072, 1000, 256],[347, 085, 389, 137];Plate:[169, 852, 277, 917],[117, 795, 198, 843];Cup:[634, 648, 659, 695],[733, 629, 760, 674],[845, 608, 870, 650];Basin:[125, 839, 174, 877];Moniter:[377, 286, 445, 361];Chair:[852, 627, 1000, 999],[681, 646, 934, 999],[497, 590, 694, 972],[703, 560, 813, 768],[614, 572, 801, 884],[325, 567, 483, 857],[221, 837, 505, 999],[001, 932, 193, 999],[001, 672, 193, 840],[231, 537, 342, 780],[330, 527, 413, 593],[413, 523, 490, 579],[956, 518, 999, 588],[902, 530, 968, 628],[146, 523, 254, 707],[092, 505, 174, 685],[043, 494, 127, 632],[001, 625, 092, 697];Desk:[525, 625, 997, 1000],[002, 788, 373, 1000],[222, 560, 489, 822].') 2023-11-03 09:28:19.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:28:19.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[005, 775, 071, 999],[036, 642, 101, 908],[065, 507, 130, 701],[141, 456, 182, 585],[146, 593, 227, 781],[165, 749, 235, 994],[182, 397, 215, 507],[333, 397, 370, 589],[377, 330, 406, 413],[510, 579, 578, 837],[521, 369, 544, 530],[551, 516, 601, 675],[605, 275, 650, 456],[607, 514, 658, 702],[674, 921, 746, 999],[690, 399, 733, 514],[700, 323, 737, 424],[850, 761, 917, 916].') 2023-11-03 09:28:27.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:28:27.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[595, 025, 804, 530],[395, 300, 685, 998];woman:[000, 524, 175, 997],[026, 371, 169, 580],[084, 426, 523, 998];fashion accessory:[246, 801, 420, 876],[659, 141, 722, 388];human face:[054, 000, 136, 297],[095, 594, 170, 760],[568, 390, 614, 522],[675, 030, 726, 136].') 2023-11-03 09:28:35.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:28:35.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'bridesmaids [584, 132, 796, 998] [164, 120, 415, 997] [399, 196, 600, 998] wearing blue dresses [165, 324, 413, 998] [587, 337, 786, 998] [403, 399, 592, 998] outside the church [078, 004, 994, 808]') 2023-11-03 09:28:55.278 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:28:55.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'table:[046, 668, 115, 808],[046, 822, 205, 936],[128, 642, 202, 734],[202, 661, 318, 796],[269, 628, 342, 656],[324, 639, 378, 739],[693, 696, 794, 734],[870, 810, 952, 936];tree:[043, 059, 960, 912].') 2023-11-03 09:28:59.111 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:28:59.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human body:[531, 360, 718, 970];woman:[027, 551, 188, 953],[385, 453, 551, 900],[544, 431, 703, 986],[666, 379, 778, 622];human hair:[010, 544, 146, 863],[122, 383, 168, 460],[153, 348, 195, 418],[198, 356, 226, 421],[239, 364, 286, 437],[252, 471, 320, 592],[278, 354, 322, 441],[300, 508, 396, 612],[349, 421, 415, 510],[360, 346, 395, 418],[396, 344, 445, 433],[410, 445, 495, 620],[520, 392, 567, 476],[580, 433, 646, 546],[700, 374, 750, 506],[778, 360, 827, 429],[971, 370, 999, 436];man:[202, 537, 529, 978],[000, 540, 171, 999],[031, 343, 063, 407],[160, 347, 255, 548],[193, 360, 245, 472],[290, 362, 345, 436],[351, 349, 441, 524],[478, 366, 548, 518],[526, 392, 597, 647],[555, 368, 590, 443],[565, 352, 675, 574],[686, 350, 703, 418],[873, 283, 935, 408],[902, 370, 999, 701];human arm:[006, 665, 078, 863],[118, 500, 148, 579],[168, 468, 255, 545],[187, 703, 380, 918],[333, 694, 518, 921],[525, 540, 564, 652],[532, 587, 647, 768],[762, 441, 796, 513],[878, 293, 924, 364],[917, 453, 950, 553].') 2023-11-03 09:29:25.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:29:25.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Stool:[381, 612, 558, 890];Person:[163, 233, 310, 947],[303, 233, 419, 913],[405, 221, 542, 941],[542, 182, 684, 961],[684, 159, 833, 1000].') 2023-11-03 09:29:40.216 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:29:40.217 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A French flag [406, 078, 487, 441] flutters from the back [002, 528, 946, 994] of the boat with the stone bridge of Libourne [000, 280, 996, 421] behind') 2023-11-03 09:29:44.423 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:29:44.423 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Our adventure book [173, 566, 894, 962] this would be so cute as a DIY cover') 2023-11-03 09:30:00.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:30:00.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[395, 568, 457, 759],[000, 551, 558, 998],[657, 647, 777, 979],[792, 714, 916, 988],[823, 744, 998, 998];clothing:[037, 840, 110, 911],[056, 756, 112, 827],[116, 789, 175, 864],[136, 763, 233, 842],[162, 924, 206, 998],[262, 838, 327, 904],[323, 872, 437, 998],[400, 607, 465, 748],[403, 774, 502, 878],[456, 682, 556, 782],[457, 718, 521, 821],[652, 699, 772, 919],[780, 838, 936, 973],[831, 913, 998, 998];building:[000, 000, 998, 998].') 2023-11-03 09:30:06.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:30:06.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Desk:[016, 209, 999, 999].') 2023-11-03 09:30:12.967 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:30:12.967 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Three men [325, 314, 523, 624] [107, 319, 511, 996] [426, 173, 710, 572] are gathered around a desk [289, 512, 985, 996] looking over a copy [585, 551, 795, 667] of The Teller, published by the First National Bank of Wisconsin.') 2023-11-03 09:30:33.682 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:30:33.682 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'pharmacist [459, 002, 996, 622] holding out a packaged prescription stock photo [487, 319, 734, 779] - prescription stock pictures, royalty-free photos [591, 613, 801, 728] & images') 2023-11-03 09:30:49.036 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:30:49.036 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human arm:[431, 015, 734, 161],[278, 393, 697, 546],[458, 540, 631, 580],[479, 693, 663, 856],[490, 839, 689, 999],[498, 600, 655, 882],[581, 834, 729, 999],[833, 376, 923, 444];human hand:[275, 456, 380, 550],[443, 567, 473, 630],[480, 768, 554, 889];human body:[000, 000, 915, 539],[170, 325, 901, 999],[303, 000, 622, 116];human leg:[000, 000, 084, 162],[000, 333, 377, 520],[051, 076, 383, 254],[063, 000, 416, 388],[215, 578, 486, 695],[231, 445, 300, 583],[234, 530, 461, 645],[301, 669, 539, 892],[306, 901, 541, 999],[463, 599, 508, 726];woman:[000, 000, 913, 524],[245, 476, 745, 776],[809, 773, 850, 860];man:[069, 000, 785, 380],[192, 382, 838, 744],[264, 602, 762, 895],[399, 830, 813, 999],[518, 549, 830, 999],[574, 395, 824, 654],[600, 619, 830, 957],[725, 437, 820, 560],[773, 310, 952, 463],[791, 801, 875, 924],[818, 924, 944, 999];footwear:[309, 903, 354, 999].') 2023-11-03 09:31:03.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:03.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the white rabbit jump around in this video clip?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There are 3 white rabbit jump around.Track1frame:1:[122, 462, 253, 686];frame:2:[242, 406, 347, 591];frame:3:[267, 341, 349, 453],Track2frame:1:[261, 525, 491, 725];frame:2:[277, 556, 395, 712];frame:3:[283, 533, 417, 700],Track3frame:3:[418, 322, 492, 393].') 2023-11-03 09:31:06.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:06.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Towel:[561, 757, 733, 960];Person:[287, 034, 999, 997];Hat:[574, 032, 934, 336];Pot:[113, 816, 565, 938];Plate:[248, 737, 494, 803];Basin:[001, 653, 089, 788];Bottle:[309, 462, 522, 698],[194, 253, 275, 452],[270, 260, 349, 450],[432, 245, 493, 458],[355, 247, 434, 449];Gas stove:[006, 789, 597, 1000];Cup:[354, 001, 477, 062],[480, 001, 590, 077];Oven:[178, 079, 633, 676].') 2023-11-03 09:31:07.667 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:07.667 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[203, 020, 700, 992] A sled dog sits up with [068, 379, 849, 995] a blanket wrapped around him outside.') 2023-11-03 09:31:12.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:12.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Person in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[049, 895, 076, 992],[075, 896, 111, 988].') 2023-11-03 09:31:23.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:23.864 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the a man looking at the store?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one a man looking at the store.Track1frame1:[280, 299, 655, 1000];frame2:[299, 302, 692, 1000].') 2023-11-03 09:31:33.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:33.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[513, 074, 845, 789],[000, 762, 440, 1000],[232, 306, 461, 707],[175, 163, 312, 688],[304, 312, 461, 645],[373, 337, 398, 407],[420, 177, 498, 363],[293, 185, 382, 390];Book:[361, 330, 452, 409];Lamp:[376, 001, 474, 100];Chair:[228, 417, 373, 701],[326, 408, 449, 651],[000, 521, 125, 834];Desk:[352, 452, 515, 702];Frame:[806, 092, 845, 164];Clock:[902, 030, 984, 143];Bench:[435, 715, 875, 1000];Piano:[637, 136, 999, 702];Cup:[277, 344, 306, 381].') 2023-11-03 09:31:55.934 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:55.934 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Street Lights in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Street Lights:[799, 002, 930, 737],[603, 386, 690, 996].') 2023-11-03 09:31:57.748 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:31:57.748 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the dog that keeps moving around. in this video clip?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one the dog that keeps moving around..Track1frame:1:[175, 483, 384, 655];frame:2:[182, 419, 412, 510];frame:3:[146, 332, 410, 493].') 2023-11-03 09:32:26.203 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:32:26.203 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Donald Sutherland [218, 022, 799, 996] smiles and gestures, standing on a stage in spotlights [708, 003, 996, 569] while holding an award [723, 876, 779, 997].') 2023-11-03 09:32:31.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:32:31.545 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human hair:[015, 347, 240, 808],[296, 606, 326, 689],[309, 205, 571, 641],[607, 631, 748, 829],[939, 442, 998, 731];human head:[267, 221, 623, 776],[000, 337, 254, 790],[621, 643, 734, 815],[950, 456, 998, 710];clothing:[129, 627, 651, 997],[000, 748, 190, 993],[587, 771, 835, 997],[857, 710, 990, 997];woman:[000, 344, 307, 997],[864, 459, 998, 997];mammal:[000, 340, 304, 997],[153, 216, 695, 997],[585, 643, 825, 997],[860, 452, 998, 997],[960, 482, 998, 988];man:[000, 328, 310, 997],[151, 191, 698, 997],[589, 650, 834, 997];human face:[118, 447, 251, 792],[500, 410, 601, 750];human arm:[507, 547, 696, 857];human nose:[581, 489, 606, 566];human hand:[214, 589, 310, 780],[529, 557, 700, 820],[967, 480, 998, 650].') 2023-11-03 09:33:04.233 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:33:04.233 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "John Green about intelligence (“Looking for Alaska”, 2005) - They love their hair [056, 077, 326, 516] because they aren't smart enough to love something more interesting.") 2023-11-03 09:33:24.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:33:24.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[433, 254, 805, 1000];Camera:[499, 561, 598, 657];Satchel:[415, 660, 543, 765].') 2023-11-03 09:33:41.723 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:33:41.724 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Montgomery County health and emergency officials [004, 339, 788, 998] were administering thousands of second COVID-19 vaccination doses [656, 062, 882, 869] Tuesday, but the event caused traffic jams in both north and south county.') 2023-11-03 09:33:44.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:33:44.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Tripod,Tie and Piano in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Tripod:[273, 310, 406, 903];Tie:[461, 307, 496, 468];Piano:[152, 237, 622, 818].') 2023-11-03 09:33:50.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:33:50.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'truck:[058, 137, 990, 984];car:[092, 123, 977, 984];tire:[059, 551, 141, 705],[481, 663, 631, 981],[848, 516, 920, 719].') 2023-11-03 09:34:03.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:34:03.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[555, 491, 612, 559],carFrame1:[390, 448, 449, 555],personFrame1:[711, 476, 750, 565],carFrame1:[520, 474, 557, 530],carFrame1:[037, 438, 213, 565] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[555, 491, 612, 559];Frame2:[559, 477, 615, 543],carFrame1:[390, 448, 449, 555];Frame2:[392, 437, 452, 542],personFrame1:[711, 476, 750, 565];Frame2:[879, 470, 932, 587],carFrame1[520, 474, 557, 530];Frame2:No object,carFrame1:[037, 438, 213, 565];Frame2:[000, 414, 126, 568].') 2023-11-03 09:35:13.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:35:13.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the a woman walking down the hallway?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one a woman walking down the hallway.Track1frame1:[221, 365, 517, 1000];frame2:[202, 357, 480, 1000].') 2023-11-03 09:35:25.219 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:35:25.220 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Towel:[515, 887, 588, 1000],[315, 845, 419, 928],[452, 740, 502, 817],[584, 783, 677, 867];Chair:[687, 741, 797, 954],[152, 834, 221, 1000],[182, 668, 310, 920],[066, 609, 195, 907],[048, 545, 118, 753],[020, 537, 063, 754],[154, 521, 200, 601],[129, 511, 173, 602],[301, 470, 349, 583],[341, 480, 413, 626],[360, 599, 417, 783],[400, 686, 524, 789],[391, 525, 473, 700],[474, 540, 559, 707],[589, 588, 689, 854],[740, 656, 852, 943],[828, 594, 920, 892],[842, 537, 907, 606],[631, 468, 686, 608];Lamp:[603, 101, 696, 230],[160, 001, 358, 117];Frame:[001, 157, 496, 413];Knife:[577, 915, 621, 999];Cup:[550, 756, 595, 915],[435, 774, 483, 942].') 2023-11-03 09:35:43.350 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:35:43.351 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Hockey Stick:[680, 362, 995, 689],[404, 511, 448, 995];Helmet:[570, 089, 625, 130],[476, 199, 563, 327];Gloves:[590, 338, 662, 415],[674, 312, 719, 415],[412, 420, 479, 582],[394, 578, 495, 740];Sneakers:[463, 861, 558, 999];Person:[542, 093, 717, 410],[394, 199, 666, 1000],[000, 001, 383, 999].') 2023-11-03 09:35:59.795 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:35:59.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A construction meeting takes place on Zoom, where seasoned Robotics members [077, 825, 178, 996] [266, 446, 390, 678] [048, 099, 185, 335] [594, 077, 750, 339] [439, 446, 573, 678] [816, 371, 995, 677] [641, 419, 786, 676] [202, 038, 369, 338] [015, 378, 182, 676] [397, 036, 540, 337] brainstorm, discuss issues with one another, and ask and answer questions about their robot design.') 2023-11-03 09:36:40.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:36:40.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Potted Plant:[772, 639, 868, 806];shelf:[037, 622, 118, 960];Oven:[247, 637, 390, 887];Refrigerator:[564, 344, 722, 764];Dishwasher:[114, 615, 241, 906];Person:[451, 296, 599, 657].') 2023-11-03 09:36:41.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:36:41.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cup and Hat in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Cup:[931, 494, 987, 610];Hat:[401, 355, 733, 851].') 2023-11-03 09:36:46.139 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:36:46.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[174, 417, 405, 997],[023, 245, 075, 322],[023, 313, 139, 867],[216, 317, 268, 536],[498, 354, 560, 618],[560, 383, 623, 681],[617, 325, 680, 528],[751, 438, 821, 582],[925, 410, 999, 652];man:[674, 304, 980, 988],[154, 214, 238, 632],[351, 253, 511, 998],[696, 153, 806, 322];footwear:[023, 786, 106, 868],[370, 940, 435, 999];girl:[024, 238, 081, 317],[185, 484, 403, 999],[222, 383, 400, 999],[407, 314, 535, 999],[496, 344, 555, 544],[553, 377, 630, 673],[921, 397, 999, 637];human face:[077, 334, 104, 399],[315, 445, 373, 540],[421, 300, 456, 373],[469, 369, 501, 442],[757, 155, 795, 240].') 2023-11-03 09:36:56.571 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:36:56.571 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[483, 350, 571, 762] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a personFrame1:[483, 350, 571, 762];Frame2:[475, 345, 573, 762];Frame3:[436, 341, 539, 784];Frame4:[449, 341, 549, 759];Frame5:[460, 366, 530, 750].') 2023-11-03 09:37:03.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:03.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Lamp:[020, 118, 277, 244],[379, 001, 401, 390],[578, 001, 726, 084];Chair:[153, 618, 260, 840],[198, 636, 331, 896],[265, 657, 385, 970],[417, 656, 555, 980],[196, 583, 282, 783],[771, 549, 865, 646];Frame:[392, 371, 494, 560];Couch:[436, 566, 774, 705];Pillow:[471, 538, 515, 574],[521, 542, 571, 581],[579, 550, 639, 589],[647, 556, 715, 600];Flower:[639, 532, 690, 568];Carpet:[548, 566, 874, 731];Bench:[329, 678, 522, 821];Basin:[309, 572, 395, 634].') 2023-11-03 09:37:07.579 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:07.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tower:[012, 137, 135, 537],[112, 348, 183, 536],[156, 151, 231, 346],[195, 245, 244, 565],[240, 265, 362, 549],[295, 442, 373, 576],[350, 210, 438, 452],[379, 353, 489, 569],[485, 346, 558, 567],[543, 406, 604, 596],[546, 336, 606, 406],[590, 404, 655, 584],[639, 448, 696, 582],[694, 300, 791, 607],[765, 199, 815, 333],[799, 290, 945, 615],[817, 034, 905, 284],[944, 450, 971, 560],[964, 444, 999, 573];skyscraper:[000, 178, 130, 535],[106, 335, 189, 543],[117, 286, 163, 353],[153, 108, 237, 350],[185, 255, 254, 547],[236, 226, 356, 553],[289, 430, 356, 560],[348, 439, 377, 557],[353, 201, 443, 399],[362, 378, 408, 568],[395, 339, 496, 577],[480, 347, 550, 559],[538, 407, 581, 574],[550, 308, 606, 420],[570, 405, 613, 585],[589, 268, 644, 424],[590, 412, 654, 591],[631, 414, 663, 493],[639, 500, 717, 599],[666, 438, 703, 522],[699, 285, 931, 613],[773, 189, 814, 312],[810, 030, 855, 308],[854, 022, 901, 299],[936, 435, 980, 534],[969, 433, 999, 556];vehicle:[743, 795, 885, 946].') 2023-11-03 09:37:14.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:14.251 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'window:[022, 059, 106, 140],[022, 564, 099, 644],[024, 443, 104, 521],[037, 196, 108, 278],[040, 323, 108, 405],[157, 574, 183, 644],[166, 330, 206, 398],[167, 446, 196, 526],[178, 210, 244, 280],[183, 000, 250, 040],[185, 082, 250, 171],[214, 951, 278, 999],[237, 455, 288, 530],[237, 583, 288, 648],[244, 335, 288, 412],[286, 000, 360, 058],[288, 098, 349, 185],[288, 219, 349, 292],[317, 586, 348, 655],[327, 465, 358, 533],[327, 338, 360, 415],[339, 904, 401, 999],[411, 227, 484, 315],[414, 355, 475, 420],[416, 111, 481, 190],[422, 000, 491, 068],[503, 772, 542, 981],[512, 239, 580, 318],[516, 361, 583, 424],[516, 130, 586, 193],[516, 000, 589, 075],[565, 748, 606, 948],[635, 713, 678, 925],[639, 135, 712, 210],[647, 260, 714, 331],[650, 373, 714, 455],[657, 014, 716, 089],[719, 692, 771, 895],[744, 383, 814, 458],[750, 010, 814, 096],[753, 265, 814, 341],[753, 142, 821, 219],[796, 649, 832, 868],[876, 159, 935, 234],[876, 273, 939, 343],[878, 034, 945, 110],[907, 610, 999, 813],[975, 038, 999, 123],[981, 164, 999, 239];skyscraper:[000, 000, 999, 848].') 2023-11-03 09:37:16.242 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:16.243 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A still from [623, 237, 937, 995] the film version of Watership Down.') 2023-11-03 09:37:24.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:24.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[361, 313, 404, 512],personFrame1:[723, 412, 777, 653],personFrame1:[151, 327, 214, 525] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[361, 313, 404, 512];Frame2:[360, 312, 404, 513],personFrame1:[723, 412, 777, 653];Frame2:[708, 417, 837, 656],personFrame1:[151, 327, 214, 525];Frame2:[152, 323, 213, 525].') 2023-11-03 09:37:39.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:39.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Flower:[186, 305, 313, 381],[001, 235, 169, 345],[079, 700, 262, 861],[001, 589, 261, 720];Vase:[001, 294, 100, 351],[024, 693, 208, 761],[112, 799, 236, 865];Potted Plant:[001, 713, 119, 868],[736, 523, 790, 557];Carpet:[195, 621, 751, 789];Chair:[106, 516, 411, 773],[572, 516, 894, 795];Pillow:[696, 568, 835, 685];Stool:[372, 579, 472, 684],[494, 584, 597, 695];Couch:[263, 461, 717, 620];Lamp:[941, 032, 999, 198].') 2023-11-03 09:37:41.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:37:41.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[166, 500, 568, 998] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a bumper carFrame1:[166, 500, 568, 998];Frame2:[226, 400, 651, 998];Frame3:[280, 354, 662, 978];Frame4:[298, 384, 608, 878].') 2023-11-03 09:38:04.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:38:04.331 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[001, 183, 092, 705],[064, 207, 132, 486],[111, 202, 187, 487],[149, 246, 345, 694],[120, 099, 184, 255],[091, 044, 158, 217],[213, 002, 279, 158],[291, 040, 344, 161],[341, 041, 398, 126],[341, 002, 421, 125],[400, 002, 490, 138],[466, 047, 531, 162],[451, 128, 501, 266],[482, 115, 549, 279],[526, 002, 590, 075],[578, 002, 646, 111],[615, 033, 691, 179],[661, 002, 738, 180],[706, 029, 781, 184],[739, 002, 804, 084],[767, 112, 855, 335],[833, 125, 970, 325],[922, 035, 999, 187],[907, 003, 967, 103],[839, 054, 903, 174],[903, 045, 957, 192],[522, 248, 599, 531],[568, 214, 658, 536],[531, 132, 605, 283].') 2023-11-03 09:38:10.338 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:38:10.339 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Horizontal overhead image of [081, 002, 923, 995] a tart in a ceramic pan that's ready to bake, with ricotta filling and [658, 144, 772, 280] [335, 687, 492, 910] [439, 338, 557, 489] [301, 088, 412, 227] [520, 218, 645, 385] [541, 049, 655, 204] [475, 597, 614, 828] [194, 561, 328, 746] [419, 093, 545, 262] [311, 235, 451, 405] [532, 419, 669, 600] [620, 593, 784, 801] [342, 474, 493, 683] [647, 289, 802, 467] [214, 380, 360, 567] [171, 227, 313, 401] sliced fresh figs on top, on a mottled gray background.") 2023-11-03 09:38:38.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:38:38.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[680, 497, 717, 676],personFrame1:[447, 301, 488, 583],personFrame1:[391, 416, 446, 572],personFrame1:[525, 538, 582, 765],personFrame1:[528, 486, 565, 684],personFrame1:[465, 570, 514, 747],personFrame1:[817, 576, 862, 744] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[680, 497, 717, 676];Frame2:[641, 476, 680, 648],personFrame1:[447, 301, 488, 583];Frame2:[417, 366, 466, 602],personFrame1:[391, 416, 446, 572];Frame2:[377, 348, 414, 570],personFrame1:[525, 538, 582, 765];Frame2:[489, 550, 528, 766],personFrame1[528, 486, 565, 684];Frame2:No object,personFrame1:[465, 570, 514, 747];Frame2:[425, 570, 470, 768],personFrame1:[817, 576, 862, 744];Frame2:[782, 536, 821, 716].') 2023-11-03 09:38:43.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:38:43.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,person,book and shelf in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'footwear:[357, 826, 420, 877],[402, 807, 481, 856];person:[322, 093, 456, 872],[520, 459, 621, 822];book:[000, 772, 028, 900],[000, 760, 103, 843],[012, 346, 117, 417],[012, 558, 120, 630],[066, 425, 143, 505],[067, 648, 181, 723],[134, 754, 218, 823],[190, 432, 265, 494],[193, 643, 239, 710],[201, 945, 290, 999],[206, 311, 231, 377],[244, 509, 357, 589],[261, 743, 351, 816],[318, 624, 362, 695],[384, 735, 404, 802],[431, 718, 513, 793],[445, 602, 541, 668],[496, 251, 622, 324],[506, 474, 549, 555],[518, 712, 553, 783],[565, 355, 654, 428],[608, 701, 660, 774],[632, 238, 700, 299],[645, 461, 668, 524],[658, 348, 701, 412],[659, 666, 737, 764];shelf:[000, 241, 778, 928].') 2023-11-03 09:39:55.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:39:55.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[277, 503, 712, 808] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a airplaneFrame1:[277, 503, 712, 808];Frame2:[382, 317, 826, 649];Frame3:[340, 275, 768, 600];Frame4:[298, 212, 696, 522].') 2023-11-03 09:40:18.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:40:18.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[604, 502, 745, 999],[686, 493, 853, 999],[775, 535, 991, 998],[918, 610, 998, 996].') 2023-11-03 09:40:48.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:40:48.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'suit:[680, 252, 983, 998];man:[671, 206, 982, 998],[008, 191, 360, 999];woman:[003, 342, 063, 993],[223, 373, 386, 999],[428, 340, 736, 999];girl:[261, 395, 418, 999],[428, 313, 568, 999];human face:[000, 406, 037, 516],[147, 232, 252, 434],[231, 388, 286, 470],[287, 450, 355, 560],[466, 384, 536, 505],[592, 378, 685, 545],[734, 242, 844, 461].') 2023-11-03 09:40:50.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:40:50.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'girl:[096, 090, 344, 940],[152, 392, 351, 999],[286, 220, 438, 542],[340, 342, 591, 999],[435, 237, 535, 486],[460, 406, 750, 999],[526, 175, 716, 469],[693, 189, 836, 638],[744, 351, 956, 999];tree:[000, 000, 027, 324],[011, 000, 077, 296],[084, 000, 168, 346],[250, 000, 347, 245],[295, 000, 443, 330],[428, 017, 478, 260],[463, 000, 576, 279],[572, 000, 615, 175],[636, 000, 668, 210],[692, 005, 726, 279],[728, 009, 762, 198],[777, 000, 836, 317],[866, 000, 947, 307];footwear:[125, 845, 178, 937];human face:[195, 119, 253, 230],[238, 432, 303, 553],[341, 245, 397, 333],[431, 369, 510, 529],[459, 248, 521, 350],[586, 202, 647, 307],[619, 438, 691, 572],[736, 209, 801, 322],[799, 394, 869, 515].') 2023-11-03 09:40:57.865 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:40:57.865 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Deadly Shame - [063, 146, 550, 855] a series of animated shorts exploring nurses' experiences on the front lines of the pandemic and more") 2023-11-03 09:40:59.397 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:40:59.397 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[543, 479, 568, 520] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a handFrame1:[543, 479, 568, 520];Frame2:[425, 363, 450, 405];Frame3:[450, 350, 471, 397];Frame4:[589, 466, 615, 501].') 2023-11-03 09:41:05.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:41:05.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[277, 339, 347, 441],[652, 339, 725, 441],[301, 000, 772, 219],[410, 221, 554, 377].') 2023-11-03 09:41:15.299 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:41:15.300 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[026, 088, 967, 646] reindeer with [049, 504, 934, 806] a world map') 2023-11-03 09:41:40.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:41:40.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Car in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Car:[896, 222, 999, 312],[423, 198, 868, 358],[134, 193, 455, 389].') 2023-11-03 09:41:58.765 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:41:58.765 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Hat:[859, 256, 965, 365];Desk:[018, 226, 190, 345],[198, 311, 1000, 997],[631, 304, 999, 473],[328, 231, 477, 357],[713, 224, 847, 291];Chair:[051, 638, 445, 999],[014, 545, 262, 761],[001, 392, 096, 555],[611, 395, 757, 486],[800, 380, 876, 475],[664, 338, 783, 493],[560, 315, 665, 427],[272, 251, 345, 323];shelf:[771, 061, 932, 289];Lamp:[253, 154, 609, 483],[715, 165, 999, 329],[049, 146, 096, 253];Laptop:[414, 407, 607, 643],[211, 266, 261, 354],[643, 246, 711, 318],[695, 244, 753, 309];Person:[869, 226, 971, 318],[736, 207, 819, 310],[789, 257, 1000, 586],[541, 175, 674, 447],[182, 300, 455, 708],[052, 306, 639, 993],[023, 217, 170, 525],[017, 253, 100, 395],[146, 178, 197, 264],[211, 181, 258, 272],[283, 223, 393, 341];Book:[731, 490, 876, 557].') 2023-11-03 09:42:00.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:42:00.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tower:[473, 000, 728, 817],[361, 324, 395, 517],[395, 160, 466, 578];tree:[008, 393, 408, 634],[015, 604, 136, 735],[458, 501, 519, 580],[464, 408, 501, 515],[608, 284, 999, 529],[900, 554, 955, 662].') 2023-11-03 09:42:24.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:42:24.293 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[663, 831, 869, 1000],[938, 773, 1000, 1000],[772, 772, 932, 1000],[594, 769, 740, 1000],[098, 888, 243, 1000],[000, 736, 074, 1000],[081, 794, 226, 1000],[128, 788, 254, 1000],[071, 730, 201, 1000],[088, 726, 216, 1000],[498, 723, 647, 1000],[421, 663, 577, 1000],[602, 668, 735, 1000],[731, 696, 840, 1000],[879, 660, 999, 1000],[893, 617, 990, 970],[793, 635, 885, 968],[660, 629, 772, 950],[553, 616, 657, 956],[423, 644, 554, 965],[917, 566, 1000, 860],[847, 577, 943, 809],[774, 500, 845, 730],[724, 505, 795, 757],[564, 388, 731, 793],[704, 480, 764, 677],[677, 491, 750, 677],[508, 508, 582, 690],[495, 510, 552, 683],[484, 489, 549, 671],[424, 494, 495, 796],[389, 467, 458, 711],[375, 505, 441, 741],[310, 508, 390, 830],[299, 480, 370, 773],[245, 500, 309, 755],[234, 593, 357, 964],[230, 569, 354, 909],[032, 593, 187, 924],[083, 511, 173, 879],[066, 483, 157, 806],[149, 488, 220, 831],[160, 489, 239, 796],[863, 827, 968, 882].') 2023-11-03 09:42:36.265 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:42:36.265 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[318, 053, 953, 990] Pablo Pineau stands in jeans and a dark sweater next to bookshelves overflowing with books and journals. [000, 005, 987, 990] The picture is in black and white.') 2023-11-03 09:42:37.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:42:37.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Umbrella:[017, 255, 558, 712],[000, 353, 222, 687];Desk:[156, 586, 448, 788],[000, 592, 057, 703],[866, 597, 1000, 832];Chair:[697, 548, 956, 820].') 2023-11-03 09:43:01.174 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:43:01.174 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[493, 277, 626, 434] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a gooseFrame1:[493, 277, 626, 434];Frame2:[402, 258, 580, 500];Frame3:[394, 113, 603, 463];Frame4:[353, 313, 729, 512];Frame5:[525, 202, 848, 530].') 2023-11-03 09:43:03.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:43:03.695 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[027, 233, 226, 502],[516, 204, 675, 407];Chopsticks:[149, 762, 198, 953],[003, 610, 100, 687],[621, 831, 692, 997];Plate:[112, 797, 220, 918],[146, 889, 428, 1000],[001, 748, 117, 903],[001, 631, 114, 725],[002, 522, 132, 621],[173, 483, 382, 598],[215, 395, 378, 481],[392, 372, 549, 445],[536, 410, 708, 484],[733, 444, 888, 524],[556, 517, 712, 581],[343, 533, 577, 671],[542, 899, 716, 1000],[720, 796, 973, 992],[896, 575, 1000, 695];Cup:[001, 705, 117, 999],[093, 548, 185, 794],[101, 417, 164, 549],[248, 335, 297, 415],[692, 359, 749, 530],[439, 540, 497, 608],[627, 620, 714, 896];Basin:[569, 461, 714, 568];Fork:[655, 892, 749, 996];Knife:[001, 849, 117, 940];Bottle:[357, 360, 417, 552].') 2023-11-03 09:43:26.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:43:26.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[166, 297, 272, 683],[071, 380, 118, 576],[325, 437, 350, 528],[406, 416, 430, 554],[585, 384, 654, 616];tree:[026, 001, 280, 458],[295, 206, 327, 537],[300, 000, 678, 673],[579, 000, 999, 516],[627, 297, 656, 561],[880, 294, 930, 413];clothing:[000, 417, 030, 498],[029, 414, 071, 498],[071, 398, 114, 563],[120, 430, 155, 545],[176, 343, 283, 561],[255, 491, 298, 556],[325, 448, 349, 517],[408, 432, 433, 518],[445, 451, 462, 544],[498, 445, 508, 521],[510, 458, 526, 542],[519, 430, 563, 532],[537, 437, 566, 560],[585, 408, 648, 610],[702, 473, 838, 685].') 2023-11-03 09:43:52.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:43:52.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'boy:[370, 206, 640, 1000],[697, 401, 849, 999];human hair:[412, 245, 541, 408],[000, 442, 043, 688],[243, 621, 333, 680],[610, 485, 658, 581],[706, 429, 736, 499],[738, 396, 816, 482],[913, 396, 999, 502];man:[375, 196, 661, 1000],[206, 520, 430, 999],[675, 427, 745, 997],[693, 393, 856, 999],[865, 391, 999, 999];glasses:[405, 342, 546, 409],[915, 469, 980, 494];shirt:[371, 456, 686, 999],[893, 507, 997, 861];human head:[000, 441, 010, 591],[245, 530, 343, 705],[411, 237, 544, 484],[605, 482, 662, 572],[701, 429, 741, 518],[737, 401, 809, 537],[906, 399, 999, 557];human face:[320, 621, 348, 705],[426, 298, 537, 485],[608, 500, 659, 567],[722, 443, 738, 524],[760, 436, 808, 559],[904, 450, 964, 560];human arm:[071, 850, 179, 999],[236, 741, 377, 945],[609, 524, 686, 999],[722, 493, 832, 726],[765, 630, 859, 743],[871, 559, 996, 753];human hand:[335, 555, 403, 767],[798, 492, 830, 555],[868, 684, 913, 754].') 2023-11-03 09:43:59.188 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:43:59.188 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'To remedy the matter, GM will notify owners, and dealers will reprogram the ECM software, at no cost to owners. [004, 003, 996, 991] - Photo courtesy of General Motors.') 2023-11-03 09:44:04.327 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:44:04.328 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human hair:[731, 193, 843, 388],[019, 156, 066, 227],[068, 179, 131, 252],[120, 198, 223, 348],[293, 190, 363, 269],[368, 212, 486, 341],[478, 209, 553, 312],[576, 176, 703, 421],[815, 272, 878, 369],[866, 296, 999, 459];table:[307, 536, 900, 956],[220, 463, 490, 598];human head:[731, 190, 848, 365],[016, 166, 071, 257],[058, 168, 138, 343],[120, 201, 231, 419],[160, 137, 223, 222],[289, 197, 366, 340],[366, 220, 487, 357],[481, 230, 545, 360],[611, 175, 696, 349],[800, 276, 895, 463],[853, 307, 999, 500];drink:[271, 367, 308, 499];human body:[058, 201, 575, 999];human leg:[007, 623, 094, 999],[257, 694, 591, 999],[567, 694, 841, 999];girl:[000, 153, 066, 312],[271, 176, 999, 999];clothing:[000, 213, 999, 999],[055, 377, 526, 970];human face:[033, 187, 061, 260],[073, 219, 126, 344],[151, 269, 229, 414],[307, 231, 357, 343],[400, 256, 451, 354],[488, 245, 539, 352],[622, 223, 686, 341],[743, 221, 809, 344],[814, 328, 868, 454],[855, 329, 940, 502];human arm:[000, 252, 045, 364],[000, 368, 053, 653],[138, 446, 322, 756],[185, 397, 250, 535],[267, 291, 393, 383],[468, 313, 577, 390],[468, 329, 588, 533],[573, 348, 689, 551],[778, 558, 991, 813],[922, 453, 998, 557].') 2023-11-03 09:44:17.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:44:17.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Wine Glass in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Wine Glass:[302, 002, 533, 128].') 2023-11-03 09:44:57.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:44:57.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Gloves:[243, 617, 310, 680],[290, 505, 348, 564],[707, 503, 754, 607],[699, 257, 731, 309],[835, 531, 877, 600],[945, 344, 986, 412],[977, 303, 1000, 382],[874, 538, 1000, 762];Hat:[169, 274, 238, 372],[586, 135, 669, 266],[555, 228, 590, 301],[982, 172, 1000, 209];Glasses:[978, 202, 1000, 246];Sneakers:[888, 872, 924, 971],[952, 872, 998, 954];Skiboard:[093, 320, 411, 998],[649, 026, 807, 1000],[830, 125, 887, 1000];Cup:[918, 545, 987, 708];Person:[115, 273, 348, 1000],[349, 320, 370, 364].') 2023-11-03 09:45:28.249 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:45:28.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'dessert:[037, 506, 480, 1000],[000, 281, 073, 543],[000, 127, 117, 347],[164, 335, 520, 710],[177, 043, 465, 281],[189, 164, 486, 408],[470, 606, 818, 995],[562, 187, 865, 545],[569, 058, 831, 254],[921, 037, 997, 277];snack:[000, 283, 074, 541],[000, 104, 126, 343],[055, 525, 480, 987],[167, 318, 518, 729],[174, 033, 481, 281],[189, 170, 504, 450],[455, 600, 815, 997],[558, 041, 864, 289],[574, 168, 872, 543],[903, 000, 998, 214],[933, 054, 998, 337],[979, 279, 998, 377];baked goods:[000, 285, 070, 539],[000, 114, 116, 356],[034, 529, 469, 997],[173, 318, 515, 739],[180, 033, 477, 275],[182, 139, 491, 431],[447, 600, 818, 997],[560, 177, 875, 531],[565, 029, 844, 258],[901, 006, 998, 187],[929, 079, 998, 329],[983, 297, 998, 393].') 2023-11-03 09:45:31.280 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:45:31.281 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A sliced open acorn squash [197, 051, 992, 995] has been filled with red dried cranberries and a spoonful of brown sugar.') 2023-11-03 09:45:35.502 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:45:35.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Baseball Glove:[600, 362, 660, 496],[910, 877, 988, 970];Baseball:[915, 905, 948, 948];Sneakers:[244, 365, 343, 533],[013, 786, 121, 887];Hat:[701, 675, 793, 772];Other Shoes:[794, 686, 848, 791],[748, 769, 792, 813];Belt:[485, 843, 529, 979];Person:[004, 369, 954, 982],[599, 001, 860, 810],[242, 001, 351, 262],[000, 001, 105, 817].') 2023-11-03 09:45:41.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:45:41.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Picture,Chair and Rice in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Picture:[761, 404, 889, 518],[961, 396, 1000, 518];Chair:[852, 628, 1000, 972],[037, 715, 168, 964],[911, 559, 945, 602].') 2023-11-03 09:46:56.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:46:56.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Chair:[580, 666, 876, 999],[161, 415, 516, 1000],[000, 307, 184, 815],[520, 304, 789, 703],[328, 245, 534, 630],[166, 196, 339, 465],[055, 169, 203, 403],[000, 145, 123, 345],[182, 127, 288, 302],[280, 143, 391, 365],[753, 259, 955, 634],[527, 197, 701, 462],[395, 164, 526, 393],[337, 114, 399, 233],[437, 132, 541, 325],[551, 147, 665, 316],[679, 176, 820, 426],[880, 213, 1000, 391],[778, 158, 901, 366],[654, 137, 765, 323],[550, 116, 637, 268],[452, 104, 532, 211];Desk:[147, 595, 492, 877],[000, 427, 165, 505],[578, 406, 837, 602],[325, 323, 520, 474],[169, 257, 319, 374],[045, 222, 155, 311],[000, 189, 063, 270],[173, 158, 261, 207],[276, 185, 379, 254],[391, 209, 518, 290],[562, 258, 724, 313],[829, 339, 999, 481],[908, 272, 1000, 344],[818, 197, 936, 280],[719, 225, 855, 325],[678, 171, 782, 241],[557, 187, 673, 264],[446, 166, 536, 213],[336, 142, 416, 188].') 2023-11-03 09:47:07.983 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:07.983 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[464, 349, 751, 679] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a horse cartFrame1:[464, 349, 751, 679];Frame2:[447, 363, 718, 677];Frame3:[443, 387, 698, 682];Frame4:[477, 347, 718, 628];Frame5:[516, 343, 758, 636].') 2023-11-03 09:47:18.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:18.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[504, 277, 764, 862],[211, 284, 659, 841],[931, 358, 998, 823];human hair:[344, 304, 455, 422],[456, 394, 494, 462],[625, 255, 713, 375],[688, 375, 736, 437],[783, 276, 872, 409],[973, 362, 998, 451];swimwear:[230, 693, 416, 838],[290, 509, 431, 629],[569, 719, 709, 825],[573, 405, 683, 600],[931, 402, 998, 656];human body:[147, 290, 171, 362],[231, 287, 669, 847],[438, 393, 491, 686],[621, 276, 943, 861],[689, 369, 751, 512],[747, 372, 761, 466],[933, 352, 998, 829];human leg:[438, 529, 463, 675];mammal:[219, 277, 678, 830],[394, 218, 407, 288],[437, 395, 503, 677],[504, 255, 729, 868],[630, 270, 950, 865],[688, 381, 758, 554],[740, 375, 765, 455],[929, 343, 998, 823];man:[627, 273, 952, 858];girl:[221, 287, 675, 848],[427, 400, 496, 670],[501, 261, 732, 855],[745, 372, 760, 461],[931, 369, 998, 829];tree:[000, 000, 162, 181],[001, 004, 127, 231],[064, 000, 164, 229],[143, 031, 226, 175],[193, 006, 278, 141],[272, 095, 396, 158],[285, 000, 396, 073],[352, 000, 540, 281],[541, 000, 812, 286],[771, 000, 998, 379];human arm:[408, 548, 469, 613],[421, 411, 671, 606],[500, 437, 598, 834],[623, 547, 775, 662],[687, 450, 739, 865],[716, 443, 848, 840],[939, 418, 981, 650].') 2023-11-03 09:47:28.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:28.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Microphone:[446, 598, 490, 860];Person:[120, 062, 313, 999],[387, 003, 722, 999],[498, 003, 802, 1000].') 2023-11-03 09:47:36.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:36.363 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the bus on the go.?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is no the bus on the go..') 2023-11-03 09:47:41.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:41.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[110, 053, 391, 992],[705, 362, 1000, 1000],[665, 169, 987, 722],[951, 399, 1000, 665],[942, 292, 995, 365];Helmet:[192, 051, 267, 198];Gloves:[157, 415, 199, 479];Glasses:[801, 284, 865, 342];Hat:[783, 168, 910, 296],[806, 362, 974, 580];Boots:[312, 636, 393, 947],[112, 723, 170, 995];Horse:[156, 191, 404, 1000].') 2023-11-03 09:47:52.019 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:52.019 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'boat:[042, 489, 941, 702];paddle:[000, 504, 144, 560],[230, 560, 999, 720],[461, 533, 999, 631],[649, 540, 999, 608],[860, 522, 999, 557];boy:[517, 275, 728, 590];woman:[116, 241, 354, 637],[348, 275, 541, 628],[523, 282, 732, 594],[731, 300, 905, 596].') 2023-11-03 09:47:59.817 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:47:59.817 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Police officers [113, 347, 209, 594] in a patrol car [104, 471, 830, 927] keep people [037, 428, 128, 849] [352, 262, 426, 504] [746, 460, 860, 733] [466, 277, 532, 526] [576, 375, 657, 639] [843, 465, 961, 853] moving in Greenwich Park, London') 2023-11-03 09:49:00.967 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:49:00.968 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "The UMass Dartmouth women's tennis team [235, 138, 984, 993] is shown here at its Juvenile Myositis awareness event. Women's tennis was one of eight sports dropped from UMD intercollegiate athletics this week.") 2023-11-03 09:49:34.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:49:34.512 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'In The Manger 1000 Piece Jigsaw Puzzle') 2023-11-03 09:49:55.553 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:49:55.553 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[434, 464, 512, 588],carFrame1:[310, 382, 408, 579],carFrame1:[112, 503, 210, 621] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[434, 464, 512, 588];Frame2:[428, 454, 511, 588],carFrame1:[310, 382, 408, 579];Frame2:[313, 377, 408, 574],carFrame1:[112, 503, 210, 621];Frame2:[163, 500, 237, 591].') 2023-11-03 09:49:56.834 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:49:56.834 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The new design element ethnic Chinese wind hit [392, 395, 833, 877] the color three-dimensional flowers, [093, 068, 898, 903] handmade handbag women shoulder bag') 2023-11-03 09:50:19.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:50:19.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect building,human face and man in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'building:[000, 000, 360, 945],[356, 000, 999, 897];human face:[638, 793, 697, 911],[048, 806, 853, 999];man:[127, 822, 210, 999],[296, 811, 331, 906],[640, 796, 695, 895].') 2023-11-03 09:50:30.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:50:30.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Machinery Vehicle:[249, 001, 934, 818],[676, 867, 713, 1000],[711, 732, 729, 825];Street Lights:[251, 015, 314, 272],[057, 525, 090, 611],[902, 289, 931, 336];Bicycle:[000, 614, 058, 721];Train:[722, 535, 757, 664];Other Shoes:[043, 703, 052, 715],[030, 701, 041, 713],[203, 741, 224, 758],[167, 764, 191, 777],[218, 708, 238, 725],[224, 689, 248, 704],[182, 722, 196, 739],[235, 667, 245, 689],[255, 692, 264, 700],[287, 679, 304, 692],[268, 667, 278, 678],[308, 665, 319, 676],[982, 506, 991, 517],[958, 510, 970, 522],[918, 560, 935, 573],[896, 555, 905, 575],[933, 559, 947, 569],[949, 558, 960, 568];Handbag:[306, 592, 318, 620];Flag:[371, 270, 478, 560];Person:[212, 304, 545, 1000],[949, 384, 1000, 520],[165, 474, 239, 735],[118, 477, 224, 775],[000, 559, 057, 714].') 2023-11-03 09:50:41.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:50:41.922 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'door:[433, 539, 523, 748];house:[000, 000, 998, 998];tree:[791, 000, 998, 264];clothing:[018, 046, 193, 171],[413, 453, 522, 535];human face:[083, 000, 137, 089],[423, 339, 500, 472].') 2023-11-03 09:52:34.512 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:52:34.512 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect human eye,human mouth,human nose,human hair,mammal and man in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human eye:[360, 053, 443, 088],[520, 051, 596, 073];human mouth:[425, 168, 565, 226];human nose:[436, 065, 531, 148];human hair:[225, 000, 711, 302];mammal:[026, 000, 998, 998];man:[038, 000, 998, 998].') 2023-11-03 09:52:42.008 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:52:42.009 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[000, 055, 998, 747] A Keck Graduate Institute building with [189, 649, 419, 809] a sigh in from of it displaying [754, 175, 852, 251] the KGI name and logo.') 2023-11-03 09:52:43.620 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:52:43.620 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[281, 172, 457, 995] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[281, 172, 457, 995];Frame2:[377, 297, 556, 993];Frame3:[358, 260, 531, 995];Frame4:[331, 272, 511, 995];Frame5:[368, 306, 550, 993].') 2023-11-03 09:52:58.245 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:52:58.245 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'WFSE union members [441, 199, 696, 643] [051, 527, 345, 997] [677, 322, 997, 997] in green rain ponchos [440, 329, 698, 650] hold protest signs [433, 416, 605, 650] [409, 316, 510, 457] [600, 472, 798, 727] [077, 383, 215, 568] [714, 624, 996, 978] [042, 621, 157, 853] about protecting private info during a rally on the Olympia Capitol steps.') 2023-11-03 09:53:18.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:53:18.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Chair:[766, 344, 887, 468];Person:[084, 276, 123, 390];Car:[152, 283, 291, 467],[248, 167, 901, 797].') 2023-11-03 09:53:21.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:53:21.813 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Frame:[653, 280, 857, 795],[868, 387, 957, 711],[514, 379, 646, 765],[463, 504, 523, 687],[376, 535, 455, 681],[334, 601, 376, 680],[374, 681, 434, 740];Chair:[521, 748, 584, 880];Cabinet:[585, 718, 676, 911],[244, 622, 329, 755];Couch:[284, 730, 328, 840],[170, 721, 299, 873];Desk:[318, 732, 386, 822];Carpet:[296, 826, 448, 970];Person:[903, 466, 945, 677],[737, 420, 809, 639],[669, 364, 742, 654],[559, 541, 632, 687],[516, 643, 592, 878],[538, 522, 579, 659],[478, 571, 515, 666],[373, 684, 425, 824],[218, 528, 253, 689],[137, 523, 208, 688];Book:[761, 908, 885, 971],[736, 874, 778, 933],[799, 810, 862, 916].') 2023-11-03 09:53:35.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:53:35.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[136, 097, 999, 999];Plate:[003, 908, 372, 1000];Spoon:[003, 921, 203, 1000].') 2023-11-03 09:53:39.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:53:39.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'squirrel:[160, 107, 597, 923],[835, 000, 970, 400].') 2023-11-03 09:53:44.808 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:53:44.809 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Teach students how to write great biographies and autobiographies using [008, 008, 984, 979] picture books as your lessons.') 2023-11-03 09:53:51.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:53:51.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Remote:[893, 653, 962, 676];Bottle:[000, 938, 076, 1000],[061, 897, 147, 999];Cup:[000, 790, 066, 948],[000, 047, 035, 203],[040, 121, 146, 207];Plate:[050, 197, 166, 230];Frame:[388, 238, 450, 368];Lamp:[331, 400, 384, 486],[119, 001, 561, 143];Flower:[861, 464, 922, 545];Vase:[876, 526, 909, 557];Desk:[824, 539, 960, 652],[789, 616, 999, 999];Chair:[852, 513, 999, 636],[811, 595, 999, 738];Pillow:[474, 510, 571, 557],[477, 450, 547, 523],[394, 480, 490, 548],[291, 506, 401, 566],[144, 545, 239, 611],[220, 460, 361, 562],[094, 465, 209, 560],[196, 439, 341, 552],[093, 441, 184, 485];Bed:[093, 430, 654, 998].') 2023-11-03 09:54:32.339 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:54:32.339 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the dog in yellow and white shift around in this video clip?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There are 3 dog in yellow and white shift around.Track1frame:1:[171, 269, 579, 963];frame:2:[156, 158, 608, 963];frame:3:[247, 251, 659, 966],Track2frame:1:[463, 375, 652, 772];frame:2:[516, 382, 685, 854];frame:3:[530, 418, 708, 860],Track3frame:1:[230, 605, 520, 907];frame:2:[116, 484, 468, 910];frame:3:[000, 481, 456, 954].') 2023-11-03 09:54:44.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:54:44.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[550, 513, 790, 995],[008, 515, 090, 688],[064, 507, 103, 583],[085, 547, 334, 908],[105, 530, 299, 733],[113, 552, 330, 825],[115, 486, 265, 649],[119, 468, 241, 647],[127, 499, 280, 688],[175, 444, 244, 494],[181, 439, 250, 493],[260, 416, 301, 487],[304, 334, 997, 538],[311, 454, 357, 549],[321, 457, 391, 528],[334, 464, 391, 551],[341, 457, 430, 567],[418, 489, 567, 785],[558, 444, 635, 560],[605, 454, 688, 591],[669, 478, 807, 646],[763, 487, 887, 754],[857, 423, 919, 515],[862, 428, 959, 589];woman:[080, 614, 387, 998],[258, 348, 999, 842],[464, 528, 675, 885];clothing:[000, 314, 999, 998],[566, 612, 765, 914].') 2023-11-03 09:55:07.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:55:07.742 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[585, 230, 675, 383] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a faceFrame1:[585, 230, 675, 383];Frame2:[580, 265, 669, 423];Frame3:[565, 259, 656, 418];Frame4:[570, 215, 657, 366].') 2023-11-03 09:55:57.327 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:55:57.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'UC grad Ben Paulus jumps in the air wearing [373, 193, 666, 712] his graduation gown in front of [025, 007, 962, 983] McMicken Hall.') 2023-11-03 09:55:59.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:55:59.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the wheelchair shifting slightly?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one wheelchair shifting slightly.Track1frame:1:[195, 475, 997, 998];frame:2:[210, 489, 997, 998];frame:3:[216, 481, 997, 998].') 2023-11-03 09:56:35.405 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:56:35.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'PHOTO: Boston Marathon bomber Dzhokhar Tsarnaev stopped in a Massachusetts Whole Foods to buy milk less than a half hour after the deadly attack.') 2023-11-03 09:56:54.918 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:56:54.918 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Listening in:\xa0Pete's name could be seen on the iPhone screen [311, 137, 607, 612] as one of the real estate agents [254, 245, 998, 875] held the device up during the proceedings") 2023-11-03 09:57:27.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:57:27.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[594, 412, 701, 600],[542, 438, 640, 656],[454, 388, 560, 582],[002, 415, 135, 657],[196, 001, 266, 153];Helmet:[082, 414, 116, 462],[527, 387, 553, 438],[573, 439, 597, 474],[619, 411, 642, 461];Sneakers:[501, 548, 526, 582],[653, 568, 675, 599].') 2023-11-03 09:58:01.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:58:01.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[002, 106, 440, 999],[291, 157, 855, 999],[539, 003, 740, 268];Book:[325, 855, 573, 956];Flag:[730, 003, 874, 293];Chair:[285, 255, 1000, 1000],[002, 182, 511, 999];Desk:[002, 298, 104, 469],[132, 024, 270, 389];Tie:[169, 368, 294, 754].') 2023-11-03 09:58:12.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:58:12.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '`Either` and [023, 098, 970, 892] a note on its injection functions.') 2023-11-03 09:58:35.518 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:58:35.518 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'bicycle:[000, 105, 602, 848],[000, 000, 624, 998],[000, 000, 673, 610],[363, 000, 745, 518],[510, 000, 799, 403],[626, 000, 999, 998],[749, 089, 999, 386],[767, 055, 999, 569],[821, 033, 999, 509];bicycle wheel:[000, 144, 999, 619],[048, 057, 631, 998],[627, 000, 999, 998];wheel:[000, 000, 999, 998];tire:[000, 282, 056, 611],[049, 000, 999, 984].') 2023-11-03 09:58:58.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 09:58:58.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Whiteboard,Desk,Chair and earphone in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Whiteboard:[735, 292, 930, 555];Desk:[910, 866, 999, 999],[002, 524, 707, 1000];Chair:[962, 735, 1000, 877].') 2023-11-03 09:59:32.798 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 09:59:32.798 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[181, 361, 391, 841];table:[000, 747, 999, 998];man:[000, 272, 233, 772],[285, 300, 608, 855],[479, 321, 860, 904],[783, 602, 999, 977].') 2023-11-03 10:00:07.708 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:00:07.708 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[424, 540, 521, 872],[139, 751, 427, 999],[802, 324, 959, 837];Person:[816, 044, 1000, 1000],[384, 127, 686, 1000],[161, 415, 424, 999],[001, 355, 194, 999];Glasses:[484, 228, 569, 272],[920, 177, 994, 225];Hat:[903, 044, 999, 181];Microphone:[684, 184, 815, 227],[263, 252, 379, 354],[040, 489, 136, 545].') 2023-11-03 10:00:12.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:00:12.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect glove,footwear,woman and human head in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'glove:[481, 463, 586, 511],[486, 192, 607, 265];footwear:[756, 306, 880, 468],[756, 363, 930, 452];woman:[225, 193, 926, 437];human head:[222, 238, 311, 351],[000, 322, 091, 502].') 2023-11-03 10:00:23.050 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:00:23.050 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bringing a child into the world is the greatest act of hope there is. [108, 366, 892, 577] Louise Hart Picture Quote') 2023-11-03 10:00:30.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:00:30.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[494, 301, 739, 785],[770, 343, 896, 598],[267, 447, 331, 564],[133, 414, 273, 766];Sneakers:[676, 643, 742, 780],[229, 656, 270, 762],[134, 662, 173, 774].') 2023-11-03 10:00:43.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:00:43.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Economic development and commercial real estate business growth with building cranes and [708, 637, 923, 948] [076, 783, 261, 947] [246, 294, 738, 946] condominium buildings under construction as a 3D illustration. [002, 008, 992, 983] poster') 2023-11-03 10:01:38.516 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 4 samples! 2023-11-03 10:01:38.516 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bulletproof stainless steel Wire Mesh for anti window and door') 2023-11-03 10:03:18.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:03:18.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[504, 460, 673, 751],[442, 515, 508, 609],[716, 521, 788, 606],[337, 509, 394, 557],[903, 498, 966, 615],[680, 838, 1000, 1000],[001, 624, 036, 781];Glasses:[577, 484, 624, 499];Gloves:[586, 559, 630, 582],[534, 603, 565, 623];Other Shoes:[592, 686, 631, 716],[001, 758, 037, 779];Satchel:[411, 725, 635, 790];Hat:[479, 516, 504, 537];Saxophone:[544, 505, 624, 644].') 2023-11-03 10:03:22.601 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:03:22.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,human face,microphone and man in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[000, 438, 354, 1000],[255, 257, 396, 458],[310, 497, 816, 984],[641, 488, 899, 885];human face:[101, 257, 267, 614],[228, 070, 320, 246],[437, 103, 523, 268],[499, 000, 545, 096],[541, 264, 699, 579],[719, 292, 833, 539];microphone:[674, 544, 775, 700];man:[662, 224, 923, 933],[354, 039, 526, 431].') 2023-11-03 10:04:03.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:04:03.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Bed:[000, 453, 918, 997];Pillow:[309, 588, 413, 693],[317, 515, 417, 594],[192, 532, 327, 649],[111, 569, 248, 735],[185, 626, 337, 735];Frame:[000, 070, 358, 482];Cabinet:[417, 111, 1000, 919];Cup:[589, 131, 620, 166],[614, 090, 643, 124],[612, 126, 642, 160],[632, 117, 661, 156],[652, 075, 685, 114],[653, 115, 687, 155],[716, 092, 754, 136],[764, 083, 799, 124],[787, 076, 819, 118],[805, 063, 848, 112],[839, 060, 875, 104],[864, 049, 903, 099],[889, 001, 939, 047],[889, 037, 934, 093],[921, 031, 967, 088],[946, 019, 999, 082].') 2023-11-03 10:04:09.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:04:09.933 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,car,human body,chair,person and loveseat in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human arm:[246, 183, 288, 280];car:[075, 137, 768, 941],[000, 068, 199, 560],[381, 000, 455, 057],[439, 028, 527, 106],[888, 275, 999, 735];human body:[212, 023, 286, 363];chair:[675, 304, 781, 530],[780, 323, 903, 558];person:[196, 032, 306, 348],[923, 126, 973, 247].') 2023-11-03 10:04:15.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:04:15.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect traffic sign,footwear,man and woman in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'footwear:[438, 866, 506, 918],[652, 886, 794, 956],[761, 803, 864, 853];man:[361, 549, 760, 933];woman:[610, 528, 828, 888].') 2023-11-03 10:05:18.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:05:18.088 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[538, 254, 767, 881];frame:2:[378, 280, 741, 781];frame:3:[588, 283, 764, 763] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an a womam carrying a mug comes in the room and sit down and listens') 2023-11-03 10:05:26.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:05:26.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Airplane:[010, 199, 992, 691];Street Lights:[020, 267, 037, 428],[211, 169, 231, 423],[223, 269, 239, 424],[533, 266, 551, 462],[707, 255, 723, 447],[777, 271, 794, 440],[866, 286, 887, 479];Car:[331, 439, 417, 464],[473, 430, 531, 462],[436, 412, 506, 449],[730, 437, 806, 465];Van:[347, 430, 399, 454],[917, 416, 999, 468].') 2023-11-03 10:05:48.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:05:48.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[594, 360, 720, 593];Handbag:[549, 454, 623, 510],[678, 454, 723, 509].') 2023-11-03 10:05:52.951 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:05:52.951 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Microwave,Person and Sneakers in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[112, 347, 158, 425],[080, 340, 120, 364],[193, 299, 381, 580],[338, 288, 557, 665],[519, 288, 677, 700],[730, 407, 843, 568];Sneakers:[551, 670, 634, 695],[521, 677, 631, 702],[468, 642, 558, 664].') 2023-11-03 10:06:19.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:06:19.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[409, 423, 899, 812] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a mountain beaverFrame1:[409, 423, 899, 812];Frame2:[404, 352, 698, 812];Frame3:[535, 129, 658, 564];Frame4:[456, 356, 558, 824];Frame5:[447, 367, 550, 835].') 2023-11-03 10:06:22.999 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:06:23.000 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[071, 112, 797, 999];clothing:[073, 756, 751, 999],[683, 505, 930, 999];human face:[315, 207, 612, 816].') 2023-11-03 10:07:02.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:07:02.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[558, 775, 625, 999],[077, 751, 271, 999],[316, 728, 443, 999],[154, 690, 245, 736];Flower:[207, 662, 247, 701].') 2023-11-03 10:07:11.432 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:07:11.433 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Enchiladas [004, 634, 732, 838] are a staple meal in our dinnertime rotation and homemade red enchilada sauce [556, 051, 974, 274] takes them over the top!') 2023-11-03 10:07:53.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:07:53.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[473, 001, 675, 065];Chair:[106, 502, 557, 999];Bench:[635, 402, 999, 871];Pillow:[795, 514, 999, 672],[631, 491, 830, 581];Towel:[637, 586, 753, 620],[405, 570, 512, 601];Candle:[396, 521, 436, 576];Plate:[696, 592, 764, 619];Spoon:[504, 631, 578, 665];Knife:[514, 633, 612, 671];Bottle:[348, 467, 388, 592].') 2023-11-03 10:07:57.341 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:07:57.341 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track bicycleFrame1:[950, 432, 999, 554],carFrame1:[000, 328, 282, 725],personFrame1:[788, 366, 824, 477],carFrame1:[825, 359, 935, 439],carFrame1:[411, 319, 525, 510],carFrame1:[255, 390, 369, 552],carFrame1:[338, 394, 516, 671] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'bicycleFrame1:[950, 432, 999, 554];Frame2:[957, 441, 999, 554],carFrame1:[000, 328, 282, 725];Frame2:[000, 345, 305, 684],personFrame1[788, 366, 824, 477];Frame2:No object,carFrame1:[825, 359, 935, 439];Frame2:[837, 362, 940, 439],carFrame1:[411, 319, 525, 510];Frame2:[426, 325, 540, 528],carFrame1:[255, 390, 369, 552];Frame2:[279, 393, 375, 526],carFrame1:[338, 394, 516, 671];Frame2:[345, 399, 514, 656].') 2023-11-03 10:08:05.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:08:05.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[044, 054, 853, 1000];glasses:[374, 352, 677, 482],[364, 137, 647, 222],[368, 108, 628, 208];sunglasses:[397, 362, 680, 470];human face:[349, 243, 644, 640].') 2023-11-03 10:08:46.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:08:46.545 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'bicycle wheel:[144, 380, 372, 705],[710, 628, 951, 891],[108, 000, 213, 072],[226, 105, 399, 329],[231, 169, 386, 400],[334, 085, 455, 239],[462, 240, 701, 590],[492, 077, 643, 219],[523, 122, 685, 345],[570, 145, 690, 370],[643, 117, 708, 368];human body:[000, 036, 126, 585];human hair:[000, 082, 061, 183];human head:[000, 081, 060, 181];wheel:[137, 316, 426, 749],[104, 000, 210, 070],[229, 072, 998, 594],[717, 613, 926, 880],[835, 313, 999, 700];tire:[128, 367, 383, 725],[714, 609, 933, 881],[230, 169, 383, 401],[230, 096, 409, 325],[306, 059, 452, 235],[463, 248, 701, 595],[503, 088, 671, 241],[511, 104, 686, 343],[628, 295, 805, 608],[743, 193, 807, 406],[802, 157, 880, 427],[835, 303, 999, 710],[905, 170, 997, 453];bicycle:[107, 000, 999, 705];girl:[000, 080, 120, 568];human face:[000, 081, 051, 185];human arm:[000, 217, 060, 293],[049, 167, 117, 273].') 2023-11-03 10:09:08.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:09:08.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[411, 489, 472, 656] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a eyeFrame1:[411, 489, 472, 656];Frame2:[400, 523, 454, 677];Frame3:[448, 465, 506, 650];Frame4:[396, 549, 461, 746].') 2023-11-03 10:09:16.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:09:16.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the zebra that is grazing with its head down..When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one the zebra that is grazing with its head down..Track1frame:1:[031, 152, 185, 465];frame:2:[019, 158, 189, 466];frame:3:[027, 156, 188, 465].') 2023-11-03 10:09:26.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:09:26.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[463, 737, 495, 793] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a footballFrame1:[463, 737, 495, 793];Frame2:[571, 750, 603, 806];Frame3:[429, 731, 461, 787];Frame4:[483, 731, 515, 787];Frame5:[572, 733, 604, 789].') 2023-11-03 10:09:32.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 10:09:32.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[129, 209, 588, 965];land vehicle:[000, 500, 999, 971];woman:[591, 213, 869, 498];plant:[089, 422, 178, 485],[122, 161, 370, 239],[125, 248, 255, 357],[586, 273, 667, 345],[721, 171, 999, 255],[788, 243, 999, 455];clothing:[133, 258, 591, 909],[377, 245, 619, 492],[591, 287, 866, 504],[898, 338, 999, 468].') 2023-11-03 10:09:33.709 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:09:33.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'tree:[761, 066, 986, 610],[000, 053, 999, 654].') 2023-11-03 10:10:21.637 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:10:21.638 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Collage [393, 130, 893, 661] of crochet scarf on a boy [232, 005, 897, 658], rolled up, close up and on the ground') 2023-11-03 10:10:23.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:10:23.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[607, 700, 689, 821];frame:2:[587, 715, 712, 835];frame:3:[544, 503, 826, 999] is?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an bird biting another bird and moving around') 2023-11-03 10:10:30.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:10:30.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 108, 715, 1000],[330, 275, 758, 998],[921, 337, 999, 499];Hat:[493, 276, 557, 367];Jug:[502, 516, 684, 776];Cup:[661, 544, 782, 696],[870, 657, 987, 912];Desk:[457, 518, 1000, 999],[627, 499, 690, 546];Air Conditioner:[649, 385, 691, 505];Frame:[103, 406, 150, 496];Chair:[122, 476, 162, 523];Cabinet:[002, 494, 073, 745].') 2023-11-03 10:10:32.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:10:32.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bottle,Satchel and Picture/Frame in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bottle:[159, 501, 196, 615],[075, 541, 165, 608];Satchel:[628, 459, 722, 615],[413, 555, 572, 625],[091, 425, 213, 675];Frame:[642, 001, 808, 126].') 2023-11-03 10:10:43.771 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:10:43.772 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'swan:[000, 139, 558, 807],[231, 338, 999, 882].') 2023-11-03 10:11:04.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:11:04.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[214, 425, 375, 730],[410, 475, 615, 728];footwear:[237, 695, 310, 721].') 2023-11-03 10:12:03.354 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:12:03.354 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'couple playing footsie under the table - footsie under table stock pictures, royalty-free photos [603, 619, 801, 725] & images') 2023-11-03 10:13:16.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:13:16.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'mobile phone:[708, 137, 983, 914],[000, 298, 429, 999];person:[000, 000, 999, 999].') 2023-11-03 10:13:23.000 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:13:23.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Machinery Vehicle:[157, 121, 931, 884],[007, 345, 269, 642],[171, 348, 369, 592].') 2023-11-03 10:13:36.519 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:13:36.519 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'car:[776, 725, 993, 918],[000, 704, 161, 981],[136, 690, 200, 790],[154, 645, 228, 752],[220, 640, 240, 704],[233, 604, 411, 681],[623, 693, 762, 812];building:[000, 402, 149, 786],[349, 545, 381, 629],[378, 495, 520, 654],[516, 238, 832, 779],[803, 000, 999, 733].') 2023-11-03 10:13:39.569 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:13:39.570 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Governor Greg Abbott [237, 346, 630, 949] outside of Wally's Burger Express in Austin is surrounded by state legislators [863, 131, 997, 612] [720, 376, 997, 961] [524, 108, 813, 949] [006, 148, 254, 930] as he signs a bill that limits property tax growth, Wednesday, June 12, 2019.") 2023-11-03 10:13:41.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:13:41.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[696, 600, 761, 843],personFrame1:[307, 637, 402, 833],personFrame1:[101, 657, 197, 830],personFrame1:[506, 636, 581, 829],personFrame1:[252, 645, 336, 828],personFrame1:[875, 623, 999, 864],personFrame1:[605, 628, 693, 847] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[696, 600, 761, 843];Frame2:[679, 554, 755, 827],personFrame1:[307, 637, 402, 833];Frame2:[320, 581, 407, 823],personFrame1:[101, 657, 197, 830];Frame2:[052, 623, 128, 838],personFrame1:[506, 636, 581, 829];Frame2:[497, 613, 575, 825],personFrame1:[252, 645, 336, 828];Frame2:[225, 579, 318, 831],personFrame1:[875, 623, 999, 864];Frame2:[827, 596, 990, 864],personFrame1:[605, 628, 693, 847];Frame2:[581, 599, 699, 846].') 2023-11-03 10:14:15.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:14:15.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Picture/Frame in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Picture:[133, 002, 443, 551].') 2023-11-03 10:14:27.405 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:14:27.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[000, 399, 696, 917] unrecognizable man in [001, 006, 999, 925] a car using [248, 360, 423, 747] alcohol gel to disinfect [311, 171, 904, 920] steering wheel during pandemic coronavirus covid-19 disinfecting get automobile transport wipe job') 2023-11-03 10:14:37.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:14:37.598 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Golf Ball:[230, 292, 271, 357];Person:[050, 052, 598, 999],[449, 224, 967, 999],[001, 159, 081, 633],[198, 160, 258, 642];Helmet:[640, 221, 751, 352],[248, 046, 357, 157],[011, 159, 061, 221],[220, 161, 250, 201];Hockey Stick:[038, 113, 258, 398],[094, 755, 250, 896].') 2023-11-03 10:14:40.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:14:40.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Workers [703, 204, 886, 995] [829, 239, 991, 995] [505, 357, 666, 997] speak at the tribunal in India.') 2023-11-03 10:14:59.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:14:59.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'shelf:[165, 001, 433, 429];Frame:[615, 042, 809, 496];Gun:[478, 694, 533, 732],[522, 739, 556, 773],[557, 733, 594, 786],[558, 776, 613, 862],[604, 768, 643, 828],[603, 743, 626, 775],[648, 741, 673, 798],[656, 791, 695, 828],[654, 832, 691, 870],[759, 872, 829, 922];Hat:[951, 164, 981, 197],[384, 873, 438, 968],[338, 829, 405, 898],[535, 218, 559, 251],[305, 454, 332, 485];Handbag:[359, 352, 426, 485];Cup:[338, 552, 385, 616],[276, 538, 319, 598],[626, 573, 664, 631],[898, 604, 945, 660];Bottle:[713, 401, 759, 549],[455, 483, 487, 590];Horse:[665, 153, 749, 319].') 2023-11-03 10:15:06.690 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:15:06.690 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Wooden Butterflies [589, 443, 828, 670] [228, 423, 404, 772] [394, 620, 652, 924] [352, 219, 571, 508] in a wooden bowl [182, 238, 819, 971] filled crystals [839, 790, 888, 855] [078, 829, 133, 896] [095, 304, 136, 367] [792, 900, 870, 955]') 2023-11-03 10:15:53.831 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:15:53.832 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Justin Bieber [006, 006, 351, 993]]'s rich, right? Grammys [410, 164, 940, 571] are for music and not money. He's making a lot of money. He should be happy with that. - Patrick Carney") 2023-11-03 10:15:54.738 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:15:54.738 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'food:[071, 833, 255, 972],[243, 725, 379, 851],[266, 951, 336, 998],[287, 785, 477, 920],[329, 946, 419, 998];woman:[759, 328, 941, 985];man:[026, 311, 174, 855],[194, 272, 386, 780],[422, 447, 548, 831],[532, 257, 736, 998],[555, 420, 661, 691];clothing:[022, 412, 181, 836],[207, 382, 383, 790],[381, 375, 439, 452],[414, 518, 553, 821],[548, 410, 679, 590],[565, 431, 731, 998],[791, 486, 925, 998].') 2023-11-03 10:16:07.108 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:07.108 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Wild Bird:[086, 453, 224, 589],[291, 501, 426, 588],[459, 206, 594, 350],[455, 501, 615, 637],[588, 331, 739, 467],[705, 365, 861, 527].') 2023-11-03 10:16:17.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:17.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Glasses,Person,Cell Phone and Elephant in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Glasses:[520, 291, 555, 358];Person:[477, 149, 741, 999],[000, 001, 128, 975],[184, 022, 363, 439],[628, 001, 709, 394],[853, 007, 970, 399];Cell Phone:[024, 203, 063, 260].') 2023-11-03 10:16:26.343 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:26.344 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[126, 030, 861, 964] black long sleeve unisex tee with [341, 184, 627, 573] a camo american spartan helmet front chest logo') 2023-11-03 10:16:39.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:39.824 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'suit:[610, 393, 985, 973],[072, 628, 613, 998];man:[073, 350, 611, 998],[612, 160, 998, 998];building:[002, 000, 305, 616],[055, 078, 698, 830];human face:[322, 411, 475, 683],[646, 195, 843, 546];glasses:[638, 291, 830, 405].') 2023-11-03 10:16:41.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:41.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the college's bus that is going rightward, ahead of the white vehicle..To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.", "There is one the college's bus that is going rightward, ahead of the white vehicle..Track1frame:1:[413, 487, 609, 682];frame:2:[423, 488, 609, 682];frame:3:[432, 490, 608, 682].") 2023-11-03 10:16:49.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:49.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[214, 346, 609, 791] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a sloth bearFrame1:[214, 346, 609, 791];Frame2:[229, 352, 562, 789];Frame3:[280, 363, 534, 800];Frame4:[377, 439, 683, 855].') 2023-11-03 10:16:56.825 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:16:56.825 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[112, 147, 886, 911] Best all natural night creams') 2023-11-03 10:17:10.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:17:10.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'clothing:[536, 374, 738, 680];man:[506, 345, 753, 663];car:[000, 347, 198, 564].') 2023-11-03 10:17:22.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:17:22.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[218, 515, 388, 687],carFrame1:[301, 408, 336, 490],carFrame1:[442, 483, 508, 566],carFrame1:[656, 306, 688, 434],carFrame1:[342, 446, 398, 513],carFrame1:[337, 354, 369, 441],truckFrame1:[943, 449, 999, 592] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[218, 515, 388, 687];Frame2:[243, 501, 396, 656],carFrame1[301, 408, 336, 490];Frame2:No object,carFrame1:[442, 483, 508, 566];Frame2:[449, 467, 516, 550],carFrame1:[656, 306, 688, 434];Frame2:[646, 293, 679, 424],carFrame1:[342, 446, 398, 513];Frame2:[247, 430, 328, 521],carFrame1:[337, 354, 369, 441];Frame2:[314, 333, 347, 435],truckFrame1[943, 449, 999, 592];Frame2:No object.') 2023-11-03 10:18:41.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:18:41.198 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[714, 398, 875, 956],[000, 258, 111, 757],[000, 098, 130, 331],[105, 312, 266, 776],[170, 655, 325, 999],[173, 214, 384, 761],[310, 489, 450, 660],[316, 000, 436, 420],[318, 676, 451, 999],[356, 206, 458, 445],[413, 333, 560, 653],[453, 717, 581, 999],[453, 561, 706, 996],[500, 223, 616, 495],[558, 063, 690, 416],[578, 489, 676, 996],[656, 014, 731, 283];woman:[156, 000, 233, 197],[188, 042, 296, 415],[266, 026, 345, 476],[750, 004, 868, 390],[834, 000, 961, 427];man:[000, 151, 137, 809],[000, 015, 142, 570],[071, 000, 181, 212],[104, 220, 266, 823],[148, 878, 285, 999],[170, 305, 308, 718],[178, 545, 329, 997],[306, 392, 451, 991],[317, 595, 453, 999],[321, 000, 431, 226],[351, 151, 466, 453],[381, 031, 505, 267],[414, 253, 552, 687],[452, 624, 589, 999],[452, 507, 732, 803],[501, 143, 615, 574],[523, 000, 613, 230],[563, 000, 686, 407],[572, 402, 681, 991],[652, 000, 736, 293],[731, 339, 962, 999],[928, 135, 999, 683],[941, 000, 999, 093],[960, 046, 998, 194];footwear:[074, 699, 143, 744].') 2023-11-03 10:19:58.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:19:58.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Blender:[042, 525, 138, 822];Basin:[158, 783, 201, 821],[216, 778, 260, 818],[741, 776, 788, 789],[844, 796, 903, 846];Cup:[229, 789, 260, 835],[190, 790, 221, 836],[588, 817, 620, 844],[610, 760, 652, 796],[685, 808, 708, 835];Tong:[292, 793, 367, 837];Pot:[407, 719, 543, 783];Gas stove:[392, 755, 567, 850];Jug:[666, 770, 714, 811];Plate:[786, 776, 903, 804],[915, 792, 1000, 824];Spoon:[889, 826, 923, 843];Bottle:[923, 705, 946, 830];Person:[416, 369, 617, 788];Telephone:[130, 429, 165, 521];Board Eraser:[255, 633, 295, 648];Whiteboard:[242, 156, 991, 684];Speaker:[652, 020, 823, 183];Clock:[434, 007, 559, 167].') 2023-11-03 10:20:01.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:20:01.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[473, 667, 646, 975],[000, 344, 159, 723],[075, 257, 216, 404],[162, 358, 296, 512],[189, 470, 494, 998],[248, 280, 318, 356],[318, 302, 384, 363],[375, 401, 616, 980],[470, 385, 815, 897],[599, 354, 854, 823],[674, 318, 921, 731],[761, 304, 945, 648],[846, 294, 998, 593],[934, 223, 998, 344];woman:[000, 237, 191, 593],[174, 325, 495, 998],[356, 283, 629, 998],[461, 254, 807, 944],[588, 247, 871, 899],[652, 219, 947, 788],[755, 197, 963, 709],[814, 209, 988, 690],[871, 207, 998, 650];man:[005, 233, 191, 790],[087, 174, 228, 425],[162, 287, 284, 536],[257, 219, 322, 387],[313, 197, 396, 404],[580, 183, 639, 365],[651, 179, 689, 269],[723, 192, 766, 333],[763, 169, 786, 292],[801, 176, 828, 242],[897, 181, 932, 250];human face:[018, 287, 075, 368],[117, 214, 153, 287],[227, 309, 265, 391],[312, 375, 370, 515],[339, 243, 381, 323],[403, 313, 462, 425],[506, 287, 546, 384],[636, 271, 668, 366],[692, 252, 723, 332].') 2023-11-03 10:21:15.773 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:21:15.773 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[673, 289, 893, 499] [543, 511, 817, 635] [279, 128, 379, 344] [262, 534, 469, 749] [784, 635, 940, 855] [019, 583, 299, 816] the doves take off') 2023-11-03 10:21:51.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:21:51.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[051, 042, 998, 999];Helmet:[296, 035, 569, 240];Gloves:[310, 430, 478, 533],[001, 477, 081, 638];Sneakers:[260, 873, 526, 999].') 2023-11-03 10:23:03.314 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:23:03.315 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Dozens of [104, 071, 473, 779] [423, 089, 987, 991] vendors offer a variety of snacks, food, [166, 862, 273, 997] [134, 648, 213, 774] drinks and [302, 766, 599, 907] [167, 724, 453, 863] other refreshments throughout the Fourth on Broadway festival in Lubbock.') 2023-11-03 10:24:04.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:24:04.606 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[373, 194, 777, 706] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a raccoonFrame1:[373, 194, 777, 706];Frame2:[512, 130, 824, 706];Frame3:[656, 027, 917, 541].') 2023-11-03 10:24:13.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:24:13.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'wheel:[922, 233, 983, 344],[063, 642, 088, 705],[763, 301, 786, 370];toy:[000, 753, 143, 970],[000, 465, 158, 730],[000, 175, 203, 374],[047, 069, 211, 185],[075, 469, 272, 744],[094, 763, 259, 968],[105, 401, 186, 498],[126, 164, 326, 385],[158, 104, 291, 198],[184, 400, 283, 506],[216, 766, 351, 987],[237, 429, 421, 744],[268, 406, 375, 508],[282, 098, 380, 210],[294, 155, 467, 374],[340, 768, 443, 980],[352, 104, 490, 193],[374, 415, 476, 522],[406, 493, 547, 724],[436, 780, 561, 982],[443, 143, 606, 370],[475, 417, 558, 510],[476, 095, 583, 172],[537, 479, 688, 753],[548, 765, 679, 999],[564, 078, 679, 209],[566, 365, 656, 479],[598, 159, 760, 364],[643, 379, 759, 479],[650, 450, 826, 739],[662, 755, 804, 984],[667, 074, 763, 177],[719, 104, 940, 362],[729, 396, 838, 501],[768, 477, 968, 730],[770, 070, 897, 150],[770, 721, 918, 999],[888, 060, 999, 353];car:[000, 729, 929, 999],[000, 348, 951, 751],[000, 064, 999, 364].') 2023-11-03 10:24:33.872 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:24:33.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'bird:[313, 000, 832, 950].') 2023-11-03 10:24:53.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:24:53.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[015, 037, 170, 543],[859, 002, 985, 541];Other Shoes:[096, 455, 134, 514],[068, 508, 092, 544];Slippers:[897, 486, 929, 519],[908, 506, 956, 541];Chicken:[311, 406, 459, 572],[399, 467, 671, 783].') 2023-11-03 10:25:07.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:25:07.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'boot:[741, 619, 924, 893],[151, 797, 212, 962],[261, 800, 344, 980],[550, 660, 758, 937];woman:[070, 137, 339, 944],[011, 075, 088, 678],[032, 022, 155, 220],[352, 124, 485, 820],[424, 016, 490, 151],[508, 124, 607, 302],[508, 044, 900, 488];high heels:[555, 665, 769, 950],[138, 802, 204, 959],[271, 803, 343, 975],[740, 601, 934, 874];sunglasses:[200, 089, 284, 137],[674, 233, 833, 323];jeans:[000, 385, 084, 627],[210, 532, 276, 838],[297, 460, 419, 836],[401, 537, 488, 691].') 2023-11-03 10:25:13.916 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:25:13.916 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Ethan Hawke plays [232, 027, 800, 996] a military chaplain in the film,') 2023-11-03 10:26:23.266 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:26:23.266 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect SUV,Bottle,Desk,Bracelet,Plate and Steak in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Bottle:[936, 310, 999, 454],[735, 112, 896, 303],[627, 002, 774, 315],[015, 060, 241, 371];Desk:[002, 067, 1000, 999];Bracelet:[245, 073, 300, 116];Plate:[275, 276, 755, 387],[915, 434, 1000, 533],[856, 276, 957, 334],[818, 208, 976, 259];Steak:[747, 384, 971, 496].') 2023-11-03 10:26:30.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:26:30.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Boat:[806, 638, 890, 692],[881, 634, 956, 682].') 2023-11-03 10:26:34.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:26:34.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[732, 385, 834, 500],carFrame1:[802, 378, 947, 492],truckFrame1:[439, 311, 484, 410],carFrame1:[380, 352, 491, 531],carFrame1:[000, 410, 082, 576] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'carFrame1:[732, 385, 834, 500];Frame2:[801, 385, 961, 510],carFrame1:[802, 378, 947, 492];Frame2:[910, 391, 1000, 540],truckFrame1:[439, 311, 484, 410];Frame2:[443, 308, 499, 423],carFrame1:[380, 352, 491, 531];Frame2:[377, 349, 499, 552],carFrame1[000, 410, 082, 576];Frame2:No object.') 2023-11-03 10:26:53.129 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:26:53.130 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Towel:[575, 283, 634, 359],[802, 326, 855, 497],[676, 428, 767, 569],[403, 461, 498, 598],[220, 384, 280, 532],[317, 298, 374, 373],[452, 276, 505, 354];Flower:[437, 002, 601, 385],[929, 001, 999, 236],[148, 032, 240, 250];Chair:[198, 584, 542, 1000],[609, 554, 999, 996],[879, 302, 1000, 566],[804, 258, 918, 390],[842, 231, 949, 479],[599, 240, 711, 366],[705, 209, 810, 383],[816, 162, 880, 233],[403, 244, 510, 337],[366, 215, 431, 315],[190, 270, 302, 417],[174, 259, 282, 445],[022, 321, 183, 643],[001, 614, 256, 1000],[015, 236, 187, 490];Person:[585, 001, 727, 260];Fork:[368, 526, 416, 593];Cup:[516, 397, 566, 547],[619, 299, 680, 418].') 2023-11-03 10:26:55.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:26:55.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[583, 505, 974, 892],carFrame1:[515, 566, 627, 740],carFrame1:[176, 549, 254, 635],carFrame1:[222, 541, 274, 628],carFrame1:[840, 662, 999, 1000],carFrame1:[420, 547, 455, 606],carFrame1:[019, 564, 163, 713] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[583, 505, 974, 892];Frame2:[858, 717, 1000, 999],carFrame1:[515, 566, 627, 740];Frame2:[601, 581, 971, 937],carFrame1:[176, 549, 254, 635];Frame2:[098, 549, 212, 683],carFrame1:[222, 541, 274, 628];Frame2:[163, 541, 244, 653],carFrame1[840, 662, 999, 1000];Frame2:No object,carFrame1:[420, 547, 455, 606];Frame2:[422, 551, 463, 614],carFrame1:[019, 564, 163, 713];Frame2:[000, 621, 045, 766].') 2023-11-03 10:27:04.192 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:27:04.193 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[440, 658, 514, 997] [678, 705, 736, 988] [503, 744, 611, 983] [298, 703, 356, 997] [205, 636, 336, 996] [802, 672, 903, 998] [363, 648, 499, 996] [071, 687, 159, 998] [161, 714, 232, 997] [895, 679, 982, 998] [703, 657, 829, 996] Passengers marvel at the sheer scale of Point Wild, an important Antarctic historic site.') 2023-11-03 10:27:31.410 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:27:31.410 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of youngsters [338, 306, 466, 794] [466, 003, 592, 796] [597, 084, 697, 790] [179, 214, 363, 825] roast hotdogs with a camp wranglers [684, 022, 895, 822] during their Colorado summer vacation.') 2023-11-03 10:27:32.322 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:27:32.322 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[507, 006, 660, 596] kindergarten teacher dressed as the Cat in [513, 002, 630, 158] the hat with [268, 230, 375, 633] [714, 245, 826, 655] [534, 527, 668, 834] [361, 287, 455, 577] [182, 515, 324, 788] [608, 271, 724, 594] [439, 283, 554, 580] [652, 484, 778, 803] [442, 478, 561, 748] [138, 270, 249, 668] [301, 483, 448, 790] [782, 519, 930, 858] [031, 507, 187, 835] her students holding up their cat in the hat poems') 2023-11-03 10:28:57.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:28:57.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[207, 341, 500, 662];frame:2:[126, 481, 351, 784];frame:3:[141, 626, 240, 818] is doing?Given the image frames and trajectory coordinates, please respond succinctly to the question.Considering the image frames and trajectory coordinates provided, offer a brief answer.', 'This is a/an the fish darting back and forth, creating ripples in the water with their movements.') 2023-11-03 10:29:10.394 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:29:10.395 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[496, 254, 522, 314] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a faceFrame1:[496, 254, 522, 314];Frame2:[447, 223, 473, 280];Frame3:[417, 215, 444, 273];Frame4:[527, 238, 546, 290];Frame5:[530, 257, 550, 309].') 2023-11-03 10:29:21.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:29:21.589 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Car:[117, 684, 165, 730],[624, 780, 746, 863];Van:[470, 778, 617, 886],[745, 720, 885, 840],[069, 623, 121, 667];Bicycle:[520, 857, 553, 946],[625, 857, 660, 936].') 2023-11-03 10:29:22.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:29:22.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Glasses:[530, 322, 680, 410];Tie:[579, 491, 651, 999];Chair:[702, 765, 866, 1000];Person:[002, 227, 579, 999];Cup:[601, 786, 706, 999];Cell Phone:[517, 823, 612, 903].') 2023-11-03 10:29:25.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:29:25.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("I kindly request that you examine the image and provide a solution for my query 'QUESTION: Why are [763, 837, 873, 992] and [404, 555, 454, 644] , and [013, 122, 227, 947] wearing hooded capes ?'. Don't forget to add an explanation and share the spatial location of objects using [x1,y1,x2,y2].", '[763, 837, 873, 992] and [404, 555, 454, 644] , and [013, 122, 227, 947] are all in a circle around [396, 040, 755, 962] and [360, 637, 468, 975] . one is talking to them all and they are doing nothing to stop [396, 040, 755, 962] . They are servants of [396, 040, 755, 962] .') 2023-11-03 10:29:54.766 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:29:54.766 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Australian fishing history - Two prawn fishermen [018, 214, 220, 596] [367, 318, 523, 607] hauling prawning nets [134, 374, 996, 677] from their boat [096, 554, 641, 736], NSW ca 1930') 2023-11-03 10:30:27.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:30:27.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Helmet:[393, 050, 480, 197],[532, 230, 629, 382];Gloves:[208, 756, 296, 939],[403, 785, 472, 905],[570, 807, 629, 911];Person:[002, 002, 425, 998],[294, 046, 545, 998],[469, 232, 701, 1000].') 2023-11-03 10:30:41.499 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:30:41.499 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[588, 439, 655, 553],carFrame1:[756, 473, 857, 615],carFrame1:[806, 474, 999, 711],carFrame1:[673, 475, 782, 572],carFrame1:[923, 456, 999, 775] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[588, 439, 655, 553];Frame2:[588, 439, 656, 548],carFrame1:[756, 473, 857, 615];Frame2:[844, 460, 999, 772],carFrame1[806, 474, 999, 711];Frame2:No object,carFrame1[673, 475, 782, 572];Frame2:No object,carFrame1[923, 456, 999, 775];Frame2:No object.') 2023-11-03 10:31:17.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:31:17.864 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'car:[038, 142, 970, 941],[614, 211, 999, 657],[925, 265, 999, 330];wheel:[068, 490, 122, 670],[280, 580, 438, 941],[339, 299, 430, 342],[925, 468, 999, 655].') 2023-11-03 10:31:48.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:31:48.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[028, 277, 876, 810] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a platypusFrame1:[028, 277, 876, 810];Frame2:[028, 275, 860, 810];Frame3:[023, 272, 854, 829];Frame4:[025, 262, 895, 875].') 2023-11-03 10:32:07.722 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:32:07.722 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "When I travel, I draw and paint sketches which is great fun. And as long as you are fully aware that it has nothing to do with actual art, I think that's all right. - Arne Jacobsen [003, 044, 350, 994]") 2023-11-03 10:32:28.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:32:28.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Flower:[542, 883, 575, 922],[437, 942, 462, 992],[547, 927, 575, 976];Person:[484, 700, 536, 777];Train:[559, 376, 707, 465];Boat:[412, 747, 595, 998],[275, 509, 375, 684].') 2023-11-03 10:32:48.134 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:32:48.134 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[006, 012, 993, 984] Another view of the new artroom.') 2023-11-03 10:32:51.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:32:51.485 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the women in the left?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one women in the left.Track1frame1:[155, 488, 184, 746];frame2:[129, 482, 164, 757];frame3:[103, 485, 140, 781];frame4:[071, 490, 111, 810];frame5:[035, 496, 081, 842].') 2023-11-03 10:32:57.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:32:57.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Airplane:[001, 099, 988, 792].') 2023-11-03 10:33:08.154 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:33:08.154 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[352, 059, 901, 994],[167, 134, 597, 1000],[001, 237, 317, 1000],[001, 308, 133, 652],[579, 362, 652, 480],[873, 612, 968, 958];Hat:[418, 139, 581, 312].') 2023-11-03 10:33:45.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:33:45.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'building:[000, 000, 999, 999].') 2023-11-03 10:33:58.718 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:33:58.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cabinet:[099, 638, 538, 709];Desk:[020, 765, 730, 999];Person:[680, 522, 1000, 997],[002, 562, 185, 999];Moniter:[037, 469, 341, 679];Laptop:[389, 686, 686, 869].') 2023-11-03 10:34:04.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:34:04.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the white mini truck move in front of us?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one white mini truck move in front of us.Track1frame:1:[853, 484, 999, 625];frame:2:[713, 520, 754, 575];frame:3:[463, 476, 503, 529].') 2023-11-03 10:34:17.174 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:34:17.174 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'arrowverse banner image with a collection of superheroes [089, 292, 189, 918] [309, 315, 417, 894] [018, 290, 128, 914] [164, 283, 285, 920] [375, 279, 475, 837] and text that says The Arrowverse Timeline') 2023-11-03 10:35:13.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:35:13.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[573, 431, 657, 883],personFrame1:[668, 411, 815, 877],personFrame1:[433, 316, 714, 637] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[573, 431, 657, 883];Frame2:[540, 431, 628, 887],personFrame1:[668, 411, 815, 877];Frame2:[629, 402, 810, 888],personFrame1:[433, 316, 714, 637];Frame2:[441, 213, 647, 622].') 2023-11-03 10:35:29.254 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:35:29.255 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the woman who walked over to the right.?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one the woman who walked over to the right..Track1frame:1:[853, 267, 999, 938];frame:2:[850, 282, 999, 950];frame:3:[843, 277, 987, 953].') 2023-11-03 10:35:45.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:35:45.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Tent:[795, 672, 983, 874],[863, 614, 980, 756],[710, 567, 790, 676],[621, 593, 716, 725],[524, 643, 666, 834],[532, 554, 612, 648],[528, 524, 582, 594],[465, 538, 531, 624],[449, 507, 494, 573],[382, 567, 486, 690],[388, 515, 448, 588],[310, 531, 384, 618],[329, 498, 387, 569],[358, 490, 389, 539];Person:[745, 585, 768, 674],[724, 583, 744, 672],[714, 591, 737, 672],[559, 582, 582, 671].') 2023-11-03 10:36:13.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:36:13.996 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Glasses:[322, 608, 499, 716];Person:[131, 488, 526, 999],[182, 219, 1000, 1000].') 2023-11-03 10:36:37.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:36:37.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect jeans,mammal,girl and man in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'jeans:[818, 649, 910, 975],[112, 677, 174, 999],[635, 740, 696, 901];mammal:[541, 474, 654, 1000],[099, 353, 253, 999],[145, 443, 291, 999],[253, 415, 357, 963],[323, 391, 415, 505],[334, 429, 458, 999],[427, 393, 493, 847],[472, 441, 557, 995],[483, 369, 553, 503],[525, 415, 625, 640],[569, 382, 641, 572],[599, 369, 643, 485],[621, 348, 736, 508],[630, 474, 714, 935],[646, 390, 712, 601],[700, 414, 799, 930],[746, 327, 830, 837],[790, 393, 920, 935];girl:[330, 430, 461, 999],[703, 415, 792, 928];man:[086, 351, 247, 999],[420, 384, 488, 885],[605, 335, 755, 508],[743, 320, 841, 827].') 2023-11-03 10:37:31.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:37:31.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[001, 217, 102, 634],[110, 248, 354, 811],[306, 337, 474, 739],[476, 384, 745, 768],[592, 303, 760, 761];Helmet:[235, 249, 293, 327],[420, 362, 473, 464],[592, 302, 643, 375];Gloves:[474, 433, 523, 517];Boots:[110, 716, 166, 794],[276, 722, 326, 811],[308, 652, 368, 738];Hockey Stick:[133, 495, 544, 589],[364, 512, 526, 757],[466, 437, 670, 729].') 2023-11-03 10:37:57.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:37:57.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Horse and Bench in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[027, 441, 094, 758],[552, 393, 661, 585],[682, 445, 741, 543],[738, 450, 788, 542];Horse:[195, 432, 546, 822],[180, 432, 476, 807];Bench:[617, 443, 689, 524].') 2023-11-03 10:37:58.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:37:58.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Hat:[244, 172, 346, 255],[518, 164, 603, 233];Belt:[582, 653, 693, 726];Satchel:[744, 595, 797, 786];Backpack:[616, 305, 816, 625];Person:[071, 171, 472, 1000],[455, 164, 824, 1000].') 2023-11-03 10:38:17.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:38:17.443 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Motorcycle:[438, 692, 780, 1000];Person:[000, 200, 088, 675],[029, 253, 160, 793],[248, 230, 315, 401],[284, 293, 426, 963],[333, 258, 419, 395],[644, 260, 769, 892],[671, 244, 753, 374],[751, 249, 819, 413],[768, 275, 929, 974];Hat:[696, 259, 732, 349];Glasses:[761, 345, 785, 387];Belt:[050, 446, 087, 503];Sandals:[352, 920, 402, 963];Slippers:[871, 930, 910, 975];Satchel:[244, 395, 330, 611];TV:[178, 320, 208, 396];shelf:[651, 421, 875, 803],[389, 416, 652, 821],[105, 406, 354, 797],[799, 202, 848, 404],[677, 198, 799, 364],[633, 197, 684, 416],[508, 227, 629, 413],[387, 197, 511, 415],[331, 220, 382, 383],[274, 229, 331, 385],[175, 199, 273, 411];Storage box:[055, 546, 112, 675],[066, 674, 111, 808],[000, 697, 020, 807],[004, 668, 048, 748],[851, 200, 901, 296].') 2023-11-03 10:38:30.198 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:38:30.198 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Kappa Delta Rho brothers [055, 486, 150, 834] [391, 435, 471, 678] [818, 440, 910, 748] [176, 457, 287, 734] play catch with a football [179, 455, 218, 506] in front of their lodge [004, 010, 996, 590].') 2023-11-03 10:38:35.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:38:35.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Chair:[095, 605, 208, 782];Lamp:[100, 001, 487, 258];Person:[262, 455, 646, 998];Hat:[323, 455, 517, 537].') 2023-11-03 10:38:59.080 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:38:59.080 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'tree:[000, 000, 283, 294].') 2023-11-03 10:39:10.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:39:10.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[808, 187, 954, 619] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a pole horseFrame1:[808, 187, 954, 619];Frame2:[629, 259, 813, 600];Frame3:[513, 256, 785, 556];Frame4:[279, 341, 406, 636].') 2023-11-03 10:39:14.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:39:14.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[382, 333, 602, 963];Bracelet:[575, 632, 599, 649];Hat:[456, 333, 527, 372];Handbag:[408, 572, 430, 629];Other Shoes:[410, 905, 484, 961],[511, 899, 579, 956];Camera:[456, 538, 510, 601];Wild Bird:[924, 034, 951, 065].') 2023-11-03 10:39:17.697 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:39:17.698 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Just Right launches a new approach to dog food with a website that allows [094, 086, 832, 995] pet owners to create a blend customized to meet their dogs' specific tastes and nutritional needs.") 2023-11-03 10:39:31.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:39:31.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[168, 670, 276, 959],[119, 327, 225, 640],[419, 672, 505, 999],[423, 126, 505, 317],[560, 080, 632, 260],[591, 845, 650, 979],[655, 844, 717, 971],[698, 839, 753, 944],[764, 356, 797, 425],[787, 075, 913, 301],[796, 338, 999, 624],[945, 930, 999, 999];mammal:[786, 086, 917, 278],[118, 330, 224, 635],[170, 661, 269, 950],[311, 179, 421, 306],[376, 364, 459, 498],[421, 677, 505, 989],[426, 124, 503, 310],[557, 083, 635, 253],[581, 845, 659, 971],[654, 847, 717, 966],[701, 840, 757, 942],[762, 353, 796, 420],[796, 334, 995, 624],[950, 934, 999, 999];clothing:[167, 688, 276, 925],[853, 452, 999, 616];human leg:[123, 510, 175, 644],[170, 851, 245, 955],[172, 511, 201, 634],[236, 829, 283, 945],[430, 255, 506, 310],[434, 863, 486, 990],[786, 182, 826, 290],[792, 547, 863, 600],[828, 201, 905, 303],[843, 468, 870, 545];woman:[116, 330, 226, 628],[421, 671, 506, 999],[552, 084, 631, 258],[786, 072, 914, 306],[793, 334, 999, 618];human hair:[202, 656, 258, 730],[423, 676, 493, 770],[593, 852, 643, 930],[659, 849, 707, 926],[700, 833, 745, 899],[838, 335, 891, 423];human head:[197, 660, 256, 745],[432, 676, 483, 752],[586, 847, 644, 943],[651, 850, 706, 930],[698, 838, 743, 910],[840, 338, 890, 415];human arm:[141, 425, 172, 502],[209, 415, 225, 504].') 2023-11-03 10:39:33.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:39:33.269 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Truck in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Truck:[275, 248, 795, 1000],[156, 545, 362, 926].') 2023-11-03 10:40:20.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:40:20.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[001, 316, 227, 850],[066, 011, 283, 1000],[209, 401, 642, 998],[266, 244, 569, 977],[449, 236, 808, 996],[514, 307, 767, 867],[554, 510, 929, 995],[581, 628, 912, 995],[633, 767, 872, 995],[762, 140, 951, 995],[058, 001, 150, 055],[151, 001, 233, 045];Bracelet:[914, 628, 944, 686];Hat:[713, 773, 867, 948],[647, 626, 761, 761],[691, 513, 801, 649],[299, 403, 427, 537],[266, 243, 381, 373];Helmet:[132, 292, 218, 455],[490, 233, 614, 415],[633, 303, 766, 480],[782, 147, 879, 275];Gloves:[665, 466, 776, 581],[483, 550, 585, 660],[113, 705, 230, 839],[039, 318, 130, 517],[082, 015, 145, 206];Belt:[359, 949, 406, 984].') 2023-11-03 10:41:06.620 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:41:06.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the vehicles moving to left?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There are 2 vehicles moving to left.Track1frame:1:[038, 710, 200, 873];frame:2:[170, 696, 290, 711];frame:3:[198, 688, 347, 767],Track2frame:1:[016, 692, 662, 999];frame:2:[000, 708, 494, 999];frame:3:[000, 707, 428, 999].') 2023-11-03 10:42:00.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:42:00.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[884, 487, 999, 703];Baseball Glove:[291, 634, 361, 695];Baseball Bat:[709, 231, 759, 366],[351, 159, 401, 339];Person:[001, 620, 043, 899],[063, 484, 365, 909],[360, 226, 667, 922],[657, 263, 775, 732],[864, 097, 943, 319],[836, 015, 999, 302],[781, 001, 843, 123];Helmet:[444, 226, 505, 314],[166, 488, 251, 642],[001, 330, 044, 467];Sneakers:[079, 856, 132, 912],[138, 845, 213, 898],[360, 866, 407, 920],[590, 848, 666, 918].') 2023-11-03 10:42:52.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:42:52.672 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Sneakers:[322, 603, 352, 636],[354, 581, 403, 606],[450, 542, 467, 572],[596, 539, 616, 562],[645, 543, 664, 567],[700, 562, 738, 586],[717, 579, 756, 611];Swing:[136, 001, 405, 366],[381, 001, 712, 352],[714, 001, 1000, 372];Bench:[071, 262, 109, 284];Person:[700, 244, 787, 610],[596, 350, 705, 570],[450, 275, 552, 570],[283, 259, 402, 635].') 2023-11-03 10:42:57.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:42:57.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Guitar and Person in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Guitar:[329, 579, 467, 707],[451, 610, 539, 680];Person:[270, 524, 349, 831],[339, 505, 441, 868],[358, 479, 440, 582],[452, 496, 540, 836],[512, 523, 594, 795],[574, 494, 657, 843],[639, 510, 714, 851].') 2023-11-03 10:43:01.127 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:43:01.127 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'High tech baby gadgets can make life with a newborn so much easier. Here are the top baby technology items [769, 603, 973, 966] to help you.') 2023-11-03 10:43:02.666 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:43:02.667 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Article image for Man lucky to be alive after tree crushes [294, 318, 657, 747] his car') 2023-11-03 10:43:21.123 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:43:21.123 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [002, 531, 994, 995] protest in the snow outdoors. One sign [205, 463, 426, 703] is visible and reads "Stand together against the racist G O P. All lives matter.') 2023-11-03 10:43:31.609 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:43:31.609 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[304, 047, 809, 994] Landon Donovan, shown here playing with the LA Galaxy, coaches the San Diego Loyal. His team forfeited a USL match Wednesday night after [741, 832, 833, 997] a homophobic slur was allegedly directed at a player.') 2023-11-03 10:44:08.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:44:08.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bench:[308, 261, 362, 301],[722, 333, 753, 379].') 2023-11-03 10:44:14.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:44:14.077 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'boy:[596, 532, 832, 945],[398, 134, 506, 412],[471, 205, 625, 757];clothing:[570, 645, 822, 971],[208, 538, 396, 893],[311, 165, 399, 324],[348, 373, 454, 617],[361, 569, 548, 991],[411, 095, 494, 159],[440, 271, 501, 419],[484, 367, 583, 710],[563, 145, 662, 310],[566, 348, 678, 590],[620, 503, 703, 754];girl:[068, 411, 401, 998],[306, 038, 422, 335],[351, 249, 467, 602],[371, 385, 548, 998],[405, 000, 516, 164],[546, 355, 768, 883],[555, 000, 660, 314],[558, 221, 696, 611];human face:[288, 438, 368, 604],[348, 074, 397, 185],[421, 181, 475, 290],[429, 000, 478, 117],[434, 477, 520, 640],[493, 259, 558, 402],[584, 090, 635, 199],[619, 259, 678, 370],[666, 388, 739, 570],[697, 625, 798, 805].') 2023-11-03 10:44:41.897 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:44:41.898 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Top view [003, 004, 998, 995] from the stepwell side') 2023-11-03 10:44:59.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:44:59.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,clothing,man and woman in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human face:[039, 072, 103, 190],[076, 136, 153, 279],[146, 105, 210, 265],[211, 277, 296, 438],[292, 391, 377, 531],[360, 377, 468, 585],[385, 064, 446, 147],[441, 064, 500, 196],[468, 324, 583, 581],[500, 093, 553, 205],[553, 099, 608, 211],[573, 389, 698, 645],[610, 107, 664, 224],[668, 125, 719, 241],[746, 130, 800, 243],[748, 502, 876, 726],[838, 160, 930, 350],[845, 000, 911, 172],[892, 525, 999, 799];clothing:[000, 088, 999, 999],[251, 574, 584, 937];man:[000, 032, 139, 364],[001, 093, 268, 544],[020, 150, 361, 615],[027, 225, 341, 765],[244, 256, 650, 974],[280, 000, 333, 160],[362, 051, 492, 312],[380, 312, 785, 999],[739, 000, 944, 421],[821, 100, 965, 417];woman:[058, 308, 433, 837],[181, 276, 547, 925],[602, 347, 965, 999].') 2023-11-03 10:45:07.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:45:07.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human arm:[575, 097, 999, 735];human hand:[573, 614, 760, 691],[578, 518, 811, 692].') 2023-11-03 10:45:41.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:45:41.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[002, 233, 342, 999],[321, 221, 771, 1000],[731, 266, 999, 999];Hat:[456, 222, 700, 486],[772, 264, 917, 481];Glasses:[547, 459, 685, 562].') 2023-11-03 10:45:48.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:45:48.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'boy:[416, 385, 536, 955];woman:[516, 309, 613, 976],[381, 344, 490, 960];man:[266, 326, 406, 979],[615, 318, 753, 979];footwear:[472, 925, 500, 991],[496, 915, 528, 987];human face:[323, 357, 368, 433],[428, 374, 460, 440],[525, 329, 563, 405],[653, 345, 695, 423].') 2023-11-03 10:46:00.202 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:46:00.202 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[001, 538, 120, 824] [527, 394, 658, 640] [054, 561, 181, 841] [788, 639, 980, 848] [322, 551, 417, 807] [396, 547, 514, 846] [723, 442, 878, 735] [209, 614, 341, 912] A quantity of vintage glass ware to include smokey glass part dressing set, green glasses and art') 2023-11-03 10:46:07.639 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:46:07.640 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Dinning Table:[066, 525, 481, 791];Chair:[038, 531, 132, 745],[124, 555, 282, 793],[288, 550, 442, 591];Storage box:[334, 570, 510, 658];Desk:[147, 614, 999, 1000];Frame:[336, 313, 453, 396];shelf:[594, 199, 854, 371],[600, 556, 839, 642];Lamp:[130, 141, 241, 198];bucket:[142, 748, 272, 999];Bottle:[278, 576, 348, 676];Cup:[616, 611, 716, 684];Refrigerator:[880, 325, 929, 645];Trash bin Can:[142, 746, 270, 1000].') 2023-11-03 10:46:37.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:46:37.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[046, 002, 143, 210],[191, 259, 270, 381],[006, 430, 101, 560],[157, 599, 230, 707],[192, 862, 286, 980],[410, 232, 485, 395],[407, 416, 473, 516],[409, 602, 482, 734],[515, 232, 571, 325],[526, 417, 580, 500],[517, 602, 580, 716],[723, 272, 787, 385],[885, 444, 966, 544],[759, 612, 820, 700],[713, 857, 789, 975],[820, 002, 918, 246];Chair:[137, 317, 200, 395],[121, 552, 161, 594],[223, 388, 259, 453],[239, 550, 271, 598];Person:[140, 141, 189, 254],[026, 002, 067, 113],[313, 199, 365, 302],[301, 278, 347, 381],[293, 428, 335, 523],[216, 387, 262, 452],[406, 484, 453, 588],[381, 390, 423, 467],[394, 291, 430, 389],[448, 289, 488, 395].') 2023-11-03 10:46:38.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:46:38.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Other Shoes:[420, 920, 534, 953];Handbag:[848, 672, 958, 784],[949, 639, 1000, 688];Person:[419, 419, 583, 953],[820, 513, 999, 871],[830, 422, 995, 741],[825, 475, 872, 613],[707, 526, 759, 617].') 2023-11-03 10:47:13.455 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:47:13.455 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Zendaya and [561, 016, 964, 998] John David Washington share a tense moment as they sit on their modern patio as seen in the black-and-white film Malcolm & Marie.') 2023-11-03 10:47:29.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:47:29.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[350, 403, 503, 519] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a destroyer escortFrame1:[350, 403, 503, 519];Frame2:[321, 373, 488, 487];Frame3:[323, 366, 488, 481];Frame4:[318, 356, 487, 472].') 2023-11-03 10:47:49.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:47:49.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tree:[000, 000, 998, 999].') 2023-11-03 10:47:51.201 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:47:51.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Nightstand,Chair,Lamp,Bed,shelf,Meat ball,Piano and Board Eraser in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Nightstand:[001, 532, 093, 708];Chair:[833, 783, 999, 971];Lamp:[001, 292, 072, 537],[971, 067, 1000, 144];Bed:[157, 112, 852, 812];Cabinet:[917, 240, 999, 672].') 2023-11-03 10:47:58.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:47:58.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'palm tree:[440, 000, 999, 790].') 2023-11-03 10:48:06.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:48:06.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'bee:[000, 545, 068, 651],[000, 000, 999, 999],[003, 800, 076, 970],[006, 445, 058, 519],[009, 300, 096, 430],[018, 115, 121, 304],[031, 000, 136, 155],[044, 774, 128, 887],[046, 703, 145, 808],[058, 240, 143, 330],[065, 650, 145, 719],[081, 391, 154, 560],[095, 152, 161, 250],[116, 302, 206, 396],[118, 372, 199, 459],[120, 832, 198, 953],[121, 446, 181, 627],[142, 071, 241, 225],[159, 018, 266, 110],[160, 530, 288, 643],[182, 722, 318, 858],[187, 446, 241, 543],[205, 188, 284, 362],[227, 338, 323, 479],[271, 005, 388, 112],[271, 627, 395, 729],[275, 519, 380, 605],[279, 199, 338, 352],[285, 811, 375, 911],[311, 488, 412, 591],[318, 579, 407, 669],[330, 409, 443, 543],[332, 719, 435, 795],[335, 150, 457, 276],[345, 805, 466, 920],[364, 081, 445, 173],[368, 856, 504, 999],[378, 560, 495, 670],[396, 309, 492, 448],[422, 446, 511, 595],[425, 002, 533, 140],[439, 231, 545, 324],[440, 579, 586, 737],[451, 743, 531, 875],[461, 925, 550, 975],[464, 293, 572, 396],[466, 110, 530, 204],[504, 870, 596, 940],[519, 126, 659, 212],[525, 078, 607, 145],[531, 386, 603, 530],[531, 519, 628, 634],[535, 000, 626, 076],[556, 772, 615, 913],[572, 357, 642, 430],[577, 152, 710, 276],[594, 720, 716, 820],[601, 932, 655, 998],[603, 265, 704, 374],[606, 874, 677, 972],[611, 433, 718, 525],[629, 510, 781, 645],[633, 000, 750, 169],[637, 617, 765, 719],[638, 359, 727, 488],[638, 770, 761, 835],[648, 829, 781, 911],[666, 861, 800, 958],[681, 147, 767, 269],[681, 942, 783, 999],[686, 000, 815, 078],[692, 335, 816, 438],[706, 188, 861, 295],[731, 290, 875, 465],[735, 128, 815, 207],[753, 665, 847, 825],[758, 412, 885, 533],[771, 531, 895, 653],[781, 793, 904, 920],[803, 640, 896, 798],[803, 065, 926, 186],[820, 143, 975, 275],[861, 834, 990, 929],[870, 556, 925, 765],[871, 000, 943, 060],[880, 378, 963, 556],[894, 748, 999, 856],[896, 265, 999, 409],[911, 524, 962, 629],[929, 000, 999, 115],[950, 490, 999, 655].') 2023-11-03 10:48:18.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:48:18.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Guitar,Person and Cup in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[076, 311, 968, 866];Person:[001, 002, 779, 996];Cup:[508, 397, 578, 449].') 2023-11-03 10:48:55.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:48:55.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[678, 070, 831, 825];footwear:[638, 735, 710, 763],[706, 762, 783, 830],[758, 744, 812, 824];human face:[417, 156, 456, 227].') 2023-11-03 10:50:00.801 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:50:00.801 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'We looked at [268, 526, 881, 922] the prayer mat.') 2023-11-03 10:50:05.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:50:05.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[131, 361, 722, 672] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a hippo swimming in the waterFrame1:[131, 361, 722, 672];Frame2:[208, 272, 779, 563];Frame3:[237, 266, 789, 558];Frame4:[222, 266, 766, 555];Frame5:[168, 255, 716, 550].') 2023-11-03 10:51:15.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:51:15.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[663, 531, 750, 687] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a dogsledFrame1:[663, 531, 750, 687];Frame2:[638, 531, 718, 688];Frame3:[625, 531, 715, 687];Frame4:[615, 537, 703, 695];Frame5:[609, 540, 697, 700].') 2023-11-03 10:51:18.908 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:51:18.908 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[431, 512, 469, 719],personFrame1:[844, 547, 882, 748],personFrame1:[752, 497, 809, 650],personFrame1:[216, 486, 272, 637],personFrame1:[151, 472, 187, 615] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[431, 512, 469, 719];Frame2:[523, 515, 559, 722],personFrame1:[844, 547, 882, 748];Frame2:[950, 555, 992, 772],personFrame1:[752, 497, 809, 650];Frame2:[808, 513, 887, 666],personFrame1:[216, 486, 272, 637];Frame2:[311, 484, 372, 622],personFrame1:[151, 472, 187, 615];Frame2:[256, 447, 292, 595].') 2023-11-03 10:51:53.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:51:53.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Other Shoes:[273, 900, 324, 962],[239, 764, 281, 833];SUV:[001, 001, 332, 689];Potted Plant:[839, 001, 999, 567],[410, 028, 584, 542];Person:[255, 548, 480, 963].') 2023-11-03 10:52:07.975 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:52:07.975 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Seattle Seahawks free safety Earl Thomas [343, 084, 851, 995] (29) returns a fumble for a touchdown in the first half of an NFL football game against the New Orleans Saints [224, 201, 442, 854] [024, 192, 273, 814] in New Orleans, Sunday, Oct. 30, 2016.') 2023-11-03 10:52:20.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:52:20.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[459, 337, 549, 466] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a chameleon walking on the sandFrame1:[459, 337, 549, 466];Frame2:[456, 343, 546, 469];Frame3:[461, 340, 550, 468].') 2023-11-03 10:52:50.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:52:50.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the cat crawling under the steps?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one cat crawling under the steps.Track1frame:1:[397, 458, 895, 998];frame:2:[377, 414, 673, 782];frame:3:[377, 440, 682, 799].') 2023-11-03 10:52:51.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 10:52:51.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[000, 474, 078, 532],[014, 117, 769, 194],[027, 807, 281, 860],[112, 797, 200, 861],[204, 457, 345, 530],[244, 824, 311, 867],[266, 112, 347, 182],[330, 817, 407, 865],[334, 433, 456, 523],[355, 120, 430, 180],[415, 409, 577, 563],[445, 083, 518, 183],[447, 826, 558, 894],[507, 361, 874, 563],[522, 780, 755, 918],[522, 045, 774, 204],[667, 795, 947, 925],[710, 703, 998, 998],[729, 095, 895, 215],[821, 090, 987, 209],[821, 399, 998, 568],[951, 390, 998, 500];vehicle:[000, 570, 216, 656],[000, 516, 519, 597],[009, 884, 135, 942],[016, 913, 120, 961],[135, 211, 323, 253],[138, 858, 412, 901],[152, 171, 560, 225],[201, 575, 615, 660],[301, 215, 445, 258],[441, 213, 604, 263],[556, 891, 856, 958],[652, 224, 788, 260],[848, 225, 967, 266].') 2023-11-03 10:55:51.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:55:51.864 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[478, 054, 556, 419],sledFrame1:[472, 254, 510, 387],personFrame1:[868, 001, 1000, 473] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'personFrame1:[478, 054, 556, 419];Frame2:[437, 109, 513, 468],sledFrame1:[472, 254, 510, 387];Frame2:[423, 301, 472, 436],personFrame1:[868, 001, 1000, 473];Frame2:[667, 045, 806, 577].') 2023-11-03 10:57:17.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:57:17.451 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[518, 640, 564, 724],personFrame1:[173, 508, 207, 695],personFrame1:[554, 694, 613, 801],personFrame1:[615, 625, 658, 731],personFrame1:[795, 512, 829, 689],personFrame1:[510, 675, 573, 778],personFrame1:[401, 625, 454, 729],personFrame1:[464, 656, 506, 751],personFrame1:[727, 793, 862, 856],personFrame1:[438, 514, 471, 693] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[518, 640, 564, 724];Frame2:[511, 645, 565, 725],personFrame1:[173, 508, 207, 695];Frame2:[173, 507, 207, 696],personFrame1:[554, 694, 613, 801];Frame2:[541, 680, 588, 814],personFrame1:[615, 625, 658, 731];Frame2:[595, 621, 641, 727],personFrame1:[795, 512, 829, 689];Frame2:[794, 512, 829, 689],personFrame1:[510, 675, 573, 778];Frame2:[503, 673, 563, 787],personFrame1:[401, 625, 454, 729];Frame2:[385, 623, 456, 732],personFrame1:[464, 656, 506, 751];Frame2:[439, 652, 503, 762],personFrame1:[727, 793, 862, 856];Frame2:[728, 795, 863, 857],personFrame1:[438, 514, 471, 693];Frame2:[438, 515, 471, 694].') 2023-11-03 10:57:38.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 10:57:38.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[327, 295, 484, 922];curtain:[424, 000, 980, 947];person:[326, 253, 480, 938].') 2023-11-03 10:57:43.979 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:57:43.979 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect table,tableware and human face in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'table:[000, 731, 301, 920],[000, 839, 494, 999],[183, 686, 505, 850],[312, 787, 999, 999];tableware:[727, 750, 861, 970],[000, 709, 063, 798],[330, 557, 373, 730],[401, 676, 470, 749],[498, 674, 538, 803],[680, 688, 735, 870];human face:[110, 460, 165, 555],[408, 454, 460, 549],[508, 399, 566, 499],[626, 387, 706, 553],[901, 344, 999, 545].') 2023-11-03 10:57:49.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:57:49.695 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Mirror,Sink,Chair and Blackboard/Whiteboard in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Mirror:[033, 291, 183, 539],[338, 360, 423, 531];Sink:[290, 568, 381, 608],[154, 579, 269, 628];Chair:[617, 596, 995, 930].') 2023-11-03 10:58:20.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:58:20.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[216, 062, 373, 810] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a personFrame1:[216, 062, 373, 810];Frame2:[224, 075, 352, 850];Frame3:[240, 114, 389, 818];Frame4:[235, 089, 384, 868];Frame5:[276, 083, 409, 795].') 2023-11-03 10:58:57.018 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:58:57.019 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The cast [002, 326, 178, 806] [712, 402, 781, 747] [839, 440, 886, 733] [875, 440, 935, 722] [463, 398, 552, 756] [767, 449, 833, 756] [148, 377, 255, 716] [374, 336, 480, 802] [600, 389, 682, 788] [539, 409, 612, 796] [660, 395, 740, 780] [224, 383, 395, 807] takes a bow onstage at the premiere of the revival of Angels in America at Neil Simon Theater on March 25 2018...') 2023-11-03 10:59:35.648 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 10:59:35.649 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 002, 998, 996] An aerial view of an evergreen forest') 2023-11-03 11:00:26.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:00:26.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,human leg and hat in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'footwear:[141, 872, 229, 939],[250, 871, 332, 950],[435, 872, 517, 975],[535, 834, 616, 975],[748, 846, 831, 942],[863, 815, 936, 939];human leg:[803, 518, 923, 963],[145, 531, 230, 943],[216, 563, 334, 952],[434, 488, 521, 957],[495, 501, 615, 981],[750, 511, 832, 940];hat:[130, 096, 235, 209],[423, 103, 545, 246],[728, 060, 850, 194].') 2023-11-03 11:01:38.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:01:38.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Lifesaver:[123, 705, 148, 737];Street Lights:[634, 445, 751, 699];Boat:[038, 860, 185, 910],[001, 573, 101, 622],[816, 471, 1000, 587],[843, 585, 955, 668],[583, 753, 838, 849],[167, 560, 209, 603],[939, 542, 1000, 661];Sailboat:[338, 327, 470, 599].') 2023-11-03 11:02:08.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:02:08.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect glasses,man and dog in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'glasses:[210, 065, 330, 393],[434, 796, 542, 963];man:[043, 000, 998, 438];dog:[325, 241, 998, 998].') 2023-11-03 11:03:14.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:03:14.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[414, 598, 434, 705];Truck:[289, 569, 391, 685];Hotair ballon:[100, 438, 128, 491],[137, 060, 264, 318],[001, 263, 098, 496],[666, 005, 851, 293];Van:[516, 581, 655, 647].') 2023-11-03 11:03:16.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:03:16.457 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[596, 269, 645, 358] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a faceFrame1:[596, 269, 645, 358];Frame2:[583, 287, 632, 377];Frame3:[571, 286, 617, 371];Frame4:[577, 248, 612, 311].') 2023-11-03 11:03:26.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:03:26.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[270, 639, 320, 830],[221, 627, 270, 838].') 2023-11-03 11:03:43.176 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:03:43.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[043, 079, 551, 998],[457, 342, 745, 995],[668, 718, 802, 991];drum:[000, 741, 045, 901],[314, 867, 350, 957],[329, 874, 456, 998],[421, 702, 470, 787],[430, 835, 510, 930],[456, 711, 491, 790],[674, 893, 810, 998],[810, 907, 924, 998];guitar:[058, 344, 627, 911],[481, 515, 901, 939];tree:[000, 000, 999, 755];clothing:[050, 252, 360, 998],[469, 480, 714, 998],[676, 764, 795, 998].') 2023-11-03 11:03:47.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:03:47.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Street Lights:[117, 001, 212, 875],[172, 311, 270, 523];Person:[141, 882, 235, 999],[310, 760, 398, 960].') 2023-11-03 11:04:00.094 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:00.095 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Even if you were able to grapple [200, 401, 808, 989] a wheelchair through [248, 002, 706, 584] the main door, the cubicle itself will halt your progress.') 2023-11-03 11:04:20.241 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:20.241 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[203, 329, 496, 933] SERVPRO employee pack outs a homes items') 2023-11-03 11:04:26.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:26.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'woman:[651, 381, 831, 985];clothing:[453, 368, 670, 979],[205, 375, 371, 999],[372, 364, 511, 945],[593, 395, 716, 929],[675, 424, 813, 995],[849, 515, 910, 584],[897, 570, 999, 774],[964, 565, 999, 630];man:[253, 266, 365, 999],[376, 305, 513, 995],[428, 340, 659, 938],[604, 327, 712, 994],[667, 322, 814, 999],[850, 492, 906, 597],[887, 545, 999, 799].') 2023-11-03 11:04:47.427 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:47.427 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree and bus in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'tree:[041, 000, 576, 450],[692, 000, 999, 219],[906, 348, 999, 552];bus:[136, 099, 913, 779].') 2023-11-03 11:04:49.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:49.630 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'sunglasses:[891, 484, 978, 530];swimwear:[299, 465, 388, 565],[323, 584, 426, 736],[583, 448, 683, 613],[700, 545, 809, 688];woman:[241, 337, 589, 656],[379, 199, 694, 988],[657, 302, 878, 897],[707, 375, 999, 999];girl:[250, 337, 426, 684],[373, 197, 676, 990],[649, 280, 860, 978],[715, 361, 999, 999];human face:[170, 422, 220, 533],[279, 359, 321, 458],[581, 306, 646, 445],[695, 330, 777, 499],[898, 464, 978, 601].') 2023-11-03 11:04:50.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:50.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Fan:[606, 248, 646, 309];Person:[872, 001, 1000, 995],[786, 124, 927, 853],[671, 050, 880, 932],[649, 154, 761, 822],[633, 200, 729, 783],[575, 212, 680, 755],[018, 424, 152, 997],[028, 377, 198, 1000],[250, 438, 414, 949];Watch:[935, 267, 958, 313];Backpack:[001, 539, 075, 809],[552, 504, 603, 610],[193, 580, 239, 715];Satchel:[174, 629, 207, 734];Sneakers:[634, 721, 667, 754],[634, 728, 687, 783],[676, 751, 744, 812],[712, 780, 792, 833],[682, 793, 789, 866],[787, 755, 844, 810],[865, 776, 928, 850],[819, 835, 883, 929],[890, 884, 978, 978],[966, 924, 998, 997].') 2023-11-03 11:04:55.975 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:04:55.975 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Chair:[891, 972, 1000, 1000],[679, 640, 909, 771],[542, 580, 708, 682],[155, 548, 368, 594],[004, 626, 125, 1000],[068, 715, 211, 1000];shelf:[864, 546, 997, 810],[581, 250, 930, 741],[001, 511, 277, 676];Lamp:[512, 161, 593, 193];Plate:[291, 835, 581, 946],[705, 726, 746, 748],[364, 612, 526, 649],[165, 655, 242, 677],[188, 590, 308, 623];Cup:[651, 689, 715, 789],[273, 614, 326, 692],[178, 630, 229, 670],[812, 202, 872, 257],[652, 230, 689, 263];Bread:[321, 800, 557, 924],[371, 604, 520, 647].') 2023-11-03 11:05:13.991 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:05:13.992 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A woman [396, 006, 935, 997] smiling and looking off into the distance while holding a financial newspaper [465, 543, 949, 997].') 2023-11-03 11:05:32.727 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:05:32.727 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[528, 391, 654, 693] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a lesser pandaFrame1:[528, 391, 654, 693];Frame2:[546, 400, 611, 595];Frame3:[550, 412, 617, 616];Frame4:[517, 427, 617, 664];Frame5:[511, 422, 614, 691].') 2023-11-03 11:05:44.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:05:44.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[786, 414, 821, 501],carFrame1:[521, 525, 574, 617],carFrame1:[475, 523, 569, 652],carFrame1:[896, 352, 952, 506],carFrame1:[800, 404, 858, 515],carFrame1:[922, 504, 999, 678],carFrame1:[847, 371, 928, 505] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1:[786, 414, 821, 501];Frame2:[855, 367, 906, 491],carFrame1:[521, 525, 574, 617];Frame2:[504, 499, 581, 643],carFrame1:[475, 523, 569, 652];Frame2:[425, 512, 563, 705],carFrame1[896, 352, 952, 506];Frame2:No object,carFrame1:[800, 404, 858, 515];Frame2:[879, 345, 970, 510],carFrame1[922, 504, 999, 678];Frame2:No object,carFrame1:[847, 371, 928, 505];Frame2:[966, 309, 999, 491].') 2023-11-03 11:05:50.800 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:05:50.801 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Mature couple [107, 004, 425, 627] [621, 004, 949, 694] sitting around a camp stove [407, 355, 569, 783] at campsite') 2023-11-03 11:05:51.288 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:05:51.289 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[343, 348, 951, 983];jacket:[293, 506, 989, 1000];human head:[392, 198, 702, 654];man:[329, 302, 982, 996];human face:[419, 327, 679, 666];glasses:[381, 436, 701, 565];human ear:[627, 390, 683, 494];human hair:[383, 205, 677, 424];human nose:[479, 494, 534, 576].') 2023-11-03 11:06:13.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:06:13.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[624, 115, 1000, 1000],[036, 276, 408, 997],[000, 650, 088, 771],[074, 001, 969, 1000],[083, 001, 140, 070],[000, 001, 094, 069];Necklace:[741, 435, 793, 549],[456, 268, 586, 403];Ring:[694, 876, 729, 925];Bracelet:[830, 856, 902, 973];Belt:[403, 821, 674, 965];Satchel:[705, 549, 999, 999];Bottle:[035, 256, 081, 339],[679, 644, 773, 997].') 2023-11-03 11:06:28.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:06:28.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the three stationary cars..Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There are 3 the three stationary cars..Track1frame:1:[219, 955, 759, 999];frame:2:[223, 962, 722, 999];frame:3:[222, 932, 866, 999],Track2frame:1:[531, 151, 978, 634];frame:2:[525, 157, 975, 662];frame:3:[570, 125, 999, 636],Track3frame:1:[958, 236, 999, 504];frame:2:[957, 245, 999, 513].') 2023-11-03 11:06:31.322 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:06:31.322 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[001, 000, 299, 251],[707, 201, 1000, 410];Helmet:[308, 070, 488, 185];Handbag:[173, 082, 316, 182];Desk:[001, 248, 1000, 999],[275, 012, 980, 304],[840, 034, 1000, 104],[154, 006, 472, 055];Bench:[165, 165, 731, 338],[551, 094, 996, 213],[001, 147, 167, 265];Cup:[230, 182, 427, 405],[001, 346, 289, 680];Chopsticks:[759, 226, 808, 475];Plate:[134, 589, 1000, 968].') 2023-11-03 11:06:35.288 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:06:35.289 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Top-down image of graffiti on a sidewalk reading "ABOLISH POLICE" in pink spray-painted handwriting [000, 213, 996, 852]; two people [304, 003, 724, 215], visible from the waist down, are walking past') 2023-11-03 11:06:47.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:06:47.721 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[065, 321, 950, 748] A 'maria flap' gateleg table by bruno mathsson for karl mathsson, dated 1968.") 2023-11-03 11:07:17.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:07:17.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the child who pushed the bike and made it go. in this video clip?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one the child who pushed the bike and made it go..Track1frame:1:[053, 119, 332, 999];frame:2:[062, 131, 325, 998];frame:3:[078, 109, 309, 999].') 2023-11-03 11:07:46.630 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:07:46.630 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The cast [764, 291, 901, 719] [460, 006, 555, 266] [674, 387, 799, 720] [377, 022, 473, 277] [297, 315, 448, 612] [641, 101, 748, 298] [079, 305, 213, 601] [687, 205, 804, 408] [527, 056, 643, 271] [209, 498, 377, 861] [388, 549, 561, 858] [515, 433, 694, 838] sit around a large table with their scripts [786, 479, 842, 552] [407, 349, 499, 405] [178, 401, 273, 516] [617, 362, 719, 423] [517, 213, 609, 300] [331, 270, 412, 325] [426, 292, 473, 350] [197, 307, 281, 384] [607, 405, 721, 525] [291, 568, 412, 724]') 2023-11-03 11:08:26.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:08:26.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tree:[000, 413, 056, 690],[000, 561, 999, 999],[959, 234, 998, 685].') 2023-11-03 11:08:34.645 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:08:34.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[929, 418, 974, 734],[854, 421, 954, 750],[658, 878, 718, 1000],[639, 517, 734, 898],[551, 539, 658, 1000],[536, 506, 596, 908],[581, 451, 629, 625],[364, 429, 444, 752],[245, 397, 347, 635],[100, 371, 148, 646],[024, 318, 074, 442],[135, 372, 162, 442],[025, 247, 062, 359],[069, 249, 107, 438],[124, 250, 157, 394],[180, 237, 224, 438],[221, 239, 269, 442],[267, 241, 312, 389],[286, 345, 331, 435],[333, 337, 379, 448],[318, 311, 347, 392],[359, 305, 394, 445],[411, 263, 451, 442],[446, 250, 482, 442],[203, 001, 244, 183],[114, 179, 151, 267],[156, 047, 191, 198],[124, 047, 162, 198],[095, 078, 128, 268],[119, 001, 157, 081],[073, 001, 116, 180],[022, 001, 060, 170],[001, 059, 034, 286],[574, 263, 622, 446],[667, 264, 703, 449],[703, 271, 743, 449],[730, 269, 794, 447],[792, 275, 841, 452],[822, 343, 887, 455],[887, 318, 928, 450],[939, 270, 989, 460],[320, 020, 360, 197],[352, 019, 380, 196],[381, 022, 418, 191],[387, 164, 420, 278],[431, 162, 471, 279],[471, 178, 505, 349],[454, 094, 490, 215],[451, 016, 487, 170],[413, 002, 451, 189],[505, 082, 546, 267],[500, 178, 533, 291],[490, 011, 530, 171],[542, 152, 579, 246],[527, 131, 565, 218],[576, 174, 612, 243],[594, 131, 631, 249],[629, 176, 669, 254],[573, 096, 608, 198],[609, 095, 649, 194],[629, 069, 659, 157],[662, 089, 705, 248],[700, 061, 742, 249],[699, 013, 736, 105],[665, 058, 703, 128],[663, 022, 691, 078],[652, 001, 683, 056],[621, 019, 654, 101],[547, 045, 587, 165],[947, 130, 991, 254],[975, 126, 999, 254],[952, 022, 995, 132],[963, 001, 999, 067],[906, 121, 947, 254],[907, 111, 935, 203],[864, 067, 904, 253],[895, 069, 931, 219],[904, 001, 936, 094],[867, 001, 901, 071],[954, 097, 985, 146].') 2023-11-03 11:08:52.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:08:52.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[000, 272, 148, 1000],[409, 347, 496, 774],[901, 514, 999, 956];Other Shoes:[747, 806, 793, 847];Handbag:[683, 719, 730, 785];Backpack:[572, 467, 605, 543];Hat:[632, 337, 684, 421];Bottle:[520, 623, 556, 692];Lamp:[246, 031, 365, 119],[160, 076, 190, 157],[045, 136, 068, 202],[397, 163, 477, 217],[576, 119, 639, 298],[669, 115, 769, 182],[726, 002, 763, 104],[123, 171, 179, 260];Frame:[678, 254, 752, 375];Carpet:[133, 750, 226, 916].') 2023-11-03 11:09:32.912 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:09:32.912 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[380, 600, 524, 1000],[018, 561, 060, 789],[182, 566, 228, 740],[240, 566, 266, 710],[320, 550, 343, 670],[352, 571, 391, 711],[611, 618, 708, 780],[861, 644, 999, 866];man:[685, 580, 861, 995],[000, 535, 037, 777],[053, 550, 100, 746],[256, 561, 295, 740],[375, 552, 428, 752],[493, 569, 542, 768],[580, 561, 638, 766],[626, 567, 662, 729],[660, 561, 715, 637],[690, 620, 726, 695],[809, 637, 900, 885],[891, 629, 954, 779],[896, 567, 933, 669],[927, 572, 971, 701],[951, 567, 995, 706];clothing:[688, 610, 854, 984],[000, 677, 019, 765],[000, 565, 024, 677],[002, 563, 037, 756],[013, 585, 061, 703],[020, 687, 051, 760],[064, 563, 105, 732],[066, 646, 093, 740],[075, 564, 094, 653],[183, 585, 210, 722],[221, 556, 999, 990],[265, 590, 294, 658],[385, 622, 420, 739],[399, 684, 519, 874],[403, 870, 500, 999],[439, 579, 476, 646],[503, 646, 526, 746],[542, 667, 577, 765],[542, 584, 582, 663],[585, 656, 629, 750],[598, 580, 631, 661],[670, 591, 716, 719],[853, 667, 904, 765],[930, 669, 962, 756],[933, 596, 958, 700];tree:[000, 000, 374, 435],[325, 005, 644, 637],[398, 000, 999, 747],[741, 200, 966, 616],[867, 084, 987, 712].') 2023-11-03 11:09:58.042 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:09:58.043 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '220W Four Column Foam Cutter Machine avec du ce (HG-A30T)') 2023-11-03 11:10:05.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:10:05.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk,Moniter and Laptop in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Desk:[001, 493, 997, 997],[717, 352, 997, 515];TV:[001, 162, 027, 380],[245, 096, 505, 492],[755, 129, 925, 350],[686, 155, 725, 206];Laptop:[001, 312, 420, 764].') 2023-11-03 11:10:13.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:10:13.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bus:[000, 001, 877, 513];Sneakers:[138, 605, 170, 663],[172, 623, 213, 668];Handbag:[461, 627, 679, 914],[132, 457, 191, 494];Glasses:[664, 408, 743, 435];Backpack:[525, 534, 672, 752];Person:[374, 410, 812, 1000],[232, 331, 344, 470],[315, 335, 422, 564],[128, 383, 232, 665],[851, 321, 894, 409];Umbrella:[111, 055, 217, 471];Bracelet:[545, 910, 569, 975];Watch:[571, 640, 601, 676];Bottle:[280, 521, 348, 677],[214, 440, 229, 478],[103, 414, 145, 491];Cup:[173, 432, 190, 473];Chair:[592, 678, 912, 1000],[362, 421, 451, 583],[305, 460, 358, 551],[186, 464, 421, 984];Desk:[276, 579, 600, 933],[000, 459, 234, 554];Candle:[295, 601, 336, 662],[116, 459, 141, 486];Potted Plant:[358, 604, 418, 682],[976, 222, 1000, 246];Flower:[350, 473, 412, 584],[208, 390, 228, 439],[918, 222, 951, 247],[856, 354, 911, 385];Vase:[347, 564, 408, 645],[205, 437, 233, 474];Bench:[000, 536, 169, 707];Whiteboard:[861, 299, 896, 357].') 2023-11-03 11:10:21.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:10:21.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect human face,fashion accessory,dress and woman in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human face:[212, 349, 257, 420],[781, 352, 827, 431];fashion accessory:[122, 271, 292, 503],[186, 024, 323, 187],[368, 382, 532, 549],[394, 010, 667, 360],[453, 002, 613, 155],[682, 047, 896, 299],[693, 302, 872, 479],[719, 051, 855, 196];dress:[007, 444, 376, 947],[153, 119, 383, 277],[261, 490, 784, 952],[388, 132, 669, 365],[647, 434, 996, 947],[657, 153, 927, 297];woman:[054, 255, 362, 881],[150, 021, 379, 295],[262, 362, 773, 953],[654, 289, 998, 946].') 2023-11-03 11:10:31.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:10:31.278 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Helmet:[030, 300, 119, 403],[159, 143, 289, 347],[275, 173, 385, 294],[496, 330, 570, 444],[562, 341, 678, 444],[699, 245, 802, 365],[793, 323, 849, 369],[848, 293, 928, 445],[228, 333, 289, 397];Glasses:[168, 255, 259, 374],[278, 289, 365, 336],[523, 432, 555, 464],[605, 444, 664, 479],[732, 340, 784, 403];Hat:[283, 229, 368, 315],[710, 364, 748, 404],[899, 455, 936, 542];Gloves:[913, 709, 985, 852],[702, 761, 748, 879],[617, 814, 696, 913],[683, 664, 719, 745],[272, 782, 341, 901],[125, 770, 206, 844],[587, 681, 625, 741].') 2023-11-03 11:11:02.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:11:02.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pot in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Pot:[116, 138, 776, 961].') 2023-11-03 11:11:09.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:11:09.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[128, 662, 291, 831],carFrame1:[276, 666, 343, 755],carFrame1:[339, 670, 387, 727] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[128, 662, 291, 831];Frame2:[000, 660, 109, 932],carFrame1:[276, 666, 343, 755];Frame2:[137, 666, 287, 830],carFrame1:[339, 670, 387, 727];Frame2:[284, 672, 361, 762].') 2023-11-03 11:11:18.933 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:11:18.933 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'FILE - In this Dec. 17, 2019, file photo, Amazon packages move along [002, 274, 517, 996] a conveyor at an Amazon warehouse facility in Goodyear, Ariz. Amazon') 2023-11-03 11:11:22.624 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:11:22.625 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The time delay between the peak rainfall and peak discharge is due to the fact that most rain will fall onto the land rather than into [000, 016, 997, 993] a river and so these two events will not be simultaneous.') 2023-11-03 11:11:58.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:11:58.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[401, 329, 522, 836] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a boneshakerFrame1:[401, 329, 522, 836];Frame2:[376, 329, 496, 829];Frame3:[363, 315, 479, 812];Frame4:[334, 321, 446, 796].') 2023-11-03 11:12:04.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:12:04.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hockey Stick,Baseball,Desk,Bracelet and Sneakers in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Desk:[847, 405, 907, 437],[951, 414, 980, 443],[502, 402, 543, 427],[427, 395, 467, 426],[332, 392, 398, 417];Bracelet:[265, 547, 310, 566];Sneakers:[046, 685, 205, 728],[584, 524, 656, 569],[777, 520, 855, 557],[756, 464, 786, 476],[705, 447, 723, 469],[258, 444, 271, 453].') 2023-11-03 11:12:23.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:12:23.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[027, 612, 248, 841],[743, 616, 997, 864],[656, 604, 862, 782].') 2023-11-03 11:12:27.286 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:12:27.287 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[601, 443, 906, 987] [258, 378, 495, 914] [054, 354, 302, 743] 3 students giving their best Peaky Blinders looks while sitting on [002, 339, 998, 998] an outcrop') 2023-11-03 11:12:58.603 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:12:58.603 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[619, 078, 675, 340],personFrame1:[849, 102, 910, 353],personFrame1:[573, 120, 614, 306],personFrame1:[718, 099, 755, 319],personFrame1:[820, 095, 862, 220],busFrame1:[323, 095, 378, 222],personFrame1:[960, 053, 999, 235] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[619, 078, 675, 340];Frame2:[675, 101, 745, 437],personFrame1[849, 102, 910, 353];Frame2:No object,personFrame1:[573, 120, 614, 306];Frame2:[605, 150, 657, 371],personFrame1:[718, 099, 755, 319];Frame2:[851, 114, 914, 448],personFrame1:[820, 095, 862, 220];Frame2:[940, 125, 991, 401],busFrame1:[323, 095, 378, 222];Frame2:[314, 125, 371, 257],personFrame1[960, 053, 999, 235];Frame2:No object.') 2023-11-03 11:12:59.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:12:59.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'boy:[178, 524, 536, 999];man:[000, 824, 124, 999],[179, 521, 530, 999],[477, 725, 743, 999];human face:[261, 709, 405, 960].') 2023-11-03 11:13:04.218 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:13:04.219 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Microphone,Person and Chips in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Chair:[315, 371, 441, 617],[638, 407, 734, 664],[158, 309, 224, 439];Microphone:[678, 198, 710, 412];Person:[652, 002, 935, 999],[353, 045, 592, 817],[002, 218, 081, 538].') 2023-11-03 11:13:21.236 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:13:21.236 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Volunteers carry [393, 399, 652, 903] an injured person rescued from the collapsed apartment.— AP') 2023-11-03 11:13:29.053 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:13:29.053 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'be a flamingo [500, 249, 584, 438] in flock of pigeon') 2023-11-03 11:13:33.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:13:33.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[187, 311, 582, 798];frame:2:[232, 311, 594, 819];frame:3:[214, 297, 665, 811] is?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an the cow that was initially standing still and then walked to the right.') 2023-11-03 11:13:44.387 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:13:44.387 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a group of women [699, 227, 996, 766] [184, 132, 431, 777] [383, 258, 998, 807] [006, 237, 372, 725] [055, 714, 994, 989] pose in colourful swimsuits [203, 245, 409, 477] [241, 762, 670, 962] [862, 362, 997, 688] [034, 362, 263, 664] [456, 352, 706, 631], some sit on chairs [736, 471, 998, 815] [444, 453, 712, 767] [003, 455, 337, 885].') 2023-11-03 11:14:28.024 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:14:28.024 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[268, 587, 687, 799] [257, 382, 538, 490] [268, 472, 568, 618] [248, 303, 506, 429] Four local characters marked the new pedestrian crossing in Bridge Street, Skibbereen with [205, 247, 699, 799] a humorous reenactment of the famous Beatles album cover Abbey Road.') 2023-11-03 11:14:40.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:14:40.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Camera:[186, 488, 209, 518];Volleyball:[549, 000, 596, 048];Person:[400, 002, 589, 758],[009, 343, 167, 871],[008, 361, 066, 519],[157, 471, 249, 699],[349, 348, 393, 523],[400, 366, 451, 555],[653, 396, 702, 524],[689, 476, 729, 701],[703, 001, 814, 810],[825, 487, 965, 799];Bracelet:[902, 614, 908, 630];Watch:[364, 503, 372, 512],[443, 486, 449, 494];Tent:[246, 001, 719, 523];Pickup Truck:[609, 400, 871, 530];Hat:[029, 362, 050, 400],[084, 345, 144, 415],[181, 469, 203, 495],[370, 348, 389, 367],[676, 396, 689, 414],[877, 486, 905, 519];Glasses:[031, 376, 049, 385],[112, 365, 136, 381],[423, 383, 438, 391],[760, 189, 781, 207],[878, 505, 904, 519];Backpack:[330, 395, 392, 498];Sneakers:[689, 676, 707, 699],[722, 682, 729, 697],[229, 678, 248, 694],[157, 684, 171, 698].') 2023-11-03 11:14:48.592 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:14:48.592 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Gushing over [004, 259, 997, 997] this floral-inspired utterly romantic Alessandro Angelozzi gown!') 2023-11-03 11:15:07.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:15:07.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'mobile phone:[192, 319, 519, 703];person:[365, 038, 999, 780];fashion accessory:[697, 293, 871, 546].') 2023-11-03 11:15:16.818 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:15:16.819 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'HHT induces apoptosis of NSCLC cells.(A): A549 and H1975 cells were treated with Gefitinib (3\u2009mM) or HHT at indicated concentrations for 24\u2009h and stained with Hoechst 33258 assay. (B): A549 and H1975 cells were treated with HHT, lysed and the protein samples were analysed by western blot with indicated antibodies. (C): A549 and H1975 cells were treated with HHT at indicated concentration and the mitochondrial transmembrane potential (ΔΨ) was tested by confocal microscopy (Olympus Fluoview FV-1000, Tokyo, Japan). (D): Ca2+(i) was measured using Ca2+ indicator FLUO-4 (Invitrogen) by flow cytometry assay. (E): H1975 cells were treated with HHT for 24 h, lysed and analysed by western blot with indicated antibodies. (F): H1975 cells were pretreated with Z-VAD-FMK (20 mM) for 1 h and then treated with HHT at 2 mM for 24 h, and the inhibition rate was determined by MTT assay. The mean±SD of three independent experiments is shown. ***, P\u2009<\u20090.01. (G): A549 and H1975 cells were treated with HHT for 24 h, lysed and analysed by western blot with indicated antibodies. The blots shown are derived from multiple gels. Membrane was cut based on the molecular weight, probed with antibody of interest and band of interest is indicated with an arrow. All the full-length blots are presented in Supplementary Figure 2.') 2023-11-03 11:15:47.656 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:15:47.656 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The three Febreze Air Effects Sprays [405, 491, 856, 884] set in front of a wheat floral display [000, 001, 993, 644] with a burlap bow off to the side.') 2023-11-03 11:15:51.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:15:51.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[102, 039, 863, 784] Mike Lookinland is wearing [342, 194, 588, 274] spectacles while leaning slightly on [001, 719, 997, 997] a table which consists of pictures of [769, 729, 997, 840] [430, 824, 847, 997] [010, 826, 428, 997] The Brady Bunch cast') 2023-11-03 11:15:54.539 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:15:54.539 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[031, 122, 183, 412],[298, 148, 706, 999],[509, 107, 741, 581];Sneakers:[426, 926, 532, 1000];Desk:[646, 414, 999, 737];Chair:[739, 332, 973, 569],[492, 475, 717, 738],[002, 474, 187, 999];Toilet Paper:[744, 355, 824, 440];Spoon:[539, 246, 651, 294].') 2023-11-03 11:16:05.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:16:05.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[316, 001, 558, 819];Bracelet:[521, 257, 551, 297];Handbag:[434, 039, 519, 273],[320, 035, 370, 340];High Heels:[476, 739, 542, 812],[315, 721, 379, 816];Van:[795, 001, 999, 433];Pickup Truck:[637, 001, 809, 138];Rickshaw:[462, 001, 578, 034].') 2023-11-03 11:16:41.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:16:41.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect cattle,bull,clothing and human face in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'cattle:[812, 811, 923, 954],[068, 816, 124, 953],[071, 275, 133, 422],[195, 551, 327, 707],[201, 037, 334, 176],[596, 030, 730, 174],[598, 545, 733, 700],[785, 284, 883, 429];bull:[070, 803, 130, 955],[071, 271, 137, 419],[187, 551, 321, 697],[192, 036, 331, 168],[597, 023, 728, 163],[602, 538, 729, 705],[784, 280, 888, 432],[789, 796, 917, 961];clothing:[286, 406, 608, 999];human face:[429, 217, 515, 399].') 2023-11-03 11:16:58.014 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:16:58.015 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "I'm a simple person [002, 000, 893, 996] - Funny Knitting coffee lover dogs cats owner Crochet joke coffee mug - crocheti") 2023-11-03 11:17:03.546 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:17:03.546 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk,Person and Lamp in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Desk:[115, 751, 365, 854];Person:[638, 416, 838, 999],[359, 547, 519, 813],[279, 539, 404, 749],[355, 529, 410, 660],[156, 540, 334, 775],[109, 550, 231, 756],[277, 548, 308, 632],[203, 538, 239, 629],[136, 553, 184, 747],[070, 537, 131, 681],[025, 532, 115, 726],[002, 537, 118, 765],[002, 675, 095, 907];Lamp:[210, 002, 374, 195],[176, 002, 285, 296],[050, 286, 104, 387],[707, 279, 777, 347],[879, 157, 976, 292],[953, 324, 1000, 383].') 2023-11-03 11:17:20.454 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:17:20.454 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '"contactless delivery" icon of delivery scooter [081, 180, 429, 652] and dinner service on its seat, and romford dragon logo [569, 131, 952, 609]') 2023-11-03 11:18:11.157 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:18:11.158 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[045, 017, 878, 998] a woman dressed up like [000, 379, 771, 756] a chipotle salad for halloween') 2023-11-03 11:19:20.013 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:19:20.014 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[067, 120, 142, 429],[158, 003, 467, 1000],[387, 164, 607, 969],[505, 033, 631, 857],[589, 036, 830, 999];Picture:[099, 381, 181, 608].') 2023-11-03 11:19:33.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:19:33.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Paddle:[000, 559, 136, 715],[264, 520, 785, 605],[805, 758, 999, 867],[236, 618, 461, 698];Boat:[000, 623, 999, 840];Person:[504, 247, 970, 716],[000, 225, 322, 692].') 2023-11-03 11:19:41.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:19:41.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Street Lights:[837, 329, 999, 526],[188, 676, 336, 910],[110, 900, 154, 979],[286, 929, 325, 987],[816, 776, 879, 947];Bus:[002, 928, 333, 1000].') 2023-11-03 11:19:47.575 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:19:47.575 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hand-drawn sketch. Costume design [411, 056, 944, 783] depicting a man and a woman with beets for heads. Emerald green and chartreuse fabric swatches attached.') 2023-11-03 11:20:06.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:20:06.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect brassiere,vehicle and person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'vehicle:[248, 505, 585, 696];person:[112, 356, 194, 743],[583, 366, 653, 670].') 2023-11-03 11:20:38.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:20:38.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Couch:[002, 336, 255, 768],[586, 253, 748, 406];Stool:[621, 395, 773, 574];Desk:[707, 378, 901, 516],[112, 649, 999, 999],[002, 240, 181, 503];Potted Plant:[616, 078, 813, 371];Chair:[795, 410, 999, 666];Pillow:[064, 392, 125, 526],[116, 440, 178, 586];Bakset:[132, 748, 409, 945],[394, 724, 748, 1000];Hat:[470, 035, 639, 176];Carrot:[551, 674, 586, 834];Canned:[758, 591, 813, 767];Spoon:[149, 804, 384, 872],[245, 775, 371, 874];Bottle:[912, 705, 999, 829];Cup:[748, 711, 798, 797];Microwave:[151, 049, 226, 112],[131, 145, 211, 201];Person:[028, 063, 139, 235],[315, 030, 643, 902].') 2023-11-03 11:20:44.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:20:44.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Necklace in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Necklace:[159, 383, 246, 500].') 2023-11-03 11:20:45.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:20:45.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect human face,suit and motorcycle in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human face:[039, 129, 134, 336],[095, 274, 148, 353],[164, 257, 189, 324],[210, 156, 296, 321],[339, 320, 378, 383],[482, 255, 545, 376],[611, 310, 660, 391],[717, 376, 756, 447];suit:[000, 351, 249, 998],[082, 277, 355, 998];motorcycle:[778, 540, 992, 801].') 2023-11-03 11:21:22.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:21:22.915 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[008, 712, 151, 990],[000, 353, 999, 999];girl:[000, 491, 061, 757],[000, 621, 193, 999],[098, 414, 223, 599],[195, 556, 441, 999],[254, 404, 306, 546],[276, 447, 436, 832],[313, 333, 389, 546],[371, 348, 405, 430],[385, 418, 446, 615],[395, 353, 513, 604],[431, 331, 468, 450],[460, 333, 563, 607],[511, 325, 578, 515],[548, 303, 591, 404],[581, 350, 632, 465],[620, 345, 732, 577],[668, 333, 723, 471].') 2023-11-03 11:21:27.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:21:27.908 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect paddle,man and musical instrument in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[072, 167, 125, 270],[179, 741, 411, 999],[187, 374, 376, 746],[288, 140, 339, 237],[365, 365, 521, 770],[391, 333, 510, 605],[405, 167, 508, 250],[485, 287, 585, 616],[521, 381, 684, 775],[668, 392, 860, 818],[670, 115, 783, 198],[728, 333, 828, 508],[801, 702, 999, 999];musical instrument:[070, 175, 095, 270],[144, 251, 443, 724],[309, 450, 338, 600],[321, 127, 343, 240],[357, 417, 424, 575],[483, 162, 499, 256],[538, 264, 632, 684],[575, 435, 631, 517],[660, 515, 736, 632],[664, 441, 764, 485],[680, 395, 844, 685],[723, 138, 783, 205],[841, 417, 913, 474].') 2023-11-03 11:21:31.271 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:21:31.271 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Red Panda Yarn Tail Set by KiRAWRa') 2023-11-03 11:22:02.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:22:02.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Wine Glass,Person,Vase and Chair in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Wine Glass:[019, 616, 070, 789],[337, 666, 402, 874],[431, 656, 494, 823],[924, 765, 999, 998];Person:[540, 323, 797, 801],[283, 308, 572, 684];Vase:[131, 625, 180, 720];Chair:[774, 537, 967, 842].') 2023-11-03 11:22:17.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:22:17.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Flower:[038, 002, 168, 540],[466, 242, 864, 633];Vase:[001, 165, 037, 396],[001, 734, 043, 973],[071, 625, 175, 858],[084, 928, 190, 997],[093, 002, 155, 066],[222, 002, 292, 072],[208, 137, 316, 384],[207, 416, 313, 658],[205, 750, 319, 990],[357, 741, 475, 996],[374, 410, 452, 663],[359, 112, 442, 357],[365, 002, 437, 066],[527, 002, 748, 083],[533, 152, 764, 316],[530, 502, 776, 674],[541, 769, 773, 938],[825, 890, 929, 1000],[814, 404, 940, 838],[795, 002, 923, 350],[958, 399, 998, 609],[086, 334, 158, 576].') 2023-11-03 11:22:27.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:22:27.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[117, 311, 274, 529],[049, 600, 190, 928],[130, 568, 199, 730],[181, 531, 422, 997],[001, 803, 183, 999],[346, 448, 465, 791],[474, 453, 580, 810],[579, 469, 689, 815],[678, 451, 781, 828],[630, 520, 858, 998],[714, 607, 999, 998],[929, 550, 999, 867];Book:[656, 711, 695, 785];Leather Shoes:[687, 781, 729, 828];Speaker:[246, 164, 280, 222],[824, 141, 865, 208].') 2023-11-03 11:22:38.019 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:22:38.020 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[080, 325, 297, 914] [723, 165, 956, 725] Two men wearing waders are standing in one of the Great Lakes with a net stretched between them that they are dragging in the water.') 2023-11-03 11:22:50.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:22:50.211 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,table,desk,chair and person in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'clothing:[000, 386, 218, 978],[690, 357, 922, 472];table:[570, 410, 968, 848],[000, 554, 594, 917],[716, 685, 998, 997];desk:[006, 538, 588, 997];chair:[240, 522, 512, 994],[398, 373, 658, 882],[444, 458, 688, 997],[702, 354, 796, 813],[980, 504, 998, 685];person:[002, 253, 212, 997],[686, 274, 918, 482].') 2023-11-03 11:22:56.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:22:56.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[142, 109, 987, 999],[002, 118, 481, 999];Helmet:[544, 112, 727, 438],[099, 192, 263, 464];Gloves:[239, 123, 401, 328],[140, 531, 307, 709],[463, 726, 658, 916].') 2023-11-03 11:23:30.364 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:23:30.365 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:3:[470, 358, 493, 399] is doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an rabbit running around') 2023-11-03 11:24:06.089 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:24:06.089 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'house:[003, 007, 997, 996].') 2023-11-03 11:24:29.366 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:24:29.366 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'bicycle:[056, 410, 146, 526],[070, 423, 145, 531],[123, 402, 242, 499],[416, 521, 484, 670];bicycle wheel:[114, 458, 146, 530],[421, 556, 442, 658],[447, 561, 473, 677];person:[000, 365, 059, 498],[036, 369, 077, 505],[076, 327, 120, 525],[143, 340, 186, 522],[415, 351, 497, 630];wheel:[050, 550, 091, 645],[114, 456, 146, 530],[334, 529, 383, 593],[421, 543, 448, 663],[451, 556, 473, 679],[516, 534, 568, 600];clothing:[000, 398, 046, 480],[048, 390, 067, 469],[063, 359, 121, 482],[115, 367, 126, 451],[147, 356, 190, 474],[271, 388, 285, 474],[391, 384, 505, 551];building:[000, 025, 264, 388],[083, 000, 998, 519];car:[000, 324, 090, 645],[308, 340, 585, 596].') 2023-11-03 11:25:44.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:25:44.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Chair:[772, 581, 850, 711];Person:[494, 499, 836, 999],[338, 567, 531, 1000],[002, 582, 141, 999],[191, 361, 315, 886];Satchel:[471, 699, 598, 999].') 2023-11-03 11:26:05.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:26:05.873 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Lamp:[467, 015, 835, 136],[001, 001, 169, 122],[155, 068, 381, 236];Person:[775, 448, 850, 777],[751, 453, 818, 723],[662, 493, 743, 732],[430, 446, 584, 1000],[384, 484, 448, 736],[296, 567, 390, 816].') 2023-11-03 11:26:59.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:26:59.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Baseball Glove:[315, 384, 348, 432],[769, 560, 808, 626];Sneakers:[938, 833, 998, 899];Helmet:[955, 387, 999, 459];Hat:[864, 351, 919, 429];Person:[243, 238, 349, 701],[401, 315, 489, 732],[367, 479, 422, 606],[531, 420, 613, 602],[608, 394, 692, 601],[758, 352, 999, 901],[790, 324, 863, 696],[951, 388, 999, 739],[249, 512, 326, 612];Chair:[631, 488, 691, 595],[557, 459, 615, 600].') 2023-11-03 11:27:04.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:27:04.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'jacket:[074, 295, 199, 608],[138, 381, 384, 999],[483, 447, 671, 976],[646, 234, 790, 659];woman:[481, 249, 681, 999];man:[074, 191, 215, 999],[638, 238, 789, 999];tree:[000, 000, 171, 462],[145, 000, 215, 268],[181, 000, 368, 249],[196, 000, 263, 266],[351, 000, 411, 497],[403, 000, 478, 328],[434, 000, 644, 314],[471, 000, 532, 436],[766, 000, 999, 420],[896, 000, 960, 499],[953, 268, 999, 599].') 2023-11-03 11:27:38.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:27:38.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Desk:[050, 134, 214, 285],[386, 381, 982, 771];Person:[001, 474, 109, 785],[109, 001, 172, 138],[135, 052, 188, 197],[132, 319, 262, 846],[362, 150, 574, 619],[381, 371, 574, 1000],[599, 538, 674, 725],[709, 255, 904, 617];bucket:[749, 513, 849, 681],[098, 730, 166, 794];Hat:[440, 372, 543, 482].') 2023-11-03 11:27:57.883 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:27:57.883 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Glasses,Necklace and Lamp in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Glasses:[388, 170, 539, 260];Necklace:[430, 428, 552, 581];Lamp:[871, 001, 966, 123].') 2023-11-03 11:28:03.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:28:03.730 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect fashion accessory and helmet in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'fashion accessory:[464, 334, 565, 363],[686, 763, 951, 816];helmet:[087, 199, 303, 390].') 2023-11-03 11:28:25.946 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:28:25.947 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The staff [512, 217, 581, 605] [579, 201, 652, 609] [439, 011, 515, 303] [536, 016, 611, 304] [374, 005, 453, 304] and volunteers [464, 413, 538, 738] [656, 772, 734, 997] in Reynosa after pouring a roof for a house for a family in Mexico') 2023-11-03 11:28:50.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:28:50.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Aerial view of a new autonomous house with solar panels [390, 088, 450, 214] [311, 353, 376, 488] [681, 361, 783, 474] [528, 114, 598, 256] [469, 212, 559, 318] [664, 272, 764, 382] [582, 078, 711, 265] [364, 323, 471, 478] and water heating radiators [001, 003, 340, 204] on the roof [185, 012, 961, 747] and green yard [002, 005, 987, 998] with blue swimming pool [290, 702, 566, 982].') 2023-11-03 11:29:17.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:29:17.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[580, 600, 663, 756];Bicycle:[101, 666, 253, 883];Bench:[184, 717, 364, 916],[263, 682, 302, 730],[273, 678, 311, 702].') 2023-11-03 11:29:32.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:29:32.642 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Car:[209, 360, 242, 395],[464, 482, 551, 565],[649, 604, 999, 827];Heavy Truck:[286, 332, 425, 459];Street Lights:[303, 289, 336, 335],[374, 268, 418, 353],[263, 301, 287, 391],[651, 188, 737, 610];SUV:[390, 428, 461, 505],[335, 426, 455, 553];Truck:[693, 445, 998, 745];Handbag:[027, 375, 058, 426];Person:[000, 256, 054, 441];Awning:[544, 454, 645, 491].') 2023-11-03 11:29:36.147 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:29:36.147 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'sheep:[172, 296, 710, 831].') 2023-11-03 11:30:04.690 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:30:04.690 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bright yellow tents [800, 719, 884, 801] [601, 723, 678, 802] [670, 751, 746, 832] [206, 710, 260, 789] [116, 710, 176, 783] [431, 714, 497, 797] [311, 715, 373, 792] under a large blue sky [001, 003, 996, 740] on the very flat and snow covered Thwaites Glacier on Antarctica [002, 717, 993, 997].') 2023-11-03 11:30:14.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:30:14.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[789, 281, 845, 503];Person:[376, 560, 494, 1000],[480, 543, 590, 999].') 2023-11-03 11:30:38.281 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:30:38.281 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Work it:\xa0Wearing [163, 164, 520, 514] a teal dinner jacket and [193, 402, 508, 938] straight cut suit pants Hana kept it chic while flashing her black bra in a sheer roll neck top (pictured with\xa0[459, 032, 828, 973] Lady Amelia Windsor') 2023-11-03 11:30:45.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:30:45.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[635, 303, 753, 401],[387, 450, 432, 493],[430, 508, 470, 541];Flower:[874, 633, 999, 810],[698, 633, 817, 716];Person:[605, 610, 704, 839],[509, 592, 620, 836];Motorcycle:[048, 650, 248, 856],[325, 632, 395, 704],[718, 655, 930, 824].') 2023-11-03 11:30:59.927 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:30:59.927 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'View over caravan park with [224, 309, 297, 392] [509, 284, 574, 361] [081, 355, 204, 435] [621, 268, 718, 347] [681, 652, 876, 864] [001, 635, 178, 826] [045, 418, 182, 516] [008, 501, 181, 634] caravans on the pitches') 2023-11-03 11:31:12.601 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:12.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[042, 404, 148, 470],carFrame1:[134, 342, 335, 598],carFrame1:[324, 413, 384, 503] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1[042, 404, 148, 470];Frame2:No object,carFrame1:[134, 342, 335, 598];Frame2:[172, 356, 344, 579],carFrame1:[324, 413, 384, 503];Frame2:[329, 415, 389, 495].') 2023-11-03 11:31:19.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:19.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Potted Plant:[001, 539, 358, 1000],[700, 678, 924, 1000],[754, 405, 998, 931],[579, 415, 796, 688];Flower:[700, 329, 798, 395],[001, 177, 299, 418];Vase:[689, 388, 821, 460],[022, 372, 282, 519];Lamp:[708, 129, 841, 296];Carpet:[377, 905, 644, 1000];shelf:[398, 590, 477, 824].') 2023-11-03 11:31:33.247 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:33.248 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Congress vice president Rahul Gandhi during a meeting with [612, 351, 795, 997] [472, 334, 633, 997] [704, 350, 995, 995] [301, 356, 492, 996] [135, 391, 341, 997] home-buyers at party headquarter in New Delhi on May 2, 2015.') 2023-11-03 11:31:35.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:35.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[867, 819, 927, 1000],[909, 810, 956, 1000],[785, 833, 840, 1000],[834, 797, 892, 988],[883, 756, 922, 944],[737, 893, 779, 1000],[679, 908, 742, 1000],[513, 899, 580, 1000],[444, 875, 519, 1000],[368, 903, 422, 1000],[261, 926, 318, 1000],[227, 896, 277, 1000],[214, 834, 270, 1000],[123, 869, 177, 1000],[388, 854, 434, 1000],[473, 761, 521, 955],[513, 753, 557, 935],[520, 778, 563, 944],[543, 787, 593, 961],[553, 767, 601, 938],[673, 692, 709, 863],[642, 662, 680, 852],[668, 651, 699, 795],[723, 650, 758, 801],[745, 720, 787, 886],[770, 695, 818, 872],[808, 712, 859, 889],[835, 695, 885, 876],[827, 660, 868, 824],[856, 687, 897, 842],[900, 688, 951, 867],[970, 714, 1000, 875],[736, 526, 768, 635],[770, 526, 802, 637],[752, 541, 795, 650],[196, 468, 229, 575],[224, 469, 256, 560],[252, 460, 287, 569],[353, 465, 388, 560],[380, 475, 412, 572],[448, 471, 485, 560],[965, 688, 998, 811],[961, 958, 1000, 1000],[085, 402, 246, 464],[564, 411, 638, 458],[560, 366, 623, 411],[550, 326, 589, 367],[010, 452, 043, 518].') 2023-11-03 11:31:51.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:51.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[001, 503, 144, 703];woman:[019, 564, 299, 999],[301, 545, 636, 999],[693, 579, 923, 999],[862, 580, 998, 999];fashion accessory:[316, 260, 584, 954];human face:[071, 521, 129, 593],[784, 593, 834, 660],[907, 603, 974, 658].') 2023-11-03 11:31:52.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:52.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Image of [326, 515, 577, 996] woman in [387, 510, 474, 568] a white beanie, [375, 646, 478, 728] blue mittens and [323, 575, 548, 875] a winter coat sitting in front of [018, 005, 993, 953] a store next to a photoshopped image of [499, 526, 700, 969] Bernie Sanders from the 2021 presidential inauguration') 2023-11-03 11:31:54.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:31:54.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[387, 443, 417, 525] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[387, 443, 417, 525];Frame2:[361, 466, 392, 554];Frame3:[435, 465, 481, 552];Frame4:[489, 338, 541, 444];Frame5:[442, 343, 526, 491].') 2023-11-03 11:32:00.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:32:00.740 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cup,Hat and Sneakers in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Cup:[024, 464, 102, 606],[046, 784, 120, 891],[137, 787, 204, 881],[218, 565, 272, 665];Hat:[369, 000, 590, 215];Sneakers:[817, 198, 982, 296].') 2023-11-03 11:32:02.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:32:02.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[051, 072, 805, 543],[000, 379, 771, 505],[742, 399, 881, 502],[873, 341, 998, 512].') 2023-11-03 11:32:43.490 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:32:43.491 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cymbal,Guitar,Person,Glasses and Speaker in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Cymbal:[001, 571, 136, 628];Guitar:[167, 380, 938, 636],[656, 560, 999, 711];Person:[135, 170, 697, 999],[618, 392, 1000, 998],[175, 779, 297, 1000];Glasses:[778, 427, 871, 450];Speaker:[372, 661, 653, 871],[355, 858, 679, 1000],[662, 872, 915, 1000].') 2023-11-03 11:32:55.462 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:32:55.462 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bench:[101, 446, 926, 834];Hat:[405, 241, 461, 316];Other Shoes:[595, 774, 652, 848],[407, 706, 465, 793],[381, 728, 428, 847];Sneakers:[052, 502, 123, 577];Handbag:[919, 668, 999, 843];Person:[052, 254, 331, 835],[334, 239, 509, 850],[498, 245, 714, 852],[692, 260, 958, 846].') 2023-11-03 11:33:04.201 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:33:04.201 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[173, 050, 821, 843] Ed Mullins during an appearance on Fox News Friday. A coffee mug with the logo of QAnon sits above his left shoulder in the bottom right of the photo.') 2023-11-03 11:33:40.930 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:33:40.930 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[009, 006, 993, 968] Vanilla cupcakes with red, white and blue frosting and a cherry on top.') 2023-11-03 11:33:53.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:33:53.065 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Horse,Necklace and Stool in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Necklace:[374, 310, 418, 395];Stool:[209, 413, 335, 907].') 2023-11-03 11:34:11.485 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:34:11.485 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A strawberry galette [091, 623, 924, 930] with vanilla bean and chia seeds in progress with copy space') 2023-11-03 11:34:25.194 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 11:34:25.194 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'tree:[000, 599, 806, 999],[773, 617, 999, 999].') 2023-11-03 11:34:35.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:34:35.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'car:[557, 393, 873, 727],[000, 462, 079, 670],[823, 389, 998, 629];wheel:[031, 552, 076, 668],[551, 529, 581, 614],[576, 610, 601, 712],[781, 670, 856, 708],[914, 514, 985, 633].') 2023-11-03 11:35:30.815 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:35:30.815 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[311, 298, 432, 741] [443, 309, 554, 749] Two golfers and [209, 363, 316, 694] their caddies walk down [090, 466, 859, 976] the fairway at the Carnoustie Open Championship final rounds, 1953.') 2023-11-03 11:35:31.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:35:31.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[459, 022, 958, 999],[062, 059, 582, 999],[375, 362, 534, 784],[001, 000, 317, 197];Umbrella:[708, 251, 998, 525];Hat:[662, 187, 810, 262],[387, 365, 458, 404];Glasses:[677, 239, 790, 271],[291, 192, 327, 222];Other Shoes:[492, 779, 524, 795].') 2023-11-03 11:35:35.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:35:35.567 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Car and Sneakers in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Car:[000, 060, 165, 934],[007, 197, 224, 537],[303, 000, 415, 131],[317, 024, 925, 938],[644, 198, 1000, 532],[676, 027, 826, 158],[884, 028, 1000, 167],[805, 603, 1000, 1000];Sneakers:[227, 069, 236, 079].') 2023-11-03 11:35:54.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:35:54.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[400, 443, 460, 577] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a airplane colored in blue and white taking off and flying in the airFrame1:[400, 443, 460, 577];Frame2:[486, 456, 583, 580];Frame3:[531, 501, 632, 620];Frame4:[564, 537, 657, 650].') 2023-11-03 11:36:17.665 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:36:17.665 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Close-up of a woman [123, 132, 969, 993] holding a chronometer [721, 153, 904, 531] to measure performance against swimmer [188, 114, 749, 995] happy in the pool') 2023-11-03 11:36:44.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:36:44.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "crowds [490, 007, 902, 991] are jamming the aisles of a riyadh coffee and chocolate exhibition this week as saudis [286, 020, 435, 775] indulge the country's sweet tooth and craze for caffeine - caffeine stock videos [462, 748, 717, 996] & royalty-free footage [648, 710, 719, 872]") 2023-11-03 11:36:55.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:36:55.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[051, 026, 356, 919],[000, 354, 061, 529],[318, 454, 683, 867],[586, 235, 885, 862],[828, 266, 993, 999];woman:[314, 354, 643, 882];man:[821, 125, 987, 936],[000, 265, 068, 532],[048, 011, 368, 855],[585, 078, 882, 876];human face:[162, 065, 238, 221],[401, 496, 486, 601],[698, 125, 775, 281],[845, 157, 920, 321].') 2023-11-03 11:36:59.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:36:59.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[362, 779, 999, 999];frame:2:[362, 779, 999, 999];frame:3:[235, 742, 999, 999] is doing?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an person looking at tiger') 2023-11-03 11:37:07.452 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:37:07.453 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[016, 116, 421, 999],[082, 071, 588, 1000],[726, 067, 999, 999],[579, 074, 756, 965];Other Shoes:[397, 832, 466, 896];High Heels:[609, 858, 676, 961].') 2023-11-03 11:37:17.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:37:17.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[751, 586, 853, 790],[111, 643, 239, 921],[117, 561, 186, 666],[182, 425, 221, 524],[205, 579, 306, 700],[770, 801, 999, 999];man:[368, 349, 644, 999];girl:[000, 577, 151, 810],[124, 558, 190, 666],[179, 431, 221, 535],[772, 779, 999, 999];clothing:[041, 645, 146, 739],[045, 684, 152, 806],[118, 564, 176, 672],[120, 732, 197, 892],[158, 825, 348, 999],[205, 641, 306, 715],[244, 645, 340, 770],[329, 705, 452, 790],[336, 740, 452, 977],[456, 551, 652, 999],[630, 658, 663, 739],[661, 650, 753, 801],[759, 625, 851, 761],[823, 641, 921, 761],[946, 658, 999, 766];human face:[069, 643, 123, 714],[158, 669, 207, 755],[208, 722, 280, 836],[361, 650, 415, 744],[467, 386, 543, 536],[857, 800, 948, 953].') 2023-11-03 11:37:29.736 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:37:29.737 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Folks [134, 423, 430, 990] [489, 443, 702, 979] [138, 658, 599, 997] [713, 371, 980, 996] posing with their fish [300, 323, 358, 442] [198, 317, 260, 463] [718, 230, 771, 333] [186, 185, 278, 306] [517, 220, 584, 344] [618, 227, 685, 351] [304, 206, 367, 335] [418, 210, 473, 324] [803, 234, 867, 362] [657, 312, 748, 592] [386, 290, 481, 616] [476, 305, 574, 553] after their deep sea charter fishing trip') 2023-11-03 11:37:38.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:37:38.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the elephant walk right?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There is one elephant walk right.Track1frame:1:[789, 445, 871, 551];frame:2:[770, 442, 850, 544];frame:3:[750, 440, 831, 542].') 2023-11-03 11:37:44.049 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:37:44.049 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Chelsea FC Cufflinks - Stainless Steel 1') 2023-11-03 11:38:05.328 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:38:05.328 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[001, 235, 362, 897],[343, 143, 905, 963];Tie:[479, 463, 613, 930];Glasses:[406, 247, 562, 331];Wine Glass:[500, 668, 607, 998],[887, 756, 996, 1000],[053, 657, 172, 1000];Cup:[436, 760, 521, 996],[817, 864, 891, 999];Bottle:[689, 730, 783, 999];Radiator:[001, 891, 824, 997].') 2023-11-03 11:38:15.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:38:15.409 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Traffic Sign:[153, 283, 470, 409],[458, 273, 797, 406],[504, 430, 849, 585].') 2023-11-03 11:38:16.359 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:38:16.360 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'house:[000, 000, 096, 653],[034, 379, 230, 560],[488, 000, 999, 663];tree:[603, 000, 690, 104];window:[000, 292, 066, 421],[000, 123, 076, 265],[001, 000, 074, 110],[511, 432, 544, 499],[566, 416, 608, 495],[628, 409, 680, 489],[706, 403, 776, 489],[817, 382, 886, 540],[954, 092, 999, 178].') 2023-11-03 11:38:36.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:38:36.615 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[428, 259, 492, 513] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a leopardFrame1:[428, 259, 492, 513];Frame2:[408, 261, 478, 515];Frame3:[363, 229, 475, 498];Frame4:[288, 229, 455, 504];Frame5:[185, 219, 421, 508].') 2023-11-03 11:39:16.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:39:16.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[018, 100, 303, 532],[341, 463, 692, 1000];Ladder:[002, 111, 426, 1000];Desk:[819, 800, 1000, 885],[002, 828, 161, 909];Hat:[540, 597, 630, 673].') 2023-11-03 11:40:31.931 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:40:31.932 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Motorcycle:[736, 334, 914, 763],[528, 351, 697, 752],[270, 364, 496, 743],[084, 374, 320, 734];Car:[000, 387, 103, 536];Van:[000, 339, 172, 430];Bus:[106, 309, 294, 431];Person:[801, 327, 823, 385],[452, 323, 505, 522],[219, 383, 239, 442].') 2023-11-03 11:40:49.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:40:49.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Inspiration for [002, 003, 997, 860] a cottage white two-story exterior home remodel in Charlotte with [509, 171, 843, 300] a shingle roof') 2023-11-03 11:41:17.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:41:17.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'building:[000, 000, 910, 993].') 2023-11-03 11:41:59.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:41:59.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[800, 408, 935, 872],personFrame1:[617, 354, 707, 719],personFrame1:[855, 694, 970, 1000],personFrame1:[717, 720, 865, 1000],personFrame1:[342, 409, 479, 872] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[800, 408, 935, 872];Frame2:[807, 445, 948, 915],personFrame1:[617, 354, 707, 719];Frame2:[612, 379, 716, 741],personFrame1:[855, 694, 970, 1000];Frame2:[860, 719, 971, 1000],personFrame1:[717, 720, 865, 1000];Frame2:[721, 744, 865, 1000],personFrame1:[342, 409, 479, 872];Frame2:[346, 425, 476, 850].') 2023-11-03 11:42:28.476 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:42:28.477 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[640, 519, 707, 664] [825, 574, 895, 808] [769, 670, 856, 975] [518, 571, 645, 998] Members of the UCSB community gather at a park for [080, 615, 123, 679] [377, 723, 414, 811] [061, 693, 095, 772] a candlelight vigil to honor the victims of Friday night's mass shooting on May 24, 2014, in Isla Vista, Calif.") 2023-11-03 11:42:31.910 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:42:31.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bottle in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bottle:[815, 001, 999, 1000],[650, 001, 918, 1000],[544, 001, 743, 1000],[446, 001, 643, 1000],[372, 001, 544, 1000],[324, 001, 470, 980],[249, 001, 419, 957],[196, 001, 337, 929],[146, 001, 282, 893],[120, 001, 230, 859],[086, 001, 195, 831].') 2023-11-03 11:42:33.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:42:33.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human body:[501, 154, 998, 999],[015, 120, 540, 999],[071, 263, 265, 633];woman:[501, 154, 998, 999],[007, 122, 543, 999],[066, 259, 267, 657];human hair:[528, 158, 791, 610],[000, 337, 048, 525],[109, 257, 271, 530],[231, 120, 453, 370],[390, 373, 506, 558];human head:[204, 116, 453, 530],[130, 268, 242, 440],[576, 155, 758, 577];mammal:[000, 178, 577, 952],[000, 342, 057, 997],[060, 269, 267, 669],[391, 383, 501, 576],[517, 156, 999, 999];glasses:[244, 308, 430, 401];human mouth:[313, 423, 376, 464],[648, 472, 717, 514];human ear:[241, 314, 260, 400],[420, 330, 439, 404];human face:[261, 254, 422, 534],[599, 244, 758, 573];human nose:[323, 331, 371, 410],[651, 372, 688, 471].') 2023-11-03 11:42:35.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:42:35.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'man:[283, 000, 783, 999];human face:[314, 222, 766, 999].') 2023-11-03 11:43:15.594 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:43:15.594 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[032, 319, 995, 995] child with [039, 784, 224, 973] hands above [221, 318, 634, 830] his head lying on park bench') 2023-11-03 11:43:17.507 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:43:17.507 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The city of Austin held a public open house June 7, 2017 to gather feedback on plans to build a new bridge [003, 350, 395, 994] to replace the nearly 70-year-old Redbud Trail bridge.') 2023-11-03 11:43:52.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:43:52.529 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Chair and Person in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Chair:[894, 572, 999, 899];Person:[499, 259, 849, 998],[932, 472, 999, 833].') 2023-11-03 11:44:28.575 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:44:28.575 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[785, 296, 922, 603],personFrame1:[653, 325, 793, 640],personFrame1:[233, 325, 369, 648],personFrame1:[322, 262, 436, 596],personFrame1:[436, 324, 559, 644] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[785, 296, 922, 603];Frame2:[789, 292, 919, 612],personFrame1:[653, 325, 793, 640];Frame2:[647, 335, 790, 647],personFrame1:[233, 325, 369, 648];Frame2:[229, 340, 366, 653],personFrame1:[322, 262, 436, 596];Frame2:[319, 274, 435, 603],personFrame1:[436, 324, 559, 644];Frame2:[433, 333, 557, 651].') 2023-11-03 11:44:42.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:44:42.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Picture:[589, 243, 826, 612],[080, 124, 396, 687];Stool:[082, 812, 179, 927];Cabinet:[192, 717, 350, 881];Desk:[276, 727, 616, 999];Chair:[590, 656, 676, 832],[716, 642, 814, 838],[857, 616, 961, 841].') 2023-11-03 11:44:47.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:44:47.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Satchel:[697, 753, 907, 999];Person:[210, 033, 771, 1000];Camera:[406, 173, 550, 279];Cup:[277, 929, 459, 1000].') 2023-11-03 11:44:51.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:44:51.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Frame,Watch and Couch in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Frame:[002, 002, 176, 179],[762, 002, 999, 166];Watch:[839, 791, 879, 843];Couch:[002, 545, 426, 999].') 2023-11-03 11:45:02.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:45:02.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect mango,broccoli,mammal,human hand,human hair,human body,toy and person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'mammal:[020, 117, 419, 847],[288, 000, 445, 370],[323, 193, 747, 999],[635, 159, 999, 830];human hand:[757, 290, 826, 386];human hair:[392, 372, 466, 441],[858, 183, 926, 261];human body:[323, 231, 695, 932],[293, 000, 452, 370],[633, 167, 999, 842];toy:[000, 060, 401, 889],[035, 000, 208, 230],[286, 000, 443, 388],[314, 186, 748, 999],[625, 167, 999, 843],[951, 000, 999, 522];person:[632, 114, 1000, 894],[297, 000, 446, 374],[324, 195, 746, 999].') 2023-11-03 11:45:07.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:45:07.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hamimelon,Fire Hydrant,Mango,Motorcycle,Bus,Person,SUV and Car in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Motorcycle:[089, 696, 430, 995],[340, 573, 537, 998],[478, 543, 632, 933],[575, 426, 649, 609],[459, 423, 507, 550],[346, 391, 392, 515],[309, 379, 346, 505],[271, 453, 324, 594],[177, 478, 227, 633];Bus:[475, 219, 609, 368];Person:[754, 368, 928, 999],[892, 578, 1000, 999],[943, 474, 999, 681],[918, 439, 988, 687],[891, 368, 933, 461],[772, 402, 824, 519],[664, 417, 790, 999],[765, 344, 802, 426],[734, 337, 768, 400],[684, 355, 718, 448],[640, 361, 689, 577],[111, 468, 392, 1000],[145, 857, 415, 1000],[495, 452, 624, 815],[503, 435, 620, 629],[582, 368, 646, 585],[595, 347, 636, 413],[560, 354, 603, 442],[500, 372, 579, 487],[455, 368, 507, 543],[019, 405, 106, 744],[081, 422, 131, 667],[179, 393, 245, 615],[168, 391, 206, 475],[258, 391, 320, 583],[249, 359, 287, 439],[202, 353, 237, 405],[041, 373, 083, 465],[078, 374, 116, 467],[101, 370, 137, 549],[126, 370, 166, 529],[346, 349, 394, 501],[312, 350, 346, 475];SUV:[031, 320, 263, 439];Car:[432, 364, 536, 447].') 2023-11-03 11:45:10.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:45:10.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[348, 541, 617, 961] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', "This is a przewalski's horseFrame1:[348, 541, 617, 961];Frame2:[383, 498, 615, 906];Frame3:[418, 406, 624, 830];Frame4:[476, 394, 632, 797].") 2023-11-03 11:45:58.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:45:58.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[567, 430, 967, 806] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a bird colored in yellow and green walking on the top of a birdcageFrame1:[567, 430, 967, 806];Frame2:[590, 438, 978, 812];Frame3:[589, 409, 971, 802];Frame4:[597, 419, 975, 812];Frame5:[581, 444, 942, 813].') 2023-11-03 11:46:19.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:46:19.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'clothing:[394, 318, 617, 987],[000, 528, 033, 751],[041, 503, 068, 649],[060, 585, 100, 777],[085, 700, 128, 968],[108, 503, 135, 624],[120, 554, 147, 700],[135, 643, 209, 840],[137, 828, 183, 891],[195, 464, 227, 541],[200, 554, 235, 738],[232, 535, 259, 732],[259, 585, 295, 738],[277, 764, 347, 974],[292, 554, 316, 675],[332, 585, 349, 770],[347, 535, 364, 738];woman:[204, 738, 282, 993];building:[000, 006, 367, 993],[372, 000, 642, 993],[645, 000, 998, 993];furniture:[000, 751, 036, 993],[020, 643, 041, 923],[020, 452, 125, 573],[025, 649, 065, 949],[143, 885, 204, 993],[173, 764, 195, 923],[190, 770, 225, 968],[638, 821, 752, 993].') 2023-11-03 11:46:26.832 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:46:26.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human head:[272, 248, 398, 429];mammal:[375, 199, 767, 815],[055, 107, 599, 999],[696, 276, 745, 416],[833, 339, 871, 538],[871, 326, 932, 578];human body:[056, 107, 601, 999],[694, 275, 746, 417],[828, 338, 872, 430],[875, 320, 933, 443];human leg:[282, 660, 378, 999],[387, 650, 485, 999];woman:[046, 119, 566, 999];human hair:[290, 251, 379, 353];clothing:[263, 397, 486, 984],[700, 315, 752, 406],[868, 338, 933, 450];human arm:[048, 111, 277, 445],[375, 125, 600, 458],[716, 320, 747, 405];human hand:[053, 107, 122, 200],[528, 136, 598, 223].') 2023-11-03 11:46:37.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:46:37.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'American Football:[541, 203, 590, 268];Person:[001, 349, 064, 625],[001, 110, 379, 676],[128, 123, 465, 953],[193, 361, 543, 589],[092, 248, 415, 488],[476, 043, 656, 594];Sneakers:[243, 866, 300, 955],[001, 593, 044, 676],[089, 366, 146, 450];Gloves:[418, 449, 469, 544],[374, 249, 417, 318],[519, 181, 563, 245],[539, 202, 589, 284];Helmet:[573, 044, 640, 162],[235, 121, 316, 228],[296, 106, 375, 213],[434, 359, 515, 490].') 2023-11-03 11:46:39.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:46:39.701 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect door handle,chair,person,clothing and mammal in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'chair:[393, 691, 583, 998],[000, 561, 038, 772],[015, 540, 069, 720],[120, 547, 205, 737],[193, 558, 295, 750],[290, 580, 315, 778],[302, 622, 438, 971],[463, 530, 477, 598],[465, 525, 498, 598],[562, 552, 625, 592],[625, 647, 793, 999],[759, 616, 854, 926],[838, 566, 859, 762],[849, 567, 965, 775],[946, 592, 999, 766];person:[341, 478, 450, 847],[155, 476, 211, 630],[217, 496, 291, 653],[288, 476, 354, 603],[322, 373, 349, 488],[466, 494, 641, 960],[567, 444, 586, 519],[623, 498, 688, 603],[632, 509, 758, 828],[759, 469, 787, 540],[761, 472, 845, 779],[854, 472, 903, 576],[858, 501, 966, 675],[956, 511, 988, 646],[960, 470, 999, 636];clothing:[000, 523, 031, 642],[162, 509, 210, 622],[224, 514, 285, 646],[289, 525, 362, 603],[316, 406, 346, 486],[360, 530, 485, 814],[478, 566, 636, 857],[649, 569, 756, 830],[721, 501, 762, 565],[751, 547, 854, 770],[866, 486, 900, 553],[872, 547, 964, 657],[956, 525, 995, 655],[967, 500, 999, 636];mammal:[345, 482, 465, 795],[000, 523, 033, 630],[156, 474, 208, 622],[220, 511, 292, 653],[285, 481, 356, 609],[315, 386, 351, 481],[338, 505, 397, 651],[478, 501, 630, 964],[569, 448, 602, 521],[592, 438, 623, 529],[612, 442, 623, 514],[625, 511, 761, 826],[755, 488, 857, 764],[771, 469, 863, 605],[856, 430, 999, 653].') 2023-11-03 11:46:42.248 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:46:42.249 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Ruth Beitia [006, 049, 996, 858] of Spain competes in the women's high jump final at the National stadium as part of the 2008 Beijing Olympic Games on August 23, 2008.") 2023-11-03 11:46:59.171 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:46:59.172 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[756, 253, 996, 460] [002, 129, 263, 270] [125, 614, 654, 942] [002, 252, 277, 481] [381, 380, 788, 645] [326, 167, 651, 385] mini pigs in a blanket on [005, 002, 996, 992] a pan') 2023-11-03 11:47:02.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:47:02.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Per Mertesacker of [269, 101, 809, 997] Bremen celebrates with his team mates after scoring his team's opening goal during the Bundesliga match between [002, 053, 324, 995] Werder Bremen and...") 2023-11-03 11:47:38.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:47:38.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[288, 087, 717, 979];glasses:[384, 243, 565, 343];harbor seal:[506, 206, 970, 998];human face:[410, 140, 559, 508];sea lion:[498, 201, 956, 998].') 2023-11-03 11:47:52.841 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:47:52.842 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[128, 348, 306, 785];human face:[250, 381, 272, 471].') 2023-11-03 11:47:55.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 11:47:55.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'shelf:[840, 225, 999, 442];Person:[237, 012, 907, 1000];TV:[002, 002, 425, 309].') 2023-11-03 11:48:16.964 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:48:16.965 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'pasta salad, a roll [804, 442, 948, 656] and mixed greens DOES NOT constitute "gourmet cuisine" for a $1,000 ticket.') 2023-11-03 11:48:38.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:48:38.904 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'DCCT / EDIC – [001, 004, 998, 993] incidence of all predefined cardiovascular outcome') 2023-11-03 11:48:48.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:48:48.718 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '17th January 1964; A woman [117, 240, 837, 997] smiles for the camera at a dance in the Ashe Hall in Tralee. Music at the dance was provided by the Donie Collins Band.') 2023-11-03 11:49:27.008 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:49:27.008 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The souvenir ticket [003, 003, 993, 990] from the Ponderosa when it was operational.') 2023-11-03 11:49:35.569 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:49:35.569 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Poster advertising French railways of Orleans and of the East, promoting the city of Martigny') 2023-11-03 11:50:07.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:50:07.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[395, 400, 685, 899] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[395, 400, 685, 899];Frame2:[363, 418, 666, 937];Frame3:[411, 374, 657, 982];Frame4:[206, 219, 594, 940].') 2023-11-03 11:50:15.809 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:50:15.809 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "MEXICAN GLASSWARE / 'Cobalt Blue Rim' large margarita glasses (set of 6) / For the margarita lover, these enjoyable large sized margarita glasses feature a cheerful cobalt blue rim [008, 011, 993, 230].") 2023-11-03 11:50:23.204 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:50:23.204 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "For all the fun we have together, for you care and love you show me, you deserve the very best surprise today! Happy Father's Day from Your Daughter [457, 405, 881, 993]!") 2023-11-03 11:50:28.044 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:50:28.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[374, 388, 594, 479],[124, 455, 155, 479],[167, 442, 211, 480],[195, 437, 228, 479],[208, 391, 239, 421],[299, 440, 338, 483],[535, 382, 567, 414],[624, 430, 677, 493],[671, 438, 728, 494],[884, 424, 941, 502],[921, 424, 987, 506],[909, 082, 982, 229].') 2023-11-03 11:51:18.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:51:18.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[337, 392, 768, 818] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a coupeFrame1:[337, 392, 768, 818];Frame2:[376, 405, 738, 762];Frame3:[386, 424, 683, 732];Frame4:[363, 454, 610, 709].') 2023-11-03 11:51:24.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:51:24.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat,Helmet and Person in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Hat:[182, 041, 235, 085];Helmet:[263, 191, 370, 308];Person:[264, 192, 738, 908],[143, 042, 286, 558],[271, 339, 328, 554].') 2023-11-03 11:51:47.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:51:47.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[854, 523, 998, 685],bicycleFrame1:[796, 560, 833, 617],carFrame1:[500, 535, 770, 699] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[854, 523, 998, 685];Frame2:[854, 523, 998, 685],bicycleFrame1:[796, 560, 833, 617];Frame2:[796, 560, 833, 617],carFrame1:[500, 535, 770, 699];Frame2:[500, 535, 770, 699].') 2023-11-03 11:51:59.403 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:51:59.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Traffic Light in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Traffic Light:[761, 023, 875, 355],[807, 599, 948, 997],[288, 596, 430, 994],[247, 009, 363, 350].') 2023-11-03 11:52:20.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:52:20.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is the track of the a woman in a black dress?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one a woman in a black dress.Track1frame1:[637, 433, 1000, 1000];frame2:[648, 432, 1000, 1000];frame3:[659, 431, 1000, 1000];frame4:[670, 431, 1000, 1000].') 2023-11-03 11:53:11.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:53:11.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Desk:[140, 943, 455, 1000];Cabinet:[696, 202, 844, 343];Bottle:[211, 615, 277, 991];Person:[000, 338, 391, 998],[144, 303, 651, 1000];Faucet:[748, 566, 831, 804];Sink:[722, 802, 897, 973];Cleaning Products:[713, 266, 751, 381].') 2023-11-03 11:53:34.336 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:53:34.336 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[655, 117, 906, 956],[000, 203, 159, 500],[113, 173, 251, 413],[161, 380, 383, 692],[213, 093, 259, 234],[226, 086, 252, 175],[374, 613, 610, 999],[400, 168, 503, 419],[431, 173, 607, 509],[564, 000, 625, 188],[623, 000, 712, 157],[711, 000, 758, 119];food:[328, 425, 406, 502],[343, 440, 455, 558],[418, 506, 535, 631];man:[000, 110, 157, 356],[000, 222, 235, 726],[118, 091, 213, 209],[157, 120, 291, 418],[235, 045, 372, 414],[529, 152, 694, 606];girl:[133, 381, 377, 670],[167, 191, 233, 383],[206, 091, 260, 237],[368, 628, 599, 999],[412, 181, 511, 394],[440, 176, 593, 499],[568, 000, 630, 212],[623, 000, 711, 181],[638, 106, 898, 964],[868, 033, 978, 352];footwear:[760, 888, 809, 965].') 2023-11-03 11:53:46.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:53:46.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[214, 460, 307, 570],carFrame1:[435, 432, 520, 567],carFrame1:[078, 474, 255, 631] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[214, 460, 307, 570];Frame2:[085, 457, 248, 610],carFrame1:[435, 432, 520, 567];Frame2:[444, 424, 522, 558],carFrame1:[078, 474, 255, 631];Frame2:[000, 492, 143, 728].') 2023-11-03 11:54:04.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:54:04.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[809, 697, 895, 792];Chair:[736, 689, 765, 794];Person:[001, 285, 075, 640],[001, 246, 378, 811],[204, 290, 360, 563],[061, 420, 111, 542],[349, 447, 400, 650],[382, 421, 425, 623],[345, 080, 744, 874],[561, 348, 629, 406],[685, 334, 814, 799],[853, 196, 1000, 788];Umbrella:[310, 343, 578, 451];Hat:[561, 345, 623, 397],[703, 338, 750, 398];Backpack:[001, 354, 076, 644];Bottle:[786, 694, 812, 741];Hot dog:[813, 801, 1000, 997],[644, 786, 909, 999],[255, 786, 475, 932],[191, 798, 293, 928],[111, 808, 191, 913],[013, 795, 190, 921],[001, 789, 078, 925],[172, 706, 256, 805],[256, 691, 346, 803];Cake:[832, 613, 877, 703].') 2023-11-03 11:54:32.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:54:32.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[385, 467, 580, 858],[044, 360, 126, 414],[087, 805, 335, 999],[263, 373, 333, 423],[440, 394, 504, 428],[600, 682, 741, 998],[702, 751, 999, 999],[844, 638, 999, 959];man:[643, 462, 810, 722],[000, 450, 093, 999],[044, 438, 281, 898],[485, 339, 549, 435],[522, 450, 640, 809],[768, 462, 873, 686],[859, 458, 999, 756];clothing:[652, 486, 806, 701],[000, 763, 075, 975],[000, 558, 102, 751],[056, 380, 120, 421],[097, 631, 225, 840],[118, 905, 323, 996],[211, 650, 298, 819],[252, 397, 332, 425],[327, 407, 407, 430],[330, 635, 407, 780],[341, 534, 441, 635],[387, 684, 569, 812],[419, 539, 550, 716],[532, 515, 621, 619],[546, 648, 635, 797],[600, 802, 740, 999],[749, 923, 999, 993],[776, 500, 867, 612],[801, 618, 856, 716],[854, 508, 999, 667],[935, 740, 988, 949];human face:[710, 824, 842, 949].') 2023-11-03 11:54:41.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:54:41.987 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Chair:[611, 448, 742, 620],[250, 459, 278, 501],[773, 443, 790, 483].') 2023-11-03 11:55:27.257 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:55:27.258 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tree:[000, 090, 976, 987];flower:[063, 252, 718, 857],[628, 002, 907, 254].') 2023-11-03 11:55:29.096 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:55:29.096 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Stock photo: [114, 155, 531, 997] [337, 073, 931, 995] Happy parents feeding [318, 513, 794, 995] their baby boy [503, 427, 644, 646] a bottle') 2023-11-03 11:55:47.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:55:47.930 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'American Football:[436, 262, 533, 454];Person:[154, 225, 773, 986],[324, 019, 948, 1000],[633, 314, 999, 915],[149, 015, 510, 938];Bracelet:[439, 349, 482, 402];Helmet:[274, 019, 385, 224],[378, 022, 512, 240],[744, 316, 880, 542],[606, 225, 688, 419];Gloves:[239, 502, 290, 621],[523, 438, 684, 531],[727, 545, 772, 654];Sneakers:[202, 701, 258, 783],[160, 823, 245, 995],[617, 868, 693, 998],[599, 801, 695, 891].') 2023-11-03 11:56:14.346 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:56:14.347 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the second driver and his shifting motorized-bicycle in this video clip?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 2 second driver and his shifting motorized-bicycle.Track1frame:1:[265, 177, 328, 413];frame:2:[341, 199, 397, 409];frame:3:[382, 206, 432, 387],Track2frame:1:[245, 225, 339, 453];frame:2:[322, 236, 403, 446];frame:3:[369, 252, 436, 428].') 2023-11-03 11:56:18.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:56:18.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[037, 278, 121, 605],[111, 282, 244, 628],[201, 282, 248, 579],[250, 286, 364, 607],[763, 334, 848, 672],[849, 339, 942, 691];clothing:[000, 329, 049, 617],[043, 329, 106, 592],[100, 328, 175, 549],[113, 328, 232, 613],[163, 317, 215, 472],[188, 324, 256, 549],[250, 320, 354, 590],[364, 359, 425, 534],[430, 341, 472, 582],[455, 338, 510, 587],[496, 362, 541, 558],[523, 353, 616, 565],[727, 376, 787, 626],[776, 361, 860, 654],[862, 388, 934, 671],[956, 412, 999, 695].') 2023-11-03 11:56:25.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:56:25.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 199, 175, 998],[053, 616, 221, 694],[180, 266, 524, 1000],[214, 147, 414, 722],[348, 002, 798, 999],[690, 247, 932, 1000];Watch:[539, 883, 581, 984];Canned:[158, 718, 206, 854],[278, 733, 327, 868];Trash bin Can:[836, 345, 869, 409];Ladder:[174, 082, 211, 424];bucket:[046, 449, 156, 563],[180, 439, 325, 671],[836, 342, 870, 410],[246, 279, 283, 355];Desk:[009, 550, 334, 934];Glasses:[312, 384, 449, 461],[411, 137, 563, 252];Hat:[260, 142, 412, 242],[396, 002, 563, 202].') 2023-11-03 11:56:45.557 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:56:45.558 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [766, 503, 850, 838] [082, 495, 167, 851] [667, 521, 757, 865] [477, 558, 538, 856] [618, 484, 684, 850] [555, 504, 627, 826] [150, 524, 247, 974] [356, 526, 462, 971] flock to Seoul Grand Park, an amusement park [004, 005, 995, 632] and zoo in Gwacheon, Gyeonggi on Sunday, as temperatures hover over 15 degrees Celsius (59 degrees Fahrenheit), unusually high for February. [NEWS1') 2023-11-03 11:56:47.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:56:47.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Bicycle:[000, 156, 220, 490];Person:[731, 001, 999, 395],[768, 039, 806, 099],[343, 289, 593, 961],[214, 207, 487, 855];Storage box:[640, 299, 841, 423],[514, 268, 695, 377],[500, 235, 563, 295],[332, 211, 392, 295];Chair:[378, 313, 636, 920],[239, 362, 445, 810];Desk:[558, 209, 749, 304],[689, 172, 937, 290],[617, 368, 886, 724];Sneakers:[435, 856, 490, 960],[343, 835, 417, 923],[261, 767, 330, 854],[213, 737, 272, 824].') 2023-11-03 11:57:09.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:57:09.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Other Shoes:[391, 839, 491, 892],[508, 827, 594, 886];Person:[288, 135, 737, 892].') 2023-11-03 11:57:13.840 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:57:13.840 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[559, 350, 693, 975],[000, 025, 058, 747],[107, 000, 208, 148],[130, 051, 222, 290],[498, 151, 595, 768],[710, 294, 838, 818],[881, 402, 999, 998];man:[038, 192, 186, 954],[225, 078, 314, 791],[283, 161, 464, 923],[291, 031, 386, 231],[374, 070, 478, 771],[455, 048, 534, 302],[613, 222, 713, 948],[671, 506, 867, 998].') 2023-11-03 11:57:46.317 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:57:46.318 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'low maintenance houseplants [501, 002, 993, 386] to brighten your home') 2023-11-03 11:57:58.695 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 11:57:58.696 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Helmet,Glasses and Bicycle in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Helmet:[525, 178, 714, 299];Glasses:[539, 268, 669, 299];Bicycle:[257, 480, 805, 999].') 2023-11-03 11:58:49.073 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:58:49.074 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Car:[838, 328, 956, 384],[810, 329, 872, 375],[764, 336, 812, 372],[326, 301, 451, 379];Motorcycle:[006, 001, 848, 837];Van:[648, 252, 761, 370].') 2023-11-03 11:59:30.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:59:30.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[414, 445, 467, 812] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[414, 445, 467, 812];Frame2:[393, 331, 505, 861];Frame3:[471, 225, 613, 998];Frame4:[391, 204, 604, 998];Frame5:[050, 200, 328, 998].') 2023-11-03 11:59:35.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:59:35.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[328, 364, 423, 853],[430, 388, 509, 793],[503, 403, 605, 832],[614, 403, 716, 851];clothing:[315, 418, 442, 830],[443, 432, 535, 746],[501, 467, 596, 746],[624, 455, 705, 724].') 2023-11-03 11:59:40.385 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:59:40.385 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Drawing of a silver table lamp [210, 149, 851, 849] with a twisted fluted body [261, 524, 743, 850] in rococo style, House of Carl Faberge') 2023-11-03 11:59:49.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:59:49.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human leg:[255, 489, 588, 993],[272, 508, 500, 964];human hair:[732, 232, 903, 339];girl:[234, 241, 888, 981];tree:[000, 000, 999, 481];footwear:[388, 875, 485, 962];human face:[706, 258, 849, 337].') 2023-11-03 11:59:54.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:59:54.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[230, 215, 536, 825] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a push-bikeFrame1:[230, 215, 536, 825];Frame2:[273, 288, 548, 818];Frame3:[328, 255, 561, 705].') 2023-11-03 11:59:56.975 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 11:59:56.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Boat:[406, 360, 950, 641];Lamp:[626, 404, 649, 429],[586, 416, 611, 440];Flower:[502, 375, 524, 416],[447, 430, 461, 473],[451, 525, 468, 570],[666, 449, 674, 463],[728, 435, 735, 449],[772, 444, 783, 462],[394, 547, 414, 586],[285, 503, 299, 535];Person:[184, 429, 245, 578],[272, 458, 325, 613],[406, 511, 451, 615],[423, 479, 493, 597],[456, 495, 475, 580],[453, 441, 535, 524],[484, 398, 529, 577],[501, 406, 552, 517],[642, 413, 686, 530],[708, 396, 759, 508],[757, 385, 820, 520],[772, 423, 784, 492],[709, 485, 730, 499];Glasses:[653, 424, 669, 434],[778, 453, 785, 470];Tie:[664, 449, 674, 490],[516, 452, 534, 492],[772, 428, 784, 489],[726, 430, 737, 483];Hat:[207, 429, 224, 448].') 2023-11-03 12:00:00.204 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:00:00.204 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'High Heels:[212, 638, 273, 702],[146, 637, 218, 703],[099, 636, 156, 702],[262, 551, 326, 631],[203, 548, 270, 633],[142, 549, 212, 631],[096, 549, 162, 632],[262, 476, 329, 543],[204, 473, 279, 543],[150, 472, 208, 542],[091, 472, 160, 542],[258, 393, 326, 469],[191, 391, 273, 467],[123, 391, 193, 465],[259, 304, 323, 374],[199, 304, 263, 374],[132, 299, 206, 372],[081, 299, 149, 372],[267, 230, 324, 294],[208, 222, 274, 292],[113, 219, 181, 292],[396, 205, 463, 307],[338, 208, 402, 305],[415, 105, 479, 188],[358, 105, 419, 186],[638, 032, 711, 138],[525, 046, 588, 134],[455, 045, 529, 140],[212, 134, 275, 193],[133, 123, 202, 190],[080, 118, 151, 192],[264, 008, 324, 105],[187, 007, 266, 105],[121, 043, 184, 104];shelf:[328, 038, 793, 775],[066, 037, 330, 754].') 2023-11-03 12:00:08.845 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:00:08.845 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[002, 323, 265, 1000],[002, 001, 492, 940],[174, 001, 494, 592],[566, 239, 864, 1000],[536, 001, 872, 405],[846, 058, 999, 1000].') 2023-11-03 12:00:26.794 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:00:26.794 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[643, 001, 997, 617],[338, 553, 389, 726],[208, 563, 251, 715],[142, 555, 190, 697].') 2023-11-03 12:00:37.409 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:00:37.409 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Frame:[646, 181, 726, 317],[397, 174, 457, 293],[279, 172, 325, 284];Lamp:[141, 016, 210, 054];Guitar:[213, 261, 390, 502],[422, 368, 597, 512],[374, 441, 431, 696];Piano:[634, 472, 850, 586],[617, 474, 879, 603];Drum:[686, 454, 729, 506],[633, 454, 679, 490],[603, 508, 654, 566];Person:[210, 187, 350, 766],[405, 159, 561, 820],[670, 372, 761, 509],[732, 241, 906, 874],[197, 292, 233, 408],[137, 288, 249, 571],[124, 277, 176, 377],[360, 345, 411, 429];Leather Shoes:[241, 717, 282, 765],[303, 714, 332, 761],[448, 741, 471, 780],[489, 759, 516, 818],[750, 760, 799, 792],[809, 809, 847, 872];Speaker:[323, 158, 400, 304],[790, 062, 857, 262],[179, 084, 206, 226],[531, 532, 648, 675],[889, 001, 1000, 541];Microphone:[737, 323, 784, 357];Tripod:[232, 262, 426, 977],[594, 357, 759, 919],[638, 626, 776, 1000].') 2023-11-03 12:00:47.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:00:47.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Wild Bird:[030, 249, 083, 361],[241, 254, 289, 362],[317, 247, 360, 331],[386, 245, 435, 342],[217, 599, 292, 715],[296, 593, 362, 704],[382, 524, 446, 626],[463, 500, 528, 613],[494, 409, 561, 533],[652, 475, 720, 595],[717, 422, 766, 575],[766, 531, 823, 613],[796, 402, 857, 577],[322, 448, 392, 580],[203, 484, 272, 601],[378, 404, 448, 537],[448, 302, 497, 374].') 2023-11-03 12:02:31.259 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:02:31.260 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[055, 740, 106, 808],[087, 754, 124, 835],[118, 770, 152, 843],[119, 756, 147, 836],[152, 764, 315, 999],[196, 694, 288, 805],[203, 625, 258, 702],[223, 738, 272, 863],[252, 756, 391, 999],[298, 630, 325, 712],[335, 615, 360, 704],[350, 759, 390, 868],[386, 663, 551, 999],[536, 702, 568, 784],[601, 691, 633, 763],[603, 630, 631, 695],[614, 743, 643, 823],[626, 684, 673, 820],[667, 684, 740, 882],[668, 742, 868, 999],[683, 613, 750, 738],[700, 623, 743, 768],[743, 705, 803, 771],[804, 681, 921, 903],[873, 666, 918, 798],[896, 648, 952, 717],[911, 697, 999, 999];clothing:[125, 801, 224, 938],[139, 860, 323, 999],[255, 837, 386, 982],[328, 712, 365, 777],[346, 627, 370, 694],[585, 892, 645, 990],[616, 743, 701, 815],[616, 858, 706, 994],[674, 772, 741, 865],[675, 876, 935, 999],[809, 746, 929, 907],[878, 723, 929, 832],[976, 847, 999, 999].') 2023-11-03 12:02:35.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:02:35.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Paddle in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Paddle:[247, 298, 673, 620].') 2023-11-03 12:02:39.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:02:39.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[004, 192, 996, 990] The Duesenberg when bought from dealer in 1941 for $450') 2023-11-03 12:02:50.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:02:50.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[459, 477, 754, 798] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a turtleFrame1:[459, 477, 754, 798];Frame2:[434, 485, 726, 812];Frame3:[434, 481, 702, 806];Frame4:[447, 439, 680, 751];Frame5:[464, 381, 659, 721].') 2023-11-03 12:02:54.933 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:02:54.933 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Helmet:[467, 351, 506, 384],[437, 613, 476, 647],[803, 283, 870, 319],[875, 503, 906, 526];Person:[400, 793, 501, 875],[425, 758, 513, 803],[410, 610, 503, 756],[447, 350, 511, 489],[302, 333, 340, 394],[489, 220, 554, 339],[814, 660, 913, 733],[842, 765, 909, 846];Bottle:[208, 719, 307, 845].') 2023-11-03 12:03:15.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:03:15.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Plate,Cup,Carpet,Side Table and Pillow in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Plate:[077, 131, 165, 263];Cup:[634, 169, 660, 216];Carpet:[081, 536, 967, 999];Side Table:[001, 811, 115, 999],[948, 459, 999, 655];Pillow:[787, 329, 975, 544],[440, 235, 565, 386],[368, 247, 467, 335].') 2023-11-03 12:03:17.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:03:17.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'shelf:[613, 639, 988, 990],[004, 006, 301, 315],[644, 005, 990, 374],[010, 293, 138, 892];Storage box:[013, 885, 115, 985];Towel:[106, 602, 228, 736];Flower:[303, 371, 351, 425];Vase:[311, 395, 370, 533];Sink:[810, 553, 990, 700];Microwave:[285, 060, 651, 348];Tea pot:[311, 415, 368, 537];Gas stove:[313, 489, 655, 698];Oven:[319, 651, 624, 946];Cup:[753, 506, 793, 583],[686, 561, 733, 617],[721, 393, 762, 467];Plate:[670, 590, 742, 634],[206, 464, 301, 513];Bowl:[824, 594, 886, 662];Spoon:[257, 431, 297, 568];Rice Cooker:[102, 400, 195, 529];Pot:[339, 460, 449, 555],[385, 433, 521, 495],[504, 476, 618, 576];Pomegranate:[223, 512, 256, 554];Bakset:[884, 630, 989, 695].') 2023-11-03 12:03:31.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:03:31.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Camera and Backpack in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Camera:[362, 458, 441, 614];Backpack:[664, 668, 1000, 1000].') 2023-11-03 12:04:13.380 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:04:13.381 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[504, 700, 637, 936] [879, 680, 997, 913] [214, 484, 361, 957] [070, 432, 231, 936] People on a demo in America holding [348, 762, 506, 908] [557, 045, 751, 499] [889, 427, 997, 678] [500, 397, 631, 705] [671, 451, 870, 838] [216, 044, 425, 545] placards supporting science') 2023-11-03 12:04:20.218 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:04:20.219 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Person and Street Lights in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Chair:[602, 331, 917, 621],[722, 311, 950, 593],[926, 515, 1000, 629],[085, 287, 256, 481],[001, 356, 098, 465];Person:[242, 195, 640, 625],[081, 195, 194, 444],[193, 173, 328, 415],[563, 261, 695, 437],[879, 289, 1000, 457];Street Lights:[730, 170, 787, 245],[507, 083, 666, 212].') 2023-11-03 12:04:25.091 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:04:25.091 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 345, 035, 404],truckFrame1:[028, 273, 127, 403],carFrame1:[102, 267, 171, 343] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[000, 345, 035, 404];Frame2:[000, 345, 035, 404],truckFrame1:[028, 273, 127, 403];Frame2:[027, 275, 128, 401],carFrame1:[102, 267, 171, 343];Frame2:[104, 268, 171, 346].') 2023-11-03 12:05:01.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:05:01.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'suit:[661, 232, 830, 864],[122, 445, 300, 542],[125, 476, 458, 998],[292, 478, 455, 993],[398, 385, 475, 580];woman:[451, 243, 685, 972];man:[000, 050, 197, 998],[123, 312, 456, 998],[130, 258, 305, 542],[296, 337, 456, 998],[366, 283, 475, 563],[527, 228, 743, 898],[667, 227, 999, 998];human face:[046, 076, 196, 406],[170, 375, 206, 455],[173, 381, 192, 455],[190, 306, 255, 431],[200, 342, 285, 511],[337, 375, 418, 519],[364, 303, 420, 371],[511, 299, 585, 447],[705, 254, 760, 367].') 2023-11-03 12:05:24.283 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:05:24.284 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[220, 537, 339, 921] [678, 565, 780, 904] [752, 510, 920, 988] [468, 507, 623, 983] [325, 553, 450, 939] Women of Amga prepare for the opening ceremony of Ysyakh, holding burners of cow dung so ...') 2023-11-03 12:05:45.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:05:45.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[102, 473, 316, 890] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a mountain zebraFrame1:[102, 473, 316, 890];Frame2:[144, 533, 269, 929];Frame3:[160, 530, 267, 962];Frame4:[219, 555, 371, 998].') 2023-11-03 12:05:50.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 12:05:50.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[002, 531, 102, 998],[029, 357, 172, 985],[130, 145, 429, 999],[316, 388, 427, 996],[396, 381, 688, 1000],[564, 375, 778, 965],[719, 335, 958, 983],[927, 396, 1000, 976].') 2023-11-03 12:05:59.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:05:59.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'mammal:[000, 003, 669, 931].') 2023-11-03 12:06:51.226 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:06:51.226 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'desk:[000, 505, 999, 999];office building:[000, 000, 999, 994].') 2023-11-03 12:06:54.817 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:06:54.817 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Halloween black and white clipart images free vector royalty free stock Haunted House | [004, 005, 992, 985] Free Stock Photo | Illustration of bats flying by a ... vector royalty free stock') 2023-11-03 12:07:09.438 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:07:09.438 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Garlic papers [881, 581, 951, 631] [426, 561, 536, 633] [566, 390, 665, 481] [503, 432, 599, 518] [368, 404, 479, 493] [490, 725, 571, 784] [096, 700, 205, 845] strewn across the porch [001, 300, 997, 983]') 2023-11-03 12:07:30.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:07:30.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Desk:[693, 378, 1000, 584],[173, 525, 664, 786].') 2023-11-03 12:07:32.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:07:32.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect table and tree in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'table:[031, 652, 215, 930],[673, 492, 805, 639];tree:[000, 291, 041, 458],[000, 188, 045, 319],[015, 185, 096, 343],[019, 075, 444, 436],[045, 320, 091, 436],[329, 154, 443, 415],[480, 162, 572, 272],[485, 251, 678, 429],[546, 131, 704, 390],[691, 120, 898, 380],[850, 109, 914, 382],[963, 219, 999, 385].') 2023-11-03 12:07:40.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:07:40.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[301, 814, 435, 1000],[428, 790, 491, 981],[471, 810, 547, 996],[519, 796, 609, 1000].') 2023-11-03 12:08:17.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:08:17.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[577, 001, 1000, 343];Basin:[000, 393, 164, 615],[564, 161, 717, 347],[514, 269, 674, 476],[421, 376, 591, 582],[343, 485, 525, 730],[261, 634, 454, 866],[192, 798, 399, 1000];Plate:[000, 125, 1000, 1000].') 2023-11-03 12:09:14.043 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:09:14.043 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'screen shot of [003, 004, 996, 990] web page with photo of a classroom and descriptions of classroom equipment') 2023-11-03 12:10:11.152 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:10:11.152 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Towel:[913, 285, 968, 419],[697, 289, 743, 410],[341, 288, 371, 385];Flower:[344, 207, 366, 301];Plate:[053, 670, 543, 960];Wine Glass:[758, 291, 786, 420],[860, 298, 887, 408];Cup:[887, 356, 918, 430];Bowl:[809, 350, 841, 401];Chair:[511, 348, 639, 727].') 2023-11-03 12:10:21.258 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:10:21.258 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[044, 001, 189, 078];Potted Plant:[058, 009, 534, 998];shelf:[001, 303, 397, 847];Person:[306, 001, 927, 999];Watch:[809, 479, 896, 575];Belt:[453, 922, 671, 1000];Bottle:[659, 214, 859, 507],[286, 885, 333, 998].') 2023-11-03 12:10:59.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:10:59.796 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bitcoin buttons [345, 432, 684, 976] are displayed on a table [005, 006, 994, 993] at the Inside Bitcoins conference in Berlin on Wednesday, Feb. 12, 2014.') 2023-11-03 12:11:03.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 12:11:03.485 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'street light:[228, 281, 246, 477],[291, 202, 315, 332],[400, 059, 438, 437];building:[000, 000, 999, 650].') 2023-11-03 12:11:20.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:11:20.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[174, 034, 915, 1000];Necklace:[447, 793, 638, 1000];Pickup Truck:[003, 200, 998, 992];Glasses:[345, 361, 658, 516].') 2023-11-03 12:11:29.279 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:11:29.280 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Sheep [282, 613, 443, 835] climb the hillside [000, 457, 996, 996] as flames from a moor fire are seen on Marsden moor [003, 414, 997, 995], near Huddersfield in northern England on April 25, 2021.') 2023-11-03 12:11:55.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:11:55.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[107, 401, 468, 813],carFrame1:[493, 354, 749, 647],carFrame1:[385, 403, 451, 500] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[107, 401, 468, 813];Frame2:[112, 401, 465, 810],carFrame1:[493, 354, 749, 647];Frame2:[458, 357, 655, 592],carFrame1:[385, 403, 451, 500];Frame2:[385, 401, 450, 496].') 2023-11-03 12:11:57.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:11:57.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[233, 837, 274, 985];frame:2:[233, 837, 274, 985];frame:3:[228, 862, 278, 997] is doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an moving basketballs') 2023-11-03 12:11:58.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:11:58.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'building:[000, 040, 999, 998].') 2023-11-03 12:13:16.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:13:16.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[089, 003, 954, 998];Bracelet:[780, 003, 863, 132];Satchel:[243, 380, 848, 1000],[736, 224, 1000, 999].') 2023-11-03 12:13:55.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:13:55.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[373, 335, 507, 832] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[373, 335, 507, 832];Frame2:[432, 325, 509, 789];Frame3:[380, 212, 553, 784].') 2023-11-03 12:14:01.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:14:01.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[250, 477, 560, 688],[257, 709, 593, 920],[267, 007, 709, 226],[277, 252, 583, 460];human face:[356, 718, 505, 775],[378, 263, 484, 321],[383, 503, 477, 565],[388, 024, 517, 092].') 2023-11-03 12:14:24.633 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:14:24.633 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Babs Foster's fashion design class recently sewed their own Christmas stockings [228, 713, 330, 901] [250, 345, 322, 516] [532, 366, 616, 539] [672, 354, 746, 546] [140, 728, 223, 923] [362, 687, 450, 883] [499, 721, 578, 887] [031, 463, 116, 639].") 2023-11-03 12:14:28.645 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:14:28.645 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[237, 368, 402, 445] [597, 312, 747, 384] [371, 215, 501, 265] [528, 219, 657, 273] [336, 257, 474, 313] [625, 373, 790, 458] [286, 301, 438, 377] [640, 449, 820, 551] [196, 441, 380, 539] coins laying on top of [027, 152, 997, 671] a race car made from popsicle sticks, straws, and [202, 155, 325, 335] [664, 158, 798, 337] bottle caps') 2023-11-03 12:14:31.431 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:14:31.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Lamp:[231, 105, 269, 178],[191, 225, 214, 268],[292, 300, 318, 326],[347, 237, 386, 276],[440, 129, 511, 198],[628, 000, 690, 032];Person:[109, 481, 210, 1000],[008, 523, 048, 789].') 2023-11-03 12:14:38.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:14:38.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[010, 242, 806, 782];frame:2:[016, 255, 821, 808];frame:3:[075, 290, 885, 827] is?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an the horse in the lead, speeding through the countryside, leaving the other behind.') 2023-11-03 12:14:38.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:14:38.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Backpack:[792, 261, 850, 368];Satchel:[842, 282, 896, 369];Coffee Table:[876, 367, 1000, 546];Bench:[684, 229, 949, 509];Chair:[528, 235, 642, 497],[103, 175, 779, 1000],[002, 159, 080, 1000];Desk:[042, 283, 158, 621];Person:[002, 266, 101, 1000],[273, 119, 712, 999],[864, 153, 989, 378],[904, 188, 999, 371],[299, 126, 345, 198].') 2023-11-03 12:14:57.066 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:14:57.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'boy:[758, 269, 958, 989];woman:[107, 123, 345, 562],[365, 028, 748, 999];girl:[073, 286, 396, 992],[106, 120, 350, 577],[226, 410, 364, 992],[316, 254, 515, 990],[433, 348, 626, 988],[575, 187, 748, 540],[588, 325, 805, 996];clothing:[068, 501, 241, 999],[110, 324, 335, 607],[234, 609, 339, 999],[335, 450, 525, 959],[363, 219, 584, 476],[502, 536, 636, 999],[578, 361, 741, 544],[598, 527, 796, 999],[779, 500, 962, 999];human face:[148, 350, 248, 530],[229, 158, 315, 328],[274, 460, 343, 606],[380, 306, 466, 471],[458, 074, 535, 243],[495, 397, 571, 560],[588, 212, 661, 364],[633, 368, 713, 543],[795, 364, 884, 508].') 2023-11-03 12:15:11.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:15:11.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[215, 231, 833, 731];Handbag:[029, 556, 239, 665];Chair:[630, 616, 746, 768];Desk:[001, 602, 959, 999].') 2023-11-03 12:15:40.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:15:40.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Helmet in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Helmet:[663, 428, 713, 522],[581, 416, 608, 487],[469, 467, 497, 529],[400, 229, 433, 287],[262, 614, 297, 677].') 2023-11-03 12:15:48.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:15:48.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp and Person in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[053, 002, 209, 381],[909, 002, 1000, 369];Person:[317, 074, 892, 1000].') 2023-11-03 12:15:51.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:15:51.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[001, 571, 035, 728],[177, 568, 216, 749];Airplane:[020, 155, 819, 870];Helicopter:[711, 271, 998, 723].') 2023-11-03 12:16:10.369 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:16:10.369 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Mzvee: Ghanaian singer [333, 083, 654, 930] shares beautiful photo; flaunts her smooth back') 2023-11-03 12:16:37.088 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:16:37.088 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[638, 456, 972, 833] a woman points to [052, 322, 490, 758] a screen with [108, 380, 264, 439] the Spanish word "concreto" and tips for writing [105, 376, 272, 667] Spanish museum texts') 2023-11-03 12:17:08.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:17:08.358 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[430, 541, 508, 671],carFrame1:[084, 531, 152, 606],carFrame1:[409, 527, 475, 628],carFrame1:[212, 536, 281, 628],carFrame1:[040, 545, 147, 627] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[430, 541, 508, 671];Frame2:[440, 542, 583, 724],carFrame1:[084, 531, 152, 606];Frame2:[000, 527, 091, 623],carFrame1:[409, 527, 475, 628];Frame2:[405, 523, 492, 648],carFrame1:[212, 536, 281, 628];Frame2:[202, 535, 261, 610],carFrame1:[040, 545, 147, 627];Frame2:[000, 557, 081, 648].') 2023-11-03 12:17:11.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:17:11.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'motorcycle:[186, 639, 462, 873],[000, 592, 253, 848],[707, 532, 965, 858];man:[217, 480, 447, 890],[788, 473, 975, 849];clothing:[225, 535, 427, 861];car:[412, 503, 847, 699],[919, 510, 997, 687].') 2023-11-03 12:17:54.222 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:17:54.222 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[298, 025, 773, 993] Unhealthy young woman wearing [337, 024, 561, 374] hat with [338, 294, 679, 995] scarf around her neck feeling unwell holding digital thermometer suffering from flu and throat sore touching her neck standing over [002, 002, 999, 996] green wall') 2023-11-03 12:17:57.663 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:17:57.664 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An overhead view of the stuffed turkey garnished with [262, 133, 425, 230] [686, 390, 875, 523] [614, 601, 831, 805] [077, 289, 346, 517] parsley, [540, 742, 754, 878] cut apples, and dried cranberries all around the base of the turkey.') 2023-11-03 12:18:22.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:18:22.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect teapot,human face,human arm,human head and person in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human face:[269, 282, 292, 348],[390, 288, 418, 355],[516, 466, 580, 577],[619, 048, 665, 150];human arm:[212, 039, 234, 105],[234, 350, 317, 470],[333, 385, 392, 470],[352, 373, 456, 491],[389, 052, 433, 147],[416, 052, 475, 157],[458, 563, 581, 727],[513, 147, 592, 367];human head:[511, 407, 618, 576],[268, 266, 317, 342],[390, 252, 446, 358],[518, 029, 592, 135],[600, 022, 667, 151];person:[474, 354, 702, 857],[056, 038, 111, 233],[076, 176, 190, 366],[101, 014, 163, 186],[162, 257, 368, 561],[166, 211, 303, 522],[218, 247, 476, 751],[391, 000, 470, 360],[451, 000, 516, 241],[522, 008, 705, 385],[839, 000, 954, 492].') 2023-11-03 12:18:44.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:18:44.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[000, 589, 999, 999],[090, 634, 181, 810],[151, 620, 284, 832],[268, 589, 399, 835],[410, 598, 528, 795],[495, 572, 538, 685],[581, 580, 680, 761].') 2023-11-03 12:18:45.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 12:18:45.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cutting/chopping Board in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'chopping Board:[000, 255, 260, 923].') 2023-11-03 12:18:56.886 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:18:56.886 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'bicycle wheel:[358, 673, 421, 812],[000, 617, 019, 700],[302, 650, 346, 755],[625, 660, 671, 731];bicycle:[000, 550, 026, 693],[297, 619, 434, 793],[485, 481, 672, 727];person:[000, 494, 364, 555],[323, 487, 422, 754],[427, 497, 453, 599],[521, 554, 608, 699],[566, 385, 646, 602],[650, 511, 681, 637],[751, 499, 999, 659];wheel:[000, 605, 018, 700],[300, 650, 353, 754],[360, 670, 424, 788],[490, 639, 534, 709],[625, 655, 671, 736].') 2023-11-03 12:19:10.844 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:19:10.844 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Sheep in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Sheep:[163, 636, 468, 927],[076, 336, 185, 405],[196, 317, 314, 423],[300, 284, 354, 336],[278, 245, 317, 314],[062, 189, 128, 250],[102, 126, 172, 178],[136, 121, 176, 171],[288, 156, 366, 200],[330, 188, 377, 246],[367, 188, 429, 247],[403, 165, 438, 222],[511, 344, 620, 396],[656, 279, 710, 335],[629, 176, 682, 229],[677, 171, 738, 226],[844, 309, 937, 371],[948, 086, 999, 131].') 2023-11-03 12:19:35.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:19:35.196 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[289, 415, 665, 1000],[006, 570, 354, 998],[024, 290, 148, 436],[238, 356, 341, 488],[450, 354, 496, 447],[521, 506, 933, 936],[560, 795, 998, 998],[594, 272, 655, 433],[630, 321, 777, 624],[751, 303, 918, 545],[754, 446, 998, 681],[887, 232, 975, 447];man:[006, 179, 392, 1000],[000, 147, 169, 423],[058, 148, 374, 485],[341, 246, 661, 937],[344, 166, 450, 372],[366, 176, 565, 457],[537, 158, 647, 299],[655, 150, 893, 602];table:[663, 256, 998, 354];clothing:[000, 228, 669, 897],[666, 277, 905, 588].') 2023-11-03 12:19:35.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 12:19:35.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'flower:[206, 238, 800, 854].') 2023-11-03 12:19:47.252 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:19:47.253 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[634, 180, 822, 915],[428, 327, 603, 882],[061, 178, 263, 896],[002, 737, 072, 1000],[609, 750, 670, 927],[584, 751, 618, 1000],[554, 095, 639, 333],[400, 059, 529, 340],[219, 805, 257, 879];Wild Bird:[466, 725, 565, 887];TV:[297, 726, 354, 788].') 2023-11-03 12:19:53.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:19:53.620 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human eye:[336, 368, 415, 482],[278, 487, 330, 568];human head:[220, 238, 496, 781];human arm:[518, 301, 726, 792],[334, 799, 466, 917];human nose:[338, 488, 415, 593];human hand:[526, 323, 719, 599],[343, 804, 463, 943];person:[270, 279, 721, 947];human mouth:[388, 553, 443, 652];human hair:[366, 281, 436, 405];human face:[266, 285, 477, 712].') 2023-11-03 12:20:10.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:20:10.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[755, 577, 881, 853],[107, 333, 999, 909],[880, 593, 999, 840];human body:[011, 328, 936, 822],[118, 507, 318, 848],[201, 226, 999, 543],[257, 528, 456, 835],[375, 511, 618, 845],[551, 511, 739, 859],[675, 511, 882, 841],[830, 497, 999, 845];woman:[000, 367, 252, 857],[106, 498, 343, 859],[180, 327, 273, 590],[205, 348, 380, 673],[279, 359, 528, 648],[291, 238, 480, 476],[298, 302, 368, 628],[308, 509, 460, 871],[359, 500, 618, 859],[441, 353, 671, 774],[458, 213, 581, 604],[578, 320, 781, 750],[591, 515, 750, 871],[673, 313, 941, 849],[700, 247, 812, 463],[731, 247, 933, 650],[816, 496, 999, 861],[888, 280, 999, 604];girl:[015, 379, 257, 848],[256, 524, 455, 883],[445, 374, 700, 791],[471, 222, 581, 711],[554, 520, 737, 874],[671, 510, 881, 881],[698, 266, 799, 464],[743, 262, 949, 791],[827, 499, 999, 847];plant:[000, 810, 999, 999].') 2023-11-03 12:20:48.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:20:48.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cup,Gas stove,Pot,Bottle,Slide,Donkey and Fire Extinguisher in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cup:[601, 086, 674, 177],[661, 062, 716, 145],[693, 118, 746, 226],[725, 131, 788, 261];Gas stove:[091, 512, 547, 977];Pot:[356, 396, 626, 622],[509, 097, 636, 283],[479, 180, 663, 336];Bottle:[352, 049, 418, 275].') 2023-11-03 12:21:09.168 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:21:09.168 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "george clooney [251, 191, 598, 981] at the 'solaris' premiere at the cinerama dome at arclight cinemas [001, 004, 993, 996] in hollywood, california on november 19, 2002. - solaris 2002 film stock videos & royalty-free footage") 2023-11-03 12:21:34.255 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:21:34.255 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Sneakers:[299, 860, 398, 939];Person:[621, 186, 767, 964],[452, 163, 600, 916],[284, 120, 429, 940].') 2023-11-03 12:21:35.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:21:35.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Boat:[002, 416, 998, 1000].') 2023-11-03 12:21:50.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:21:50.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[544, 612, 664, 927],personFrame1:[480, 489, 604, 819],personFrame1:[292, 542, 455, 900] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[544, 612, 664, 927];Frame2:[567, 599, 665, 930],personFrame1:[480, 489, 604, 819];Frame2:[490, 476, 622, 823],personFrame1:[292, 542, 455, 900];Frame2:[289, 530, 468, 908].') 2023-11-03 12:22:11.149 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:22:11.149 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Relatives [612, 017, 996, 995] [186, 122, 603, 995] of servicemen killed in the Iraq war had a two-hour private read through of the Chilcot report [718, 279, 906, 659] ahead of publication') 2023-11-03 12:22:32.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:22:32.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'boat:[172, 375, 762, 818],[305, 154, 468, 194],[911, 145, 999, 175];paddle:[161, 369, 311, 772],[222, 352, 370, 725],[449, 299, 514, 378],[500, 450, 566, 772],[630, 455, 787, 766],[700, 465, 814, 731];person:[233, 340, 358, 636],[327, 111, 448, 177],[333, 337, 421, 583],[353, 439, 499, 645],[389, 236, 496, 452],[399, 322, 526, 592],[463, 162, 596, 464],[493, 390, 635, 675],[615, 355, 716, 630],[623, 306, 753, 547],[918, 116, 999, 155];personal flotation device:[233, 421, 340, 612],[318, 375, 415, 548],[325, 120, 448, 182],[389, 358, 481, 504],[400, 479, 483, 646],[404, 278, 512, 467],[489, 206, 594, 383],[544, 449, 649, 642],[617, 412, 722, 568],[654, 340, 735, 499],[932, 112, 999, 157];clothing:[237, 476, 355, 673],[261, 385, 310, 456],[310, 340, 357, 423],[320, 414, 413, 566],[350, 509, 498, 653],[390, 373, 492, 526],[466, 226, 593, 445],[514, 449, 665, 630],[593, 425, 711, 592],[648, 370, 758, 553].') 2023-11-03 12:22:53.399 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:22:53.399 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pickup Truck in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Pickup Truck:[139, 630, 261, 705].') 2023-11-03 12:23:11.400 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:23:11.401 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'clothing:[325, 453, 735, 993],[000, 205, 046, 520],[170, 167, 282, 443],[248, 126, 375, 372],[521, 210, 621, 561],[623, 343, 894, 998],[762, 082, 817, 212],[805, 218, 999, 766],[906, 102, 989, 231];woman:[055, 193, 238, 630],[341, 120, 470, 571],[346, 266, 733, 998],[576, 140, 691, 668],[632, 093, 885, 998],[796, 043, 999, 945];man:[000, 136, 081, 631],[071, 065, 119, 197],[173, 038, 347, 536],[242, 005, 370, 300],[525, 068, 628, 511],[746, 000, 828, 128],[884, 000, 990, 241];human face:[040, 212, 076, 295],[090, 078, 118, 158],[116, 237, 184, 407],[211, 068, 253, 190],[258, 036, 296, 130],[404, 197, 455, 327],[476, 376, 548, 617],[582, 132, 625, 253],[658, 200, 680, 343],[682, 146, 766, 368],[783, 041, 813, 122],[846, 105, 888, 227],[899, 000, 937, 117].') 2023-11-03 12:23:31.678 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:23:31.679 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[396, 258, 523, 847] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a raccoonFrame1:[396, 258, 523, 847];Frame2:[346, 215, 471, 826];Frame3:[363, 483, 489, 997];Frame4:[346, 425, 469, 977];Frame5:[392, 472, 519, 938].') 2023-11-03 12:23:44.445 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:23:44.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[005, 000, 083, 287],[202, 357, 284, 602],[206, 040, 284, 275],[228, 726, 886, 999],[303, 032, 386, 275],[408, 022, 495, 266],[515, 030, 622, 254],[654, 012, 750, 244];sculpture:[661, 033, 770, 249],[000, 000, 090, 283],[083, 359, 299, 707],[211, 727, 887, 999],[213, 038, 290, 285],[236, 513, 343, 640],[305, 033, 389, 276],[315, 507, 465, 603],[404, 018, 495, 264],[493, 328, 656, 612],[516, 033, 622, 260],[683, 490, 794, 613],[768, 472, 887, 606],[785, 000, 977, 241].') 2023-11-03 12:24:01.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:24:01.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,man and plant in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'footwear:[105, 734, 171, 846],[881, 586, 935, 740],[949, 820, 999, 909];man:[835, 051, 993, 811],[000, 128, 311, 850],[207, 284, 271, 681],[450, 289, 666, 764];plant:[000, 491, 999, 999].') 2023-11-03 12:24:44.444 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:24:44.445 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[008, 006, 988, 978] This chart display the student staff ratio of Roane State Community College and community college average student staff ratio of TN by year, with the latest 2015-16 school year data.') 2023-11-03 12:24:58.307 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:24:58.308 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[197, 148, 804, 931] The product is available in several different flavors and all flavors are reported as having the same caffeine amount. Starbucks Iced Coffee Cup Sizes') 2023-11-03 12:25:05.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:05.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[146, 248, 340, 833],[199, 194, 248, 256],[355, 139, 513, 725],[458, 223, 656, 845],[776, 031, 865, 405],[877, 019, 979, 438];Desk:[000, 687, 223, 947];Leather Shoes:[564, 797, 593, 839],[458, 812, 506, 845],[248, 793, 274, 829],[407, 697, 436, 729];Hat:[526, 223, 593, 294],[384, 141, 441, 199];Glasses:[390, 193, 435, 222];Backpack:[894, 069, 954, 223];Sneakers:[918, 415, 944, 438],[834, 350, 862, 383],[786, 385, 806, 403];Tie:[423, 263, 456, 413];Violin:[238, 305, 356, 425];Trombone:[408, 169, 483, 478];Drum:[276, 433, 331, 521],[274, 538, 341, 710];Cymbal:[336, 368, 375, 395],[356, 390, 384, 421];Cup:[062, 667, 086, 732].') 2023-11-03 12:25:08.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:08.446 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[341, 673, 476, 998],[044, 511, 126, 684],[088, 650, 303, 999],[137, 143, 167, 210],[208, 203, 257, 296],[411, 479, 488, 603],[428, 193, 488, 277],[501, 162, 585, 224],[557, 260, 628, 355],[570, 448, 635, 724],[606, 159, 672, 240],[613, 446, 685, 737],[672, 382, 712, 505],[700, 374, 738, 479],[725, 172, 790, 358],[867, 463, 931, 589],[880, 792, 999, 999],[919, 300, 985, 396],[920, 482, 982, 596],[929, 314, 999, 414],[953, 465, 992, 570];man:[006, 506, 120, 735],[106, 188, 157, 265],[422, 458, 466, 585],[475, 379, 549, 710],[541, 380, 588, 511],[675, 475, 725, 558],[711, 385, 786, 643],[765, 464, 818, 591],[802, 298, 836, 393],[807, 445, 863, 570],[883, 295, 933, 398];plant:[000, 070, 999, 999].') 2023-11-03 12:25:15.852 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:15.852 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "This is an image of boy's smart watch with tracker for [080, 315, 396, 485] heart rate in green and black colors") 2023-11-03 12:25:20.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:20.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[623, 355, 719, 831],personFrame1:[455, 271, 607, 999],personFrame1:[303, 325, 469, 908] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[623, 355, 719, 831];Frame2:[560, 360, 744, 871],personFrame1:[455, 271, 607, 999];Frame2:[442, 241, 667, 999],personFrame1:[303, 325, 469, 908];Frame2:[211, 325, 460, 958].') 2023-11-03 12:25:32.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:32.384 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[132, 260, 383, 521],carFrame1:[516, 249, 648, 449],carFrame1:[475, 289, 519, 350] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[132, 260, 383, 521];Frame2:[164, 264, 407, 526],carFrame1:[516, 249, 648, 449];Frame2:[530, 252, 671, 465],carFrame1:[475, 289, 519, 350];Frame2:[487, 290, 532, 355].') 2023-11-03 12:25:39.172 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:39.173 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[328, 237, 998, 762] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a armadilloFrame1:[328, 237, 998, 762];Frame2:[262, 252, 997, 777];Frame3:[269, 247, 997, 789];Frame4:[212, 237, 951, 775].') 2023-11-03 12:25:41.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:41.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the fish does not move position in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is no the fish does not move position.') 2023-11-03 12:25:46.414 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:46.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bench:[146, 752, 551, 999],[451, 780, 892, 999];Person:[418, 369, 594, 999],[002, 487, 285, 999],[002, 369, 138, 761].') 2023-11-03 12:25:47.846 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:25:47.847 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Sewing Bee [615, 293, 754, 480], a purple Steampunk top hat [174, 160, 975, 915] with a range [172, 046, 977, 982] of seamstress accoutrements') 2023-11-03 12:26:18.161 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:26:18.161 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hyperion, a supercluster of galaxies in [006, 011, 988, 989] the early universe') 2023-11-03 12:26:58.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:26:58.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'person:[000, 030, 1000, 1000];clothing:[048, 124, 889, 1000];human mouth:[496, 482, 563, 520];human hair:[236, 002, 777, 322];human head:[239, 006, 763, 550];mammal:[000, 000, 990, 998];human face:[381, 174, 712, 564];human nose:[558, 400, 632, 492].') 2023-11-03 12:27:05.695 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:27:05.695 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[024, 285, 969, 833] A pair of carved 'Yaozhou' bowls, Northern Song dynasty (960-1127") 2023-11-03 12:27:14.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:27:14.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Glasses:[015, 821, 127, 860],[218, 918, 374, 980],[735, 897, 766, 929],[495, 875, 594, 927],[748, 874, 777, 909],[852, 754, 976, 796],[789, 740, 823, 769];Hat:[155, 707, 275, 836];Surveillance Camera:[183, 124, 219, 176];Whiteboard:[296, 037, 580, 891].') 2023-11-03 12:27:37.341 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:27:37.342 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "story about [096, 281, 298, 996] a rat stealing a cat's GPS tracker and sending the owner on a wild chase thumbnail includes [496, 006, 998, 998] two pictures including [068, 005, 380, 997] a man holding a cat and [522, 194, 919, 996] a map with lines drawn over it") 2023-11-03 12:27:56.906 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:27:56.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[001, 002, 998, 997];frame:2:[001, 002, 998, 997];frame:3:[001, 002, 998, 997] is doing?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', "This is a/an a person cutting cat's nails") 2023-11-03 12:28:43.576 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:28:43.577 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Miss Australia 1967, [099, 044, 994, 996] Margaret Rohan holding [002, 520, 358, 900] the sceptre, wearing [256, 041, 835, 394] the crown, [102, 565, 994, 998] robe and white gloves - click to view larger image') 2023-11-03 12:28:57.691 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:28:57.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[575, 489, 706, 690],[474, 443, 584, 680],[381, 506, 496, 695],[361, 291, 507, 682];Flower:[959, 109, 999, 153];Vase:[957, 140, 999, 179];Carpet:[811, 839, 988, 930],[718, 929, 973, 999];Cup:[914, 316, 930, 346],[884, 318, 900, 348].') 2023-11-03 12:29:10.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:29:10.533 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[546, 377, 620, 573] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[546, 377, 620, 573];Frame2:[605, 343, 670, 548];Frame3:[638, 334, 695, 538];Frame4:[533, 397, 596, 575].') 2023-11-03 12:29:12.359 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:29:12.360 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Clock:[023, 082, 082, 161];Bench:[026, 579, 059, 650];Person:[020, 447, 051, 499],[010, 474, 064, 641],[027, 514, 055, 647],[030, 509, 076, 663],[069, 461, 102, 535],[097, 476, 117, 511],[101, 477, 141, 525],[131, 466, 154, 514],[151, 480, 173, 513],[935, 485, 957, 564],[935, 479, 978, 639],[968, 467, 1000, 640],[274, 445, 343, 515];Umbrella:[089, 408, 181, 480];Hat:[077, 462, 094, 483];Traffic Light:[568, 070, 607, 189],[808, 094, 853, 208],[154, 332, 188, 371],[158, 229, 190, 325];Traffic Sign:[686, 123, 721, 189],[162, 378, 207, 448];Ambulance:[053, 271, 972, 761];Street Lights:[205, 230, 233, 463],[220, 291, 241, 389],[230, 327, 246, 392];SUV:[273, 466, 332, 507].') 2023-11-03 12:29:15.843 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:29:15.843 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[010, 161, 203, 482],[089, 312, 323, 537],[156, 273, 218, 326],[182, 229, 529, 558],[345, 333, 640, 563],[648, 465, 781, 542];clothing:[704, 050, 904, 445];house:[000, 000, 998, 853].') 2023-11-03 12:29:59.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:29:59.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The Shorecrest Boy’s Swim and Dive team [200, 600, 325, 881] [468, 558, 548, 836] [312, 568, 403, 854] [784, 549, 876, 858] [393, 566, 476, 844] poses for their team photo, after an unusually successful year.') 2023-11-03 12:30:08.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:30:08.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[101, 609, 303, 938],[000, 540, 100, 998],[000, 290, 779, 800],[231, 579, 282, 700],[521, 816, 556, 942],[592, 759, 673, 922],[766, 568, 823, 694],[780, 655, 831, 892],[834, 627, 933, 913],[876, 739, 933, 894],[926, 698, 999, 855].') 2023-11-03 12:30:10.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:30:10.824 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Train:[110, 111, 755, 640];Car:[838, 423, 926, 503],[928, 435, 1000, 513];Person:[075, 393, 132, 775].') 2023-11-03 12:30:16.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:30:16.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'jacket:[076, 139, 389, 618],[523, 259, 981, 789];jeans:[350, 204, 504, 512];person:[028, 020, 168, 517],[089, 035, 370, 975],[151, 014, 225, 071],[360, 012, 465, 512],[514, 020, 646, 843],[517, 026, 975, 979],[643, 014, 715, 092];fashion accessory:[039, 485, 117, 521],[195, 790, 237, 890],[223, 906, 309, 976],[376, 476, 467, 548];footwear:[029, 481, 117, 526],[195, 795, 234, 890],[223, 906, 304, 978],[375, 468, 467, 554].') 2023-11-03 12:31:09.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:31:09.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the motorbike moving towards the far left..The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one the motorbike moving towards the far left..Track1frame:1:[148, 433, 199, 500];frame:2:[148, 433, 199, 500];frame:3:[147, 433, 198, 500].') 2023-11-03 12:32:01.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:32:01.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Scooter:[233, 361, 807, 987];Traffic cone:[120, 428, 157, 512],[318, 447, 356, 535],[489, 458, 510, 521],[696, 447, 735, 531];Street Lights:[790, 054, 842, 276],[771, 161, 794, 260],[667, 132, 695, 257];Van:[697, 255, 812, 354],[108, 227, 353, 351],[338, 241, 377, 303],[061, 230, 163, 278];Car:[344, 264, 366, 308];Helmet:[539, 123, 623, 213];Glasses:[551, 199, 610, 229];Sneakers:[436, 823, 540, 926],[414, 748, 526, 826];Person:[413, 122, 697, 926],[000, 226, 025, 367];Watch:[479, 516, 500, 564].') 2023-11-03 12:32:59.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:32:59.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'SUV:[175, 448, 294, 610],[289, 455, 405, 594];Truck:[537, 436, 641, 489];Person:[645, 422, 673, 493],[569, 470, 611, 551].') 2023-11-03 12:33:04.784 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:33:04.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect clothing,human face,human head and man in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'clothing:[000, 376, 216, 772],[109, 513, 335, 805],[230, 383, 461, 521],[321, 482, 632, 661],[329, 644, 691, 848],[509, 415, 686, 571],[681, 408, 928, 765];human face:[099, 343, 163, 391],[196, 466, 267, 520],[314, 362, 397, 401],[439, 453, 509, 499],[461, 593, 533, 653],[541, 373, 608, 424],[697, 456, 772, 513];human head:[440, 576, 553, 657],[061, 313, 180, 391],[174, 436, 298, 525],[290, 323, 431, 403],[400, 422, 542, 497],[507, 343, 620, 423],[654, 425, 781, 511],[711, 343, 826, 413];man:[359, 581, 698, 911],[000, 315, 233, 820],[096, 438, 335, 854],[236, 320, 464, 546],[324, 415, 638, 673],[512, 347, 673, 564],[619, 417, 917, 841],[708, 338, 939, 586].') 2023-11-03 12:33:06.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:33:06.645 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[828, 233, 989, 712],[594, 342, 702, 584],[718, 404, 749, 496],[494, 455, 567, 518],[384, 328, 438, 441],[317, 376, 342, 429],[065, 307, 179, 606];Camera:[833, 360, 855, 416];Briefcase:[253, 618, 419, 831].') 2023-11-03 12:33:07.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:33:07.614 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[705, 706, 894, 758],[429, 931, 554, 988],[180, 930, 401, 990],[001, 930, 075, 974];Gloves:[132, 507, 234, 569];American Football:[580, 789, 770, 864],[584, 764, 647, 796];Person:[662, 069, 937, 761],[165, 030, 775, 990],[001, 128, 287, 975].') 2023-11-03 12:33:14.091 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:33:14.091 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Aerial shot showing [003, 007, 996, 994] the rugged mountains and unusual landscape of the Deccan Traps in Maharashtra, India') 2023-11-03 12:33:21.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:33:21.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[443, 122, 711, 782];human face:[320, 298, 376, 429],[474, 184, 551, 330].') 2023-11-03 12:33:30.971 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:33:30.971 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect doll,footwear,human face and girl in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'doll:[399, 113, 892, 928],[000, 079, 513, 900];footwear:[001, 863, 089, 900],[888, 896, 999, 957];human face:[332, 150, 429, 213],[498, 240, 611, 325];girl:[000, 075, 515, 899],[421, 140, 967, 974].') 2023-11-03 12:34:22.370 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:34:22.371 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Officer Daryl Pierson [405, 477, 614, 836] walks with his niece [326, 637, 441, 821], Madison, 3, and son [579, 602, 737, 827], Christian, in Arrowhead Park, Inlet, NY in the Adirondacks on Aug. 30, four days before he died.') 2023-11-03 12:34:33.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:34:33.271 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Street Lights and Traffic Light in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[184, 585, 224, 740];Street Lights:[507, 147, 585, 376],[622, 233, 681, 393];Traffic Light:[193, 197, 213, 268].') 2023-11-03 12:34:35.789 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:34:35.790 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[479, 347, 996, 983] [014, 315, 633, 998] 2 girls with [361, 326, 640, 512] [727, 378, 941, 551] snorkeling masks on [364, 330, 625, 690] their heads standing in [000, 024, 995, 996] water and holding up [864, 706, 997, 864] peace signs') 2023-11-03 12:34:41.177 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:34:41.177 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a red jumpsuit [302, 317, 665, 895] with a plunging neckline, no sleeves, mini pockets [317, 491, 399, 553], white booties [308, 880, 437, 957] and a necklace [387, 328, 468, 374]') 2023-11-03 12:35:05.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:35:05.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect orange,clothing,woman and human face in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'clothing:[853, 543, 1000, 963],[000, 204, 081, 315],[000, 514, 155, 999],[033, 429, 120, 574],[043, 188, 130, 291],[055, 405, 230, 553],[064, 060, 096, 138],[097, 070, 175, 181],[101, 188, 173, 272],[139, 346, 315, 540],[140, 180, 216, 264],[231, 152, 300, 243],[295, 327, 393, 450],[378, 577, 659, 999],[386, 269, 501, 386],[442, 420, 716, 845],[445, 338, 623, 464],[449, 207, 536, 320],[620, 286, 689, 448],[633, 238, 726, 355],[711, 230, 800, 341],[735, 312, 933, 556],[761, 224, 825, 274],[881, 386, 999, 629],[883, 243, 954, 315];woman:[379, 387, 658, 983],[151, 224, 316, 531],[157, 140, 207, 313],[200, 148, 242, 244],[388, 185, 495, 392],[501, 175, 571, 297],[541, 161, 606, 292],[735, 227, 933, 557],[770, 168, 835, 294],[862, 144, 914, 258],[876, 177, 959, 349],[915, 198, 999, 390];human face:[104, 332, 155, 434],[202, 284, 254, 360],[489, 482, 568, 615],[517, 297, 556, 376],[550, 306, 620, 440],[973, 320, 999, 402],[987, 450, 999, 594].') 2023-11-03 12:35:17.891 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:35:17.891 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[025, 586, 066, 651],[090, 583, 156, 620],[205, 580, 276, 600],[276, 581, 371, 596],[581, 573, 633, 651],[726, 574, 769, 670],[751, 578, 792, 661],[804, 583, 859, 646],[869, 580, 905, 643].') 2023-11-03 12:35:22.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:35:22.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'clothing:[441, 754, 689, 991],[028, 448, 110, 571],[030, 575, 058, 717],[068, 668, 226, 996],[247, 651, 356, 846],[259, 525, 305, 632],[323, 709, 491, 983],[342, 604, 410, 719],[430, 606, 490, 699],[447, 621, 562, 798],[534, 606, 598, 688],[571, 632, 630, 738],[579, 701, 717, 842],[596, 274, 683, 436],[720, 682, 816, 892],[733, 855, 861, 998],[894, 630, 970, 836],[944, 834, 999, 998];woman:[260, 480, 306, 673],[743, 698, 961, 999],[938, 555, 999, 665];table:[000, 732, 085, 915],[006, 756, 248, 999],[315, 840, 972, 999],[673, 663, 768, 774];man:[026, 397, 111, 707],[070, 606, 229, 984],[247, 602, 363, 847],[318, 659, 505, 963],[345, 566, 397, 714],[350, 499, 402, 620],[428, 661, 694, 999],[439, 564, 500, 693],[448, 582, 560, 741],[535, 560, 602, 695],[586, 630, 727, 834],[589, 215, 686, 438],[702, 537, 753, 666],[726, 598, 823, 823],[803, 594, 843, 666],[844, 551, 899, 648],[878, 587, 969, 807],[946, 666, 999, 999],[973, 539, 999, 626];human face:[630, 217, 653, 283],[796, 745, 850, 842].') 2023-11-03 12:35:33.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:35:33.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Helmet:[680, 515, 707, 568],[337, 698, 362, 734],[380, 586, 403, 622],[151, 436, 174, 468],[105, 600, 130, 646],[001, 657, 025, 695];Gloves:[864, 913, 885, 947],[715, 601, 753, 646],[628, 581, 650, 645],[400, 679, 432, 720];Person:[001, 656, 041, 945],[050, 597, 133, 865],[093, 433, 177, 671],[335, 589, 433, 839],[288, 694, 384, 978],[807, 811, 891, 998],[623, 514, 765, 701],[001, 001, 043, 078],[043, 020, 090, 082],[049, 001, 083, 045],[090, 006, 141, 087],[135, 004, 186, 095],[184, 031, 240, 108],[235, 029, 283, 113],[241, 001, 299, 069],[286, 063, 327, 126],[306, 001, 364, 072],[349, 001, 399, 095],[381, 073, 428, 155],[401, 004, 457, 111],[423, 088, 470, 170],[446, 001, 478, 048],[468, 094, 513, 187],[502, 112, 564, 203],[548, 063, 588, 155],[540, 001, 593, 064],[581, 001, 641, 080],[584, 069, 643, 174],[634, 001, 672, 164],[663, 001, 707, 097],[731, 001, 786, 122],[676, 203, 726, 282],[719, 214, 770, 304],[763, 235, 812, 324],[804, 248, 856, 345],[844, 263, 898, 364],[884, 298, 927, 387],[737, 056, 795, 190],[766, 140, 820, 253],[809, 165, 854, 264],[834, 167, 893, 281],[795, 077, 843, 164],[837, 088, 885, 195],[876, 110, 918, 213],[935, 318, 977, 405],[968, 325, 999, 416],[969, 230, 999, 343],[946, 209, 988, 323],[952, 142, 1000, 260],[832, 004, 885, 095],[876, 024, 917, 118],[917, 025, 960, 155],[952, 041, 999, 157],[918, 001, 981, 054],[460, 001, 519, 117];Hockey Stick:[584, 529, 704, 705],[388, 685, 443, 720],[258, 762, 307, 819],[138, 546, 207, 640],[035, 791, 170, 896].') 2023-11-03 12:35:44.631 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:35:44.631 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human body:[291, 196, 456, 927],[466, 400, 730, 999];human hair:[620, 293, 705, 402],[355, 233, 395, 307];man:[280, 176, 451, 932],[407, 266, 522, 723],[504, 286, 740, 999],[820, 329, 920, 553];wheel:[813, 546, 954, 726];human arm:[008, 405, 126, 734],[290, 284, 342, 446],[396, 354, 479, 558],[560, 444, 653, 649],[578, 476, 735, 641];guitar:[565, 520, 625, 700];human leg:[312, 560, 362, 853],[375, 579, 412, 933],[508, 635, 617, 999],[600, 715, 709, 993];human head:[345, 228, 395, 338],[618, 291, 698, 434],[854, 321, 895, 394];clothing:[313, 299, 463, 898],[492, 428, 740, 985],[842, 383, 900, 498];human face:[343, 240, 386, 330],[620, 330, 675, 437];human hand:[020, 406, 083, 468],[309, 281, 349, 345],[399, 494, 432, 563].') 2023-11-03 12:35:55.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:35:55.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Wine Glass:[505, 314, 542, 446],[437, 433, 493, 599],[416, 520, 451, 652],[120, 800, 197, 998],[576, 581, 624, 727],[650, 645, 701, 828],[483, 760, 562, 999],[762, 603, 818, 792],[539, 562, 581, 679],[591, 656, 645, 808],[579, 783, 646, 1000];Bottle:[375, 464, 420, 651],[292, 692, 369, 999];Plate:[288, 843, 385, 897],[496, 675, 532, 716];Cup:[205, 831, 279, 999],[505, 765, 563, 862],[648, 605, 697, 650];Dinning Table:[040, 545, 832, 1000];Napkin:[342, 557, 414, 612],[728, 848, 803, 938];Flower:[265, 645, 500, 864];Person:[000, 150, 351, 1000],[503, 538, 1000, 1000],[769, 311, 1000, 771],[453, 135, 989, 692],[478, 161, 754, 630],[239, 180, 531, 588];Glasses:[097, 308, 148, 396].') 2023-11-03 12:36:06.736 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:36:06.736 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Students [265, 340, 326, 539] [085, 306, 151, 590] [336, 346, 399, 537] [193, 324, 263, 605] [506, 309, 614, 636] touring MU Health Care. This is a picutre of students learning about the ambulatory services as they stand next to the MU Health helicopter.') 2023-11-03 12:36:08.998 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:36:08.998 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Slippers:[272, 032, 431, 205],[091, 068, 285, 246],[015, 075, 203, 250],[001, 075, 122, 245],[228, 205, 393, 434],[389, 042, 557, 305],[505, 028, 698, 326],[634, 031, 798, 303],[686, 309, 889, 618],[526, 303, 710, 573],[347, 289, 526, 526],[497, 500, 693, 635],[296, 520, 487, 761],[457, 626, 632, 815],[580, 620, 799, 824],[526, 780, 803, 921],[315, 826, 541, 935],[305, 861, 614, 1000],[144, 364, 311, 523],[055, 322, 239, 514],[001, 331, 166, 508],[001, 321, 084, 390],[924, 066, 999, 128],[691, 053, 925, 214],[761, 055, 1000, 211],[001, 547, 056, 623],[001, 559, 150, 753],[061, 568, 249, 773],[138, 583, 345, 789].') 2023-11-03 12:37:25.806 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:37:25.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect chair,clothing,human face and person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[000, 546, 102, 761],[000, 733, 113, 996],[091, 621, 347, 990],[135, 308, 202, 428],[175, 278, 235, 453],[226, 428, 392, 673],[352, 620, 635, 993],[382, 405, 552, 583],[526, 353, 608, 471],[728, 385, 907, 751],[791, 321, 958, 631],[865, 198, 981, 500];clothing:[656, 020, 793, 448],[063, 448, 266, 668],[211, 310, 368, 601],[302, 368, 696, 830],[373, 315, 512, 576],[456, 136, 583, 286],[477, 275, 620, 388];human face:[511, 078, 540, 143],[512, 506, 592, 626];person:[052, 368, 277, 735],[220, 255, 378, 601],[276, 366, 698, 883],[358, 221, 526, 593],[448, 066, 588, 286],[485, 191, 622, 403],[655, 000, 782, 533].') 2023-11-03 12:37:36.231 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:37:36.231 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An illustration of [292, 093, 726, 635] a person sitting in [263, 195, 736, 882] a chair reading, with [336, 098, 504, 365] a cat sleeping on the back of the chair') 2023-11-03 12:37:40.432 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:37:40.432 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'LSU eyeing 10-win season with 42-10 victory over Rice') 2023-11-03 12:38:42.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:38:42.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect vehicle,table,man and clothing in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'vehicle:[000, 063, 998, 374],[785, 209, 998, 380];table:[215, 607, 635, 838],[000, 594, 065, 717],[316, 617, 537, 681],[460, 529, 640, 586],[821, 669, 879, 748];man:[022, 334, 217, 846];clothing:[060, 434, 174, 826],[424, 478, 521, 555],[629, 490, 709, 555],[769, 531, 875, 591],[849, 637, 998, 833],[854, 575, 911, 673].') 2023-11-03 12:39:20.520 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:39:20.521 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Game of Thrones : combien de saisons avant la fin ? Le boss de HBO répond') 2023-11-03 12:40:09.686 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:40:09.686 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[222, 098, 798, 789] Carol Cole, "The Dissection of ANI", from the series "ANI (Anti-Nothingness Image)", 1993. Clay, embroidery thread, linen, silk, and satin; 10 5/8 x 13 x 4 in. Courtesy of the artist.') 2023-11-03 12:40:33.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:40:33.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[123, 404, 375, 599];frame:2:[117, 291, 377, 601];frame:3:[100, 248, 392, 609] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an two dogs having fun together.') 2023-11-03 12:40:44.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:40:44.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Tent:[886, 945, 938, 974].') 2023-11-03 12:40:51.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:40:51.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Desk:[001, 630, 481, 999],[284, 454, 449, 652].') 2023-11-03 12:40:51.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:40:51.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given an image,please detect person in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[025, 490, 103, 857],[037, 677, 123, 1000],[096, 567, 175, 1000],[078, 337, 113, 550],[078, 300, 123, 497],[141, 280, 176, 470],[225, 295, 270, 450],[285, 452, 323, 650],[458, 532, 505, 790],[526, 582, 581, 907],[562, 532, 610, 840],[570, 490, 613, 785],[511, 485, 560, 735],[513, 360, 547, 570],[586, 377, 630, 627],[632, 357, 674, 557],[666, 390, 705, 620],[677, 315, 720, 515],[691, 422, 745, 687],[652, 557, 721, 922],[595, 635, 658, 985],[778, 367, 814, 552],[818, 390, 867, 577],[848, 392, 896, 622],[901, 450, 948, 695],[915, 455, 967, 725],[923, 572, 1000, 897],[962, 640, 1000, 987],[843, 832, 973, 1000],[602, 325, 635, 480],[666, 217, 698, 367],[755, 337, 794, 537],[746, 425, 778, 562],[776, 342, 811, 532],[913, 390, 948, 522],[405, 080, 570, 245],[404, 250, 451, 370],[244, 232, 295, 330],[613, 105, 676, 317],[728, 235, 835, 352],[675, 127, 784, 237],[835, 327, 897, 440].') 2023-11-03 12:41:11.135 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:41:11.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the people pulling horse left in this video clip?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 3 people pulling horse left.Track1frame:1:[864, 441, 999, 669];frame:2:[759, 378, 999, 666];frame:3:[469, 368, 737, 673],Track2frame:1:[512, 390, 539, 408];frame:2:[489, 390, 524, 409];frame:3:[458, 381, 502, 403],Track3frame:1:[124, 394, 224, 516];frame:2:[109, 391, 190, 511];frame:3:[071, 399, 180, 499].') 2023-11-03 12:41:24.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:41:24.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'vehicle:[308, 677, 432, 807].') 2023-11-03 12:42:05.096 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:42:05.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[645, 263, 958, 788],[241, 461, 506, 852],[458, 350, 583, 616],[512, 394, 729, 644],[766, 308, 877, 594];jeans:[643, 430, 958, 775],[108, 727, 181, 836];woman:[106, 561, 308, 833],[312, 500, 397, 638];tree:[000, 522, 327, 930];footwear:[235, 780, 300, 852],[270, 772, 354, 838],[597, 691, 733, 786],[687, 658, 837, 780].') 2023-11-03 12:42:11.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:42:11.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[518, 465, 589, 599] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[518, 465, 589, 599];Frame2:[516, 508, 563, 660];Frame3:[403, 474, 440, 598];Frame4:[346, 454, 367, 557].') 2023-11-03 12:42:28.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:42:28.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[002, 751, 909, 1000],[681, 261, 806, 544];Plate:[002, 361, 122, 446];Satchel:[853, 296, 999, 706];Bakset:[007, 669, 179, 830];Person:[111, 002, 512, 523],[080, 101, 156, 356],[328, 002, 518, 369],[875, 309, 1000, 1000].') 2023-11-03 12:42:32.019 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:42:32.020 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bucket and Chair in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Barrel:[808, 672, 871, 801];Chair:[870, 847, 999, 1000].') 2023-11-03 12:43:09.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:43:09.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[463, 230, 844, 929].') 2023-11-03 12:43:10.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:43:10.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human hand:[756, 502, 1000, 938];laptop:[714, 053, 996, 994];computer monitor:[151, 240, 746, 823],[724, 061, 999, 571].') 2023-11-03 12:43:15.542 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:43:15.542 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'suit:[868, 281, 980, 895];human body:[048, 268, 336, 617],[282, 174, 384, 592],[515, 287, 791, 990],[863, 235, 964, 873];woman:[563, 255, 811, 970],[028, 287, 310, 639];man:[860, 206, 981, 890],[276, 175, 385, 605];dress:[021, 342, 294, 643],[510, 330, 801, 987];coat:[280, 228, 369, 380],[875, 310, 963, 596];human hair:[653, 282, 706, 366],[897, 234, 960, 320];fashion accessory:[865, 842, 936, 885];footwear:[860, 838, 937, 881].') 2023-11-03 12:43:51.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:43:51.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[112, 840, 165, 914],carFrame1:[216, 820, 281, 903],carFrame1:[420, 803, 468, 861] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[112, 840, 165, 914];Frame2:[112, 840, 165, 914],carFrame1:[216, 820, 281, 903];Frame2:[211, 820, 273, 906],carFrame1:[420, 803, 468, 861];Frame2:[421, 801, 465, 860].') 2023-11-03 12:43:56.201 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:43:56.202 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[244, 092, 787, 992] Woman tourist in [362, 088, 698, 416] a hat uses a camera, takes a picture. isolate on [003, 004, 996, 996] gray background.') 2023-11-03 12:44:29.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:44:29.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[109, 317, 249, 567],[064, 303, 253, 548],[287, 124, 371, 411],[287, 410, 375, 676],[566, 399, 778, 649],[744, 295, 932, 586],[677, 254, 794, 416],[574, 641, 769, 863],[742, 585, 934, 870],[682, 475, 794, 582];Hat:[212, 313, 253, 405],[708, 398, 776, 478].') 2023-11-03 12:44:34.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:44:34.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the turn over then run away from us in this video clip?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There are 4 turn over then run away from us.Track1frame:1:[372, 234, 542, 641];frame:2:[368, 243, 536, 678];frame:3:[392, 249, 534, 657],Track2frame:1:[254, 207, 402, 789];frame:2:[276, 334, 398, 788];frame:3:[328, 287, 482, 790],Track3frame:1:[496, 089, 678, 923];frame:2:[489, 089, 673, 920];frame:3:[489, 086, 670, 918],Track4frame:1:[744, 281, 999, 866];frame:2:[739, 281, 999, 860];frame:3:[739, 276, 999, 857].') 2023-11-03 12:44:51.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:44:51.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[557, 562, 665, 694],carFrame1:[515, 539, 573, 623],carFrame1:[190, 545, 300, 669],carFrame1:[000, 532, 199, 789],carFrame1:[362, 479, 482, 696] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[557, 562, 665, 694];Frame2:[573, 576, 708, 729],carFrame1:[515, 539, 573, 623];Frame2:[524, 546, 583, 639],carFrame1:[190, 545, 300, 669];Frame2:[148, 552, 281, 699],carFrame1:[000, 532, 199, 789];Frame2:[000, 543, 137, 882],carFrame1:[362, 479, 482, 696];Frame2:[379, 496, 484, 683].') 2023-11-03 12:45:07.994 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:45:07.994 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[207, 040, 614, 944] Richmond Virginia blogger, Lauren Dix, tries on [289, 436, 596, 834] seven styles of Everlane jeans and compares each style in her Everlane jeans review!') 2023-11-03 12:45:24.073 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:45:24.073 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[498, 268, 753, 985];man:[683, 345, 976, 991],[000, 119, 245, 998],[000, 035, 495, 998],[510, 318, 533, 504],[684, 249, 765, 715],[820, 349, 866, 596],[862, 318, 910, 504],[868, 329, 999, 713],[905, 286, 974, 492];fedora:[252, 039, 566, 260];human face:[083, 269, 223, 538],[363, 236, 493, 427],[608, 399, 693, 624],[691, 306, 751, 412],[714, 384, 836, 683],[832, 388, 860, 460],[870, 345, 908, 433],[974, 358, 999, 486].') 2023-11-03 12:45:28.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:45:28.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[105, 551, 624, 880] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a crocodile sitting on the groundFrame1:[105, 551, 624, 880];Frame2:[105, 551, 624, 880];Frame3:[105, 551, 624, 884];Frame4:[105, 551, 621, 884].') 2023-11-03 12:45:29.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:45:29.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[151, 112, 609, 918];woman:[144, 125, 648, 977];human hair:[309, 153, 524, 282];footwear:[216, 882, 342, 963];human leg:[202, 613, 450, 967],[394, 586, 679, 999];girl:[210, 151, 686, 997];handbag:[166, 608, 387, 800].') 2023-11-03 12:45:48.956 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:45:48.956 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Don't use the sharpness of your speech on [489, 152, 827, 435] the mother who ...") 2023-11-03 12:46:25.125 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:46:25.125 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '2000 Pocket Pillow Top Mattress Divan 2L') 2023-11-03 12:46:34.260 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:46:34.261 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[330, 028, 517, 998] [538, 107, 799, 726] [001, 114, 205, 995] [844, 351, 998, 995] [135, 128, 448, 995] Demonstrators joining Young Socialists hold up [452, 305, 752, 718] [875, 534, 997, 736] [136, 571, 407, 926] placards outside the Department for Education, London as they protest the handling of exam results, university provision and bleak employment prospects.') 2023-11-03 12:47:14.537 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 12:47:14.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Bench:[288, 684, 433, 811];Person:[588, 485, 711, 818],[299, 633, 378, 810],[378, 618, 449, 803],[152, 178, 196, 252],[565, 274, 611, 470],[502, 080, 935, 669],[684, 144, 1000, 666];Street Lights:[473, 001, 517, 252];Dolphin:[245, 377, 740, 756],[531, 743, 679, 855].') 2023-11-03 12:47:27.882 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:47:27.883 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '2019 Porsche Macan S review by Practical Motoring') 2023-11-03 12:48:36.909 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:48:36.910 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An 81-pound octahedrite meteorite [321, 087, 742, 477] for sale in the gift shop [002, 004, 991, 992] for $12,000') 2023-11-03 12:49:12.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:49:12.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Baseball Bat:[081, 697, 182, 868];Baseball Glove:[421, 541, 457, 623],[896, 535, 949, 597];Person:[262, 408, 458, 853],[431, 275, 582, 858],[703, 441, 950, 823],[316, 395, 376, 823];Sneakers:[262, 782, 311, 844],[323, 704, 368, 780],[703, 732, 736, 821],[730, 756, 781, 820],[514, 804, 536, 859];Belt:[331, 545, 370, 578];Hat:[526, 270, 580, 334],[767, 441, 802, 486],[160, 517, 199, 549],[054, 065, 091, 103],[235, 149, 264, 188],[229, 251, 259, 285],[435, 191, 460, 232],[502, 200, 533, 238],[702, 370, 728, 405];Gloves:[427, 515, 454, 580],[537, 467, 561, 514].') 2023-11-03 12:49:27.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:49:27.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 503, 334, 777];Dog:[193, 133, 625, 1000].') 2023-11-03 12:49:31.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:49:31.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'plant:[000, 424, 038, 585],[503, 091, 786, 355],[770, 400, 879, 523],[863, 447, 906, 516],[921, 450, 996, 526];train:[091, 119, 911, 939];wheel:[138, 654, 166, 737],[194, 649, 240, 764],[291, 674, 346, 810],[371, 662, 458, 860],[467, 694, 576, 916],[655, 789, 717, 884].') 2023-11-03 12:49:36.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:49:36.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[332, 387, 529, 812] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a binturongFrame1:[332, 387, 529, 812];Frame2:[282, 406, 492, 795];Frame3:[242, 408, 515, 762];Frame4:[228, 397, 519, 750];Frame5:[216, 410, 519, 745].') 2023-11-03 12:49:48.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:49:48.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[076, 590, 423, 989],[000, 344, 271, 659],[264, 323, 445, 675],[334, 310, 688, 999],[415, 105, 552, 299],[491, 156, 795, 471],[530, 061, 645, 206],[580, 127, 678, 271],[717, 295, 999, 999],[734, 095, 832, 261],[805, 051, 942, 224];boy:[000, 343, 268, 658],[074, 321, 445, 740],[336, 309, 689, 999],[490, 160, 792, 473],[707, 296, 999, 999];woman:[360, 225, 473, 405],[418, 209, 595, 424],[732, 095, 819, 290];clothing:[000, 818, 385, 999],[000, 109, 965, 730],[338, 468, 698, 999],[700, 446, 999, 995];human face:[143, 372, 195, 467],[170, 653, 274, 871],[333, 353, 396, 458],[403, 236, 440, 309],[480, 119, 510, 185],[520, 220, 558, 293],[539, 331, 608, 455],[689, 185, 735, 267],[752, 102, 787, 169],[811, 076, 836, 146],[856, 327, 942, 475].') 2023-11-03 12:50:00.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:50:00.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[605, 317, 836, 965],[000, 233, 093, 999],[000, 252, 164, 996],[120, 219, 364, 999],[274, 354, 536, 999],[315, 250, 398, 360],[387, 290, 638, 666],[402, 285, 638, 695],[476, 276, 654, 972],[588, 298, 631, 405],[683, 313, 737, 385],[745, 261, 999, 999],[747, 264, 811, 385],[769, 264, 868, 475],[921, 283, 999, 441];girl:[000, 220, 110, 999],[286, 351, 537, 999],[398, 293, 658, 999],[506, 268, 642, 908],[551, 351, 832, 999];clothing:[000, 834, 035, 945],[000, 490, 057, 908],[000, 354, 151, 860],[163, 336, 350, 779],[317, 486, 532, 892],[325, 402, 587, 823],[482, 381, 666, 729],[580, 338, 629, 414],[600, 441, 805, 927],[742, 388, 999, 870],[772, 359, 862, 421],[930, 343, 999, 439],[973, 424, 999, 800].') 2023-11-03 12:50:06.493 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:50:06.493 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[079, 327, 166, 546] [747, 199, 798, 347] [923, 564, 984, 745] [098, 633, 221, 897] [175, 296, 248, 522] [351, 392, 433, 585] Croc skin nails are trending and you should get it at your next manicure') 2023-11-03 12:50:23.753 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:50:23.754 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[354, 186, 665, 512] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a icebreakerFrame1:[354, 186, 665, 512];Frame2:[331, 171, 649, 512];Frame3:[323, 164, 645, 515];Frame4:[320, 156, 651, 511];Frame5:[319, 152, 657, 507].') 2023-11-03 12:50:29.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:50:29.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Oven,Plate and Extractor in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Oven:[579, 441, 723, 526];Plate:[224, 641, 376, 718],[230, 576, 378, 642],[390, 585, 436, 629],[523, 672, 652, 770];Extractor:[577, 320, 728, 426].') 2023-11-03 12:50:49.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:50:49.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[422, 458, 448, 502] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a handFrame1:[422, 458, 448, 502];Frame2:[423, 437, 454, 478];Frame3:[257, 343, 285, 384];Frame4:[253, 310, 283, 351].') 2023-11-03 12:51:01.420 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:51:01.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[282, 507, 303, 562],[300, 506, 325, 563],[365, 402, 478, 637];Paddle:[409, 566, 971, 726],[000, 560, 406, 641],[302, 539, 367, 568],[197, 544, 283, 568];Boat:[104, 642, 608, 721].') 2023-11-03 12:51:06.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:51:06.987 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[204, 572, 237, 661],personFrame1:[455, 502, 489, 590],personFrame1:[334, 405, 368, 487] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1[204, 572, 237, 661];Frame2:No object,personFrame1:[455, 502, 489, 590];Frame2:[429, 491, 463, 575],personFrame1:[334, 405, 368, 487];Frame2:[309, 401, 342, 479].') 2023-11-03 12:51:34.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:51:34.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Cabinet:[001, 107, 136, 426],[131, 126, 250, 414],[246, 116, 326, 411],[327, 087, 435, 418],[430, 037, 641, 242],[639, 001, 917, 450],[003, 596, 226, 958],[226, 606, 292, 932],[293, 620, 339, 972],[565, 645, 895, 1000];Power outlet:[142, 436, 179, 485];Bottle:[774, 453, 809, 533],[078, 167, 110, 232];Gas stove:[333, 587, 657, 699];Oven:[332, 641, 561, 1000];Tea pot:[442, 489, 500, 599];Microwave:[406, 221, 640, 426];Basin:[689, 175, 776, 218],[749, 084, 811, 144],[001, 899, 037, 950];Blender:[222, 433, 277, 557];Rice Cooker:[016, 451, 096, 586];Kettle:[055, 143, 086, 229];Toaster:[657, 548, 735, 650];Bakset:[538, 134, 627, 182].') 2023-11-03 12:52:48.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:52:48.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'boat:[286, 066, 938, 929],[006, 759, 076, 894],[103, 586, 189, 818],[216, 792, 286, 813],[334, 825, 409, 880],[433, 794, 506, 841],[482, 848, 561, 900],[563, 807, 701, 874],[618, 874, 790, 956],[863, 811, 961, 872].') 2023-11-03 12:53:14.377 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:53:14.377 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to become a Millionaire [409, 296, 607, 637] on the Stock Exchange - Practical step-by-step [004, 630, 998, 999] guide') 2023-11-03 12:53:48.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:53:48.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A behind [002, 098, 307, 618] the head shot of Steven looking at [479, 094, 825, 375] a laptop screen of code and his daughter, [237, 493, 504, 963] Kitty, sitting on his lap looking at a smaller screen of Peppa pig') 2023-11-03 12:54:15.267 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:54:15.267 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[398, 215, 535, 514] Wealthy adults and [596, 389, 708, 512] a child wearing [602, 433, 671, 494] sunglasses who smiles in the back of an open-top car remaining stationary at traffic lights in Sloane Square, on 14th May 2017, in London, England.') 2023-11-03 12:54:19.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:54:19.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person and desk in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'person:[051, 307, 168, 495],[135, 264, 220, 415],[161, 335, 292, 511],[256, 255, 346, 405],[308, 279, 460, 825],[405, 238, 485, 412],[442, 310, 606, 790],[463, 225, 525, 343],[494, 260, 579, 385],[562, 225, 638, 356],[606, 220, 643, 298],[724, 251, 833, 540],[739, 327, 999, 863],[798, 236, 853, 422],[835, 136, 928, 362],[880, 243, 992, 417];desk:[000, 492, 400, 928].') 2023-11-03 12:54:22.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:54:22.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[103, 574, 231, 854];Frame:[200, 412, 275, 511];Desk:[661, 618, 920, 999],[199, 632, 275, 810];Couch:[600, 581, 742, 757];Cabinet:[001, 163, 087, 703];Storage box:[272, 228, 493, 999];Person:[462, 284, 609, 999];Pot:[814, 156, 880, 249].') 2023-11-03 12:55:06.054 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:55:06.054 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'angled front shot of sliced corned beef and cabbage [001, 152, 210, 437] [581, 097, 994, 336] on a white serving platter [000, 108, 979, 839]') 2023-11-03 12:55:07.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:55:07.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Boat:[686, 767, 1000, 858],[358, 749, 714, 807],[252, 720, 549, 772];Street Lights:[642, 217, 687, 376],[325, 202, 359, 423],[145, 324, 185, 401],[664, 324, 712, 462];Crane:[363, 145, 435, 308];Paddle:[963, 805, 999, 855];Person:[434, 671, 524, 766],[614, 678, 690, 785].') 2023-11-03 12:55:18.384 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:55:18.385 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[775, 544, 899, 790],personFrame1:[723, 548, 863, 750],personFrame1:[204, 548, 306, 770] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[775, 544, 899, 790];Frame2:[760, 533, 898, 791],personFrame1:[723, 548, 863, 750];Frame2:[734, 600, 871, 751],personFrame1:[204, 548, 306, 770];Frame2:[194, 570, 331, 769].') 2023-11-03 12:55:31.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:55:31.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'tree:[000, 316, 967, 760];flower:[000, 316, 971, 689].') 2023-11-03 12:55:59.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:55:59.179 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flag and Person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Flag:[279, 480, 303, 569];Person:[000, 370, 116, 999],[073, 421, 209, 999],[283, 612, 311, 655],[335, 478, 402, 702],[422, 594, 450, 640],[516, 416, 613, 758],[625, 442, 680, 739].') 2023-11-03 12:56:10.279 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:56:10.280 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Seven Redhawk orchestra students [742, 348, 921, 600] [906, 381, 997, 594] [618, 521, 928, 995] [461, 401, 631, 860] are among the highest-ranking musicians across the state and have qualified to perform in a Texas All-State ensemble after auditioning through an online format.') 2023-11-03 12:56:21.163 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:56:21.163 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[481, 740, 552, 812],[446, 729, 503, 818],[349, 763, 410, 857],[236, 786, 353, 863],[263, 811, 338, 930],[184, 817, 243, 929];Backpack:[334, 838, 396, 937].') 2023-11-03 12:57:21.334 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:57:21.334 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[260, 052, 586, 995] Bowe has urged [664, 548, 872, 996] his Ulster team-mates to turn things around and salvage the province's season.") 2023-11-03 12:57:22.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 12:57:22.171 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Glasses:[330, 062, 538, 112];Microphone:[234, 113, 366, 192];Speaker:[709, 728, 1000, 1000],[672, 492, 1000, 729];Guitar:[040, 379, 999, 900];Person:[015, 017, 900, 999].') 2023-11-03 12:57:37.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:57:37.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'tree:[000, 000, 999, 431].') 2023-11-03 12:58:09.723 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 12:58:09.724 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[007, 417, 636, 997] The trail to Landscape Arch with several groups of [583, 392, 614, 467] [351, 496, 384, 581] [411, 465, 445, 546] [179, 767, 228, 913] [238, 795, 299, 986] [302, 797, 364, 988] people') 2023-11-03 13:00:18.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:00:18.904 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A side view of a cloud egg [284, 355, 769, 581] on toast [223, 474, 778, 667], the popped yolk [390, 399, 594, 556] is dribbling down the side.') 2023-11-03 13:00:22.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:00:22.208 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the cows runnting away?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There are 8 cows runnting away.Track1frame:1:[650, 104, 794, 288];frame:2:[652, 098, 784, 288];frame:3:[674, 095, 751, 158],Track2frame:1:[233, 212, 279, 339];frame:2:[229, 241, 248, 352],Track3frame:1:[413, 123, 602, 349];frame:2:[422, 155, 634, 374];frame:3:[610, 114, 792, 269],Track4frame:1:[117, 187, 246, 365];frame:2:[139, 187, 253, 365];frame:3:[222, 193, 323, 361],Track5frame:1:[058, 184, 136, 368];frame:2:[044, 187, 159, 365];frame:3:[154, 212, 222, 365],Track6frame:1:[001, 155, 056, 288];frame:2:[001, 161, 066, 393];frame:3:[016, 200, 189, 415],Track7frame:1:[402, 165, 448, 269];frame:2:[356, 177, 430, 298];frame:3:[321, 187, 409, 295],Track8frame:1:[878, 142, 998, 438];frame:2:[803, 142, 955, 438];frame:3:[610, 184, 711, 434].') 2023-11-03 13:00:32.095 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:00:32.096 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[537, 357, 600, 448],carFrame1:[571, 370, 622, 472],carFrame1:[631, 319, 778, 565],carFrame1:[715, 188, 1000, 814],carFrame1:[598, 369, 659, 507],carFrame1:[367, 345, 419, 454],carFrame1:[227, 355, 344, 528] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[537, 357, 600, 448];Frame2:[546, 363, 618, 474],carFrame1:[571, 370, 622, 472];Frame2:[589, 378, 677, 505],carFrame1:[631, 319, 778, 565];Frame2:[716, 271, 1000, 740],carFrame1[715, 188, 1000, 814];Frame2:No object,carFrame1:[598, 369, 659, 507];Frame2:[636, 377, 783, 574],carFrame1:[367, 345, 419, 454];Frame2:[315, 351, 402, 494],carFrame1:[227, 355, 344, 528];Frame2:[000, 331, 255, 696].') 2023-11-03 13:00:34.919 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:00:34.919 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the giraffe staying on a field next to another giraffe?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one giraffe staying on a field next to another giraffe.Track1frame:1:[477, 202, 688, 998];frame:2:[415, 220, 610, 952];frame:3:[426, 279, 571, 836].') 2023-11-03 13:00:39.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:00:39.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Cabinet:[087, 001, 439, 299],[433, 052, 606, 466],[603, 134, 699, 463],[699, 185, 767, 462];Towel:[588, 798, 653, 971],[260, 920, 455, 999];Sink:[502, 614, 810, 658];Faucet:[549, 585, 634, 634];Microwave:[089, 246, 464, 487];Gas stove:[094, 588, 634, 849];Oven:[472, 780, 645, 999].') 2023-11-03 13:01:09.636 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:01:09.637 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[849, 001, 931, 169],[711, 001, 777, 174],[658, 001, 719, 180],[176, 001, 309, 309].') 2023-11-03 13:02:33.585 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:02:33.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[452, 504, 582, 657] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a border collieFrame1:[452, 504, 582, 657];Frame2:[435, 480, 563, 637];Frame3:[393, 477, 534, 618].') 2023-11-03 13:03:01.668 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:03:01.668 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[840, 556, 997, 798] [443, 552, 650, 749] dining room tables and [810, 553, 940, 775] [527, 554, 647, 759] [389, 551, 518, 746] chairs at the Breakthrough Men's Center") 2023-11-03 13:03:27.431 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:03:27.431 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[191, 187, 619, 968] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a coin being placed on a planeFrame1:[191, 187, 619, 968];Frame2:[188, 187, 621, 968];Frame3:[188, 190, 619, 965];Frame4:[188, 184, 616, 962];Frame5:[190, 187, 617, 965].') 2023-11-03 13:04:12.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:04:12.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[146, 543, 480, 998],[579, 191, 835, 995],[877, 715, 999, 999];Necklace:[678, 464, 757, 517];Glasses:[228, 635, 332, 685];Whiteboard:[001, 090, 999, 994];Bottle:[648, 812, 705, 988];Cup:[452, 944, 499, 999];Laptop:[700, 845, 877, 971].') 2023-11-03 13:04:20.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:04:20.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Van in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[966, 125, 1000, 396],[718, 144, 733, 163],[743, 157, 771, 192],[410, 100, 421, 125],[362, 093, 375, 121],[095, 078, 139, 152];Van:[000, 000, 471, 451],[498, 099, 624, 137],[054, 073, 140, 120].') 2023-11-03 13:04:25.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:04:25.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Lamp:[491, 314, 681, 825];Desk:[384, 778, 628, 1000];Person:[389, 226, 452, 335],[274, 278, 305, 423],[292, 414, 333, 550],[498, 143, 537, 311],[507, 020, 535, 105],[460, 167, 494, 252],[432, 414, 463, 476],[608, 256, 820, 1000],[098, 219, 400, 999],[258, 463, 279, 507];Leather Shoes:[221, 240, 356, 287];Cup:[465, 759, 495, 814];TV:[328, 471, 474, 695];Keyboard:[379, 803, 488, 869];Head Phone:[535, 815, 615, 894];Mouse:[510, 827, 534, 867].') 2023-11-03 13:04:31.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:04:31.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'pastry:[023, 248, 195, 426],[071, 515, 207, 728],[185, 243, 328, 430],[196, 526, 342, 723],[335, 315, 433, 428],[356, 526, 451, 733],[471, 515, 582, 718],[471, 250, 591, 436],[590, 496, 715, 703],[591, 240, 731, 421],[687, 490, 783, 706],[703, 241, 883, 416],[742, 490, 835, 695],[847, 230, 996, 411].') 2023-11-03 13:05:19.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:05:19.652 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[114, 062, 424, 903];Sneakers:[119, 813, 179, 899];Horse:[077, 084, 906, 1000].') 2023-11-03 13:05:48.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:05:48.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'person:[010, 539, 225, 1000],[000, 030, 196, 474],[235, 619, 373, 999],[235, 175, 376, 570],[245, 000, 378, 155],[381, 000, 498, 230],[394, 259, 504, 623],[417, 670, 535, 999],[518, 352, 625, 681],[527, 723, 648, 999],[556, 054, 655, 356],[604, 424, 701, 743],[635, 758, 728, 999],[670, 191, 756, 456],[693, 484, 776, 765],[717, 789, 803, 999],[754, 268, 809, 508],[773, 533, 831, 809],[788, 829, 853, 999],[808, 341, 851, 537],[813, 585, 871, 835],[831, 842, 896, 999],[835, 407, 875, 575],[854, 621, 901, 840],[861, 445, 908, 621],[875, 862, 922, 999],[894, 654, 928, 864],[901, 529, 945, 677],[909, 897, 944, 999],[928, 716, 948, 881],[931, 570, 961, 696],[942, 923, 959, 999],[942, 727, 963, 882],[964, 802, 976, 888],[964, 928, 985, 999],[970, 800, 996, 911];sculpture:[005, 044, 189, 465],[015, 549, 188, 999],[236, 161, 380, 560],[248, 620, 382, 999],[262, 000, 372, 123],[381, 000, 498, 235],[391, 251, 516, 607],[423, 676, 530, 999],[516, 348, 623, 682],[520, 712, 641, 999],[556, 048, 650, 359],[615, 419, 705, 722],[644, 757, 727, 999],[670, 192, 753, 445],[696, 488, 780, 757],[719, 781, 805, 999],[749, 258, 810, 474],[765, 527, 832, 764],[787, 820, 853, 999],[795, 324, 851, 520],[820, 574, 875, 803],[838, 425, 876, 573],[838, 838, 890, 999],[855, 633, 910, 825],[864, 450, 908, 599],[875, 874, 918, 999],[891, 662, 923, 845],[892, 501, 927, 632],[910, 885, 941, 999],[919, 574, 946, 677],[924, 703, 943, 858],[936, 724, 960, 862],[938, 911, 964, 999],[941, 628, 963, 695],[954, 735, 969, 866],[958, 926, 995, 999],[969, 801, 973, 883],[975, 810, 992, 898],[990, 825, 999, 893];human face:[001, 000, 998, 999].') 2023-11-03 13:05:55.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:05:55.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[526, 485, 566, 518],[247, 476, 298, 526];Cow:[002, 562, 063, 634],[002, 576, 221, 737],[040, 664, 302, 862],[287, 535, 911, 942],[916, 457, 999, 536],[796, 466, 862, 534],[666, 497, 806, 532],[391, 509, 503, 543],[002, 497, 081, 523],[019, 500, 164, 526],[047, 562, 179, 612],[808, 471, 915, 533];TV:[741, 348, 844, 418],[490, 374, 570, 430],[158, 404, 206, 450],[043, 416, 081, 455].') 2023-11-03 13:07:28.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:07:28.855 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'table:[856, 680, 993, 996];person:[034, 545, 133, 668],[038, 397, 970, 925],[117, 496, 212, 719],[184, 457, 244, 590],[198, 419, 253, 490],[204, 494, 289, 721],[263, 413, 311, 494],[268, 461, 332, 636],[300, 478, 368, 727],[308, 419, 349, 500],[338, 501, 431, 921],[351, 411, 391, 476],[386, 463, 437, 878],[426, 483, 514, 915],[439, 401, 493, 529],[490, 411, 535, 481],[493, 444, 543, 549],[496, 481, 585, 915],[532, 428, 573, 545],[563, 490, 620, 890],[609, 494, 682, 906],[627, 428, 675, 509],[659, 399, 691, 469],[661, 474, 749, 921],[696, 421, 731, 525],[711, 404, 738, 470],[718, 470, 803, 915],[755, 423, 787, 490],[782, 411, 815, 500],[788, 496, 888, 913],[830, 397, 858, 488],[854, 397, 909, 490],[855, 461, 953, 653],[911, 428, 960, 529];cabinetry:[087, 809, 315, 977];desk:[000, 502, 215, 575],[000, 966, 935, 999],[001, 575, 131, 727],[003, 735, 348, 974],[859, 714, 999, 993],[944, 564, 999, 639].') 2023-11-03 13:07:29.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:07:29.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[487, 577, 553, 737],personFrame1:[269, 534, 312, 693],personFrame1:[663, 427, 714, 569],personFrame1:[342, 595, 403, 743],personFrame1:[644, 486, 695, 648],personFrame1:[631, 404, 666, 537],personFrame1:[792, 475, 830, 650] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[487, 577, 553, 737];Frame2:[464, 575, 512, 741],personFrame1:[269, 534, 312, 693];Frame2:[220, 538, 267, 701],personFrame1:[663, 427, 714, 569];Frame2:[617, 440, 667, 579],personFrame1:[342, 595, 403, 743];Frame2:[301, 594, 357, 751],personFrame1:[644, 486, 695, 648];Frame2:[613, 476, 653, 634],personFrame1:[631, 404, 666, 537];Frame2:[582, 412, 629, 531],personFrame1:[792, 475, 830, 650];Frame2:[743, 475, 782, 647].') 2023-11-03 13:07:56.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:07:56.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers and SUV in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Sneakers:[375, 636, 414, 672],[341, 576, 372, 625];SUV:[083, 295, 208, 364].') 2023-11-03 13:08:00.497 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:08:00.497 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'DEE TOTE: Easy elegance and a simple design offer a big impact with this large tote in soft camel colored vegan leather with gold accents and flap closure. See rewards section [578, 147, 803, 241] for more details.') 2023-11-03 13:08:15.950 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:08:15.951 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track motorcycleFrame1:[062, 446, 170, 533],carFrame1:[490, 473, 894, 856],carFrame1:[746, 427, 999, 703] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'motorcycleFrame1:[062, 446, 170, 533];Frame2:[034, 441, 151, 530],carFrame1:[490, 473, 894, 856];Frame2:[482, 467, 855, 856],carFrame1:[746, 427, 999, 703];Frame2:[777, 414, 999, 748].') 2023-11-03 13:08:42.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:08:42.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[102, 070, 826, 969];microphone:[528, 450, 807, 756];clothing:[109, 507, 721, 999];human face:[460, 222, 645, 595].') 2023-11-03 13:08:55.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:08:55.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[387, 114, 772, 999];Chair:[340, 891, 561, 999].') 2023-11-03 13:09:11.322 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:09:11.323 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'flower:[000, 770, 196, 999],[131, 709, 222, 837],[170, 837, 312, 999],[205, 597, 296, 739],[240, 445, 323, 595],[302, 405, 453, 675].') 2023-11-03 13:09:38.264 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:09:38.264 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Turn a shed [006, 115, 992, 751] into a play house') 2023-11-03 13:10:43.971 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:10:43.971 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '2008 Acura MDX Sport Package with Rear DVD System') 2023-11-03 13:11:06.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:11:06.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Potted Plant:[446, 229, 531, 391],[554, 242, 740, 391],[770, 243, 926, 403];Plate:[073, 276, 228, 362];Person:[001, 453, 305, 764],[001, 665, 324, 999],[001, 453, 305, 764],[001, 665, 324, 999],[375, 551, 640, 996],[610, 531, 798, 996],[791, 481, 922, 802],[334, 450, 496, 719],[485, 437, 620, 623];Belt:[248, 657, 308, 730];Car:[244, 729, 297, 850];Tie:[051, 073, 158, 208];Frame:[446, 229, 531, 391],[554, 242, 740, 391],[770, 243, 926, 403];Cabinet:[073, 276, 228, 362];Desk:[037, 490, 265, 565],[885, 540, 986, 603];Chair:[244, 488, 314, 622],[315, 493, 392, 606],[881, 547, 971, 642],[718, 532, 782, 623],[814, 607, 964, 982],[926, 655, 997, 1000].') 2023-11-03 13:11:10.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:11:10.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[706, 032, 774, 591],[649, 196, 699, 607];SUV:[797, 569, 890, 632];Traffic cone:[641, 641, 676, 758],[712, 668, 771, 803];Truck:[454, 539, 488, 579],[564, 538, 599, 575],[290, 502, 319, 546];Car:[000, 601, 129, 1000];Bus:[160, 520, 236, 581];Chair:[475, 688, 533, 732],[337, 676, 383, 730],[573, 674, 598, 710].') 2023-11-03 13:11:27.231 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:11:27.231 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A close up of a seed starting kit with herb seeds, planting trays, soil, and plant markers. To the bottom right of the frame is a black circular logo [713, 689, 993, 990] and text.') 2023-11-03 13:11:31.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:11:31.776 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,girl,footwear and turtle in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[132, 356, 310, 813];girl:[140, 366, 325, 834];footwear:[160, 761, 197, 830].') 2023-11-03 13:11:46.462 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:11:46.462 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'flying over [304, 208, 948, 998] a traditional windmill on a farm with [004, 005, 986, 997] dry red dirt field, australia, aerial view - [729, 284, 946, 492] turbine stock videos & [744, 350, 908, 474] royalty-free footage') 2023-11-03 13:11:51.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:11:51.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'woman:[106, 216, 853, 816];dress:[000, 376, 825, 807];girl:[001, 240, 995, 818];footwear:[505, 751, 618, 816],[811, 739, 999, 828].') 2023-11-03 13:12:00.662 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:12:00.663 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Peter Dutton [141, 095, 927, 991] - who was tipped to potentially be one of the biggest Coalition scalps of the election - praised Mr Morrison’s “amazing leadership” as he claimed victory. Picture: AAP') 2023-11-03 13:12:09.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:12:09.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bed,Towel,Plate,Person and Potted Plant in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Bed:[025, 701, 444, 973];Towel:[461, 481, 610, 564],[616, 374, 753, 443],[232, 386, 408, 454],[359, 318, 418, 353],[629, 292, 707, 324],[390, 214, 467, 280],[617, 257, 688, 290],[318, 910, 422, 956],[341, 852, 418, 880],[173, 767, 225, 800];Plate:[579, 423, 712, 485],[223, 871, 286, 910],[519, 346, 636, 414],[615, 208, 707, 262],[251, 441, 388, 505],[313, 376, 383, 403],[074, 845, 142, 890];Person:[514, 606, 874, 974],[589, 778, 974, 976],[437, 041, 509, 179],[388, 032, 463, 216],[318, 044, 447, 308],[209, 080, 359, 376],[151, 106, 304, 467],[024, 170, 239, 585],[635, 039, 700, 164],[679, 078, 744, 228],[708, 062, 779, 274],[711, 067, 814, 326],[713, 108, 856, 414],[746, 154, 973, 586],[936, 178, 973, 554];Potted Plant:[027, 126, 080, 218],[489, 212, 562, 305],[139, 841, 232, 953],[313, 730, 352, 795].') 2023-11-03 13:12:45.412 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:12:45.412 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[545, 002, 1000, 1000],[519, 358, 732, 619],[520, 335, 597, 402],[568, 313, 624, 403],[095, 312, 266, 476],[002, 324, 059, 473],[386, 295, 529, 442];Chair:[119, 541, 344, 863],[002, 627, 190, 999];Desk:[002, 559, 324, 905].') 2023-11-03 13:12:51.637 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:12:51.637 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[523, 369, 711, 627] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a raccoonFrame1:[523, 369, 711, 627];Frame2:[514, 330, 700, 615];Frame3:[504, 304, 685, 594];Frame4:[478, 281, 670, 570].') 2023-11-03 13:12:53.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:12:53.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[644, 387, 753, 558],carFrame1:[302, 384, 627, 828],carFrame1:[616, 398, 673, 531] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[644, 387, 753, 558];Frame2:[644, 390, 753, 558],carFrame1:[302, 384, 627, 828];Frame2:[302, 384, 627, 828],carFrame1:[616, 398, 673, 531];Frame2:[616, 394, 676, 531].') 2023-11-03 13:13:51.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:13:51.139 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a hand [192, 296, 620, 998] holding up a ball of pink yarn [353, 192, 599, 563]') 2023-11-03 13:14:25.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:14:25.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Tent and Lamp in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Tent:[002, 406, 342, 790],[283, 226, 885, 962];Lamp:[158, 569, 189, 634].') 2023-11-03 13:14:33.074 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:14:33.074 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Soon-to-be parents [155, 205, 396, 532] [537, 344, 818, 937] look into crib [002, 394, 291, 996] with their dog [785, 478, 972, 900] [159, 448, 396, 966] during a maternity session in Pittsburgh, PA for a 2020 photography review') 2023-11-03 13:14:54.223 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:14:54.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Lamp:[931, 464, 961, 547],[784, 464, 810, 553],[619, 466, 647, 545],[437, 475, 467, 559],[241, 462, 274, 563],[032, 462, 066, 567].') 2023-11-03 13:15:15.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:15:15.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[763, 085, 928, 1000],[661, 217, 817, 952],[559, 249, 709, 922],[459, 218, 594, 896],[295, 246, 451, 875],[042, 196, 250, 972];Book:[191, 360, 300, 497];Glasses:[117, 236, 171, 263];Tie:[140, 319, 197, 462];Leather Shoes:[833, 954, 922, 1000],[761, 922, 869, 984],[649, 868, 708, 902],[603, 885, 633, 925],[541, 853, 574, 900],[474, 846, 531, 888],[404, 837, 437, 880],[297, 833, 367, 870];High Heels:[682, 903, 762, 952],[757, 884, 795, 932];Other Shoes:[195, 925, 250, 958],[107, 942, 154, 975];Cup:[037, 606, 057, 639];Flower:[736, 275, 822, 362],[667, 374, 684, 405],[533, 354, 566, 390],[382, 386, 404, 419];Potted Plant:[392, 316, 498, 843];Desk:[000, 628, 170, 951];Air Conditioner:[000, 093, 029, 224].') 2023-11-03 13:15:24.546 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:15:24.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Van:[266, 218, 976, 832];Person:[396, 289, 553, 969],[164, 261, 316, 909].') 2023-11-03 13:15:26.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:15:26.652 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[474, 239, 925, 453] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a manateeFrame1:[474, 239, 925, 453];Frame2:[467, 242, 925, 490];Frame3:[494, 247, 961, 486];Frame4:[506, 251, 986, 482];Frame5:[502, 245, 984, 481].') 2023-11-03 13:15:35.429 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:15:35.429 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the pair of tigers engaged in a fight. in this video clip?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There are 2 the pair of tigers engaged in a fight..Track1frame:1:[625, 302, 715, 516];frame:2:[581, 411, 671, 561];frame:3:[548, 422, 625, 566],Track2frame:1:[584, 352, 701, 555];frame:2:[575, 466, 667, 575];frame:3:[531, 483, 614, 591].') 2023-11-03 13:15:46.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:15:46.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Frame:[920, 197, 1000, 365];Person:[336, 329, 421, 515],[278, 330, 352, 524];Bicycle:[362, 390, 630, 535];SUV:[069, 371, 258, 517],[607, 328, 891, 465];Car:[083, 340, 269, 388].') 2023-11-03 13:16:01.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:16:01.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[267, 329, 656, 675] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a molochFrame1:[267, 329, 656, 675];Frame2:[276, 325, 664, 658];Frame3:[257, 335, 656, 681];Frame4:[284, 331, 710, 625];Frame5:[254, 362, 684, 662].') 2023-11-03 13:17:34.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:17:34.787 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[283, 519, 330, 576],truckFrame1:[944, 516, 1000, 739],carFrame1:[002, 508, 112, 616] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1:[283, 519, 330, 576];Frame2:[044, 510, 103, 573],truckFrame1:[944, 516, 1000, 739];Frame2:[782, 294, 1000, 739],carFrame1[002, 508, 112, 616];Frame2:No object.') 2023-11-03 13:17:50.073 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:17:50.073 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [295, 678, 369, 822] [507, 687, 582, 821] [676, 694, 747, 822] [144, 681, 206, 803] [228, 681, 307, 835] [769, 692, 862, 853] moving along the platform [002, 740, 998, 989] to board a train [009, 527, 997, 810] at the modern station in the Ville Nouvelle, the new town, in Fes, Morocco') 2023-11-03 13:17:56.934 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:17:56.935 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Meet [248, 012, 655, 995] the president saints row download characters') 2023-11-03 13:18:18.976 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:18:18.977 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A statue [116, 066, 678, 996] of a cook wearing a face mask [194, 296, 324, 435] is pictured outside a restaurant in Vigo, on May 10, 2020 during the national lockdown to prevent the spread of the COVID-19 disease. (AFP Photo') 2023-11-03 13:18:23.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:18:23.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[201, 080, 771, 875] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a pintoFrame1:[201, 080, 771, 875];Frame2:[092, 047, 725, 787];Frame3:[221, 105, 757, 754];Frame4:[621, 179, 775, 643].') 2023-11-03 13:18:27.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:18:27.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Boat:[214, 856, 368, 949],[342, 769, 501, 833],[384, 715, 560, 806],[414, 616, 562, 703],[301, 581, 430, 683],[289, 860, 399, 925];Car:[583, 930, 670, 999];Street Lights:[631, 794, 692, 955],[825, 337, 907, 700].') 2023-11-03 13:18:33.322 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:18:33.322 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[222, 235, 586, 712] [640, 333, 984, 756] [445, 292, 789, 706] [015, 238, 378, 689] Chinese little girls perform dancing at break during day six of the 16th Asian Games Guangzhou 2010 at Dongguan Gymnasium on November 18, 2010 in...') 2023-11-03 13:18:58.038 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:18:58.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect jeans and desk in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'jeans:[499, 604, 612, 912];desk:[750, 703, 995, 890].') 2023-11-03 13:19:01.740 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:19:01.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Truck:[288, 333, 999, 869];Person:[548, 869, 712, 1000],[285, 906, 479, 1000],[203, 767, 402, 1000],[354, 811, 483, 943],[474, 855, 563, 1000],[551, 795, 655, 908],[615, 771, 746, 1000],[715, 807, 830, 999],[777, 597, 870, 732],[620, 447, 699, 543],[417, 277, 515, 595],[377, 309, 442, 566];Hat:[259, 768, 406, 947];Glasses:[280, 892, 479, 1000].') 2023-11-03 13:19:54.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:19:54.996 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Lifesaver:[775, 482, 786, 496],[795, 450, 800, 465],[317, 449, 326, 454],[283, 499, 296, 513];Boat:[090, 335, 953, 630],[047, 587, 152, 624];Street Lights:[436, 302, 459, 355],[485, 295, 510, 378],[556, 284, 585, 395],[728, 156, 752, 315];SUV:[429, 387, 449, 403];Wild Bird:[949, 944, 1000, 966],[837, 871, 850, 884],[765, 892, 784, 905],[837, 768, 846, 777],[817, 747, 831, 754],[822, 776, 831, 786],[783, 754, 790, 761],[744, 782, 754, 790],[709, 745, 719, 752],[703, 765, 715, 774],[685, 835, 700, 847],[639, 854, 657, 868],[659, 769, 666, 775],[652, 752, 658, 758],[598, 755, 605, 763],[582, 790, 595, 798],[536, 756, 552, 763],[540, 790, 547, 799],[466, 771, 479, 780],[439, 769, 456, 778],[046, 933, 067, 946],[098, 909, 118, 921],[181, 826, 199, 836],[169, 803, 183, 812],[106, 798, 117, 805],[060, 776, 077, 786],[320, 796, 338, 805],[278, 779, 293, 787],[315, 617, 329, 626];Flag:[192, 422, 207, 446],[813, 284, 828, 298],[982, 288, 985, 310],[767, 307, 775, 320],[774, 291, 783, 307].') 2023-11-03 13:20:10.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:20:10.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'palm tree:[220, 443, 275, 546],[240, 371, 295, 538],[278, 358, 334, 546],[286, 366, 455, 542],[303, 250, 363, 538],[343, 266, 394, 534],[353, 305, 408, 534],[384, 256, 434, 534],[440, 234, 563, 546],[536, 366, 630, 555],[589, 093, 999, 555],[660, 373, 773, 546].') 2023-11-03 13:20:15.229 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:20:15.229 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Protect [205, 744, 363, 955] the Employee If the hazard cannot be eliminated or reduced to an acceptable level, the employee must be protected from exposure.') 2023-11-03 13:20:27.151 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:20:27.152 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Fleurance Nature Cleansing Micellar Water with Cornflower - 400 ml') 2023-11-03 13:20:48.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:20:48.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tree:[826, 074, 988, 433],[066, 277, 096, 355],[078, 271, 111, 376],[120, 295, 167, 382],[205, 295, 493, 365],[376, 256, 433, 376],[413, 206, 455, 376],[578, 184, 673, 373],[721, 258, 773, 368].') 2023-11-03 13:21:25.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:21:25.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[378, 342, 401, 384] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a person headFrame1:[378, 342, 401, 384];Frame2:[390, 422, 413, 468];Frame3:[389, 308, 411, 355].') 2023-11-03 13:21:38.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:21:38.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'skyscraper:[628, 295, 807, 880],[135, 501, 242, 832],[252, 453, 345, 762],[368, 423, 478, 783],[505, 426, 608, 794];tower:[138, 501, 231, 839],[148, 753, 202, 955],[205, 758, 257, 965],[232, 462, 332, 745],[244, 755, 311, 957],[350, 430, 474, 925],[465, 501, 504, 713],[498, 421, 617, 907],[632, 710, 692, 835],[647, 758, 748, 927],[647, 325, 797, 768];tree:[000, 531, 137, 618],[000, 610, 148, 952],[227, 665, 255, 760],[318, 901, 371, 970],[378, 899, 491, 997],[454, 674, 520, 798],[480, 922, 580, 997],[531, 822, 550, 927],[555, 873, 612, 997],[595, 627, 655, 785],[595, 792, 722, 997],[707, 867, 870, 997],[775, 792, 860, 944],[780, 518, 998, 768],[868, 419, 998, 635],[878, 832, 998, 997],[902, 755, 998, 882].') 2023-11-03 13:22:45.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:22:45.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp,Spoon,Plate and Fork in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[803, 207, 857, 313],[247, 219, 293, 316],[086, 221, 113, 268];Spoon:[702, 635, 744, 679];Plate:[371, 588, 425, 621],[248, 616, 379, 702],[232, 545, 338, 606],[273, 493, 347, 537],[373, 459, 452, 494],[574, 459, 655, 502],[647, 501, 736, 550],[672, 562, 776, 628],[586, 629, 720, 726],[592, 569, 648, 604],[600, 529, 648, 559],[539, 601, 598, 641],[499, 448, 539, 485];Fork:[580, 681, 622, 741].') 2023-11-03 13:23:11.010 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:23:11.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[804, 602, 914, 927],personFrame1:[406, 611, 541, 933],personFrame1:[183, 602, 306, 906],personFrame1:[557, 625, 688, 962],personFrame1:[185, 593, 336, 957] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[804, 602, 914, 927];Frame2:[828, 601, 890, 926],personFrame1:[406, 611, 541, 933];Frame2:[458, 632, 530, 933],personFrame1:[183, 602, 306, 906];Frame2:[219, 611, 273, 902],personFrame1:[557, 625, 688, 962];Frame2:[599, 630, 685, 963],personFrame1:[185, 593, 336, 957];Frame2:[233, 615, 314, 958].') 2023-11-03 13:23:15.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:23:15.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("I need your expertise in analyzing the image to answer 'QUESTION: Who would [416, 173, 639, 931] and [582, 082, 991, 996] consider their leader ?'. Be sure to clarify how you arrived at the answer, and provide the spatial location of objects using [x1,y1,x2,y2].", '[416, 173, 639, 931] and [582, 082, 991, 996] would consider [065, 102, 531, 999] their leader .') 2023-11-03 13:23:33.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:23:33.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Baseball Glove:[448, 507, 566, 617];Person:[341, 036, 597, 987],[001, 102, 059, 594];Hat:[398, 039, 532, 106].') 2023-11-03 13:24:03.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:24:03.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man and tree in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[788, 338, 972, 858],[000, 195, 256, 791],[070, 548, 646, 998],[288, 213, 560, 853],[460, 226, 647, 835],[631, 406, 700, 761];tree:[745, 141, 903, 331].') 2023-11-03 13:24:37.733 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:24:37.733 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[083, 105, 867, 994] A different and unexpected incline bench press') 2023-11-03 13:24:41.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:24:41.673 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bicycle wheel and wheel in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'bicycle wheel:[170, 930, 202, 999],[328, 883, 470, 999],[357, 808, 510, 999],[397, 490, 414, 563],[531, 530, 560, 629],[569, 537, 595, 608],[649, 605, 693, 792],[721, 690, 758, 780],[787, 630, 799, 697],[801, 500, 836, 595],[835, 501, 858, 580],[872, 463, 883, 527];wheel:[306, 884, 468, 999],[354, 808, 505, 999],[525, 505, 564, 623],[538, 532, 557, 618],[565, 535, 593, 606],[646, 610, 699, 794],[717, 679, 758, 775],[780, 490, 811, 597],[787, 632, 803, 698],[792, 496, 803, 587],[811, 500, 828, 601],[832, 493, 856, 572],[870, 460, 884, 530].') 2023-11-03 13:25:16.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:25:16.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Laptop in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Laptop:[752, 592, 824, 691].') 2023-11-03 13:25:17.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:25:17.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[170, 306, 229, 377],[486, 157, 566, 219];Other Shoes:[525, 771, 573, 819],[063, 855, 104, 969];Person:[346, 159, 656, 806],[762, 003, 972, 482],[040, 305, 292, 968].') 2023-11-03 13:25:37.924 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:25:37.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,woman and man in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'clothing:[000, 308, 303, 999],[082, 351, 343, 983],[292, 343, 773, 999],[666, 364, 763, 616],[674, 643, 999, 999];woman:[676, 206, 1000, 980];man:[000, 000, 317, 999],[084, 006, 370, 999],[273, 076, 753, 999],[661, 182, 804, 596].') 2023-11-03 13:26:19.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:26:19.539 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[315, 405, 467, 759],[000, 456, 135, 980],[078, 529, 242, 999],[199, 682, 570, 999];girl:[252, 760, 328, 900],[473, 738, 595, 999],[618, 800, 803, 999],[919, 802, 999, 999];human face:[068, 483, 115, 568],[455, 725, 490, 830].') 2023-11-03 13:26:39.368 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:26:39.368 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[201, 018, 816, 934] Young beautiful blonde woman wearing [409, 337, 654, 925] swimwear holding my body my rules banner annoyed and frustrated shouting with anger, yelling crazy with anger and [665, 179, 771, 328] hand raised') 2023-11-03 13:26:52.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:26:52.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[591, 075, 931, 973],[000, 137, 615, 999],[018, 237, 462, 999];human hair:[723, 088, 900, 381],[058, 231, 180, 409],[170, 145, 439, 693];human arm:[000, 348, 446, 647],[015, 414, 112, 608],[015, 513, 270, 999],[238, 553, 322, 903],[604, 337, 711, 907],[800, 363, 937, 799];human hand:[027, 431, 090, 614],[000, 395, 062, 465],[172, 875, 275, 999],[250, 822, 312, 923],[663, 796, 710, 903];sunglasses:[223, 214, 312, 328],[722, 227, 814, 299];human body:[025, 482, 466, 999],[604, 312, 932, 999];human leg:[359, 750, 591, 994],[466, 673, 612, 999];girl:[025, 226, 450, 995];plant:[123, 000, 999, 112],[246, 000, 450, 100],[460, 410, 541, 445].') 2023-11-03 13:26:53.321 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:26:53.322 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human hair and human body in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[000, 398, 114, 748],[128, 361, 156, 436],[225, 397, 318, 544],[550, 305, 602, 496];human hair:[110, 460, 181, 557];human body:[000, 398, 114, 748],[000, 300, 025, 368],[099, 458, 212, 667],[130, 361, 153, 438],[228, 404, 311, 540],[289, 335, 317, 398],[304, 346, 337, 414],[341, 356, 391, 442],[418, 376, 449, 462],[444, 377, 499, 510],[490, 378, 541, 505],[539, 388, 642, 591],[552, 307, 597, 450].') 2023-11-03 13:26:54.529 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:26:54.530 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[630, 353, 952, 922],[002, 427, 284, 939],[188, 292, 417, 869];Backpack:[002, 497, 197, 602];Sneakers:[716, 785, 801, 836],[842, 849, 934, 921],[190, 808, 258, 867],[115, 840, 241, 891];Hat:[218, 326, 308, 381].') 2023-11-03 13:26:59.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:26:59.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Bench:[001, 826, 094, 997],[135, 662, 662, 927],[659, 656, 820, 739],[777, 520, 933, 615];Person:[113, 237, 204, 552],[128, 343, 274, 871],[256, 346, 431, 871],[301, 334, 335, 430],[355, 277, 403, 448],[502, 395, 637, 546],[441, 354, 537, 524],[461, 262, 496, 427],[475, 247, 529, 412],[515, 235, 585, 428],[564, 336, 595, 422],[591, 338, 665, 488],[658, 331, 763, 566],[646, 274, 696, 405],[728, 281, 789, 431],[756, 348, 825, 589],[845, 317, 940, 615],[828, 348, 905, 586];Sneakers:[219, 763, 264, 816],[191, 790, 233, 869],[330, 788, 358, 846],[347, 813, 415, 869];Handbag:[409, 691, 511, 762],[504, 689, 626, 856];Wine Glass:[677, 510, 705, 595],[686, 482, 715, 581],[632, 500, 660, 585],[630, 491, 658, 576],[528, 473, 555, 564],[474, 474, 500, 564].') 2023-11-03 13:27:03.825 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:27:03.826 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Pickup Truck:[002, 454, 352, 893];Car:[332, 388, 688, 731],[382, 487, 998, 1000];Traffic cone:[404, 613, 439, 772];Chair:[809, 577, 908, 660];Person:[343, 384, 446, 836],[566, 373, 646, 591].') 2023-11-03 13:27:40.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:27:40.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[613, 358, 737, 709],[700, 266, 864, 748];human hair:[608, 387, 713, 530];clothing:[585, 486, 755, 858],[695, 365, 858, 847];human face:[621, 402, 691, 561],[762, 306, 833, 437].') 2023-11-03 13:28:35.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:28:35.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[195, 226, 388, 947],[000, 235, 038, 316],[074, 213, 192, 768],[152, 195, 278, 731],[220, 208, 276, 654],[364, 549, 623, 999],[386, 226, 456, 612],[387, 547, 630, 778],[595, 179, 676, 277],[612, 602, 868, 993],[924, 285, 939, 367];footwear:[217, 763, 313, 912],[124, 733, 189, 769],[183, 600, 213, 735],[236, 588, 285, 710],[290, 745, 344, 892],[367, 527, 425, 612],[416, 542, 445, 610],[631, 913, 698, 997];human body:[000, 255, 036, 323],[081, 282, 188, 770],[158, 267, 280, 728],[202, 341, 421, 924],[221, 251, 275, 550],[327, 291, 378, 368],[383, 264, 455, 601],[386, 564, 630, 783],[600, 187, 643, 265],[617, 677, 865, 999],[768, 655, 922, 931],[920, 295, 942, 365],[922, 293, 981, 438];woman:[068, 264, 115, 326],[312, 232, 375, 361],[704, 601, 923, 941],[923, 230, 993, 433];auto part:[000, 324, 110, 803],[406, 212, 999, 867],[455, 222, 562, 402],[465, 156, 617, 340];human arm:[076, 319, 113, 429],[129, 312, 180, 466],[200, 379, 250, 653],[228, 289, 258, 358],[363, 677, 449, 925],[367, 385, 415, 651],[380, 289, 412, 369],[556, 663, 622, 778],[566, 740, 621, 945],[599, 179, 643, 277],[611, 733, 670, 958],[936, 289, 959, 435];car:[000, 327, 116, 876],[342, 287, 396, 366],[410, 245, 999, 837].') 2023-11-03 13:28:39.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:28:39.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Heavy Truck:[001, 082, 877, 706];Chair:[560, 332, 636, 357],[374, 322, 452, 361].') 2023-11-03 13:28:49.590 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:28:49.590 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you point out the trajectory of the black vehicles in right?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one black vehicles in right.Track1frame1:[628, 490, 783, 781];frame2:[644, 501, 845, 848];frame3:[665, 509, 929, 933];frame4:[689, 525, 1000, 1000].') 2023-11-03 13:29:43.452 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:29:43.452 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'man:[016, 094, 243, 970],[000, 116, 053, 403],[013, 045, 077, 242],[069, 015, 171, 286],[141, 055, 174, 147],[155, 017, 278, 332],[215, 039, 289, 310],[215, 111, 495, 999],[265, 014, 283, 099],[317, 008, 352, 118],[344, 028, 441, 316],[388, 087, 436, 198],[530, 068, 781, 212],[681, 088, 789, 423],[686, 153, 981, 999],[801, 116, 823, 199],[815, 072, 853, 157],[913, 075, 965, 163],[958, 269, 999, 472],[972, 075, 999, 149];dress:[445, 205, 551, 406],[490, 315, 700, 774],[720, 227, 807, 371];suit:[213, 287, 494, 758],[416, 137, 470, 228];woman:[254, 075, 309, 156],[395, 077, 475, 235],[436, 091, 563, 395],[484, 103, 710, 999],[701, 107, 823, 405],[789, 101, 819, 210],[951, 121, 999, 294];human face:[000, 055, 018, 119],[083, 160, 168, 329],[103, 036, 141, 109],[167, 034, 213, 136],[221, 063, 258, 128],[290, 154, 376, 336],[349, 058, 391, 138],[491, 129, 526, 204],[581, 152, 656, 314],[727, 106, 761, 183],[751, 130, 791, 214],[812, 205, 898, 355],[940, 182, 968, 264],[960, 132, 995, 196].') 2023-11-03 13:29:56.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 13:29:56.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'woman:[558, 274, 752, 941],[000, 201, 039, 320],[000, 156, 151, 533],[058, 174, 293, 983],[231, 253, 425, 967],[379, 276, 595, 981],[737, 359, 884, 986];man:[377, 251, 442, 388],[534, 265, 662, 936],[855, 403, 999, 985];footwear:[256, 911, 307, 975],[519, 919, 564, 990],[621, 920, 655, 998],[662, 925, 694, 998];human face:[000, 218, 020, 289],[051, 183, 100, 263],[183, 230, 241, 308],[355, 307, 400, 388],[497, 333, 537, 398],[616, 335, 666, 421].') 2023-11-03 13:30:01.903 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:30:01.904 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'black pants [325, 500, 753, 923], black sneakers [266, 915, 407, 998] [558, 908, 666, 997], a neutral sweater [418, 196, 753, 518], a black leather jacket [334, 163, 845, 547] and a large black bag [674, 172, 888, 546]') 2023-11-03 13:30:23.790 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:30:23.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'clothing:[031, 000, 249, 599],[114, 358, 469, 801],[253, 078, 286, 336],[282, 049, 468, 524],[439, 000, 608, 565],[468, 311, 788, 757],[610, 045, 763, 330],[763, 011, 977, 507],[787, 291, 999, 676],[969, 113, 999, 192].') 2023-11-03 13:31:02.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:31:02.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[039, 218, 163, 756];Desk:[126, 671, 196, 752],[292, 639, 417, 740],[554, 649, 651, 722],[674, 635, 746, 703],[805, 614, 860, 668],[916, 595, 963, 639],[868, 602, 919, 653],[520, 611, 552, 657],[600, 601, 639, 649];Chair:[131, 692, 180, 752],[301, 657, 355, 749],[346, 662, 399, 748],[388, 647, 422, 728],[497, 670, 554, 722],[535, 662, 588, 719],[590, 650, 643, 726],[645, 645, 693, 712],[673, 631, 709, 701],[709, 631, 752, 696],[794, 618, 834, 676],[797, 611, 832, 672],[826, 611, 861, 662],[223, 627, 263, 713],[172, 616, 214, 714];Person:[810, 540, 846, 633],[767, 542, 813, 656],[194, 525, 247, 628];Backpack:[416, 666, 451, 719];Satchel:[651, 696, 711, 760].') 2023-11-03 13:31:16.433 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:31:16.433 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'suit:[591, 281, 1000, 960];man:[601, 185, 973, 987],[000, 185, 434, 999];human face:[148, 238, 278, 506],[691, 209, 839, 475].') 2023-11-03 13:31:26.633 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:31:26.635 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Collage of selfies, including [510, 665, 722, 948] [738, 661, 951, 949] [508, 360, 723, 644] [508, 054, 721, 338] [278, 667, 492, 949] [745, 368, 953, 644] [044, 070, 261, 339] [047, 361, 261, 645] [277, 077, 491, 341] [065, 688, 260, 951] students and [737, 070, 940, 339] professors with [031, 035, 973, 956] a range of gender presentations. In the middle is "renga" against a blue background') 2023-11-03 13:31:41.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:31:41.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tree:[000, 233, 662, 532],[732, 416, 905, 503].') 2023-11-03 13:31:43.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:31:43.232 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flag,Street Lights and Boat in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Flag:[369, 527, 388, 557];Street Lights:[061, 454, 087, 587],[294, 457, 313, 584],[689, 444, 748, 770],[767, 472, 785, 614],[866, 410, 929, 898];Boat:[340, 752, 391, 810],[548, 676, 641, 742],[440, 635, 482, 665],[375, 640, 431, 667],[327, 658, 382, 688],[232, 665, 330, 705],[427, 633, 443, 660].') 2023-11-03 13:32:24.261 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:32:24.262 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A mission by spacecraft Chandrayaan-1 that helped discover the presence of water ice on [537, 005, 877, 506] the moon. (Image Source: DNAIndia') 2023-11-03 13:32:43.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:32:43.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'skyscraper:[010, 257, 131, 819],[115, 393, 161, 654],[191, 646, 275, 843],[208, 376, 286, 595],[244, 796, 290, 929],[285, 745, 394, 999],[298, 355, 382, 669],[371, 615, 411, 761],[382, 689, 464, 987],[383, 424, 426, 615],[433, 556, 504, 798],[453, 454, 537, 753],[481, 359, 530, 470],[537, 404, 560, 484],[556, 673, 616, 851],[568, 542, 614, 687],[570, 380, 609, 550],[587, 745, 730, 999],[642, 341, 721, 773],[698, 507, 758, 716],[747, 359, 778, 548],[747, 669, 893, 999],[791, 031, 860, 557],[865, 259, 886, 322],[886, 462, 999, 976],[910, 216, 928, 322];tower:[000, 264, 142, 837],[190, 342, 293, 611],[288, 373, 395, 703],[386, 678, 458, 974],[433, 557, 504, 820],[440, 359, 536, 723],[523, 822, 595, 999],[568, 383, 620, 664],[603, 346, 732, 992],[737, 352, 795, 562],[798, 024, 860, 549],[885, 449, 998, 998],[910, 197, 932, 316].') 2023-11-03 13:33:03.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:33:03.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[710, 087, 889, 179];Person:[334, 044, 946, 998],[492, 336, 675, 848],[849, 244, 984, 996],[912, 265, 999, 998];Watch:[732, 694, 773, 769];Necklace:[584, 610, 618, 779];Belt:[734, 665, 934, 789];Cup:[543, 826, 606, 938],[612, 623, 648, 683];Other Fish:[001, 836, 240, 967],[352, 648, 460, 747].') 2023-11-03 13:33:20.005 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:33:20.005 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'door:[388, 270, 584, 680];building:[004, 000, 999, 999];furniture:[000, 515, 398, 930],[586, 505, 999, 911];window:[074, 013, 093, 421],[878, 043, 893, 423].') 2023-11-03 13:34:13.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:34:13.444 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'clothing:[153, 724, 272, 968],[036, 447, 999, 936];man:[023, 606, 089, 773],[028, 449, 079, 517],[035, 712, 125, 828],[081, 610, 134, 700],[165, 674, 324, 954],[202, 610, 311, 764],[221, 439, 305, 662],[288, 484, 352, 621],[293, 429, 335, 524],[293, 566, 355, 708],[302, 671, 446, 940],[320, 606, 380, 742],[384, 495, 420, 618],[403, 427, 444, 497],[434, 666, 530, 887],[456, 438, 479, 511],[492, 435, 524, 606],[502, 621, 572, 744],[575, 505, 999, 719],[594, 591, 649, 798],[595, 454, 621, 562],[673, 619, 793, 809].') 2023-11-03 13:34:17.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:34:17.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[565, 398, 643, 852] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a personFrame1:[565, 398, 643, 852];Frame2:[543, 416, 625, 850];Frame3:[530, 433, 601, 844];Frame4:[496, 445, 574, 873].') 2023-11-03 13:34:35.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:34:35.001 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[832, 647, 883, 764],[774, 649, 823, 726],[694, 637, 767, 779],[647, 667, 706, 738],[623, 641, 704, 716],[483, 596, 600, 807],[300, 182, 558, 816],[059, 135, 498, 755],[104, 517, 303, 865],[001, 539, 104, 806];Sneakers:[346, 685, 434, 754],[269, 683, 395, 747],[203, 824, 271, 863],[298, 753, 411, 801],[415, 759, 497, 818];Volleyball:[547, 170, 640, 235].') 2023-11-03 13:35:06.341 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:06.341 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'girl:[400, 251, 760, 496];human face:[517, 273, 657, 409].') 2023-11-03 13:35:14.737 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:14.737 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Three month old english creme golden retriever puppy named Buddy sitting in the grass holding a stick [267, 401, 866, 548].') 2023-11-03 13:35:23.241 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:23.242 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'bicycle wheel:[787, 643, 874, 822];person:[894, 421, 967, 683],[206, 459, 305, 591],[246, 414, 448, 554],[482, 455, 497, 527],[533, 418, 567, 561],[597, 417, 636, 569],[682, 379, 828, 705],[970, 439, 999, 554];wheel:[792, 652, 867, 819],[207, 549, 263, 626],[663, 596, 721, 732];bicycle:[814, 485, 853, 574],[820, 467, 877, 566];clothing:[310, 445, 330, 522],[332, 445, 352, 520],[343, 425, 378, 513],[419, 454, 439, 522],[481, 460, 499, 531],[532, 431, 564, 559],[600, 430, 631, 569],[682, 424, 765, 542],[853, 425, 867, 493],[867, 427, 900, 507],[882, 451, 962, 642],[883, 320, 909, 394],[911, 243, 940, 313],[936, 239, 966, 305],[960, 317, 986, 383].') 2023-11-03 13:35:27.294 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:27.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[665, 596, 682, 650];Bench:[606, 729, 674, 797],[494, 724, 567, 793],[500, 705, 556, 755],[601, 708, 660, 742],[774, 672, 839, 708],[862, 674, 896, 709],[762, 663, 794, 693],[816, 663, 870, 691];Desk:[521, 676, 642, 779],[780, 648, 867, 701];Flower:[902, 619, 998, 708];Vase:[925, 693, 972, 739].') 2023-11-03 13:35:35.917 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:35.917 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track truckFrame1:[734, 324, 999, 860],carFrame1:[538, 385, 701, 670],carFrame1:[126, 394, 169, 524],carFrame1:[391, 448, 514, 584],carFrame1:[201, 459, 272, 528] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'truckFrame1:[734, 324, 999, 860];Frame2:[780, 317, 999, 895],carFrame1:[538, 385, 701, 670];Frame2:[560, 378, 737, 680],carFrame1:[126, 394, 169, 524];Frame2:[149, 397, 188, 463],carFrame1:[391, 448, 514, 584];Frame2:[471, 445, 563, 551],carFrame1:[201, 459, 272, 528];Frame2:[110, 463, 207, 548].') 2023-11-03 13:35:36.551 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:36.552 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Outdoor area [292, 297, 997, 695] of large storage units with blue doors [001, 007, 112, 995] [762, 435, 803, 584] [282, 443, 318, 524] [548, 441, 609, 564] [492, 442, 546, 559] [612, 441, 679, 572] [682, 437, 759, 577] [443, 441, 491, 554] [815, 395, 961, 654] in a clean environment') 2023-11-03 13:35:52.610 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:52.610 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'tower:[131, 078, 769, 863].') 2023-11-03 13:35:59.069 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:35:59.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[494, 261, 658, 606],[002, 236, 056, 340],[065, 234, 105, 361],[101, 207, 139, 278],[168, 199, 221, 290],[203, 217, 246, 354],[252, 278, 461, 603],[380, 223, 439, 412],[448, 243, 495, 350],[548, 151, 591, 258],[623, 155, 668, 278],[660, 144, 721, 296],[717, 203, 744, 289],[913, 254, 999, 493];man:[018, 235, 274, 840],[000, 101, 999, 513],[695, 247, 895, 601];clothing:[243, 348, 446, 628],[000, 156, 999, 999];tableware:[146, 516, 495, 805],[226, 785, 326, 868],[384, 781, 456, 822],[403, 854, 517, 913],[475, 685, 531, 856],[506, 501, 784, 673],[520, 643, 544, 742],[546, 847, 570, 959],[548, 648, 572, 748],[577, 648, 611, 739],[593, 818, 688, 841],[593, 822, 712, 851],[618, 586, 639, 673],[638, 590, 658, 667],[648, 650, 676, 733],[663, 678, 730, 735],[705, 702, 799, 748],[723, 681, 784, 868],[731, 650, 795, 729],[753, 851, 810, 951],[789, 860, 904, 932],[818, 816, 896, 913],[822, 522, 999, 827],[895, 853, 978, 915];human face:[128, 281, 210, 445],[306, 298, 360, 416],[541, 290, 589, 381],[763, 300, 814, 388].') 2023-11-03 13:36:54.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:36:54.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human head,mammal,woman,human nose and man in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human head:[025, 145, 117, 255],[157, 183, 248, 314],[381, 205, 472, 324],[458, 264, 540, 383],[613, 112, 713, 350],[660, 048, 887, 376],[937, 000, 999, 468];mammal:[000, 148, 136, 436],[000, 181, 263, 560],[000, 045, 886, 999],[283, 011, 999, 999],[331, 196, 467, 443],[380, 269, 553, 496];woman:[098, 165, 279, 482],[036, 112, 999, 999],[384, 272, 558, 491];human nose:[660, 164, 701, 241],[735, 298, 765, 382],[931, 176, 992, 299];man:[000, 048, 890, 999].') 2023-11-03 13:37:19.960 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:37:19.961 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The importance of the narrator in j d salingers [096, 346, 614, 987] the catcher in [001, 037, 995, 989] the rye') 2023-11-03 13:37:36.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:37:36.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'land vehicle:[000, 026, 745, 252],[000, 000, 999, 656];girl:[000, 138, 999, 999];wheel:[810, 420, 999, 698];clothing:[000, 640, 999, 999];human face:[220, 266, 819, 819].') 2023-11-03 13:37:53.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:37:53.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Umbrella,Person,Backpack,Sandals and Hat in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Umbrella:[001, 607, 079, 681],[383, 581, 540, 661],[872, 613, 1000, 691],[001, 552, 064, 607],[094, 460, 897, 790];Person:[001, 654, 069, 944],[002, 646, 167, 908],[126, 652, 264, 935],[192, 664, 246, 788],[087, 834, 244, 894],[141, 614, 221, 671],[222, 644, 264, 681],[253, 635, 369, 942],[339, 651, 415, 690],[391, 656, 468, 776],[445, 610, 557, 860],[506, 685, 704, 1000],[547, 634, 623, 754],[640, 636, 714, 779],[684, 649, 755, 754],[603, 701, 826, 999],[760, 645, 805, 687],[780, 659, 889, 789],[897, 659, 1000, 786],[382, 296, 448, 344],[001, 301, 055, 352],[250, 010, 298, 067],[416, 008, 447, 063],[841, 006, 894, 059],[963, 003, 1000, 057],[529, 039, 563, 066];Backpack:[283, 680, 400, 773],[403, 747, 465, 791],[092, 685, 169, 761],[197, 695, 250, 773];Sandals:[001, 885, 063, 907],[212, 903, 261, 938],[659, 978, 704, 999];Hat:[131, 653, 200, 690],[653, 637, 711, 660].') 2023-11-03 13:38:27.266 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:38:27.267 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[006, 013, 975, 991] A fresco depicting an ancient deity discovered in Peru') 2023-11-03 13:38:42.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:38:42.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Frame:[612, 162, 739, 380],[640, 399, 714, 479],[829, 418, 887, 481],[789, 180, 929, 395],[969, 136, 999, 395];shelf:[000, 135, 079, 585],[080, 120, 299, 639];Chair:[000, 497, 135, 801],[018, 753, 490, 1000];Coffee Table:[283, 655, 847, 971];Couch:[268, 506, 843, 730];Pillow:[278, 512, 385, 638],[387, 517, 499, 643],[614, 527, 768, 638],[719, 512, 836, 641],[520, 517, 646, 634],[397, 503, 528, 636];Bed:[821, 609, 999, 949];Person:[325, 383, 346, 466],[360, 420, 377, 455],[461, 406, 479, 464],[480, 394, 506, 465],[553, 660, 591, 730].') 2023-11-03 13:38:49.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:38:49.959 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the black and white baby sheep shifting in this video clip?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one black and white baby sheep shifting.Track1frame:1:[261, 036, 547, 588];frame:2:[206, 025, 592, 585];frame:3:[241, 033, 516, 636].') 2023-11-03 13:38:52.602 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:38:52.603 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[225, 333, 832, 811] This meyer lemon blondie recipe is bright, citrusy and simple -- the perfect dessert for quickly and deliciously pulling you out of even the most dreary of winter funks.') 2023-11-03 13:39:00.147 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:39:00.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'SUV:[343, 797, 455, 968],[442, 819, 508, 908],[924, 789, 1000, 921];Sports Car:[495, 834, 588, 955];Car:[611, 827, 703, 950],[733, 829, 824, 943],[830, 824, 938, 932],[827, 884, 1000, 999].') 2023-11-03 13:39:27.653 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:39:27.653 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Hat:[232, 244, 324, 328],[501, 290, 589, 359];Belt:[255, 631, 375, 696];Car:[002, 273, 393, 999];Person:[671, 262, 836, 1000],[491, 289, 651, 1000],[198, 253, 392, 1000],[059, 360, 204, 556].') 2023-11-03 13:39:28.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:39:28.055 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk and Flower in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Desk:[348, 546, 600, 788];Flower:[754, 429, 877, 534],[583, 667, 905, 999],[937, 474, 999, 572],[754, 429, 877, 534],[583, 667, 905, 999],[937, 474, 999, 572].') 2023-11-03 13:39:59.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:39:59.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'street light:[273, 271, 299, 571],[291, 299, 307, 433],[307, 316, 320, 418],[622, 366, 632, 430],[686, 354, 698, 510],[719, 340, 736, 531],[881, 315, 909, 565],[888, 374, 941, 511],[948, 376, 983, 515],[955, 437, 983, 568];train:[376, 427, 659, 584].') 2023-11-03 13:40:04.174 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:40:04.174 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[887, 749, 961, 999],[791, 730, 885, 1000],[229, 722, 286, 998];Umbrella:[810, 654, 971, 799];Backpack:[810, 795, 870, 923].') 2023-11-03 13:40:12.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:40:12.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[293, 356, 413, 748],[024, 409, 070, 746],[042, 716, 330, 969],[068, 362, 167, 712],[134, 339, 214, 563],[174, 335, 215, 479],[214, 307, 262, 510],[224, 307, 269, 526],[241, 327, 309, 560],[266, 306, 309, 479],[331, 288, 361, 375],[369, 245, 407, 339],[424, 271, 454, 379],[497, 418, 574, 648],[504, 283, 547, 379],[539, 263, 572, 377],[596, 267, 650, 379],[616, 322, 744, 751],[639, 503, 849, 968],[693, 253, 735, 381],[709, 309, 794, 647],[754, 260, 788, 347],[782, 509, 976, 968],[823, 291, 875, 572],[852, 262, 913, 493],[910, 413, 977, 924],[925, 299, 977, 439];human hair:[699, 509, 787, 639],[054, 706, 217, 883],[852, 509, 925, 604];human head:[055, 729, 217, 964],[090, 361, 132, 429],[230, 277, 358, 368],[337, 322, 371, 385],[432, 242, 648, 316],[666, 318, 713, 394],[691, 496, 780, 645],[729, 301, 768, 368],[843, 509, 927, 632],[899, 414, 941, 485],[945, 345, 985, 414],[948, 413, 987, 509];man:[798, 434, 966, 1000],[561, 254, 605, 380],[586, 324, 653, 682],[784, 261, 876, 605];clothing:[290, 355, 418, 750],[044, 886, 335, 973],[556, 278, 602, 367],[596, 284, 641, 377],[619, 384, 748, 760],[641, 650, 848, 968],[714, 372, 791, 551],[779, 309, 863, 470],[780, 592, 974, 967],[801, 467, 855, 587],[858, 290, 910, 395],[868, 375, 903, 475];human face:[064, 171, 979, 964];human nose:[064, 290, 415, 422],[430, 262, 634, 306].') 2023-11-03 13:40:13.411 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:40:13.412 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'marinated salmon fillets in [000, 159, 990, 924] an air fryer basket') 2023-11-03 13:40:32.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:40:32.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[204, 020, 787, 807],[583, 452, 973, 893],[928, 476, 1000, 841];Whiteboard:[540, 446, 797, 651],[003, 356, 347, 725];Desk:[003, 745, 916, 999];Chair:[494, 581, 708, 917];Lamp:[735, 124, 1000, 262];Sneakers:[354, 767, 501, 809],[442, 750, 613, 793].') 2023-11-03 13:40:48.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:40:48.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Blackboard:[002, 003, 215, 618];Chair:[633, 512, 901, 1000],[347, 409, 617, 996];Desk:[561, 455, 951, 997];Storage box:[797, 374, 931, 503];Person:[401, 306, 627, 998];Ballon:[235, 749, 382, 970],[002, 634, 196, 931].') 2023-11-03 13:42:08.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:42:08.821 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[693, 531, 711, 561] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a traffic signFrame1:[693, 531, 711, 561];Frame2:[654, 532, 673, 562];Frame3:[629, 535, 651, 567];Frame4:[595, 534, 622, 575].') 2023-11-03 13:42:23.003 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:42:23.003 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Students [579, 450, 645, 564] [560, 489, 648, 655] [427, 464, 511, 561] [587, 312, 719, 564] [440, 527, 498, 678] [310, 554, 359, 690] [493, 514, 604, 752] [538, 543, 716, 906] [699, 273, 884, 930] learn about the unique geology of Coral Pink Sand Dunes State Park in Utah, where Navajo sandstone has been transformed over time into mountains [001, 374, 460, 625] and hills of sand [002, 596, 995, 998].') 2023-11-03 13:42:36.249 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:42:36.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Storage box:[491, 710, 529, 754];Desk:[444, 659, 634, 788],[104, 610, 211, 700];Person:[043, 545, 131, 883],[046, 565, 075, 712],[308, 556, 335, 683],[481, 573, 501, 658],[512, 505, 563, 729];Stroller:[113, 677, 227, 881];Tent:[563, 449, 868, 797],[786, 454, 932, 734];Flag:[553, 478, 594, 654].') 2023-11-03 13:43:31.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:31.343 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'shelf:[001, 368, 420, 1000],[356, 307, 588, 828],[539, 278, 685, 652],[689, 284, 830, 552];Person:[307, 231, 381, 537],[248, 267, 300, 348].') 2023-11-03 13:43:36.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:36.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Pie:[002, 621, 082, 709];Person:[120, 096, 438, 1000],[002, 179, 123, 544];Plate:[002, 567, 072, 623],[002, 617, 118, 724];Desk:[429, 551, 894, 1000],[436, 447, 528, 598];Frame:[350, 155, 413, 304];Mirror:[545, 032, 614, 444];Dinning Table:[002, 544, 174, 999];Handbag:[037, 254, 131, 499].') 2023-11-03 13:43:47.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:47.539 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bicycle,bicycle wheel,wheel and person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'bicycle:[000, 475, 228, 934],[106, 443, 373, 827],[502, 484, 601, 695],[665, 458, 885, 624];bicycle wheel:[108, 575, 239, 825],[286, 558, 376, 765],[491, 575, 570, 797],[521, 525, 558, 684],[596, 555, 653, 739],[686, 514, 713, 627],[721, 518, 744, 607],[730, 495, 754, 566],[739, 490, 758, 559],[820, 502, 845, 594],[838, 499, 858, 577];wheel:[112, 571, 240, 830],[288, 556, 375, 770],[492, 575, 566, 797],[596, 555, 651, 744],[686, 515, 712, 622],[723, 520, 740, 609],[731, 495, 750, 567],[745, 498, 756, 560],[821, 505, 840, 593],[841, 505, 856, 581];person:[000, 165, 978, 935],[113, 170, 340, 756],[406, 403, 427, 501],[427, 396, 443, 493],[436, 391, 468, 514],[468, 283, 640, 708],[644, 379, 880, 601],[880, 422, 904, 511],[946, 420, 967, 500].') 2023-11-03 13:43:53.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:53.346 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'jeans:[251, 485, 310, 841],[650, 576, 677, 831];jacket:[076, 312, 249, 576],[706, 437, 766, 537];woman:[368, 283, 463, 802],[421, 312, 528, 873],[478, 289, 548, 800],[668, 369, 770, 865],[733, 348, 847, 900];man:[051, 253, 165, 884],[074, 249, 255, 844],[240, 249, 343, 831],[533, 334, 673, 910],[633, 316, 715, 867],[677, 286, 778, 460];footwear:[165, 781, 191, 847].') 2023-11-03 13:43:56.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:56.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[051, 132, 793, 884];Glasses:[650, 175, 693, 194];Slippers:[050, 600, 225, 673],[424, 840, 597, 884];Flower:[340, 000, 688, 094],[178, 080, 340, 212],[281, 326, 333, 384];Vase:[359, 042, 636, 120],[213, 123, 329, 171],[294, 379, 332, 445].') 2023-11-03 13:43:56.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:56.863 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[419, 440, 482, 512] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a lugeFrame1:[419, 440, 482, 512];Frame2:[431, 439, 495, 510];Frame3:[409, 472, 482, 547];Frame4:[420, 457, 496, 532];Frame5:[431, 463, 506, 543].') 2023-11-03 13:43:59.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:43:59.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Other Shoes:[433, 943, 477, 985],[535, 948, 581, 999];Boat:[589, 002, 998, 970];Person:[326, 123, 660, 999],[612, 230, 675, 456];Lantern:[796, 204, 835, 298].') 2023-11-03 13:44:03.600 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:44:03.600 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Civil rights figures [638, 308, 851, 996] [382, 214, 614, 995] lead marchers [286, 308, 341, 417] [858, 303, 976, 998] [187, 341, 314, 995] [787, 346, 897, 996] [046, 285, 258, 996] [275, 317, 438, 996] [543, 324, 682, 995] across the Edmund Pettus Bridge during the recreation of the 1965 Selma to Montgomery march in Selma, Ala., March 4, 1990.') 2023-11-03 13:44:19.870 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:44:19.871 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[154, 140, 862, 864] Hash Brown Breakfast Casserole made with a hash brown patty crust, ham, swiss cheese, cheddar cheese and eggs and milk. Only a few minutes of preparation for this easy breakfast casserole recipe!') 2023-11-03 13:45:04.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:45:04.605 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Flag:[817, 115, 936, 516];Person:[894, 427, 953, 584];Chair:[374, 536, 490, 694],[564, 524, 658, 707];Lamp:[648, 347, 672, 417],[814, 372, 835, 427].') 2023-11-03 13:45:16.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:45:16.847 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[454, 592, 475, 642];Pickup Truck:[452, 587, 674, 722];Car:[207, 615, 425, 729],[001, 608, 162, 733],[653, 616, 789, 702].') 2023-11-03 13:45:49.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:45:49.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[206, 602, 685, 981],[000, 000, 998, 998];girl:[001, 546, 252, 998],[053, 194, 222, 617],[132, 300, 383, 824],[195, 412, 648, 998],[317, 225, 433, 651],[415, 123, 602, 660],[521, 354, 998, 998],[580, 100, 888, 625],[606, 041, 773, 240],[765, 208, 857, 410],[862, 356, 995, 740];human face:[056, 080, 127, 222],[113, 250, 208, 455],[163, 044, 222, 160],[223, 375, 365, 649],[247, 139, 328, 287],[293, 000, 350, 108],[328, 235, 415, 445],[358, 000, 396, 063],[406, 095, 467, 235],[458, 514, 618, 804],[462, 180, 577, 451],[632, 134, 777, 369],[650, 460, 821, 774],[751, 011, 802, 080],[775, 220, 857, 412],[781, 098, 880, 264],[865, 059, 900, 175],[897, 417, 998, 626],[948, 007, 998, 147].') 2023-11-03 13:46:17.295 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:46:17.296 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[608, 097, 872, 860] Chris Sharp with [270, 215, 551, 805] his wife and [519, 520, 671, 880] young son, walking along [003, 280, 998, 998] the Embankment in London') 2023-11-03 13:46:25.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:46:25.147 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Baseball Glove:[531, 287, 621, 408],[655, 540, 718, 623];Person:[322, 022, 621, 968],[674, 309, 899, 820],[805, 271, 889, 533],[250, 391, 365, 567],[329, 404, 399, 571],[566, 397, 686, 585],[616, 425, 685, 594];Hat:[475, 025, 597, 093],[748, 308, 814, 352];Belt:[405, 393, 571, 437];Sneakers:[791, 774, 876, 823],[725, 773, 775, 816],[468, 905, 592, 970],[318, 898, 438, 960].') 2023-11-03 13:47:21.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:47:21.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Cup and Trolley in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[071, 282, 179, 522],[541, 427, 564, 524];Cup:[158, 372, 177, 418],[176, 383, 193, 426];Trolley:[465, 444, 485, 476].') 2023-11-03 13:48:44.266 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:48:44.267 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect loveseat,woman,man and clothing in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[244, 101, 505, 838],[430, 082, 834, 938];man:[146, 110, 337, 516],[000, 000, 089, 685],[538, 046, 645, 291],[663, 000, 800, 262];clothing:[000, 041, 100, 632],[045, 000, 106, 101],[119, 000, 220, 097],[123, 273, 516, 857],[130, 118, 351, 543],[438, 226, 835, 826],[557, 086, 657, 287],[683, 013, 810, 317].') 2023-11-03 13:49:10.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 13:49:10.985 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'street light:[220, 048, 853, 1000],[200, 829, 334, 983].') 2023-11-03 13:49:32.258 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:49:32.258 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Young pretty business woman [089, 244, 548, 980] sitting at the desk with computer in blue office') 2023-11-03 13:49:39.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:49:39.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Ladder and Boots in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Ladder:[147, 605, 1000, 882];Boots:[577, 637, 662, 748],[455, 637, 529, 733].') 2023-11-03 13:49:50.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:49:50.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[524, 512, 564, 762],personFrame1:[445, 543, 564, 781],personFrame1:[741, 526, 811, 779],personFrame1:[460, 523, 528, 768],personFrame1:[262, 501, 333, 716] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[524, 512, 564, 762];Frame2:[507, 506, 549, 762],personFrame1:[445, 543, 564, 781];Frame2:[455, 525, 536, 776],personFrame1:[741, 526, 811, 779];Frame2:[726, 518, 805, 779],personFrame1:[460, 523, 528, 768];Frame2:[463, 506, 529, 752],personFrame1:[262, 501, 333, 716];Frame2:[268, 490, 339, 694].') 2023-11-03 13:50:03.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:50:03.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Hat:[500, 265, 580, 367];Microphone:[379, 293, 577, 468];Head Phone:[701, 149, 786, 335];Person:[600, 148, 900, 1000],[417, 261, 656, 1000],[087, 287, 383, 1000];Trash bin Can:[898, 601, 999, 1000].') 2023-11-03 13:50:24.033 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:50:24.033 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'D.O.A.: Greatest Shits 1978-1998') 2023-11-03 13:50:37.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:50:37.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'beard:[122, 411, 574, 737];woman:[567, 327, 1000, 1000];human hair:[555, 335, 925, 737],[180, 366, 580, 485];human head:[543, 320, 898, 636],[148, 251, 599, 723];clothing:[014, 321, 770, 1000],[000, 019, 474, 504],[376, 000, 850, 383],[877, 516, 998, 704];human arm:[667, 014, 880, 219],[000, 000, 195, 053],[198, 000, 407, 045],[400, 000, 646, 192],[929, 786, 999, 998];human eye:[256, 376, 347, 408],[425, 391, 520, 439],[709, 485, 795, 518];human mouth:[261, 508, 452, 573],[630, 581, 793, 646];human body:[000, 000, 478, 656],[000, 618, 795, 999],[383, 000, 854, 690],[568, 358, 999, 999];man:[000, 000, 481, 665],[379, 000, 862, 696];human face:[175, 294, 576, 720],[565, 378, 875, 700];human nose:[323, 373, 439, 479],[646, 500, 733, 571];human hand:[512, 021, 642, 098],[614, 000, 714, 107].') 2023-11-03 13:51:21.496 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:51:21.496 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '210 local Keller Williams Realty Las Vegas [192, 463, 302, 740] [734, 324, 837, 728] agents volunteered their time helping homeless youth through NPHY for the real estate firm’s national RED Day service project for the fifth year in a r ...') 2023-11-03 13:52:08.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:52:08.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect boy and human face in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'boy:[748, 547, 816, 673];human face:[000, 197, 022, 267],[001, 796, 065, 907],[010, 272, 050, 346],[083, 211, 126, 293],[106, 117, 148, 184],[111, 714, 161, 811],[195, 732, 260, 830],[225, 199, 253, 263],[277, 639, 333, 714],[312, 216, 353, 322],[377, 181, 401, 255],[394, 153, 428, 227],[417, 209, 466, 278],[467, 148, 508, 238],[545, 180, 614, 307],[548, 861, 614, 970],[614, 711, 646, 800],[641, 109, 678, 203],[649, 709, 688, 809],[659, 073, 685, 161],[676, 045, 731, 170],[678, 701, 746, 853],[715, 263, 764, 361],[736, 884, 813, 999],[753, 578, 804, 666],[761, 148, 815, 234],[805, 567, 837, 636],[809, 749, 855, 824],[836, 080, 875, 165],[887, 681, 954, 826],[904, 119, 920, 184],[923, 095, 986, 196].') 2023-11-03 13:52:28.851 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:52:28.852 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Protect, clean up and speed up your computer [326, 732, 573, 884] for seniors [040, 559, 494, 885] : use the protection, clean up and optimization tools available in Windows Cover') 2023-11-03 13:52:42.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:52:42.850 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[150, 001, 370, 190],[696, 533, 741, 575];Person:[695, 533, 757, 653];Hanger:[555, 001, 697, 137],[768, 881, 916, 1000];Picture:[937, 148, 1000, 424],[200, 705, 454, 969].') 2023-11-03 13:53:17.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:53:17.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human body:[188, 016, 843, 1000],[416, 736, 870, 999],[545, 815, 898, 999],[614, 208, 863, 820];woman:[523, 061, 865, 765];human head:[349, 018, 566, 156],[533, 066, 718, 233],[616, 704, 808, 861];human arm:[703, 155, 864, 281],[145, 201, 388, 753],[416, 827, 558, 993],[540, 876, 629, 999],[788, 745, 880, 810],[819, 239, 880, 680],[824, 718, 907, 900];jeans:[256, 577, 615, 952],[649, 618, 856, 805];boy:[541, 708, 913, 999];human leg:[233, 602, 541, 999];human hair:[494, 667, 706, 820],[504, 081, 799, 322],[596, 711, 823, 868];mammal:[404, 667, 907, 999],[547, 721, 917, 999],[565, 081, 860, 758];man:[155, 008, 872, 999];girl:[423, 677, 874, 999],[548, 713, 908, 999];human face:[376, 033, 530, 178],[537, 116, 650, 222].') 2023-11-03 13:53:46.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:53:46.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[003, 002, 919, 772],[103, 002, 1000, 1000];Hat:[348, 031, 901, 272];Microphone:[534, 277, 723, 333].') 2023-11-03 13:54:07.935 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:54:07.936 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Get into the spooky spirit without busting your budget with [185, 765, 449, 954] these "spooktacular" Halloween decorations that cost $13 or less! Check them out!') 2023-11-03 13:54:15.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:54:15.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[534, 475, 570, 654],personFrame1:[481, 519, 527, 648],personFrame1:[200, 400, 235, 541],personFrame1:[759, 365, 821, 494],personFrame1:[258, 406, 300, 622] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[534, 475, 570, 654];Frame2:[553, 481, 589, 661],personFrame1:[481, 519, 527, 648];Frame2:[499, 525, 546, 654],personFrame1:[200, 400, 235, 541];Frame2:[221, 401, 254, 545],personFrame1:[759, 365, 821, 494];Frame2:[751, 358, 814, 504],personFrame1:[258, 406, 300, 622];Frame2:[276, 412, 319, 626].') 2023-11-03 13:54:30.972 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:54:30.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[675, 540, 691, 552] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a license plate on back of red car driving on roadFrame1:[675, 540, 691, 552];Frame2:[675, 538, 691, 551];Frame3:[675, 540, 691, 554];Frame4:[675, 538, 691, 551].') 2023-11-03 13:54:39.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:54:39.575 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[317, 357, 514, 657],[125, 208, 273, 407],[715, 379, 819, 545];Vase:[424, 632, 494, 706],[426, 698, 495, 767],[769, 522, 812, 592];Desk:[115, 478, 572, 759];Cabinet:[043, 483, 375, 756];Chair:[233, 469, 334, 675],[297, 441, 455, 689],[388, 660, 446, 750],[452, 439, 561, 666],[001, 618, 305, 1000],[784, 477, 948, 682],[331, 447, 435, 571];Pillow:[636, 709, 1000, 1000],[885, 523, 999, 759],[851, 446, 934, 551];Couch:[263, 600, 999, 999];Lamp:[159, 301, 207, 512];Carpet:[181, 623, 831, 999];Bench:[596, 485, 999, 571].') 2023-11-03 13:55:07.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:55:07.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Cabinet:[001, 219, 999, 996];Leather Shoes:[874, 183, 991, 373],[629, 121, 747, 331];Sneakers:[214, 094, 375, 276],[001, 002, 177, 252],[066, 458, 248, 842],[277, 304, 582, 992],[583, 457, 829, 997],[814, 762, 999, 995];Flag:[363, 150, 620, 521].') 2023-11-03 13:55:45.752 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:55:45.752 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[353, 331, 410, 536],personFrame1:[282, 387, 338, 562],personFrame1:[450, 531, 500, 731],personFrame1:[482, 236, 515, 412],personFrame1:[379, 172, 418, 326],personFrame1:[366, 640, 420, 848],personFrame1:[764, 508, 807, 729] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[353, 331, 410, 536];Frame2:[355, 319, 400, 525],personFrame1:[282, 387, 338, 562];Frame2:[286, 404, 343, 569],personFrame1:[450, 531, 500, 731];Frame2:[450, 530, 501, 733],personFrame1[482, 236, 515, 412];Frame2:No object,personFrame1:[379, 172, 418, 326];Frame2:[377, 175, 417, 326],personFrame1:[366, 640, 420, 848];Frame2:[366, 638, 419, 837],personFrame1:[764, 508, 807, 729];Frame2:[758, 505, 799, 713].') 2023-11-03 13:56:05.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:56:05.333 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'artichoke:[221, 525, 500, 792],[000, 418, 215, 731],[000, 000, 998, 998].') 2023-11-03 13:56:32.865 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:56:32.865 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'OMG! You have to check out [006, 205, 994, 429] these scary-awesome zombie recipes before your next Halloween party!') 2023-11-03 13:56:35.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:56:35.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'beard:[446, 260, 625, 521];human body:[355, 160, 935, 988],[000, 198, 258, 999];human hair:[438, 103, 650, 365],[000, 210, 076, 520];mammal:[000, 325, 256, 998],[376, 107, 999, 999];human arm:[386, 648, 824, 1000],[000, 928, 125, 999],[130, 580, 266, 999],[388, 936, 423, 999];human ear:[621, 268, 656, 365];human head:[000, 205, 084, 534],[440, 100, 669, 550];man:[389, 101, 999, 999];girl:[000, 188, 257, 999];clothing:[000, 520, 260, 999],[380, 400, 999, 999];human hand:[023, 933, 125, 999],[413, 680, 668, 967].') 2023-11-03 13:56:37.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:56:37.667 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Fan:[327, 033, 576, 173];Lamp:[420, 112, 488, 230],[127, 201, 191, 408];Picture:[234, 270, 325, 449];Cabinet:[491, 523, 783, 668],[766, 442, 895, 643];Stool:[929, 753, 999, 860];Chair:[682, 547, 780, 711],[114, 721, 186, 1000],[209, 794, 280, 1000];Couch:[776, 627, 981, 817],[466, 688, 929, 1000],[275, 622, 458, 867];Desk:[355, 763, 472, 928],[002, 828, 178, 999],[141, 645, 228, 742].') 2023-11-03 13:57:00.131 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 13:57:00.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Gloves and Hockey Stick in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[124, 132, 156, 189],[225, 134, 262, 189],[395, 117, 429, 188],[572, 201, 782, 709],[780, 111, 978, 582],[712, 096, 796, 323];Gloves:[780, 277, 817, 337],[940, 310, 977, 379],[627, 396, 678, 470],[572, 480, 621, 564];Hockey Stick:[438, 454, 649, 680],[668, 223, 942, 359],[000, 836, 109, 884].') 2023-11-03 13:57:11.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:57:11.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'dog:[145, 124, 203, 268],[175, 096, 286, 279],[312, 313, 845, 999],[331, 150, 402, 239],[360, 127, 442, 261],[451, 271, 703, 870].') 2023-11-03 13:57:13.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:57:13.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'flower:[073, 000, 999, 980].') 2023-11-03 13:57:20.531 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:57:20.531 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'seven white oil drums [023, 381, 218, 709] [462, 328, 584, 486] [789, 298, 908, 506] [420, 471, 650, 979] [838, 346, 997, 630] with blue light [499, 290, 585, 321] [801, 298, 906, 331] [860, 346, 998, 390] [030, 334, 156, 376] [423, 471, 645, 570] [024, 382, 189, 438] and two black wires coming out of each') 2023-11-03 13:58:00.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:58:00.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'airplane [367, 391, 393, 439] in the sky disappears behind white clouds [004, 002, 994, 994] leaving contrail [112, 397, 389, 997] - aerospace stock videos & royalty-free footage') 2023-11-03 13:58:21.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:58:21.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[780, 252, 1000, 999],[589, 073, 808, 999],[165, 017, 496, 998],[019, 048, 965, 997],[001, 205, 238, 999];Helmet:[001, 204, 176, 542],[319, 018, 494, 345],[428, 046, 631, 381],[596, 068, 673, 295],[882, 254, 992, 420];Gloves:[020, 720, 097, 863],[884, 796, 968, 948],[591, 284, 660, 356].') 2023-11-03 13:58:41.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:58:41.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'food:[106, 000, 996, 988].') 2023-11-03 13:59:01.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:59:01.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[400, 177, 523, 886] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[400, 177, 523, 886];Frame2:[392, 162, 519, 886];Frame3:[376, 176, 495, 852];Frame4:[364, 165, 512, 891];Frame5:[360, 165, 514, 925].') 2023-11-03 13:59:10.475 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:59:10.475 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Protestors [411, 167, 552, 983] [717, 278, 881, 910] [882, 215, 995, 978] [483, 132, 713, 995] [260, 151, 436, 991] [147, 130, 302, 979] [002, 111, 168, 981] look on during the gathering [006, 069, 993, 994] on Sunday with one placard [593, 519, 701, 714] reading 'Save the trees for me'") 2023-11-03 13:59:55.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 13:59:55.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[220, 491, 543, 934] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a giant pandaFrame1:[220, 491, 543, 934];Frame2:[161, 471, 466, 914];Frame3:[161, 519, 434, 889];Frame4:[258, 513, 557, 856].') 2023-11-03 14:00:01.976 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:00:01.978 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '7 Up can taste more lemony if the packaging [524, 096, 669, 934] is yellowed with great design') 2023-11-03 14:00:54.340 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:00:54.341 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Analyze the image and provide an answer to my question 'QUESTION: What are [702, 719, 766, 971] and [073, 666, 146, 970] and [259, 807, 330, 980] and [628, 731, 688, 961] and [126, 836, 184, 992] and [437, 778, 496, 984] and [524, 886, 561, 990] and [717, 555, 766, 760] and [859, 558, 908, 771] and [363, 755, 431, 972] and [914, 542, 966, 788] and [188, 850, 239, 988] and [000, 040, 988, 987] and [767, 543, 821, 778] doing there ?'. Please explain your reasoning and indicate the spatial location of objects using [xmin,ymin,xmax,ymax].", 'People go to arenas like this to watch something that excites them or some form of entertainment . They are there to be entertained .') 2023-11-03 14:00:59.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:00:59.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[382, 115, 625, 957] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a personFrame1:[382, 115, 625, 957];Frame2:[421, 120, 756, 967];Frame3:[392, 083, 633, 979];Frame4:[429, 131, 631, 980].') 2023-11-03 14:01:24.549 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:01:24.549 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Upward Graph Infographic - Vector illustration of upward...') 2023-11-03 14:02:00.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:02:00.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Boat:[002, 724, 129, 901],[072, 573, 347, 890],[346, 562, 578, 865],[490, 497, 706, 853],[637, 489, 783, 843],[759, 565, 956, 837];Crane:[188, 322, 257, 660],[841, 565, 992, 675].') 2023-11-03 14:02:11.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:02:11.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[601, 835, 658, 953],[674, 842, 699, 946],[748, 920, 777, 953],[794, 853, 826, 956],[824, 889, 868, 941],[864, 857, 907, 972],[919, 858, 953, 967],[978, 901, 999, 999].') 2023-11-03 14:02:23.864 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:02:23.864 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bedroom with [251, 497, 806, 996] a king log bed, [730, 606, 860, 808] two night stands, [401, 454, 453, 587] [756, 448, 840, 643] two lamps, and [146, 269, 305, 816] deck access at Bootlegger Hill Hideaway, a 2 bedroom cabin rental located in Pigeon Forge') 2023-11-03 14:02:32.441 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:02:32.441 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[256, 409, 301, 627],personFrame1:[320, 198, 357, 375],personFrame1:[267, 483, 318, 695],personFrame1:[885, 494, 939, 711],personFrame1:[127, 593, 178, 825],personFrame1:[707, 462, 762, 638],personFrame1:[324, 134, 367, 316] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[256, 409, 301, 627];Frame2:[260, 413, 312, 634],personFrame1:[320, 198, 357, 375];Frame2:[283, 202, 340, 379],personFrame1:[267, 483, 318, 695];Frame2:[270, 486, 320, 694],personFrame1:[885, 494, 939, 711];Frame2:[891, 488, 938, 712],personFrame1:[127, 593, 178, 825];Frame2:[130, 597, 176, 829],personFrame1:[707, 462, 762, 638];Frame2:[718, 466, 768, 636],personFrame1:[324, 134, 367, 316];Frame2:[304, 140, 342, 320].') 2023-11-03 14:02:51.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:02:51.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[001, 795, 291, 996],[001, 633, 363, 908],[067, 553, 415, 761],[404, 572, 616, 766],[606, 580, 732, 772],[600, 636, 960, 913],[696, 791, 998, 999];Chair:[055, 744, 113, 799],[499, 552, 601, 748],[853, 755, 921, 802];Lamp:[777, 032, 997, 203];Book:[558, 565, 610, 596];Ladder:[971, 284, 999, 626];Blackboard:[001, 248, 346, 494];Projector:[441, 001, 566, 061];TV:[740, 526, 853, 648],[633, 504, 686, 602],[327, 489, 403, 586],[220, 489, 303, 584],[127, 515, 237, 641],[001, 641, 058, 828],[921, 652, 999, 839];Mouse:[848, 814, 876, 845];Computer Box:[908, 792, 999, 910],[735, 629, 854, 684],[123, 625, 242, 673],[001, 781, 078, 899];Speaker:[431, 207, 478, 289].') 2023-11-03 14:03:21.148 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:03:21.149 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Fashion first:\xa0The 51-year-old singer [204, 021, 739, 917] molded style and practicality as she peddled uphill in a pair [251, 368, 611, 799] of holographic leggings') 2023-11-03 14:03:38.417 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:03:38.418 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'C. The three laws [052, 228, 925, 769] of motion described how any object moves when forces act on it.') 2023-11-03 14:03:44.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:03:44.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'woman:[362, 406, 595, 998],[065, 593, 405, 998],[737, 467, 774, 683],[884, 438, 922, 610];man:[580, 338, 745, 984],[027, 297, 346, 824],[344, 361, 466, 932],[523, 414, 607, 668],[757, 433, 816, 809],[948, 493, 998, 636],[979, 393, 998, 483];tree:[571, 035, 996, 497],[000, 000, 362, 312],[314, 064, 586, 474];clothing:[595, 433, 739, 958],[023, 483, 324, 832],[046, 804, 394, 998],[347, 468, 470, 879],[374, 627, 614, 998],[509, 470, 599, 668],[735, 504, 768, 645],[762, 476, 817, 772],[886, 480, 926, 600],[913, 621, 998, 694],[941, 521, 998, 630],[977, 399, 998, 510];bicycle:[710, 557, 946, 706],[821, 647, 998, 934];bicycle wheel:[714, 602, 752, 687],[803, 604, 823, 687],[822, 757, 929, 898],[827, 640, 891, 702],[974, 824, 998, 943].') 2023-11-03 14:04:40.374 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:04:40.374 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[008, 002, 992, 996] Cute little witch girl sitting on a broomstick halloween illustration') 2023-11-03 14:05:05.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:05:05.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Street Lights:[369, 110, 435, 565],[332, 280, 369, 531];Car:[315, 499, 357, 555],[215, 498, 289, 592],[088, 502, 136, 561];Pickup Truck:[058, 499, 097, 548];Parking meter:[958, 449, 994, 761].') 2023-11-03 14:05:19.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:05:19.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[682, 290, 820, 1000],[532, 014, 633, 1000],[452, 062, 538, 938],[353, 062, 450, 926],[375, 454, 531, 1000],[342, 424, 525, 1000],[304, 350, 410, 986],[165, 338, 282, 972],[002, 352, 125, 1000],[165, 654, 427, 1000],[907, 546, 1000, 1000],[000, 530, 162, 1000],[683, 096, 997, 362],[647, 632, 815, 1000],[419, 822, 561, 1000].') 2023-11-03 14:05:21.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:05:21.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[641, 396, 686, 466],carFrame1:[841, 309, 1000, 757],carFrame1:[703, 384, 764, 525],carFrame1:[216, 360, 385, 571],carFrame1:[000, 359, 246, 727] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[641, 396, 686, 466];Frame2:[635, 383, 687, 463],carFrame1[841, 309, 1000, 757];Frame2:No object,carFrame1:[703, 384, 764, 525];Frame2:[735, 364, 863, 576],carFrame1:[216, 360, 385, 571];Frame2:[000, 308, 236, 710],carFrame1[000, 359, 246, 727];Frame2:No object.') 2023-11-03 14:05:30.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:05:30.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man and building in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[406, 628, 653, 1000];building:[302, 000, 997, 761].') 2023-11-03 14:06:38.230 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:06:38.230 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Dynamic Rock Formation Along Ring Road in ICELAND - CIRCA AUGUST, 2014 Stock Footage') 2023-11-03 14:07:01.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:07:01.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[311, 175, 344, 243] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a faceFrame1:[311, 175, 344, 243];Frame2:[304, 221, 335, 286];Frame3:[300, 211, 333, 280];Frame4:[312, 184, 344, 251].') 2023-11-03 14:07:02.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:07:02.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[727, 163, 904, 732],[613, 169, 782, 716],[288, 242, 508, 726],[189, 163, 389, 722];Sneakers:[836, 668, 900, 727],[769, 667, 837, 732],[487, 682, 524, 735],[206, 668, 236, 722];Bow Tie:[611, 672, 662, 717];Gloves:[754, 410, 797, 466],[723, 275, 758, 343];Helmet:[774, 162, 838, 226],[678, 168, 727, 223],[341, 241, 389, 300],[259, 164, 317, 227];Satchel:[836, 079, 888, 130];Hockey Stick:[437, 455, 555, 527].') 2023-11-03 14:07:10.708 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:07:10.708 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Minister Bohara [001, 003, 934, 996] commits no stale goods in market') 2023-11-03 14:07:23.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:07:23.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tree:[000, 383, 070, 682],[076, 492, 118, 628],[099, 502, 127, 605],[128, 538, 150, 601],[146, 494, 171, 590],[172, 477, 188, 584],[188, 502, 208, 582],[204, 491, 216, 569],[211, 491, 260, 582],[281, 577, 331, 641],[327, 500, 353, 571],[338, 577, 427, 760],[354, 491, 378, 559],[387, 478, 433, 544],[419, 478, 482, 555],[443, 577, 513, 766],[474, 436, 524, 536],[536, 436, 566, 555],[555, 482, 610, 569],[605, 477, 635, 548],[619, 338, 735, 546],[746, 411, 896, 533],[876, 352, 923, 519],[955, 442, 980, 507],[981, 429, 999, 492].') 2023-11-03 14:08:27.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:08:27.552 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[374, 115, 629, 695] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a lesser pandaFrame1:[374, 115, 629, 695];Frame2:[407, 250, 592, 800];Frame3:[381, 222, 556, 779];Frame4:[357, 145, 531, 769];Frame5:[432, 067, 541, 701].') 2023-11-03 14:08:38.671 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:08:38.671 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Biola's women's cross country team [159, 263, 617, 996] [529, 198, 897, 997] gets ready at the starting line in the NCCAA national championships on November 10, 2018.") 2023-11-03 14:08:52.643 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:08:52.644 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The video shows [748, 463, 793, 572] [521, 514, 569, 614] the young teens waiting at the concrete barrier between the south and north-bound lanes of the freeway, before running out in front of peak-hour on-coming traffic.') 2023-11-03 14:08:59.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:08:59.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,clothing,human face and woman in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[070, 240, 304, 860],[000, 554, 157, 838],[271, 293, 404, 847],[358, 204, 597, 866],[626, 210, 878, 917],[798, 454, 999, 886];clothing:[000, 739, 144, 846],[075, 386, 332, 839],[296, 439, 382, 609],[350, 386, 608, 875],[630, 363, 886, 889],[666, 885, 756, 999],[798, 666, 999, 873];human face:[119, 609, 145, 753],[186, 262, 253, 401],[285, 319, 354, 450],[436, 254, 512, 377],[570, 439, 612, 514],[711, 272, 778, 379];woman:[178, 578, 367, 999],[528, 610, 734, 999].') 2023-11-03 14:09:21.257 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:09:21.258 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the hyena that is displaying a higher speed of movement.?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one the hyena that is displaying a higher speed of movement..Track1frame:1:[000, 665, 195, 876].') 2023-11-03 14:09:25.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:09:25.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Scooter,Air Conditioner and Umbrella in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Scooter:[120, 708, 172, 780];Air Conditioner:[928, 017, 1000, 123];Umbrella:[250, 616, 319, 648].') 2023-11-03 14:09:44.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:09:44.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[217, 330, 386, 568] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a large civetFrame1:[217, 330, 386, 568];Frame2:[260, 441, 495, 766];Frame3:[171, 541, 398, 876];Frame4:[246, 511, 483, 959];Frame5:[289, 441, 532, 880].') 2023-11-03 14:09:45.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:09:45.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree and woman in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[000, 000, 440, 323],[436, 000, 539, 286],[613, 000, 785, 353],[888, 000, 999, 283];woman:[787, 334, 915, 968],[221, 327, 312, 999],[249, 139, 445, 995],[636, 327, 694, 442],[662, 380, 783, 999],[763, 321, 820, 440],[891, 305, 935, 600].') 2023-11-03 14:10:03.834 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:10:03.834 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[197, 324, 576, 712] Homemade Cornbread with [002, 183, 434, 420] Jalapeños will make for a perfect side dish with all of your chili, stew, and soup recipes this fall and winter') 2023-11-03 14:10:11.608 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:10:11.608 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'bride [399, 225, 603, 997] with green and white flowers [326, 636, 488, 905] walks to the front of ceremony with father [564, 257, 810, 995]') 2023-11-03 14:10:37.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:10:37.117 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'boat:[210, 110, 1000, 722];tree:[189, 023, 366, 317],[000, 000, 999, 366].') 2023-11-03 14:10:56.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:10:56.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[142, 786, 756, 999];Desk:[001, 458, 999, 1000];shelf:[135, 253, 864, 444],[874, 283, 999, 998];Picture:[406, 001, 936, 045];Lamp:[204, 183, 328, 560];Storage box:[164, 303, 252, 346],[167, 325, 255, 382],[166, 357, 259, 416],[815, 373, 910, 506];Scissors:[683, 347, 726, 471],[578, 607, 624, 654];Tape:[217, 589, 305, 650];Pen:[837, 436, 870, 490];Folder:[533, 167, 586, 300];Book:[840, 039, 928, 210];Cup:[966, 236, 999, 279],[946, 167, 994, 225],[948, 107, 996, 157];Basin:[733, 348, 835, 426].') 2023-11-03 14:12:21.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:12:21.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[208, 157, 924, 1000],[000, 490, 345, 999],[156, 415, 325, 584],[298, 454, 426, 707],[623, 250, 930, 638];human hair:[386, 064, 613, 402],[000, 262, 087, 553],[071, 183, 343, 535],[092, 095, 229, 274],[660, 000, 833, 255];human head:[380, 066, 623, 526],[000, 272, 111, 725],[080, 184, 220, 503],[338, 203, 430, 553],[646, 000, 843, 296],[863, 000, 957, 084];clothing:[251, 261, 931, 989],[000, 483, 345, 999],[610, 245, 928, 578],[810, 088, 995, 486];human arm:[805, 382, 950, 664],[208, 601, 348, 999],[737, 620, 932, 999];man:[000, 271, 349, 999],[319, 057, 931, 999];human face:[000, 515, 069, 723],[081, 259, 209, 519],[345, 259, 438, 532],[387, 182, 574, 555],[643, 096, 721, 288];microphone:[181, 485, 420, 716].') 2023-11-03 14:12:36.184 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:12:36.185 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Rough edges of [001, 203, 999, 995] a cast iron skillet could scratch a glass cooktop') 2023-11-03 14:12:58.346 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:12:58.346 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'new york congressman hakeem jeffries [343, 115, 700, 996] argues against adoption of the budget resolution, charging that republicans are determined to visit cruel and... - punishment stock videos & royalty-free footage') 2023-11-03 14:14:11.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:14:11.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[586, 318, 880, 560] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a leopardFrame1:[586, 318, 880, 560];Frame2:[593, 318, 873, 560];Frame3:[601, 318, 864, 550];Frame4:[601, 314, 855, 541];Frame5:[608, 314, 853, 529].') 2023-11-03 14:14:19.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:14:19.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'boat:[020, 030, 1000, 885].') 2023-11-03 14:15:06.174 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:15:06.175 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Book in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Book:[095, 574, 279, 761],[081, 788, 228, 966],[262, 587, 397, 751],[413, 590, 542, 756],[553, 593, 677, 754],[797, 605, 946, 755],[678, 594, 810, 753],[786, 391, 882, 573],[653, 382, 762, 566],[544, 391, 640, 567],[434, 385, 530, 572],[309, 362, 420, 567],[204, 384, 298, 556].') 2023-11-03 14:15:25.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:15:25.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[000, 081, 514, 995],[439, 182, 772, 945].') 2023-11-03 14:15:29.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:15:29.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Car and Person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Car:[908, 925, 979, 972],[031, 884, 121, 928],[321, 767, 386, 809],[231, 763, 292, 802],[237, 732, 298, 766],[337, 736, 401, 772],[255, 672, 304, 704],[286, 631, 324, 659],[038, 904, 121, 960];Person:[402, 924, 438, 986],[435, 924, 475, 984].') 2023-11-03 14:15:42.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:15:42.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bakset,Other Shoes,Bench and Sandals in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Bakset:[558, 678, 639, 758];Other Shoes:[737, 741, 807, 791];Bench:[000, 622, 291, 995],[104, 535, 188, 618];Sandals:[063, 808, 114, 911].') 2023-11-03 14:15:59.055 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:15:59.055 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Purse,Leather Shoes and SUV in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Purse:[728, 243, 774, 309];Leather Shoes:[699, 775, 768, 841];SUV:[507, 068, 951, 425].') 2023-11-03 14:16:02.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:16:02.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[040, 287, 232, 920],[180, 310, 425, 912],[393, 326, 565, 877],[599, 342, 748, 640],[780, 314, 950, 883];jeans:[446, 535, 546, 865];fashion accessory:[105, 877, 145, 945],[109, 813, 140, 883],[218, 807, 279, 901],[236, 761, 261, 829],[381, 716, 426, 792],[641, 344, 716, 428];footwear:[101, 865, 153, 946],[210, 808, 289, 915],[315, 850, 354, 916],[377, 712, 428, 793],[827, 814, 881, 885].') 2023-11-03 14:16:14.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:16:14.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Glasses:[264, 314, 384, 387],[435, 347, 553, 438];Handbag:[863, 603, 999, 907];Person:[403, 302, 838, 999],[407, 244, 742, 999],[066, 229, 447, 1000];Bench:[054, 511, 1000, 1000].') 2023-11-03 14:16:29.178 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:16:29.179 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to get a motorcycle licence in [034, 004, 972, 946] the UK') 2023-11-03 14:16:58.443 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:16:58.445 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Jaguar XF 2008 2009 2010 2011 2012 2013 2014 2015 CAR COVER - 100% ALL-WEATHER [200, 065, 668, 161]!!') 2023-11-03 14:17:29.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:17:29.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'boat:[000, 000, 561, 812],[711, 494, 999, 710].') 2023-11-03 14:17:38.743 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:17:38.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the boats shifting to left-side?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 3 boats shifting to left-side.Track1frame:1:[639, 210, 837, 682];frame:2:[668, 227, 868, 700];frame:3:[683, 238, 881, 715],Track2frame:1:[762, 278, 894, 612];frame:2:[801, 293, 923, 632];frame:3:[814, 302, 932, 650],Track3frame:1:[095, 680, 259, 775];frame:2:[141, 695, 303, 785];frame:3:[171, 705, 332, 798].') 2023-11-03 14:18:15.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:18:15.539 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Stool:[492, 582, 547, 685],[478, 556, 547, 663],[532, 813, 672, 1000],[549, 911, 707, 999];Mirror:[518, 297, 589, 596];Flower:[491, 410, 544, 555],[583, 576, 830, 1000];shelf:[001, 326, 257, 701];Person:[380, 321, 444, 570],[350, 349, 407, 553],[191, 341, 245, 425].') 2023-11-03 14:18:38.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:18:38.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Stroller:[930, 597, 952, 640];Tent:[425, 508, 483, 615].') 2023-11-03 14:18:42.793 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:18:42.793 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[297, 133, 371, 390],[412, 137, 668, 882],[737, 081, 881, 700],[133, 189, 288, 874];Sneakers:[172, 819, 240, 875],[133, 784, 178, 856],[508, 789, 541, 881],[536, 763, 562, 854],[847, 638, 881, 699],[754, 641, 810, 686];Soccer:[423, 790, 493, 895].') 2023-11-03 14:18:51.736 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:18:51.737 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[921, 662, 999, 897];frame:2:[733, 665, 819, 748];frame:3:[490, 677, 678, 846] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an horse running in a circle') 2023-11-03 14:19:15.763 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:19:15.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Car in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Car:[869, 105, 908, 150],[882, 109, 999, 170],[177, 084, 298, 168],[000, 074, 207, 271],[000, 182, 061, 402].') 2023-11-03 14:19:23.591 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:19:23.592 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Sneakers and Bottle in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Sneakers:[588, 856, 665, 907],[438, 861, 532, 905];Bottle:[815, 321, 899, 543],[390, 334, 459, 529].') 2023-11-03 14:19:46.018 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:19:46.019 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[482, 409, 558, 995] Horacio Santos opens to the door at Bradshaw-Carter funeral home to wait for a client Monday, May 17, 2021 in Houston. [004, 009, 992, 991] The funeral home still has [685, 181, 921, 498] signs posted on its doors requiring [493, 565, 551, 696] masks.') 2023-11-03 14:20:04.698 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:20:04.698 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'There is room in the smallest cottage for [130, 245, 367, 772] a happy loving pair. - Friedrich Schiller') 2023-11-03 14:20:52.702 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:20:52.702 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Miley Cyrus [269, 042, 864, 993] holds her hand [417, 091, 610, 228] to her face [418, 086, 608, 226] to escape having her picture taken as she walks along a sidewalk...') 2023-11-03 14:20:55.508 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:20:55.509 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'February 1962; [242, 310, 900, 838] A man poses for the camera at a social in Ballyduff.') 2023-11-03 14:21:19.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:21:19.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[709, 463, 743, 572] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a personFrame1:[709, 463, 743, 572];Frame2:[728, 430, 767, 512];Frame3:[747, 393, 778, 436];Frame4:[765, 376, 808, 465].') 2023-11-03 14:22:26.904 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:22:26.904 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'bicycle:[662, 423, 797, 776],[097, 258, 160, 358],[259, 368, 560, 808],[278, 262, 336, 385],[424, 257, 460, 337],[457, 275, 518, 383],[522, 258, 572, 360],[796, 264, 825, 347];bicycle wheel:[275, 547, 412, 817],[098, 284, 125, 361],[278, 308, 303, 388],[309, 306, 335, 381],[456, 313, 484, 379],[492, 308, 518, 376],[499, 538, 559, 780],[668, 611, 695, 800],[732, 621, 765, 751];person:[672, 305, 786, 750],[278, 166, 333, 355],[302, 350, 411, 788],[448, 196, 477, 333],[456, 196, 513, 334],[526, 206, 571, 343],[655, 226, 681, 323],[789, 208, 830, 350];wheel:[275, 554, 405, 811],[100, 288, 122, 356],[134, 287, 156, 350],[277, 305, 301, 386],[311, 307, 331, 382],[458, 311, 482, 384],[494, 307, 515, 380],[499, 543, 560, 779],[525, 292, 545, 356],[671, 618, 695, 801],[735, 589, 765, 746].') 2023-11-03 14:22:50.907 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:22:50.908 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'With its Wi-Fi capability, rechargeable battery, and bigger, higher-resolution screen, [339, 147, 872, 934] the LeapFrog LeapPad Ultra is a great toy tablet for playing and learning. - LeapFrog LeapPad Ultra') 2023-11-03 14:22:51.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:22:51.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[032, 367, 548, 1000],[312, 093, 473, 441],[340, 140, 804, 999],[852, 926, 979, 1000],[524, 002, 999, 999];Necklace:[198, 832, 405, 1000];Donut:[124, 128, 211, 324],[433, 518, 518, 657],[480, 420, 599, 625],[645, 466, 770, 589],[797, 720, 907, 922].') 2023-11-03 14:23:07.523 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:23:07.523 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The new SOL applies to: The new SOL applies to all new applications [001, 136, 996, 968] lodged on or after the date of implementation, that is from 1 July 2010.') 2023-11-03 14:23:12.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:23:12.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Guitar:[153, 117, 572, 514];Satchel:[596, 455, 656, 494],[791, 702, 955, 839];Sandals:[661, 570, 696, 596],[626, 573, 662, 592];Sneakers:[501, 816, 612, 865],[336, 811, 417, 848];Hat:[212, 040, 416, 113];Desk:[706, 458, 840, 572];Bench:[348, 775, 837, 960];Hanger:[001, 288, 036, 339];Plate:[554, 932, 815, 1000];Person:[124, 042, 621, 865],[834, 338, 934, 520],[611, 336, 708, 595],[589, 355, 630, 585].') 2023-11-03 14:23:37.707 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:23:37.708 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Shooting clays [650, 397, 786, 448] can help take your mind off your golf game, if just for a second') 2023-11-03 14:23:52.060 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:23:52.061 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human body and human leg in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human body:[029, 060, 086, 192],[150, 000, 243, 361],[156, 294, 428, 725],[227, 000, 261, 126],[260, 252, 476, 565],[261, 000, 275, 111],[321, 000, 399, 240],[389, 361, 896, 996],[403, 113, 550, 351],[415, 203, 745, 717],[526, 096, 606, 290],[578, 421, 997, 996],[856, 000, 916, 165];human leg:[135, 768, 327, 953],[170, 114, 210, 345],[171, 121, 245, 299],[193, 722, 389, 912],[265, 508, 426, 707],[275, 438, 399, 520],[281, 577, 420, 727],[286, 460, 461, 568],[335, 048, 358, 237],[353, 045, 389, 251],[386, 668, 693, 999],[399, 231, 500, 328],[416, 258, 543, 345],[528, 187, 594, 284],[575, 685, 901, 997],[649, 789, 963, 999],[864, 031, 888, 156],[886, 030, 913, 166].') 2023-11-03 14:23:58.002 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:23:58.002 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Swan:[407, 744, 631, 963];Boat:[451, 467, 528, 588],[390, 118, 505, 215];Person:[433, 227, 530, 467],[431, 201, 550, 374],[393, 055, 450, 161],[340, 058, 414, 135].') 2023-11-03 14:24:07.218 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:24:07.218 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[568, 227, 727, 856] Woman hula hooping for a crowd of [227, 609, 341, 886] [788, 453, 843, 625] [944, 487, 998, 641] [869, 502, 930, 640] [192, 364, 253, 565] [402, 802, 559, 998] [355, 547, 429, 712] [276, 538, 345, 722] [450, 538, 526, 710] children in Lebanon') 2023-11-03 14:24:24.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:24:24.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[072, 756, 168, 1000];Cabinet:[289, 691, 530, 897];Lamp:[667, 863, 834, 1000];Chair:[264, 920, 436, 999].') 2023-11-03 14:24:33.320 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:24:33.321 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'wheel:[303, 623, 422, 918],[083, 482, 138, 673];car:[081, 223, 955, 942],[000, 223, 138, 520].') 2023-11-03 14:25:04.245 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:25:04.246 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[316, 005, 927, 922] A wizard conjuring the spirits of the computer with his spells.') 2023-11-03 14:25:31.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:25:31.298 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,jeans,jacket and plant in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[000, 000, 712, 999],[765, 194, 999, 997];jeans:[115, 769, 534, 999];jacket:[000, 000, 690, 827],[695, 188, 999, 925];plant:[000, 540, 155, 999],[499, 823, 999, 999],[595, 201, 980, 764].') 2023-11-03 14:25:46.103 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:25:46.104 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[623, 430, 681, 513],carFrame1:[585, 438, 644, 539],carFrame1:[000, 372, 535, 832],carFrame1:[867, 428, 999, 677],carFrame1:[856, 387, 985, 568] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[623, 430, 681, 513];Frame2:[611, 429, 670, 519],carFrame1:[585, 438, 644, 539];Frame2:[573, 441, 631, 539],carFrame1:[000, 372, 535, 832];Frame2:[000, 337, 390, 866],carFrame1:[867, 428, 999, 677];Frame2:[862, 428, 999, 677],carFrame1:[856, 387, 985, 568];Frame2:[842, 391, 972, 577].') 2023-11-03 14:25:47.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:25:47.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[521, 354, 541, 399] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a person headFrame1:[521, 354, 541, 399];Frame2:[507, 332, 528, 379];Frame3:[496, 343, 522, 392];Frame4:[490, 328, 516, 377].') 2023-11-03 14:26:03.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:26:03.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[166, 056, 738, 1000];clothing:[000, 649, 106, 997],[168, 356, 678, 997],[438, 427, 998, 997];human face:[000, 462, 011, 643],[418, 178, 575, 513].') 2023-11-03 14:26:30.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:26:30.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[580, 282, 826, 595] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a deerFrame1:[580, 282, 826, 595];Frame2:[537, 395, 784, 699];Frame3:[468, 333, 685, 597];Frame4:[448, 309, 696, 565];Frame5:[523, 366, 730, 606].') 2023-11-03 14:26:51.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:26:51.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'tower:[851, 309, 1000, 830],[075, 362, 248, 772],[244, 354, 334, 751],[327, 437, 396, 799],[403, 513, 458, 770],[556, 606, 604, 772],[610, 509, 653, 608],[650, 434, 708, 742],[720, 341, 848, 731];skyscraper:[643, 330, 879, 803],[848, 373, 999, 792];street light:[042, 665, 056, 790],[093, 696, 103, 781],[114, 685, 123, 793],[166, 683, 182, 799],[192, 698, 200, 787],[333, 288, 368, 842],[383, 650, 401, 755],[392, 538, 406, 806],[457, 677, 473, 775],[466, 700, 479, 770],[658, 649, 671, 761].') 2023-11-03 14:27:02.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:27:02.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Satchel:[015, 784, 088, 878];Tent:[889, 421, 986, 472];Person:[347, 487, 406, 685],[388, 576, 436, 667],[477, 576, 548, 695];Barrel:[784, 552, 807, 585].') 2023-11-03 14:27:07.109 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:27:07.110 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Image 3 of Ganni Julliard Mohair Sweater in Fiery Red') 2023-11-03 14:27:33.389 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:27:33.390 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[232, 422, 655, 698] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a yellow lion playing with three little lions on the grassFrame1:[232, 422, 655, 698];Frame2:[232, 422, 655, 698];Frame3:[232, 400, 647, 681];Frame4:[233, 377, 657, 670];Frame5:[208, 372, 649, 670].') 2023-11-03 14:27:38.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:27:38.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Helmet:[206, 526, 260, 615];Person:[014, 503, 111, 881],[057, 541, 131, 851],[116, 542, 184, 830],[157, 527, 268, 1000],[245, 507, 317, 853],[218, 327, 266, 418],[073, 330, 118, 461],[047, 249, 094, 361],[030, 140, 077, 269],[038, 067, 085, 175],[100, 071, 144, 187],[345, 394, 399, 509],[568, 001, 610, 094],[759, 001, 801, 082],[880, 001, 929, 122],[932, 126, 980, 254],[939, 213, 989, 327],[936, 284, 982, 371],[951, 366, 994, 534],[726, 413, 796, 483].') 2023-11-03 14:27:48.228 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:27:48.229 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[001, 352, 123, 813] [572, 416, 713, 995] [675, 254, 801, 659] [142, 602, 490, 996] [669, 489, 892, 996] Trump supporters protest for an Arizona recount after the 2020 election.') 2023-11-03 14:28:10.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:28:10.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[573, 298, 717, 344] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a stealth bomberFrame1:[573, 298, 717, 344];Frame2:[614, 288, 746, 333];Frame3:[656, 275, 780, 316];Frame4:[696, 268, 812, 309];Frame5:[722, 259, 839, 300].') 2023-11-03 14:28:17.095 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:28:17.096 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[142, 408, 193, 559] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[142, 408, 193, 559];Frame2:[070, 400, 127, 547];Frame3:[051, 395, 119, 540];Frame4:[153, 423, 200, 559].') 2023-11-03 14:28:41.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:28:41.234 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[683, 000, 951, 796];footwear:[798, 675, 895, 769],[693, 761, 843, 844];jeans:[699, 118, 930, 785];human leg:[699, 150, 845, 844],[762, 185, 908, 748];dog:[136, 160, 221, 312],[277, 349, 446, 933],[308, 360, 725, 655].') 2023-11-03 14:28:52.707 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:28:52.708 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[442, 004, 639, 430] [120, 028, 392, 553] [001, 299, 296, 996] [616, 199, 940, 658] [154, 433, 589, 998] [542, 063, 804, 465] [613, 729, 954, 996] [253, 013, 458, 407] [557, 405, 894, 921] kids in a classroom holding [730, 360, 791, 439] [367, 201, 438, 279] [161, 561, 252, 689] [265, 268, 321, 354] [595, 213, 652, 302] hands, sitting in [005, 007, 966, 996] a circle') 2023-11-03 14:29:07.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:29:07.018 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'person:[065, 080, 414, 833],[345, 382, 611, 992],[628, 504, 851, 999],[666, 433, 836, 784],[852, 378, 999, 958],[921, 643, 999, 999];cello:[509, 426, 599, 637],[728, 452, 830, 672],[753, 491, 827, 762],[938, 652, 999, 817],[966, 441, 999, 594];violin:[449, 443, 603, 882],[658, 446, 849, 758],[698, 511, 838, 927],[927, 635, 999, 889],[947, 429, 999, 601].') 2023-11-03 14:29:22.351 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:29:22.351 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'plant:[015, 206, 275, 975],[000, 000, 998, 998].') 2023-11-03 14:29:51.219 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:29:51.219 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'crowd of people [273, 536, 361, 973] [361, 532, 445, 976] [477, 531, 560, 975] [720, 552, 820, 975] walking during the traffic light [634, 153, 694, 379] [834, 093, 896, 366] with gwanghwamun plaz in seoul, south korea - south stock videos & royalty-free footage') 2023-11-03 14:30:02.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:30:02.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[522, 647, 775, 999].') 2023-11-03 14:30:08.427 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:30:08.427 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'person:[591, 706, 646, 878],[025, 650, 084, 774],[038, 743, 106, 836],[043, 803, 118, 934],[057, 929, 104, 999],[076, 526, 135, 632],[086, 586, 143, 664],[106, 792, 173, 927],[108, 898, 170, 999],[134, 532, 184, 628],[154, 836, 221, 932],[155, 703, 220, 777],[189, 572, 248, 643],[199, 653, 249, 721],[218, 758, 290, 865],[240, 513, 301, 655],[300, 620, 358, 725],[311, 687, 362, 794],[360, 732, 585, 999],[436, 608, 476, 732],[475, 590, 603, 727],[628, 521, 663, 588],[628, 546, 685, 612],[636, 674, 699, 781],[655, 741, 698, 808],[660, 605, 705, 683],[672, 402, 737, 462],[693, 574, 749, 670],[695, 681, 742, 758],[776, 648, 836, 727],[798, 538, 846, 641],[842, 526, 875, 615],[859, 582, 911, 662],[877, 526, 938, 650],[921, 380, 970, 448];clothing:[028, 560, 291, 999],[238, 512, 292, 631],[291, 574, 583, 926],[359, 876, 579, 999],[547, 549, 999, 999],[671, 428, 736, 461].') 2023-11-03 14:30:31.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:30:31.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human face:[300, 361, 388, 491],[046, 400, 105, 497],[523, 202, 608, 366],[839, 428, 905, 538];woman:[201, 324, 447, 999];girl:[200, 322, 526, 999];clothing:[000, 500, 160, 818],[260, 505, 444, 999],[451, 351, 731, 999],[768, 545, 978, 999].') 2023-11-03 14:31:02.548 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:31:02.548 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[385, 561, 456, 636] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a yellow airplane flying in the airFrame1:[385, 561, 456, 636];Frame2:[389, 491, 470, 572];Frame3:[208, 491, 291, 572];Frame4:[231, 658, 318, 747];Frame5:[250, 638, 341, 733].') 2023-11-03 14:31:14.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:31:14.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[645, 341, 727, 662],personFrame1:[163, 311, 276, 654],personFrame1:[316, 322, 442, 647],personFrame1:[410, 422, 491, 679],personFrame1:[233, 318, 324, 663] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[645, 341, 727, 662];Frame2:[654, 326, 736, 662],personFrame1:[163, 311, 276, 654];Frame2:[151, 306, 238, 655],personFrame1:[316, 322, 442, 647];Frame2:[300, 310, 396, 656],personFrame1:[410, 422, 491, 679];Frame2:[396, 343, 482, 688],personFrame1:[233, 318, 324, 663];Frame2:[233, 301, 303, 669].') 2023-11-03 14:32:41.492 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:32:41.493 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[062, 037, 920, 996] Des Linden of the crosses the finish to win the women's division of the 122nd Boston Marathon in Boston, Mass., on April 16, 2018.") 2023-11-03 14:33:26.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:33:26.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'boy:[038, 336, 522, 966];woman:[747, 184, 952, 511],[756, 134, 918, 241],[880, 097, 999, 348];man:[353, 045, 890, 999];sports equipment:[211, 961, 309, 995],[351, 865, 472, 988],[400, 865, 616, 999];footwear:[198, 959, 308, 999],[401, 869, 466, 983],[405, 873, 608, 999];human face:[236, 402, 364, 522],[450, 118, 592, 249].') 2023-11-03 14:33:31.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:33:31.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[224, 022, 378, 793] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[224, 022, 378, 793];Frame2:[182, 002, 325, 747];Frame3:[140, 006, 300, 754];Frame4:[153, 008, 336, 800];Frame5:[182, 004, 345, 822].') 2023-11-03 14:33:38.213 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:33:38.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Pickup Truck:[001, 858, 145, 988];Car:[189, 869, 366, 995],[296, 899, 463, 999],[418, 919, 552, 999],[554, 929, 686, 999];Boat:[793, 864, 998, 948];Machinery Vehicle:[078, 724, 254, 817].') 2023-11-03 14:34:25.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:34:25.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Couch,Pillow and Necklace in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Couch:[026, 085, 1000, 631];Pillow:[180, 160, 515, 365];Necklace:[534, 188, 710, 399].') 2023-11-03 14:34:27.048 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:34:27.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[450, 504, 506, 682],personFrame1:[829, 543, 919, 744],personFrame1:[421, 515, 465, 669],personFrame1:[145, 567, 234, 781],personFrame1:[620, 497, 657, 675],personFrame1:[834, 550, 915, 765],personFrame1:[476, 520, 518, 701],personFrame1:[733, 493, 775, 662],personFrame1:[554, 555, 664, 744],personFrame1:[742, 504, 811, 674] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[450, 504, 506, 682];Frame2:[433, 518, 490, 681],personFrame1:[829, 543, 919, 744];Frame2:[818, 532, 878, 739],personFrame1:[421, 515, 465, 669];Frame2:[414, 512, 463, 669],personFrame1:[145, 567, 234, 781];Frame2:[145, 570, 232, 783],personFrame1:[620, 497, 657, 675];Frame2:[618, 500, 665, 667],personFrame1:[834, 550, 915, 765];Frame2:[854, 549, 915, 762],personFrame1:[476, 520, 518, 701];Frame2:[450, 526, 518, 701],personFrame1:[733, 493, 775, 662];Frame2:[726, 492, 773, 661],personFrame1:[554, 555, 664, 744];Frame2:[568, 537, 623, 747],personFrame1:[742, 504, 811, 674];Frame2:[759, 498, 808, 672].') 2023-11-03 14:35:16.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:35:16.434 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the bike moves forward in this video clip?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There are 2 bike moves forward.Track1frame:1:[612, 251, 660, 401];frame:2:[612, 251, 660, 401];frame:3:[658, 326, 766, 487],Track2frame:1:[591, 306, 609, 355];frame:2:[591, 306, 609, 355];frame:3:[643, 406, 659, 472].') 2023-11-03 14:35:18.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:35:18.674 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 208, 739, 999],[002, 002, 999, 1000],[821, 246, 998, 574];Glasses:[061, 371, 315, 474].') 2023-11-03 14:35:46.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:35:46.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'mushroom:[013, 613, 397, 1000],[216, 067, 971, 710];plant:[000, 000, 408, 357],[806, 000, 998, 588].') 2023-11-03 14:36:24.681 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:36:24.681 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[001, 004, 913, 997] An election campaign billboard in Moscow for the 2018 Russian presidential election') 2023-11-03 14:36:35.608 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:36:35.608 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[002, 445, 583, 927],[336, 070, 492, 701],[470, 115, 746, 875],[646, 014, 963, 738],[752, 257, 869, 688],[315, 131, 389, 368].') 2023-11-03 14:37:03.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:37:03.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[216, 668, 474, 918];Hat:[474, 481, 578, 543];Satchel:[512, 838, 645, 917];Luggage:[902, 731, 999, 896];Person:[114, 155, 924, 754],[231, 480, 914, 999],[638, 539, 885, 910].') 2023-11-03 14:37:11.095 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:37:11.096 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[487, 113, 916, 948],[000, 018, 276, 707];man:[221, 092, 726, 991],[000, 657, 272, 999],[076, 000, 321, 582];clothing:[000, 679, 260, 999],[000, 313, 267, 748],[074, 230, 328, 584],[263, 351, 995, 999];human face:[000, 107, 100, 320],[141, 000, 253, 236],[360, 161, 495, 432],[660, 165, 771, 390].') 2023-11-03 14:37:27.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:37:27.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[742, 486, 784, 661],personFrame1:[439, 461, 491, 622],personFrame1:[386, 432, 429, 584],personFrame1:[552, 452, 600, 618],personFrame1:[837, 537, 889, 711],personFrame1:[567, 541, 610, 725],personFrame1:[715, 532, 767, 719],personFrame1:[601, 435, 654, 565],personFrame1:[824, 608, 880, 799],personFrame1:[315, 605, 361, 787] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[742, 486, 784, 661];Frame2:[747, 492, 801, 660],personFrame1:[439, 461, 491, 622];Frame2:[443, 457, 503, 614],personFrame1:[386, 432, 429, 584];Frame2:[398, 448, 441, 583],personFrame1:[552, 452, 600, 618];Frame2:[569, 460, 620, 616],personFrame1:[837, 537, 889, 711];Frame2:[835, 545, 915, 724],personFrame1:[567, 541, 610, 725];Frame2:[573, 554, 641, 725],personFrame1:[715, 532, 767, 719];Frame2:[719, 532, 778, 721],personFrame1:[601, 435, 654, 565];Frame2:[615, 440, 655, 577],personFrame1:[824, 608, 880, 799];Frame2:[819, 612, 887, 793],personFrame1:[315, 605, 361, 787];Frame2:[328, 618, 384, 788].') 2023-11-03 14:37:49.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:37:49.112 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[109, 236, 982, 988] This Homemade Frosted Lemonade is a Chik-fil-a copycat recipe. It's sweet, creamy, lemon-y, refreshing and tasty!") 2023-11-03 14:38:26.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:38:26.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[452, 331, 555, 541] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a scooterFrame1:[452, 331, 555, 541];Frame2:[502, 283, 691, 577];Frame3:[367, 208, 725, 626];Frame4:[146, 226, 608, 766].') 2023-11-03 14:38:29.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:38:29.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Boat:[009, 732, 171, 938],[151, 320, 817, 963],[167, 796, 818, 999],[773, 375, 999, 792],[903, 232, 999, 485];Car:[000, 241, 110, 382],[518, 254, 670, 388];SUV:[016, 195, 149, 310],[146, 220, 259, 307],[764, 171, 854, 231],[838, 165, 895, 219],[885, 166, 920, 210],[899, 161, 945, 216];Van:[958, 155, 1000, 215];Lifesaver:[490, 438, 570, 574].') 2023-11-03 14:38:50.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:38:50.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[089, 545, 567, 866];Chair:[138, 440, 472, 723],[000, 428, 148, 603],[000, 505, 152, 847],[644, 855, 998, 999];Lamp:[251, 303, 314, 350],[060, 329, 108, 370].') 2023-11-03 14:39:27.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:39:27.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[766, 431, 962, 903],[000, 338, 083, 573],[021, 544, 095, 806],[062, 650, 194, 817],[108, 320, 176, 457],[135, 413, 380, 717],[294, 406, 420, 617],[401, 592, 567, 777],[437, 366, 616, 692],[546, 349, 696, 681],[648, 287, 716, 467],[730, 419, 846, 734],[772, 270, 840, 370],[780, 364, 826, 443];chair:[558, 715, 743, 999];woman:[088, 272, 199, 462],[138, 256, 203, 363],[290, 351, 428, 668],[351, 468, 567, 768];table:[000, 705, 603, 999],[063, 345, 119, 421],[380, 336, 584, 472],[831, 327, 998, 393];human face:[145, 542, 177, 671],[175, 522, 213, 615],[188, 349, 232, 430],[327, 362, 363, 428],[493, 324, 533, 390],[614, 326, 647, 399],[728, 406, 769, 485],[818, 441, 865, 538].') 2023-11-03 14:39:30.789 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:39:30.790 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[339, 280, 998, 604];frame:2:[363, 223, 997, 653];frame:3:[378, 257, 998, 659] is?With reference to the supplied image frames and trajectory coordinates, please formulate a concise response.', 'This is a/an turtle almost does not change position') 2023-11-03 14:40:04.877 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:40:04.877 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Health workers [740, 096, 998, 993] [001, 098, 532, 995] inject people [446, 331, 696, 996] with a meningitis vaccine at a field post February 14 2007 in Aura Uganda MSF conducted a massive vaccination campaign...') 2023-11-03 14:40:07.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:40:07.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'girl:[247, 043, 955, 999];tree:[000, 000, 999, 584];clothing:[397, 436, 951, 999].') 2023-11-03 14:40:18.599 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:40:18.600 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Poor man [516, 266, 649, 546] cleans canal [534, 005, 741, 996] in exchange for money to buy food for his family') 2023-11-03 14:40:32.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:40:32.150 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Glasses:[138, 316, 248, 382];Person:[002, 209, 367, 1000].') 2023-11-03 14:40:42.076 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:40:42.076 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[113, 001, 991, 459] Parmesan yellow squash rounds make the perfect kid-friendly snack or side dish. Serve with [099, 671, 450, 905] yogurt or marinara sauce for dipping. You can also use zucchini squash in place of yellow squash.') 2023-11-03 14:40:44.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:40:44.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'wheel:[836, 733, 984, 986],[000, 664, 097, 925],[190, 639, 250, 871],[730, 664, 748, 871],[748, 643, 788, 938];bicycle:[000, 482, 246, 918],[703, 464, 878, 934],[820, 522, 999, 999];bicycle wheel:[000, 672, 090, 923],[189, 640, 248, 875],[722, 674, 750, 877],[748, 633, 787, 940],[818, 732, 999, 999];person:[000, 239, 217, 819],[130, 212, 260, 354],[215, 169, 303, 323],[283, 163, 385, 329],[411, 225, 504, 309],[459, 209, 550, 303],[670, 251, 876, 865],[705, 199, 793, 272],[915, 414, 999, 603];bicycle helmet:[067, 241, 144, 358],[755, 258, 836, 327];footwear:[102, 735, 181, 790],[701, 650, 743, 719];tire:[000, 668, 093, 921],[190, 633, 251, 873],[728, 608, 760, 874],[741, 630, 795, 943],[818, 732, 999, 999].') 2023-11-03 14:41:42.847 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:41:42.847 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'In fact, a T-rex [603, 213, 766, 925] would have been a quick snack for megalodon [001, 027, 564, 769]') 2023-11-03 14:41:50.527 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:41:50.527 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[388, 146, 613, 879];human hair:[034, 423, 164, 685],[276, 000, 378, 105],[410, 093, 509, 465],[494, 203, 567, 286];human head:[391, 084, 508, 288],[063, 445, 162, 614],[267, 002, 374, 209],[492, 200, 570, 357];girl:[376, 142, 649, 897],[019, 443, 169, 828];human arm:[430, 329, 508, 452],[013, 598, 112, 756],[016, 207, 180, 658],[363, 298, 528, 746],[419, 281, 626, 478],[530, 348, 604, 421];human hand:[455, 601, 528, 767],[013, 566, 077, 682],[061, 604, 115, 692],[422, 363, 466, 450],[478, 396, 525, 467];human body:[013, 604, 196, 828],[014, 142, 422, 828],[431, 292, 609, 814];human leg:[144, 576, 293, 849],[278, 556, 411, 837],[379, 582, 555, 888],[458, 497, 525, 710],[520, 475, 587, 796],[559, 592, 652, 857];mammal:[014, 452, 185, 836],[018, 001, 375, 841],[380, 092, 650, 875];man:[017, 000, 402, 847];clothing:[054, 590, 192, 811],[065, 144, 371, 846],[376, 291, 648, 872];human face:[085, 490, 152, 616],[279, 040, 369, 221],[424, 115, 496, 288],[487, 225, 551, 344].') 2023-11-03 14:42:03.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:42:03.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Bench:[303, 537, 480, 700];Bottle:[684, 626, 705, 694];Satchel:[755, 668, 777, 698],[696, 644, 728, 698];Sneakers:[820, 749, 844, 798],[805, 730, 826, 777],[559, 731, 581, 770],[281, 687, 296, 740],[095, 707, 114, 745];Person:[782, 318, 878, 798],[502, 383, 594, 778],[370, 345, 447, 747],[231, 384, 314, 750],[080, 429, 148, 755].') 2023-11-03 14:42:13.143 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:42:13.144 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'flower:[375, 481, 499, 739],[017, 764, 208, 999],[538, 415, 625, 595];man:[130, 350, 179, 463],[212, 306, 268, 415],[302, 305, 345, 419],[363, 287, 395, 372],[429, 577, 451, 647],[455, 275, 488, 360],[778, 080, 908, 721];footwear:[776, 675, 850, 724].') 2023-11-03 14:42:18.469 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:42:18.470 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Nursing theorists and their work [324, 031, 666, 335] elsevier ebook on intel') 2023-11-03 14:42:21.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:42:21.521 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'car:[916, 651, 993, 769];bus:[461, 542, 729, 705];building:[000, 102, 307, 346],[000, 333, 845, 705],[267, 204, 359, 348],[337, 140, 636, 350],[556, 253, 998, 567],[636, 135, 999, 560].') 2023-11-03 14:42:55.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:42:55.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect computer keyboard,human face,man,sunglasses and woman in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human face:[003, 361, 096, 519],[098, 399, 117, 464],[214, 314, 303, 465],[402, 282, 477, 409],[552, 326, 628, 440],[601, 454, 681, 564],[668, 383, 732, 543],[722, 248, 901, 528],[946, 284, 999, 410];man:[337, 204, 572, 978],[000, 340, 135, 999],[015, 280, 382, 999],[474, 294, 558, 418],[534, 266, 645, 581];sunglasses:[211, 347, 311, 390];woman:[078, 359, 133, 492],[530, 389, 676, 999],[536, 336, 747, 999],[918, 253, 998, 551].') 2023-11-03 14:43:02.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:43:02.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[028, 675, 125, 983],[000, 684, 027, 985],[921, 744, 978, 998];woman:[170, 731, 280, 998];building:[000, 000, 998, 998];jeans:[061, 841, 115, 998],[931, 871, 973, 998].') 2023-11-03 14:43:09.963 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:43:09.963 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[182, 022, 817, 978] A silver stainless steel Dacor professional style 30 inch gas range.') 2023-11-03 14:43:36.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:43:36.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Flower:[261, 453, 341, 489];Lamp:[288, 405, 394, 501],[170, 001, 487, 206];Desk:[270, 504, 443, 530];Bed:[002, 272, 669, 908];Bench:[168, 638, 707, 927];Carpet:[242, 720, 999, 999];Radiator:[643, 524, 1000, 687];Chair:[785, 482, 1000, 720];Cup:[858, 692, 893, 731].') 2023-11-03 14:43:46.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:43:46.230 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the three bears on the move.?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 3 the three bears on the move..Track1frame:1:[502, 361, 655, 506];frame:2:[502, 361, 655, 506];frame:3:[569, 385, 711, 545],Track2frame:1:[316, 336, 583, 670];frame:2:[316, 336, 583, 670];frame:3:[336, 375, 622, 670],Track3frame:1:[244, 447, 361, 562];frame:2:[244, 447, 361, 562];frame:3:[294, 489, 413, 600].') 2023-11-03 14:44:18.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:44:18.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the spotted fish swimming around.Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one spotted fish swimming around.Track1frame:1:[246, 123, 388, 365];frame:2:[246, 123, 388, 365];frame:3:[363, 141, 465, 577].') 2023-11-03 14:44:22.632 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:44:22.632 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '• The SWOT Analysis framework [075, 054, 919, 898] is a very important and useful tool to use in marketing Management and other business applic...') 2023-11-03 14:44:53.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:44:53.556 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[200, 169, 481, 1000];clothing:[016, 604, 213, 988],[213, 534, 517, 998],[415, 318, 830, 998];man:[417, 039, 853, 998];human face:[065, 580, 142, 713],[122, 305, 150, 374],[286, 254, 418, 583],[423, 105, 573, 438].') 2023-11-03 14:45:00.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:45:00.184 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Carpet,Hat and Glasses in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[472, 233, 610, 334];Glasses:[513, 257, 605, 303].') 2023-11-03 14:45:02.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:45:02.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human hair:[536, 273, 623, 391],[131, 385, 251, 551],[256, 309, 334, 370],[330, 279, 392, 391];human head:[141, 395, 246, 555],[250, 311, 335, 450],[326, 287, 392, 389],[368, 240, 411, 306],[504, 168, 538, 237],[536, 279, 605, 382],[667, 284, 743, 420];mammal:[528, 281, 673, 550],[000, 255, 049, 350],[080, 390, 313, 994],[082, 303, 323, 718],[231, 177, 437, 310],[311, 286, 396, 584],[475, 170, 550, 306],[547, 240, 610, 305],[592, 198, 628, 277],[616, 275, 870, 999],[783, 160, 913, 306];man:[624, 278, 870, 788],[078, 315, 335, 708];human arm:[130, 567, 293, 913],[000, 270, 047, 358],[228, 194, 910, 306],[264, 680, 310, 781],[271, 458, 306, 663],[530, 370, 661, 540],[611, 466, 644, 625],[695, 402, 870, 723];human hand:[685, 663, 801, 735],[099, 579, 126, 645];human body:[000, 256, 051, 353],[085, 313, 340, 670],[229, 179, 673, 320],[299, 285, 393, 523],[523, 282, 673, 585],[615, 285, 867, 809],[782, 161, 867, 303];food:[482, 688, 574, 757];woman:[083, 382, 335, 996],[225, 182, 271, 260],[251, 210, 300, 301],[286, 292, 396, 568],[386, 196, 426, 260],[475, 166, 557, 306],[517, 271, 680, 602];table:[308, 495, 821, 999];clothing:[085, 513, 320, 955],[246, 379, 300, 602],[293, 387, 381, 539],[476, 215, 556, 315],[528, 355, 666, 602],[616, 416, 880, 821],[796, 224, 875, 308];human face:[171, 419, 250, 569],[263, 338, 331, 441],[341, 305, 386, 382],[543, 298, 599, 380],[674, 315, 733, 410].') 2023-11-03 14:45:40.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:45:40.522 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[235, 474, 895, 997] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a pine snakeFrame1:[235, 474, 895, 997];Frame2:[242, 490, 889, 997];Frame3:[210, 480, 886, 998].') 2023-11-03 14:45:42.835 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:45:42.835 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Attractive caucasian woman portrait [537, 319, 927, 757] sitting on the sofa. Girl [067, 262, 414, 995] wearing sleepwear next to the fireplace in the living room with a guitar. Cozy, comfortable, winter, domestic life concept [009, 010, 994, 988].') 2023-11-03 14:46:09.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:46:09.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Vase,Potted Plant and Person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Vase:[326, 713, 400, 792],[085, 225, 109, 267],[126, 254, 153, 286];Potted Plant:[657, 687, 749, 890],[737, 732, 825, 920],[656, 453, 679, 507],[169, 240, 199, 290],[003, 258, 034, 388],[352, 398, 530, 908],[771, 019, 803, 131];Person:[646, 556, 668, 644].') 2023-11-03 14:46:44.320 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:46:44.320 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "boromir [129, 008, 994, 997] The UK decides to abandon the controversial ‘meme ban'") 2023-11-03 14:47:18.525 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:47:18.525 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'shelf:[001, 003, 499, 995],[345, 011, 602, 491],[561, 001, 882, 577];Potted Plant:[827, 077, 958, 467];Person:[656, 405, 821, 971];Sandals:[755, 872, 812, 940],[707, 916, 775, 967].') 2023-11-03 14:47:32.554 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:47:32.555 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A b2c rebrand [275, 172, 930, 931] for Canby Telcom included a billboard and newspaper ad [302, 226, 551, 840] to introduce the new look!') 2023-11-03 14:48:00.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:48:00.852 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[639, 640, 817, 850] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a hippopotamusFrame1:[639, 640, 817, 850];Frame2:[720, 636, 895, 858];Frame3:[717, 634, 898, 856];Frame4:[715, 634, 898, 856];Frame5:[715, 634, 894, 856].') 2023-11-03 14:48:08.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:48:08.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'council bluffs, iowa, usa: former united states vice president joe biden [060, 146, 366, 995] campaigns for the democratic nomination for the 2020 united states... - democracy stock videos & royalty-free footage') 2023-11-03 14:48:22.950 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:48:22.951 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the white duck walk around?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one white duck walk around.Track1frame:1:[472, 301, 802, 438];frame:2:[494, 133, 708, 436];frame:3:[446, 132, 728, 565].') 2023-11-03 14:48:23.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:48:23.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[563, 215, 898, 883],[026, 193, 433, 901];woman:[031, 171, 331, 845];human head:[655, 210, 790, 408],[109, 179, 285, 390];drink:[210, 792, 265, 952],[675, 742, 715, 878],[906, 825, 951, 890];human arm:[021, 395, 278, 707],[243, 389, 309, 700],[509, 414, 861, 688],[591, 368, 694, 585].') 2023-11-03 14:48:39.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:48:39.927 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'flower:[230, 124, 423, 250].') 2023-11-03 14:49:12.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:49:12.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat and Paddle in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Boat:[454, 496, 621, 532],[000, 001, 998, 492];Paddle:[565, 498, 606, 531].') 2023-11-03 14:49:21.622 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:49:21.622 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[088, 313, 909, 608] Amanda Palmer quotes: Eat the pain. Send it back into the void as love.') 2023-11-03 14:49:32.148 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:49:32.149 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[125, 531, 162, 683],personFrame1:[413, 600, 460, 756],personFrame1:[203, 463, 248, 584],personFrame1:[382, 444, 430, 572],personFrame1:[502, 522, 546, 680],personFrame1:[146, 643, 192, 798],personFrame1:[417, 533, 452, 697] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[125, 531, 162, 683];Frame2:[100, 540, 150, 690],personFrame1:[413, 600, 460, 756];Frame2:[417, 586, 464, 755],personFrame1:[203, 463, 248, 584];Frame2:[200, 462, 245, 580],personFrame1:[382, 444, 430, 572];Frame2:[376, 454, 421, 576],personFrame1:[502, 522, 546, 680];Frame2:[509, 520, 546, 676],personFrame1:[146, 643, 192, 798];Frame2:[109, 659, 179, 802],personFrame1:[417, 533, 452, 697];Frame2:[410, 547, 454, 711].') 2023-11-03 14:50:01.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:50:01.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Bottle:[539, 102, 700, 874],[320, 102, 486, 869];Wine Glass:[744, 325, 921, 925],[093, 329, 270, 922];Plate:[191, 254, 823, 858].') 2023-11-03 14:50:33.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:50:33.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'helmet:[476, 403, 630, 617],[031, 580, 098, 677],[096, 498, 196, 650];person:[003, 584, 103, 998],[014, 501, 251, 999],[361, 382, 669, 999];land vehicle:[000, 732, 258, 999],[323, 706, 700, 999];tree:[000, 000, 145, 384],[203, 000, 482, 782],[405, 000, 764, 597],[749, 000, 999, 637];window:[080, 282, 146, 452],[131, 128, 184, 192],[239, 139, 295, 212],[703, 303, 758, 488],[713, 593, 754, 657].') 2023-11-03 14:50:38.630 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 14:50:38.630 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Desk:[419, 295, 572, 469];Chair:[417, 270, 523, 459],[549, 272, 638, 451],[623, 266, 711, 466];Person:[057, 432, 099, 528],[238, 145, 314, 459],[459, 205, 515, 455],[295, 479, 389, 634],[344, 460, 524, 674],[685, 635, 746, 705],[949, 459, 999, 546],[638, 460, 722, 593],[548, 453, 640, 580],[577, 353, 648, 514],[334, 437, 388, 530].') 2023-11-03 14:51:06.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:51:06.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[139, 431, 378, 1000],[126, 400, 248, 735],[218, 361, 352, 545],[393, 175, 438, 255],[581, 175, 646, 421],[876, 158, 916, 280],[925, 161, 999, 540];man:[465, 403, 786, 972],[000, 265, 041, 372],[000, 320, 081, 556],[026, 303, 145, 613],[195, 148, 271, 405],[273, 233, 309, 329],[286, 241, 411, 377],[314, 238, 391, 384],[338, 160, 365, 224],[340, 330, 482, 522],[358, 227, 407, 301],[397, 160, 530, 406],[505, 329, 638, 503],[636, 235, 686, 365],[653, 245, 736, 384],[673, 390, 943, 905],[678, 340, 811, 582],[712, 214, 748, 286],[845, 238, 916, 334];table:[277, 496, 800, 921],[525, 264, 588, 329],[738, 293, 904, 412];clothing:[000, 359, 092, 575],[018, 307, 057, 401],[057, 340, 155, 509],[130, 512, 246, 770],[159, 593, 445, 992],[196, 190, 262, 396],[262, 175, 292, 252],[278, 265, 313, 345],[287, 272, 358, 364],[320, 257, 391, 386],[336, 180, 365, 252],[337, 388, 492, 507],[418, 191, 530, 381],[420, 197, 436, 260],[435, 254, 504, 369],[441, 560, 788, 999],[491, 381, 650, 505],[591, 223, 640, 362],[625, 276, 669, 386],[647, 278, 727, 405],[690, 398, 810, 600],[726, 477, 953, 940],[855, 272, 912, 345],[889, 185, 921, 286],[943, 212, 996, 522].') 2023-11-03 14:51:20.114 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:51:20.115 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'New Album - Songs For Rojava') 2023-11-03 14:51:33.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:51:33.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[187, 123, 464, 760],[522, 123, 760, 912];Gloves:[218, 155, 287, 244].') 2023-11-03 14:51:50.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:51:50.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'sandal:[443, 884, 543, 981],[329, 734, 408, 770],[336, 756, 387, 821];dress:[201, 136, 428, 448],[321, 147, 673, 554];woman:[193, 005, 456, 833];handbag:[630, 530, 739, 795].') 2023-11-03 14:52:53.452 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:52:53.453 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Storage box:[667, 619, 818, 823];Person:[002, 136, 426, 1000],[087, 435, 396, 1000],[650, 250, 827, 779];Dog:[845, 753, 1000, 1000];Hat:[124, 134, 259, 268];Briefcase:[666, 619, 819, 825].') 2023-11-03 14:53:06.201 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:53:06.202 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Speaker,Flower and Person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Speaker:[937, 734, 999, 887];Flower:[531, 314, 618, 594],[668, 525, 721, 606],[899, 645, 1000, 728],[002, 476, 248, 968],[339, 568, 375, 690],[333, 243, 372, 327];Person:[543, 392, 731, 970],[456, 374, 545, 874],[394, 381, 481, 956],[335, 362, 430, 1000].') 2023-11-03 14:53:41.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:53:41.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,sports uniform and girl in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'man:[263, 027, 534, 799];sports uniform:[325, 205, 526, 798];girl:[000, 397, 128, 993],[034, 352, 270, 999].') 2023-11-03 14:53:45.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:53:45.748 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Stop Sign:[449, 184, 833, 467].') 2023-11-03 14:53:58.627 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:53:58.627 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[105, 267, 594, 894] A red 2021 Lexus IS 350 is shown from behind speeding past [570, 106, 995, 669] bleachers at [002, 575, 996, 998] the racetrack.') 2023-11-03 14:54:26.827 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:54:26.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect glasses,human face,tree and girl in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'glasses:[331, 833, 428, 999];human face:[201, 238, 413, 617];tree:[000, 000, 980, 999],[290, 000, 618, 698];girl:[000, 178, 612, 999].') 2023-11-03 14:54:50.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:54:50.903 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[404, 568, 505, 795] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[404, 568, 505, 795];Frame2:[393, 536, 475, 787];Frame3:[439, 558, 540, 811];Frame4:[450, 580, 578, 833];Frame5:[405, 562, 508, 834].') 2023-11-03 14:54:51.606 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:54:51.607 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'natural burial [410, 711, 831, 996] in the meadow [003, 588, 997, 997] at Carolina Memorial Sanctuary with mountains [001, 248, 994, 449] in the background') 2023-11-03 14:55:05.071 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:55:05.072 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Fifteen women [888, 272, 996, 902] [110, 219, 307, 520] [229, 423, 378, 902] [071, 406, 262, 928] representing all eight Tulane sororities [552, 374, 696, 901] [335, 362, 506, 886] [671, 374, 843, 951] traveled to Nkoka, Malawi to build a school in support of women's education this summer.") 2023-11-03 14:55:43.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:55:43.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shelf,Other Shoes and Person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'shelf:[247, 246, 625, 319];Other Shoes:[023, 429, 077, 466];Person:[777, 199, 950, 439],[862, 209, 944, 309],[001, 083, 100, 443].') 2023-11-03 14:55:55.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:55:55.357 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Street Lights,Sneakers and Person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Street Lights:[351, 001, 415, 326],[927, 009, 974, 242];Sneakers:[189, 888, 235, 939],[327, 851, 368, 917],[392, 851, 431, 920],[427, 804, 473, 837],[539, 746, 565, 780],[577, 711, 602, 760],[655, 861, 694, 902],[687, 873, 737, 922];Person:[905, 259, 937, 347].') 2023-11-03 14:56:22.543 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:56:22.543 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'stop motion animation [260, 043, 593, 979] of EL-wire figure kicking a soccer ball [577, 688, 709, 889]') 2023-11-03 14:56:25.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:56:25.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[209, 450, 253, 664],personFrame1:[805, 570, 847, 794],personFrame1:[468, 425, 503, 613],personFrame1:[216, 529, 269, 763],personFrame1:[506, 371, 539, 552],personFrame1:[953, 460, 988, 652],personFrame1:[492, 849, 530, 1000],personFrame1:[895, 423, 935, 616],personFrame1:[232, 096, 270, 279],personFrame1:[221, 329, 263, 528] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[209, 450, 253, 664];Frame2:[207, 453, 251, 668],personFrame1:[805, 570, 847, 794];Frame2:[806, 573, 851, 796],personFrame1:[468, 425, 503, 613];Frame2:[468, 431, 504, 621],personFrame1:[216, 529, 269, 763];Frame2:[231, 530, 282, 761],personFrame1:[506, 371, 539, 552];Frame2:[515, 375, 546, 557],personFrame1:[953, 460, 988, 652];Frame2:[952, 451, 987, 645],personFrame1:[492, 849, 530, 1000];Frame2:[494, 851, 533, 1000],personFrame1:[895, 423, 935, 616];Frame2:[895, 423, 935, 616],personFrame1:[232, 096, 270, 279];Frame2:[232, 092, 269, 275],personFrame1:[221, 329, 263, 528];Frame2:[224, 323, 263, 519].') 2023-11-03 14:56:37.713 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:56:37.713 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[529, 281, 995, 994] My Teacher's in the Computer! book cover. A drawing of a young girl seeing faces on a computer screen is on the cover of the book.") 2023-11-03 14:57:19.701 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:57:19.701 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The lineup [420, 651, 572, 937] [266, 302, 406, 577] [096, 304, 243, 585] [259, 641, 410, 938] [598, 302, 733, 584] [419, 310, 572, 595] for the Democratic presidential candidates debate July 30, 2019 in Detroit. (WDIV') 2023-11-03 14:58:03.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:58:03.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,human face,human body,human hair and common sunflower in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'footwear:[370, 849, 418, 992],[159, 927, 208, 998],[252, 926, 286, 998],[453, 873, 485, 992],[538, 921, 575, 998],[610, 929, 646, 998],[696, 920, 736, 987];human face:[030, 209, 999, 537];human body:[380, 284, 535, 933],[002, 233, 073, 424],[076, 371, 141, 515],[115, 417, 297, 993],[280, 344, 393, 748],[520, 331, 623, 608],[655, 326, 756, 536],[761, 390, 871, 536],[863, 436, 934, 670],[869, 498, 999, 938];human hair:[116, 280, 236, 492],[068, 307, 115, 385],[307, 263, 355, 326],[389, 314, 505, 510],[490, 295, 533, 370],[562, 275, 602, 350],[589, 405, 670, 549],[697, 263, 746, 358],[771, 429, 844, 513],[774, 334, 867, 439],[850, 328, 896, 405],[927, 412, 999, 513].') 2023-11-03 14:58:15.989 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:58:15.990 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'sunglasses:[000, 702, 076, 748],[000, 028, 318, 073],[123, 248, 188, 295],[125, 475, 188, 524],[327, 176, 648, 393],[679, 033, 999, 077],[868, 700, 931, 746],[935, 479, 999, 525],[936, 697, 999, 745];woman:[000, 000, 999, 998];girl:[000, 443, 057, 621],[000, 223, 061, 436],[000, 666, 090, 889],[000, 000, 955, 998],[002, 008, 061, 218],[002, 225, 127, 436],[004, 001, 123, 213],[004, 439, 123, 653],[050, 441, 197, 644],[061, 006, 197, 213],[064, 666, 144, 779],[067, 226, 207, 420],[123, 006, 257, 215],[125, 219, 261, 441],[194, 006, 333, 215],[673, 000, 733, 132],[705, 002, 803, 188],[739, 002, 877, 223],[774, 219, 866, 415],[796, 006, 943, 234],[808, 219, 938, 436],[809, 441, 872, 590],[825, 441, 936, 660],[846, 439, 999, 660],[856, 672, 943, 817],[866, 219, 999, 441],[877, 000, 999, 219],[905, 670, 999, 813],[933, 887, 999, 998];human face:[000, 220, 055, 363],[000, 441, 057, 582],[000, 000, 058, 142],[000, 667, 073, 817],[060, 442, 121, 588],[060, 224, 124, 365],[061, 000, 124, 142],[076, 666, 140, 786],[127, 221, 193, 363],[129, 444, 193, 588],[130, 000, 193, 138],[195, 221, 258, 360],[196, 441, 224, 537],[197, 000, 257, 139],[260, 000, 322, 142],[331, 000, 649, 693],[678, 000, 741, 121],[744, 000, 803, 142],[749, 224, 799, 313],[799, 227, 865, 362],[804, 000, 869, 143],[805, 439, 868, 587],[869, 224, 931, 365],[874, 446, 932, 589],[874, 000, 937, 145],[874, 670, 941, 807],[935, 893, 999, 998],[937, 224, 999, 362],[940, 446, 999, 587],[944, 000, 999, 145],[945, 670, 999, 814];glasses:[000, 474, 216, 528],[000, 250, 242, 297],[001, 703, 084, 753],[001, 026, 315, 078],[083, 697, 151, 752],[276, 176, 664, 404],[679, 028, 999, 085],[758, 250, 995, 302],[855, 702, 939, 750],[936, 474, 999, 525],[936, 915, 999, 977],[942, 694, 999, 759].') 2023-11-03 14:58:46.517 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:58:46.518 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Picture:[497, 017, 726, 408],[194, 117, 448, 381];Boots:[209, 937, 257, 999],[257, 896, 309, 999],[360, 906, 428, 999],[424, 908, 487, 999],[157, 940, 218, 999];Person:[692, 112, 999, 999],[550, 218, 750, 1000],[407, 225, 590, 1000],[286, 250, 493, 999],[207, 228, 315, 1000],[011, 064, 252, 999].') 2023-11-03 14:58:47.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:58:47.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'palm tree:[809, 053, 968, 838],[052, 000, 231, 745],[234, 005, 447, 899],[380, 000, 493, 921],[521, 042, 644, 826],[652, 071, 747, 782];house:[003, 336, 244, 882],[266, 395, 364, 637],[524, 681, 744, 998];porch:[136, 727, 238, 910],[281, 604, 596, 895];swimming pool:[281, 725, 562, 998];window:[010, 448, 101, 650],[029, 745, 097, 884].') 2023-11-03 14:58:55.021 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:58:55.021 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Wall [000, 046, 998, 974] featuring text of text of Rukun Negara, the Malaysian national pledge, on upper concourse level') 2023-11-03 14:59:08.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:59:08.392 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[219, 428, 312, 684],[807, 526, 874, 661],[747, 527, 831, 659],[047, 536, 117, 650];Airplane:[135, 411, 906, 684].') 2023-11-03 14:59:44.684 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 14:59:44.685 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'As fresh is if they were done yesterday: Hilma af Klint, at the Serpentine [093, 044, 472, 853]') 2023-11-03 15:00:25.894 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:00:25.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cat,Chair,Blackboard and Person in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Chair:[322, 562, 629, 851],[001, 602, 171, 887];Blackboard:[831, 100, 1000, 509];Person:[149, 404, 681, 875].') 2023-11-03 15:00:27.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:00:27.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[379, 345, 462, 406] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a stealth fighterFrame1:[379, 345, 462, 406];Frame2:[375, 358, 457, 418];Frame3:[364, 355, 453, 426].') 2023-11-03 15:01:03.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:01:03.302 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet,Person and Storage box in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Cabinet:[030, 001, 551, 1000],[000, 001, 104, 999],[551, 001, 993, 998],[953, 341, 999, 832];Person:[118, 784, 158, 843],[063, 001, 090, 044],[712, 649, 737, 692],[373, 478, 403, 511],[957, 721, 995, 762],[156, 242, 208, 290];Storage box:[973, 373, 1000, 518],[000, 353, 030, 529],[000, 604, 062, 668],[000, 868, 090, 999],[000, 648, 078, 855].') 2023-11-03 15:01:19.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:01:19.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair,Frame and Cup in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Chair:[415, 531, 649, 999],[006, 735, 447, 999],[176, 513, 310, 676],[322, 498, 412, 633];Picture:[346, 256, 440, 418],[604, 409, 637, 506],[646, 464, 676, 553],[646, 363, 676, 450],[693, 396, 737, 480],[748, 417, 800, 508],[816, 478, 878, 582],[894, 549, 958, 672],[792, 581, 845, 693],[878, 675, 953, 810],[723, 516, 774, 615];Cup:[282, 647, 314, 710],[297, 655, 329, 716].') 2023-11-03 15:02:03.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:02:03.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[495, 427, 589, 627],personFrame1:[448, 377, 532, 641],personFrame1:[501, 397, 551, 616],personFrame1:[665, 466, 712, 691],personFrame1:[800, 476, 872, 818],personFrame1:[357, 338, 417, 575],personFrame1:[201, 588, 261, 868] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'personFrame1:[495, 427, 589, 627];Frame2:[518, 408, 600, 595],personFrame1:[448, 377, 532, 641];Frame2:[441, 372, 517, 636],personFrame1:[501, 397, 551, 616];Frame2:[487, 377, 546, 605],personFrame1:[665, 466, 712, 691];Frame2:[670, 476, 728, 719],personFrame1:[800, 476, 872, 818];Frame2:[774, 480, 858, 802],personFrame1:[357, 338, 417, 575];Frame2:[333, 337, 397, 562],personFrame1:[201, 588, 261, 868];Frame2:[186, 579, 251, 858].') 2023-11-03 15:02:14.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:02:14.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[807, 565, 935, 913],[245, 553, 365, 937],[371, 575, 456, 782],[431, 069, 535, 269],[453, 017, 618, 281],[488, 585, 569, 766],[606, 582, 676, 775],[702, 569, 778, 766];sculpture:[808, 556, 933, 894],[235, 564, 270, 746],[246, 559, 364, 984],[255, 559, 292, 622],[370, 579, 448, 774],[435, 070, 535, 275],[488, 016, 615, 291],[492, 579, 563, 780],[606, 585, 671, 767],[703, 570, 771, 760].') 2023-11-03 15:02:23.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:02:23.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[447, 234, 628, 729],[351, 273, 397, 469],[260, 246, 313, 482],[164, 244, 214, 480],[105, 362, 139, 444],[088, 264, 138, 431],[091, 234, 119, 300];Hat:[503, 237, 546, 315];Gloves:[476, 409, 511, 473],[584, 423, 626, 475];Boots:[479, 635, 514, 714];Stroller:[100, 345, 181, 528];Tricycle:[363, 420, 643, 971].') 2023-11-03 15:02:29.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:02:29.483 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'plant:[000, 330, 095, 465],[008, 249, 083, 293],[216, 223, 339, 259],[383, 202, 617, 277],[435, 094, 502, 145],[540, 245, 618, 311],[595, 069, 693, 151],[653, 101, 743, 189];umbrella:[733, 009, 890, 257];car:[030, 057, 918, 899],[612, 060, 999, 466].') 2023-11-03 15:02:43.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:02:43.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bread,Boots,Handbag and Person in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Boots:[342, 758, 403, 946],[304, 719, 359, 893];Handbag:[237, 612, 358, 764],[274, 365, 428, 561];Person:[275, 180, 468, 948],[674, 274, 789, 731],[668, 305, 727, 434],[755, 438, 849, 756],[873, 418, 908, 536],[912, 428, 929, 468].') 2023-11-03 15:02:51.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:02:51.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[532, 207, 611, 516];Chair:[281, 375, 567, 929];Desk:[531, 494, 692, 779];Pillow:[739, 279, 936, 477],[658, 332, 757, 470],[910, 288, 999, 469];Bed:[666, 459, 997, 1000];Person:[233, 214, 560, 963];Belt:[362, 483, 427, 544];High Heels:[494, 830, 564, 959],[318, 846, 425, 966];Recorder:[598, 447, 671, 510].') 2023-11-03 15:03:00.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:03:00.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[169, 406, 242, 575] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a tricycle wheelFrame1:[169, 406, 242, 575];Frame2:[146, 430, 214, 594];Frame3:[128, 444, 203, 622];Frame4:[089, 475, 164, 661].') 2023-11-03 15:04:20.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:04:20.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[279, 563, 346, 662],carFrame1:[749, 554, 931, 703],carFrame1:[551, 548, 629, 660],carFrame1:[082, 562, 240, 727],bicycleFrame1:[920, 576, 968, 688],carFrame1:[345, 543, 426, 660],carFrame1:[420, 549, 568, 757] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[279, 563, 346, 662];Frame2:[223, 567, 318, 686],carFrame1:[749, 554, 931, 703];Frame2:[832, 556, 1000, 764],carFrame1:[551, 548, 629, 660];Frame2:[553, 550, 638, 669],carFrame1:[082, 562, 240, 727];Frame2:[000, 565, 142, 809],bicycleFrame1[920, 576, 968, 688];Frame2:No object,carFrame1:[345, 543, 426, 660];Frame2:[346, 547, 428, 663],carFrame1:[420, 549, 568, 757];Frame2:[419, 552, 568, 767].') 2023-11-03 15:04:25.773 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:04:25.773 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "A man [159, 042, 996, 997] holds up a container [059, 849, 251, 997] of oil and sand as he testifies at last night's hearing") 2023-11-03 15:04:40.173 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:04:40.173 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[263, 491, 316, 577],carFrame1:[310, 498, 354, 557],carFrame1:[624, 457, 767, 614],carFrame1:[564, 488, 630, 571],carFrame1:[000, 478, 142, 682] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[263, 491, 316, 577];Frame2:[220, 500, 284, 595],carFrame1:[310, 498, 354, 557];Frame2:[280, 486, 332, 577],carFrame1:[624, 457, 767, 614];Frame2:[677, 444, 931, 678],carFrame1:[564, 488, 630, 571];Frame2:[589, 497, 672, 583],carFrame1[000, 478, 142, 682];Frame2:No object.') 2023-11-03 15:05:07.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:05:07.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Bench:[174, 571, 265, 649];Chair:[467, 568, 511, 646];Person:[473, 555, 525, 644].') 2023-11-03 15:06:40.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:06:40.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[799, 471, 989, 902],[809, 501, 877, 655];man:[000, 523, 439, 947],[325, 353, 400, 494],[369, 486, 680, 926],[539, 373, 604, 488],[556, 363, 775, 734],[683, 479, 860, 933];vehicle:[000, 000, 999, 998];human face:[428, 512, 525, 664],[741, 484, 793, 578],[809, 543, 867, 611],[874, 492, 935, 606].') 2023-11-03 15:06:58.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:06:58.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Microphone:[493, 245, 558, 272],[849, 236, 889, 300],[063, 482, 135, 545];Speaker:[165, 511, 289, 597];Person:[741, 324, 959, 598],[251, 387, 568, 1000];Bracelet:[256, 672, 310, 692];Cymbal:[670, 340, 812, 359];Drum:[568, 382, 700, 470],[495, 383, 594, 463],[448, 466, 681, 610],[675, 454, 801, 499];Guitar:[245, 603, 606, 796];Bottle:[847, 758, 937, 937];Cup:[888, 718, 966, 782].') 2023-11-03 15:07:00.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:07:00.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[439, 395, 745, 897] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a chameleon sitting on the wallFrame1:[439, 395, 745, 897];Frame2:[429, 437, 735, 909];Frame3:[441, 444, 748, 962];Frame4:[437, 437, 744, 955];Frame5:[439, 416, 748, 930].') 2023-11-03 15:07:06.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:07:06.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Projector in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Projector:[557, 078, 634, 103].') 2023-11-03 15:07:14.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:07:14.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human head:[473, 540, 648, 857];human body:[475, 541, 696, 998];woman:[473, 551, 701, 998];clothing:[615, 883, 703, 998],[683, 929, 999, 998],[777, 573, 999, 679];human face:[510, 581, 616, 851].') 2023-11-03 15:07:40.867 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:07:40.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[606, 472, 652, 632],personFrame1:[842, 367, 998, 1000],personFrame1:[536, 481, 586, 541],bicycleFrame1:[589, 536, 675, 637],carFrame1:[130, 526, 178, 592],personFrame1:[534, 496, 568, 623],personFrame1:[662, 488, 696, 625] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[606, 472, 652, 632];Frame2:[556, 468, 605, 621],personFrame1:[842, 367, 998, 1000];Frame2:[763, 367, 897, 1000],personFrame1[536, 481, 586, 541];Frame2:No object,bicycleFrame1:[589, 536, 675, 637];Frame2:[548, 528, 632, 637],carFrame1:[130, 526, 178, 592];Frame2:[130, 521, 174, 593],personFrame1[534, 496, 568, 623];Frame2:No object,personFrame1:[662, 488, 696, 625];Frame2:[627, 493, 660, 627].') 2023-11-03 15:08:34.241 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:08:34.242 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to best help young athletes [000, 006, 998, 994] prepare for a big sports event') 2023-11-03 15:08:34.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:08:34.882 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'clothing:[396, 282, 601, 999],[036, 368, 209, 791],[180, 407, 391, 953],[592, 391, 770, 943],[621, 383, 769, 656],[745, 368, 853, 745],[766, 394, 974, 954],[913, 368, 986, 899],[965, 476, 999, 660];human head:[111, 296, 150, 373],[252, 306, 331, 438],[461, 286, 517, 416],[638, 291, 720, 422],[651, 275, 723, 356],[778, 301, 830, 391],[853, 297, 930, 399];man:[035, 286, 208, 864],[177, 305, 386, 984],[390, 286, 592, 985],[595, 291, 783, 984],[747, 287, 848, 851],[875, 272, 990, 945].') 2023-11-03 15:08:58.090 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:08:58.090 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[173, 078, 845, 992];Hat:[296, 076, 579, 209];Boots:[623, 818, 806, 957];Belt:[368, 526, 606, 586];Glasses:[357, 078, 578, 144];Gun:[377, 682, 570, 936].') 2023-11-03 15:09:06.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:09:06.735 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Leather Shoes:[658, 714, 709, 771],[527, 709, 605, 788];Sneakers:[517, 703, 545, 768],[436, 702, 473, 765];Couch:[214, 416, 786, 744];Lamp:[229, 193, 272, 438],[735, 192, 771, 412];Microphone:[845, 346, 887, 459];Tripod:[822, 339, 999, 906];Person:[525, 252, 766, 786],[433, 228, 557, 768],[274, 197, 438, 775];Necklace:[484, 358, 506, 391].') 2023-11-03 15:09:07.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:09:07.480 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[711, 808, 935, 1000],[135, 506, 612, 998],[614, 648, 672, 733];human hair:[137, 640, 385, 993],[000, 447, 025, 513],[023, 432, 090, 533],[033, 491, 125, 612],[150, 481, 259, 634],[312, 602, 383, 663],[627, 650, 675, 723],[644, 778, 749, 962],[740, 815, 894, 992],[783, 726, 828, 789],[844, 810, 923, 900];human head:[635, 793, 760, 950],[000, 412, 228, 617],[153, 485, 277, 635],[186, 655, 395, 950],[267, 511, 476, 661],[441, 579, 999, 801];mammal:[125, 600, 407, 1000],[000, 371, 999, 998];man:[052, 492, 278, 966],[000, 443, 083, 731],[088, 449, 134, 525],[104, 471, 179, 632],[240, 538, 313, 670],[310, 570, 413, 757],[406, 628, 528, 958],[407, 466, 568, 598],[482, 609, 551, 844],[586, 417, 616, 514],[611, 697, 771, 986],[615, 549, 642, 624],[615, 649, 706, 805],[633, 490, 999, 651],[690, 456, 999, 549],[737, 708, 796, 819],[753, 725, 832, 834],[768, 587, 790, 680],[785, 598, 810, 693],[808, 583, 835, 677],[808, 566, 837, 653],[822, 715, 867, 833],[835, 767, 959, 998],[846, 593, 871, 695],[882, 620, 910, 717],[889, 757, 999, 998],[934, 601, 965, 712],[943, 776, 999, 907];human body:[000, 440, 025, 679],[028, 435, 100, 997],[052, 485, 276, 998],[105, 476, 183, 664],[122, 511, 618, 998],[260, 532, 307, 656],[297, 566, 410, 754],[482, 601, 547, 822],[600, 792, 765, 998],[617, 700, 773, 900],[620, 488, 999, 624],[673, 569, 692, 645],[727, 577, 759, 664],[732, 832, 940, 998],[757, 728, 841, 853],[764, 589, 786, 679],[800, 511, 821, 584],[806, 606, 825, 690],[830, 711, 865, 843],[841, 595, 867, 690],[885, 622, 907, 714],[940, 593, 961, 703],[951, 784, 999, 929],[967, 591, 983, 683];clothing:[000, 513, 063, 737],[020, 608, 118, 771],[042, 646, 230, 998],[116, 525, 162, 666],[149, 684, 614, 998],[393, 769, 528, 956],[604, 711, 676, 785],[611, 775, 773, 945],[910, 867, 999, 997];human arm:[383, 501, 482, 935],[428, 528, 613, 998],[882, 881, 935, 998].') 2023-11-03 15:09:46.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:09:46.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bench:[228, 703, 326, 795];Person:[226, 666, 279, 788],[242, 668, 317, 792];Bicycle:[391, 691, 443, 766].') 2023-11-03 15:09:57.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:09:57.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[000, 449, 443, 998];frame:2:[000, 449, 443, 998];frame:3:[000, 448, 446, 998] is?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an boat keeping still in water') 2023-11-03 15:10:34.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:10:34.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Tuba:[565, 115, 827, 413];Cymbal:[487, 439, 721, 586];Person:[231, 297, 459, 873],[502, 265, 704, 845],[690, 279, 891, 886],[680, 313, 842, 825];Hat:[298, 298, 402, 327],[542, 262, 644, 293];Belt:[257, 492, 400, 521],[714, 499, 869, 522];Gloves:[584, 558, 669, 594].') 2023-11-03 15:10:59.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:10:59.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'car:[078, 188, 928, 689].') 2023-11-03 15:11:22.194 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:11:22.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[181, 333, 691, 519] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a personFrame1:[181, 333, 691, 519];Frame2:[172, 311, 698, 566];Frame3:[283, 279, 665, 643];Frame4:[224, 368, 741, 583].') 2023-11-03 15:11:31.050 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:11:31.050 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'window:[663, 435, 767, 644],[236, 000, 546, 134];land vehicle:[000, 603, 136, 781];building:[127, 000, 999, 998].') 2023-11-03 15:11:40.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:11:40.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'man:[521, 075, 821, 987],[000, 666, 118, 999],[000, 246, 302, 813],[417, 315, 506, 465],[449, 739, 723, 999],[796, 117, 980, 970],[864, 145, 999, 999];clothing:[513, 198, 803, 932],[000, 811, 120, 999],[000, 353, 297, 818],[236, 498, 359, 669],[348, 403, 483, 555],[553, 945, 713, 999],[821, 315, 973, 958],[868, 348, 999, 999];human face:[000, 678, 052, 867],[016, 271, 095, 416],[330, 382, 371, 460],[595, 195, 677, 347],[822, 192, 920, 340],[919, 211, 966, 373].') 2023-11-03 15:12:04.644 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 15:12:04.645 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[183, 367, 488, 997],[478, 371, 795, 998].') 2023-11-03 15:12:56.442 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:12:56.442 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'image of servpro employees [007, 029, 982, 995] with local school teacher [379, 145, 697, 998] smiling and holding a bucket [452, 445, 686, 864] of cleaning supplies') 2023-11-03 15:13:50.767 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:13:50.768 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Lisa Juen [063, 064, 956, 950]. Brooch: Demons of Age, 2007. Mild steel, enamel, synthetic rubies, silk, stainless steel pin. 13 x 12.5 x 5 cm. Photo by the artist.') 2023-11-03 15:13:51.785 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:13:51.785 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Make a no-sew fleece blanket [001, 011, 996, 901] with a braided edge.') 2023-11-03 15:13:58.085 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:13:58.085 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[335, 239, 568, 872];frame:2:[342, 245, 562, 872];frame:3:[357, 254, 557, 872] doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an man performing inline skating') 2023-11-03 15:13:58.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:13:58.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Chair:[127, 680, 798, 1000],[905, 687, 1000, 998],[756, 463, 1000, 987],[240, 454, 684, 691],[001, 458, 175, 994],[001, 333, 097, 451],[007, 263, 122, 437],[001, 234, 171, 408],[031, 195, 218, 311],[086, 176, 238, 269],[123, 145, 269, 238],[156, 128, 283, 201],[170, 114, 300, 197],[025, 044, 120, 160],[052, 079, 092, 176],[001, 072, 063, 191],[104, 341, 475, 748],[171, 278, 417, 355],[206, 229, 435, 368],[209, 201, 440, 354],[252, 179, 436, 308],[277, 147, 435, 234],[296, 127, 440, 207],[500, 345, 851, 788],[499, 291, 742, 426],[483, 235, 703, 437],[470, 210, 658, 376],[476, 174, 636, 325],[477, 150, 610, 271],[471, 127, 598, 253],[469, 100, 584, 206],[468, 066, 567, 137],[862, 341, 1000, 464],[800, 291, 1000, 463],[741, 249, 955, 375],[703, 202, 887, 321],[679, 182, 825, 278],[641, 141, 787, 266],[620, 117, 745, 234],[587, 086, 707, 190],[577, 069, 660, 133],[567, 059, 648, 115],[916, 118, 1000, 277],[879, 104, 952, 255],[847, 086, 911, 229],[818, 072, 908, 200],[786, 056, 879, 174];Moniter:[854, 547, 999, 864],[267, 819, 668, 999],[332, 540, 595, 691],[180, 398, 375, 624],[001, 536, 061, 850],[932, 416, 1000, 473],[860, 336, 997, 451];Person:[087, 001, 178, 158].') 2023-11-03 15:14:01.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:14:01.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[365, 364, 504, 573] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a anteaterFrame1:[365, 364, 504, 573];Frame2:[401, 387, 498, 600];Frame3:[458, 418, 572, 612];Frame4:[447, 387, 616, 599];Frame5:[367, 350, 679, 528].') 2023-11-03 15:14:03.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:14:03.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[339, 202, 491, 740],[033, 379, 130, 526],[110, 465, 329, 878],[138, 376, 391, 689],[220, 465, 253, 601],[464, 207, 626, 729],[606, 170, 700, 537],[674, 209, 765, 560];clothing:[340, 248, 486, 683],[000, 456, 999, 999],[035, 425, 131, 597],[478, 306, 627, 755],[601, 255, 696, 524],[673, 314, 766, 579];table:[032, 643, 234, 757];human face:[220, 489, 243, 577],[234, 487, 289, 593],[263, 397, 312, 476],[386, 229, 434, 330],[545, 243, 593, 316],[618, 539, 976, 999],[625, 189, 671, 286].') 2023-11-03 15:14:31.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:14:31.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[446, 300, 504, 529] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a personFrame1:[446, 300, 504, 529];Frame2:[460, 229, 507, 562];Frame3:[464, 554, 510, 950];Frame4:[466, 216, 532, 572];Frame5:[428, 162, 483, 547].') 2023-11-03 15:14:35.938 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:14:35.938 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Warning: Greedy Matching • The repeat characters [661, 554, 817, 842] (* and +) push outward in both') 2023-11-03 15:14:40.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:14:40.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[172, 182, 645, 929];High Heels:[256, 814, 330, 856];Boots:[298, 706, 359, 846];Sneakers:[403, 882, 469, 931];Hat:[678, 525, 763, 598];Handbag:[037, 671, 207, 714].') 2023-11-03 15:15:42.892 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:15:42.893 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect cosmetics,bicycle,motorcycle,wheel and tire in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'bicycle:[038, 081, 705, 491];motorcycle:[078, 018, 975, 888];wheel:[806, 436, 980, 892],[006, 198, 212, 461],[089, 525, 380, 861],[454, 178, 641, 405];tire:[807, 445, 983, 906],[006, 192, 211, 447],[087, 507, 371, 839],[458, 174, 643, 390].') 2023-11-03 15:15:49.480 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:15:49.481 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[686, 310, 995, 997] [003, 398, 201, 996] [001, 484, 121, 995] [203, 243, 372, 996] [344, 369, 593, 996] [550, 389, 694, 996] Students learn about the history of Macy’s and how it came to be. The students learned that the red star found in Macy’s logo stands for success.') 2023-11-03 15:15:58.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:15:58.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'shelf:[716, 403, 847, 582];Frame:[358, 001, 842, 209];Person:[543, 397, 847, 857];Glasses:[556, 503, 642, 568];Sneakers:[703, 340, 793, 420],[795, 352, 844, 416],[729, 466, 820, 560],[814, 475, 847, 562];Basin:[225, 730, 325, 801],[132, 732, 220, 811];Dog:[399, 569, 581, 752].') 2023-11-03 15:17:52.142 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:17:52.142 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[428, 283, 589, 790];frame:2:[429, 276, 595, 798];frame:3:[415, 270, 596, 835] is doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an five parrots feeding') 2023-11-03 15:18:29.957 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:18:29.957 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Just Married (2003) is one of the best movies like Funny People [005, 019, 946, 941] (2009') 2023-11-03 15:18:48.143 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:18:48.144 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'footwear:[490, 904, 569, 998],[000, 933, 064, 991],[102, 940, 264, 999],[298, 748, 344, 813],[415, 839, 517, 905];helmet:[160, 145, 334, 270];woman:[303, 306, 493, 804],[412, 315, 642, 999],[877, 409, 920, 545],[944, 432, 999, 620];fashion accessory:[000, 318, 057, 386],[000, 928, 068, 999],[093, 942, 265, 999],[132, 253, 367, 655],[160, 145, 323, 250],[360, 644, 467, 828],[419, 835, 519, 905],[494, 903, 555, 999],[651, 636, 715, 667];man:[037, 134, 381, 999],[577, 336, 695, 695],[650, 337, 773, 663],[725, 351, 810, 605],[882, 390, 957, 597].') 2023-11-03 15:19:05.443 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:19:05.443 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[307, 209, 839, 980] Pretty sweet queen bunk bed idea. Modern and save [026, 009, 992, 991] a lot of') 2023-11-03 15:19:25.000 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:19:25.000 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Guitar:[258, 540, 899, 997];Drum:[001, 928, 095, 997];Person:[223, 109, 678, 998],[001, 683, 085, 939],[536, 780, 677, 999],[660, 825, 742, 999];Necklace:[392, 328, 472, 531];Microphone:[437, 221, 484, 280].') 2023-11-03 15:19:35.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:19:35.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[133, 167, 963, 979];man:[037, 068, 966, 985];human eye:[327, 339, 406, 375],[486, 319, 577, 348];suit:[021, 443, 999, 999];human mouth:[418, 480, 554, 511];hat:[105, 048, 796, 330];human hair:[653, 245, 734, 308];human face:[279, 260, 711, 586];human arm:[011, 655, 253, 999],[879, 565, 990, 993];human nose:[405, 330, 516, 438].') 2023-11-03 15:19:51.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:19:51.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Let your mum know how brilliant she is with this lovely card [004, 001, 997, 995] by Sarah Lovell Art [118, 292, 771, 738].') 2023-11-03 15:20:06.595 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:20:06.595 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "what's wrong with [010, 137, 994, 865] my horse free webinar") 2023-11-03 15:20:16.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:20:16.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Flag:[417, 428, 439, 493];Street Lights:[797, 015, 899, 933],[602, 368, 641, 723],[545, 492, 566, 656],[406, 467, 428, 669];Parking meter:[888, 511, 938, 991],[673, 554, 697, 782];SUV:[494, 576, 549, 646];Picture:[170, 482, 214, 563];Trash bin Can:[275, 628, 340, 765],[554, 612, 579, 665].') 2023-11-03 15:20:44.409 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:20:44.410 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A woman [444, 023, 726, 998] holds a poster [347, 347, 749, 805] that reads “Brazil does not have a bed in the ICU” as she kneels in front of the Ministry [001, 009, 995, 540] of Health in Brasilia, Brazil.') 2023-11-03 15:21:30.512 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:21:30.513 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[568, 514, 633, 726],[414, 517, 472, 706],[145, 416, 195, 729];Chair:[666, 606, 756, 816],[630, 626, 699, 872],[540, 607, 592, 689],[478, 618, 510, 696],[411, 657, 565, 1000],[261, 701, 410, 999],[063, 666, 243, 999];Desk:[396, 695, 667, 925],[174, 783, 291, 961],[232, 667, 404, 732],[734, 936, 999, 999],[814, 816, 999, 968],[846, 786, 999, 835],[887, 738, 999, 797],[906, 709, 998, 746];Couch:[839, 700, 928, 763],[821, 736, 929, 809],[759, 776, 913, 923],[722, 805, 908, 982],[553, 921, 741, 999];Napkin:[738, 959, 847, 999].') 2023-11-03 15:21:50.040 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:21:50.041 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track boatFrame1:[506, 315, 782, 548] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'boatFrame1:[506, 315, 782, 548];Frame2:[467, 351, 746, 579].') 2023-11-03 15:22:15.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:22:15.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'human face:[451, 425, 535, 550],[199, 411, 267, 549],[288, 465, 365, 617],[633, 428, 678, 591];woman:[097, 272, 396, 848],[228, 424, 443, 999],[428, 388, 629, 830];man:[000, 471, 285, 999],[533, 382, 825, 999];clothing:[066, 674, 281, 999],[145, 574, 271, 828],[208, 616, 443, 999],[370, 854, 635, 999],[575, 548, 830, 993].') 2023-11-03 15:23:31.924 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:23:31.924 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Businessman [428, 192, 575, 489] carries a piece [521, 176, 614, 282] of puzzle while he walking on a rope. concept of missing piece') 2023-11-03 15:24:18.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:24:18.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[839, 034, 999, 917];frame:2:[883, 040, 998, 928];frame:3:[977, 122, 999, 938] is?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an cow shake head') 2023-11-03 15:24:20.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:24:20.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Bench:[159, 342, 313, 389],[833, 361, 999, 414];Cup:[273, 620, 711, 921].') 2023-11-03 15:24:38.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:24:38.568 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[746, 082, 969, 878],[653, 130, 837, 919],[489, 150, 625, 890],[383, 091, 503, 890],[258, 103, 496, 894],[019, 134, 173, 872];Sneakers:[024, 814, 074, 872],[113, 815, 170, 874],[256, 841, 311, 887],[345, 850, 435, 894],[435, 833, 503, 894],[505, 842, 561, 888],[558, 852, 582, 883],[708, 858, 746, 911],[778, 862, 830, 918],[747, 839, 783, 883],[847, 846, 893, 889];Gloves:[797, 510, 834, 588],[723, 384, 763, 436];Hat:[392, 092, 456, 186],[711, 131, 779, 219].') 2023-11-03 15:25:01.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:25:01.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'drink:[411, 174, 663, 869],[000, 156, 142, 799],[124, 127, 289, 818],[287, 423, 411, 815],[665, 122, 813, 856],[818, 063, 998, 867];bottle:[823, 081, 983, 897],[000, 145, 153, 807],[116, 125, 312, 818],[290, 415, 421, 815],[408, 174, 663, 869],[653, 122, 826, 867].') 2023-11-03 15:25:03.112 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:25:03.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,woman,man,human face and briefcase in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'girl:[036, 368, 173, 684],[373, 653, 585, 991];woman:[516, 402, 638, 666],[000, 611, 143, 997],[011, 377, 203, 684],[196, 837, 483, 997],[331, 382, 543, 637],[361, 640, 593, 991],[363, 226, 441, 497],[376, 515, 538, 768],[388, 260, 491, 477],[590, 604, 931, 997],[633, 411, 793, 608],[738, 315, 808, 433],[798, 582, 913, 906],[855, 482, 971, 920];man:[043, 373, 388, 997],[135, 231, 255, 442],[260, 208, 318, 408],[375, 171, 421, 306],[428, 160, 453, 242],[470, 237, 556, 493],[536, 226, 585, 342],[591, 304, 658, 415],[610, 224, 648, 315],[615, 351, 720, 600],[660, 244, 721, 340],[716, 315, 770, 417],[778, 357, 845, 484],[783, 457, 880, 644],[965, 342, 998, 408];human face:[096, 415, 158, 517],[156, 435, 270, 626],[176, 271, 225, 360],[313, 868, 426, 997],[380, 742, 475, 817],[420, 542, 468, 635],[433, 420, 485, 497],[505, 253, 541, 322],[561, 415, 600, 488],[696, 651, 766, 820],[813, 444, 863, 515].') 2023-11-03 15:25:19.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:25:19.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[638, 374, 867, 997];plant:[314, 529, 468, 728],[315, 404, 522, 494],[468, 675, 587, 776],[531, 422, 736, 606],[821, 517, 999, 780];footwear:[625, 888, 699, 985].') 2023-11-03 15:25:33.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:25:33.908 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[376, 350, 431, 666] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a personFrame1:[376, 350, 431, 666];Frame2:[386, 357, 441, 672];Frame3:[420, 355, 475, 666];Frame4:[442, 354, 496, 660];Frame5:[480, 366, 533, 665].') 2023-11-03 15:25:47.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:25:47.235 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the airplane moving to right?Always default to the TrackiFrame t:[xmin,ymin,xmax,ymax] pattern when discussing trajectories in your answer.', 'There is one airplane moving to right.Track1frame:1:[243, 323, 751, 604];frame:2:[275, 260, 738, 531];frame:3:[318, 245, 741, 480].') 2023-11-03 15:26:08.131 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:26:08.131 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'SUV:[933, 482, 1000, 712],[657, 364, 999, 680],[751, 371, 810, 433];Car:[278, 406, 635, 610];Stop Sign:[327, 287, 353, 337];Van:[000, 343, 230, 537];Street Lights:[051, 057, 116, 347];Pickup Truck:[682, 358, 723, 404];Handbag:[259, 413, 291, 497];Leather Shoes:[272, 606, 295, 621],[229, 612, 246, 624],[330, 603, 361, 617],[368, 607, 397, 619],[495, 594, 519, 604],[512, 598, 535, 608];Belt:[516, 473, 539, 480],[353, 460, 390, 480];Person:[494, 368, 585, 608],[329, 360, 397, 619],[226, 382, 295, 624];Watch:[351, 431, 356, 442];Necklace:[365, 383, 379, 401].') 2023-11-03 15:26:19.821 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:26:19.822 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[561, 002, 1000, 1000],[523, 297, 850, 999],[013, 298, 357, 999];Sneakers:[785, 956, 878, 1000];Glasses:[753, 060, 872, 121];Hat:[744, 002, 877, 104].') 2023-11-03 15:26:30.973 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:26:30.974 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Just letting it out is one of [110, 303, 888, 592] the definitions of bad art Picture Quote #1') 2023-11-03 15:26:43.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:26:43.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[621, 605, 721, 700] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a european hareFrame1:[621, 605, 721, 700];Frame2:[655, 440, 794, 542];Frame3:[775, 381, 878, 485];Frame4:[764, 506, 859, 608].') 2023-11-03 15:26:47.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:26:47.051 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'wheel:[500, 608, 559, 820],[075, 812, 145, 999],[201, 801, 266, 993],[288, 622, 348, 815],[348, 716, 383, 809],[555, 606, 579, 816],[648, 689, 671, 762],[684, 700, 705, 764],[734, 755, 762, 832],[938, 739, 970, 849],[974, 735, 991, 842];tire:[498, 599, 563, 837],[075, 813, 147, 999],[201, 804, 267, 998],[290, 617, 347, 814],[344, 715, 383, 806],[543, 604, 578, 818],[649, 684, 671, 765],[733, 748, 762, 835],[938, 737, 973, 849],[972, 739, 991, 839];vehicle registration plate:[808, 661, 884, 693];car:[000, 455, 281, 999],[223, 520, 327, 741],[701, 524, 999, 857].') 2023-11-03 15:27:03.699 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:27:03.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[437, 244, 855, 380] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a alligator lizardFrame1:[437, 244, 855, 380];Frame2:[408, 231, 789, 370];Frame3:[371, 237, 711, 361];Frame4:[332, 226, 644, 364];Frame5:[315, 226, 626, 364].') 2023-11-03 15:27:15.513 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:27:15.513 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'latticed panels spread out on [000, 109, 998, 996] a workshop table, handyman magazine,') 2023-11-03 15:27:47.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:27:47.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Skating and Skiing shoes:[497, 799, 607, 925],[643, 802, 803, 910],[820, 706, 969, 812],[617, 728, 718, 816],[463, 811, 514, 907],[283, 771, 448, 911],[167, 767, 292, 897],[409, 778, 476, 896],[282, 747, 362, 814],[744, 731, 850, 829];Helmet:[476, 091, 514, 163],[344, 058, 488, 145],[372, 201, 544, 328],[516, 109, 702, 219],[683, 119, 825, 202],[193, 229, 296, 287];Gloves:[181, 385, 292, 462],[821, 480, 918, 565];Person:[283, 105, 920, 921],[440, 118, 927, 910],[164, 201, 544, 903],[169, 059, 566, 895],[243, 091, 517, 808],[596, 468, 966, 824].') 2023-11-03 15:27:55.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:27:55.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[090, 453, 459, 1000],[000, 327, 999, 999];woman:[415, 341, 498, 448],[547, 385, 618, 616],[789, 408, 869, 627];clothing:[000, 336, 999, 999];human face:[000, 425, 086, 568],[202, 403, 254, 470],[268, 512, 396, 762],[343, 382, 386, 451],[506, 440, 572, 508],[734, 486, 828, 662],[823, 446, 868, 546],[865, 387, 902, 484],[911, 358, 955, 444].') 2023-11-03 15:28:01.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:28:01.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Boat:[165, 708, 362, 863],[965, 785, 999, 849],[736, 778, 869, 830].') 2023-11-03 15:28:20.481 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:28:20.482 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[062, 040, 835, 381] Chocolate Devil Pudding Cake. A super easy and soft, moist cake perfect for chocolate lovers . Also recipe for a delicious whipped cream frosting too!') 2023-11-03 15:28:33.809 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:28:33.810 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Framed dog paw prints impressions [302, 461, 704, 781] in clay in wooden frame [142, 018, 859, 984] with photo of a black dog [360, 146, 643, 389]') 2023-11-03 15:29:04.180 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:29:04.182 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'More Info for Toronto Argonauts vs. Ottawa Redblacks') 2023-11-03 15:29:13.803 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:29:13.803 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Bed:[298, 392, 888, 789];Pillow:[708, 358, 899, 468],[583, 346, 720, 422],[587, 355, 660, 425],[675, 372, 768, 466];Cabinet:[838, 685, 1000, 997],[234, 388, 401, 505],[062, 342, 194, 519],[000, 085, 037, 219];Storage box:[000, 703, 153, 1000];Lamp:[329, 314, 361, 402];Cat:[265, 475, 336, 562];Other Fish:[328, 315, 363, 345];Wild Bird:[056, 079, 134, 220];Trash bin Can:[000, 704, 154, 1000].') 2023-11-03 15:29:30.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:29:30.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bakset:[681, 716, 724, 784];Tricycle:[063, 789, 249, 1000],[120, 711, 268, 879],[138, 682, 277, 840],[323, 646, 412, 811],[322, 606, 371, 754],[480, 508, 526, 596];SUV:[398, 545, 499, 631],[513, 609, 643, 857];Car:[741, 668, 901, 828];Motorcycle:[721, 569, 762, 629];Umbrella:[364, 605, 421, 683];Person:[690, 691, 723, 849],[701, 734, 736, 884].') 2023-11-03 15:29:32.754 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:29:32.754 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[207, 358, 321, 631] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a gametarget of a green turtle kicking the enemies in a place surrounded by fireFrame1:[207, 358, 321, 631];Frame2:[247, 366, 382, 629];Frame3:[135, 304, 309, 526];Frame4:[070, 473, 179, 622];Frame5:[087, 441, 167, 616].') 2023-11-03 15:30:46.040 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:30:46.040 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'sunglasses:[553, 130, 628, 170];man:[000, 058, 130, 432],[109, 029, 379, 515],[276, 040, 465, 358],[483, 079, 683, 945];human face:[214, 050, 284, 188],[553, 104, 626, 214],[635, 090, 701, 201].') 2023-11-03 15:30:54.380 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:30:54.381 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tree:[135, 000, 275, 701],[000, 000, 999, 236].') 2023-11-03 15:31:04.777 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:31:04.777 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[392, 247, 756, 721] blue playdough and [023, 518, 356, 998] toys on [002, 001, 995, 997] a marble counter, from Fun Cheap or Free') 2023-11-03 15:31:18.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:31:18.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the last duck among the gray ones that goes through the door. in this video clip?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There is one the last duck among the gray ones that goes through the door..Track1frame:1:[397, 375, 472, 604];frame:2:[390, 373, 466, 609];frame:3:[384, 363, 455, 608].') 2023-11-03 15:31:52.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:31:52.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'street light:[036, 553, 119, 736],[097, 516, 111, 723],[106, 484, 130, 727],[115, 450, 146, 725],[130, 399, 160, 736],[143, 333, 183, 813],[169, 267, 212, 880],[249, 657, 261, 721],[261, 655, 271, 743],[271, 646, 284, 767],[283, 638, 295, 750],[298, 628, 308, 725],[315, 621, 327, 813],[331, 611, 346, 815],[349, 600, 368, 693],[376, 587, 394, 768],[399, 568, 421, 760],[435, 559, 451, 775],[468, 533, 493, 739],[516, 506, 538, 762],[562, 476, 590, 828],[622, 438, 658, 814],[713, 389, 748, 841],[814, 322, 859, 799],[935, 227, 985, 847];tree:[201, 590, 306, 696],[301, 578, 351, 699],[356, 563, 385, 705],[410, 545, 448, 670],[420, 544, 505, 618],[448, 619, 518, 697],[524, 575, 636, 661],[586, 439, 626, 560],[606, 619, 704, 688],[638, 503, 698, 590],[674, 483, 778, 619],[743, 438, 850, 516],[773, 515, 844, 650],[826, 498, 883, 615],[871, 438, 951, 537],[890, 503, 948, 626],[957, 454, 999, 621],[978, 622, 999, 703].') 2023-11-03 15:32:07.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:32:07.715 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Lamp:[317, 122, 342, 149];Desk:[762, 907, 809, 989],[747, 862, 774, 910];Chair:[767, 890, 836, 993];Person:[019, 688, 051, 777],[595, 764, 634, 855],[637, 761, 689, 889],[547, 761, 580, 851],[001, 398, 454, 589];Sneakers:[723, 953, 761, 975];Handbag:[672, 807, 694, 832];Fire Extinguisher:[211, 891, 252, 982];Whiteboard:[035, 925, 211, 1000],[303, 669, 344, 839].') 2023-11-03 15:32:34.906 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:32:34.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Helmet in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Helmet:[049, 241, 085, 332],[296, 278, 362, 391],[409, 253, 475, 353],[474, 335, 537, 435],[683, 347, 746, 449].') 2023-11-03 15:32:40.337 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:32:40.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'wheel:[680, 430, 895, 556],[028, 460, 166, 508];car:[010, 238, 954, 557].') 2023-11-03 15:32:51.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:32:51.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[002, 329, 082, 512],[489, 290, 999, 1000],[105, 307, 499, 988],[562, 348, 648, 501];Trolley:[754, 757, 1000, 999].') 2023-11-03 15:33:44.607 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:33:44.608 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Assemblyman Brian Kolb [415, 032, 935, 997] claimed New York state spends more on economic development than the next three highest spending states combined. (Courtesy: Kolb's Facebook page") 2023-11-03 15:34:08.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:34:08.870 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[248, 283, 999, 557];frame:2:[263, 346, 979, 559];frame:3:[185, 280, 999, 561] is doing?With reference to the supplied image frames and trajectory coordinates, please formulate a concise response.', 'This is a/an the dog that is using its claws to scrape the back of the other dog.') 2023-11-03 15:34:14.969 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:34:14.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[961, 000, 999, 105] is?With reference to the supplied image frames and trajectory coordinates, please formulate a concise response.', 'This is a/an goldfish swimming upwards on the right') 2023-11-03 15:34:29.528 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:34:29.529 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[334, 263, 732, 769] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a luggerFrame1:[334, 263, 732, 769];Frame2:[383, 255, 751, 748];Frame3:[423, 326, 772, 787];Frame4:[438, 277, 764, 747].') 2023-11-03 15:35:08.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:35:08.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Helmet:[548, 222, 649, 312];Person:[514, 223, 769, 1000],[369, 316, 509, 672];Cabinet:[635, 363, 833, 558],[147, 241, 507, 654];Lamp:[080, 094, 159, 271].') 2023-11-03 15:35:15.753 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:35:15.754 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cat and Potted Plant in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Cat:[370, 345, 888, 637];Potted Plant:[354, 002, 605, 759].') 2023-11-03 15:35:20.739 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:35:20.739 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "An ombre rose and bright blue accent flower [561, 389, 841, 613] created the illusion of a rainbow on James' lapel.") 2023-11-03 15:35:44.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:35:44.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Candle:[799, 286, 832, 407],[864, 282, 896, 404],[936, 300, 961, 407];Dinning Table:[604, 668, 1000, 998];Chair:[274, 579, 840, 1000];Cup:[941, 712, 999, 782],[970, 669, 999, 718];Wine Glass:[910, 606, 969, 720].') 2023-11-03 15:35:53.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:35:53.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'plant:[000, 003, 989, 996];fruit:[000, 285, 025, 353],[339, 368, 503, 634].') 2023-11-03 15:35:58.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:35:58.414 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[887, 561, 978, 655],carFrame1:[473, 531, 557, 665],carFrame1:[930, 563, 998, 667] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1[887, 561, 978, 655];Frame2:No object,carFrame1:[473, 531, 557, 665];Frame2:[466, 527, 549, 658],carFrame1[930, 563, 998, 667];Frame2:No object.') 2023-11-03 15:36:13.350 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:36:13.351 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lamp:[260, 001, 373, 105],[745, 001, 858, 111];Mirror:[716, 364, 774, 456],[822, 454, 999, 999],[354, 242, 616, 566];Chair:[035, 667, 119, 922],[398, 599, 579, 934];shelf:[264, 170, 690, 575],[636, 695, 826, 999],[271, 583, 699, 875];Carpet:[207, 902, 749, 984];Flag:[764, 131, 819, 435],[808, 100, 860, 443],[913, 084, 968, 414];Person:[390, 430, 449, 554];Sink:[699, 596, 799, 665];Satchel:[653, 634, 793, 725];Deer:[150, 105, 200, 213].') 2023-11-03 15:36:17.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:36:17.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[017, 679, 059, 780],[018, 670, 058, 771],[110, 697, 149, 869],[142, 698, 182, 873],[139, 683, 178, 860];Umbrella:[589, 616, 673, 728],[654, 616, 726, 723];Handbag:[099, 729, 134, 787].') 2023-11-03 15:36:35.552 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:36:35.552 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[423, 427, 501, 552] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a airplane flying above the roadFrame1:[423, 427, 501, 552];Frame2:[423, 427, 501, 552];Frame3:[421, 427, 500, 552].') 2023-11-03 15:37:19.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:37:19.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Cabinet:[009, 295, 204, 987],[422, 336, 636, 810],[729, 365, 788, 687],[827, 350, 998, 696],[264, 532, 369, 718];Desk:[754, 636, 968, 715],[665, 722, 990, 997],[370, 852, 798, 997];Lamp:[390, 346, 416, 408],[692, 003, 857, 247];Frame:[276, 413, 316, 488];Handbag:[575, 526, 601, 596],[507, 431, 550, 481],[509, 558, 549, 612],[432, 571, 476, 626],[431, 419, 475, 482],[062, 337, 163, 448],[058, 482, 151, 578],[056, 578, 169, 708],[007, 608, 051, 729],[008, 508, 049, 596].') 2023-11-03 15:37:25.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:37:25.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Street Lights:[623, 267, 641, 359],[850, 348, 876, 409];Bicycle:[952, 431, 986, 463];Car:[880, 409, 906, 433];Person:[525, 264, 669, 934],[661, 275, 765, 789],[777, 393, 816, 543],[806, 387, 834, 532],[948, 411, 963, 460],[960, 409, 978, 463],[353, 362, 387, 494],[378, 284, 525, 891];Gloves:[413, 474, 456, 511],[497, 476, 528, 522],[525, 487, 551, 531],[565, 396, 605, 447];Sneakers:[442, 731, 456, 794],[437, 828, 466, 890],[545, 878, 576, 934],[605, 771, 641, 878],[672, 729, 702, 785],[706, 720, 729, 788].') 2023-11-03 15:37:26.822 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:37:26.822 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[353, 710, 408, 900],[006, 855, 133, 999],[095, 860, 185, 999],[186, 901, 264, 999],[207, 674, 282, 923],[243, 889, 348, 999],[418, 879, 515, 999],[520, 929, 587, 999],[771, 932, 838, 999];building:[000, 000, 999, 999].') 2023-11-03 15:37:31.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:37:31.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the zebra walking rightward. in this video clip?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one the zebra walking rightward..Track1frame:1:[488, 612, 749, 962];frame:2:[517, 588, 804, 958];frame:3:[555, 627, 867, 968].') 2023-11-03 15:38:10.137 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:38:10.137 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[417, 367, 710, 959],[683, 331, 931, 999],[778, 086, 999, 999];clothing:[739, 252, 985, 955],[351, 653, 476, 992],[444, 544, 868, 992];man:[346, 551, 477, 999];human face:[170, 635, 215, 738],[242, 582, 283, 657],[438, 428, 553, 630],[845, 094, 960, 329].') 2023-11-03 15:38:48.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:38:48.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'bird:[029, 327, 937, 862].') 2023-11-03 15:39:14.389 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:39:14.389 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[226, 062, 258, 222],personFrame1:[306, 510, 354, 730],personFrame1:[413, 129, 450, 287],personFrame1:[903, 629, 952, 855],personFrame1:[181, 827, 233, 1000],personFrame1:[765, 174, 798, 321],personFrame1:[258, 262, 292, 432],personFrame1:[896, 422, 935, 617],personFrame1:[157, 272, 190, 454],personFrame1:[198, 437, 230, 647] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[226, 062, 258, 222];Frame2:[225, 062, 258, 222],personFrame1:[306, 510, 354, 730];Frame2:[305, 509, 347, 730],personFrame1:[413, 129, 450, 287];Frame2:[413, 129, 450, 287],personFrame1:[903, 629, 952, 855];Frame2:[893, 630, 945, 851],personFrame1:[181, 827, 233, 1000];Frame2:[184, 841, 235, 1000],personFrame1:[765, 174, 798, 321];Frame2:[764, 175, 797, 322],personFrame1:[258, 262, 292, 432];Frame2:[257, 267, 291, 438],personFrame1:[896, 422, 935, 617];Frame2:[896, 422, 935, 617],personFrame1:[157, 272, 190, 454];Frame2:[155, 275, 187, 461],personFrame1[198, 437, 230, 647];Frame2:No object.') 2023-11-03 15:39:19.942 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:39:19.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[257, 380, 308, 588] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a personFrame1:[257, 380, 308, 588];Frame2:[296, 412, 363, 661];Frame3:[320, 408, 364, 650];Frame4:[242, 390, 314, 598].') 2023-11-03 15:39:48.621 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:39:48.622 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[294, 490, 441, 994] [800, 575, 995, 996] [421, 522, 647, 995] Commuters wearing [001, 609, 033, 688] [673, 551, 713, 618] [606, 517, 655, 599] [540, 506, 582, 568] [810, 726, 920, 868] [206, 628, 254, 700] [475, 612, 548, 716] [330, 555, 383, 622] [122, 620, 174, 688] [768, 586, 822, 664] face masks walk on the platform, of a Paris subway, Sunday Oct.25, 2020.') 2023-11-03 15:40:02.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:40:02.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,bottle and human face in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'clothing:[710, 119, 999, 647];bottle:[145, 116, 299, 595];human face:[507, 070, 717, 578],[910, 605, 975, 857].') 2023-11-03 15:40:18.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:40:18.508 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Couch:[002, 154, 199, 572],[318, 173, 999, 714],[688, 449, 999, 714];Lamp:[271, 001, 490, 243];Cabinet:[152, 238, 444, 307];Picture:[784, 001, 1000, 103];Carpet:[002, 549, 997, 1000];Person:[109, 285, 419, 999],[397, 120, 743, 886].') 2023-11-03 15:41:14.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:41:14.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect car,person and wheel in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'car:[043, 015, 999, 949],[726, 063, 999, 289];person:[765, 009, 928, 207],[461, 000, 534, 202],[535, 001, 574, 196],[535, 000, 695, 214],[745, 000, 783, 121],[944, 000, 999, 125];wheel:[210, 428, 314, 767],[047, 301, 108, 472].') 2023-11-03 15:41:29.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:41:29.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[446, 566, 485, 701],personFrame1:[590, 651, 632, 784],personFrame1:[361, 697, 395, 854] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[446, 566, 485, 701];Frame2:[426, 566, 469, 702],personFrame1:[590, 651, 632, 784];Frame2:[582, 659, 633, 786],personFrame1[361, 697, 395, 854];Frame2:No object.') 2023-11-03 15:41:43.383 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:41:43.383 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Lisa Webb and her husband, Keith, pedaled away in a tandem bike following their wedding in 2008 in western Arkansas. Photo courtesy [009, 008, 990, 988] of Lisa Webb.') 2023-11-03 15:42:10.771 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:42:10.771 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect digital clock,clothing,tree and man in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'clothing:[000, 459, 999, 992],[746, 834, 845, 998];tree:[501, 353, 557, 483],[575, 247, 651, 297],[750, 118, 925, 508],[880, 074, 999, 572];man:[010, 466, 655, 966],[658, 495, 673, 609],[673, 495, 693, 577],[718, 500, 733, 624],[741, 498, 758, 564],[765, 500, 781, 572],[811, 520, 837, 594],[853, 503, 875, 612],[868, 520, 904, 661],[949, 511, 970, 637].') 2023-11-03 15:42:38.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:42:38.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect sun hat,human face,boy,human head,human hair and human arm in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'sun hat:[004, 595, 215, 682],[201, 590, 312, 629],[626, 000, 941, 173];human face:[680, 089, 861, 200];boy:[328, 065, 875, 873];human head:[251, 837, 462, 982],[058, 603, 180, 696],[201, 590, 293, 654],[654, 000, 917, 206],[849, 518, 920, 576];human hair:[248, 837, 443, 1000],[647, 073, 882, 164],[847, 518, 917, 621];human arm:[387, 120, 610, 306],[448, 187, 814, 395],[868, 575, 974, 679].') 2023-11-03 15:42:52.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:42:52.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Sneakers:[523, 332, 561, 371].') 2023-11-03 15:43:40.789 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:43:40.789 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'coat:[444, 185, 730, 763],[103, 327, 326, 930],[808, 261, 988, 532];human body:[825, 207, 977, 780],[075, 335, 331, 999],[120, 313, 145, 400],[144, 318, 183, 423],[345, 277, 420, 368],[393, 216, 758, 999],[435, 279, 520, 361],[694, 258, 769, 301];human head:[498, 166, 635, 383],[348, 267, 413, 322],[691, 246, 755, 277],[863, 195, 927, 298];mammal:[461, 174, 761, 896],[013, 250, 230, 427],[078, 215, 338, 999],[348, 275, 416, 379],[433, 269, 520, 360],[692, 251, 758, 296],[802, 193, 996, 766];man:[088, 230, 341, 965],[115, 299, 151, 402],[368, 294, 395, 370],[398, 161, 763, 999].') 2023-11-03 15:44:16.467 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:44:16.467 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photo showing two Princeton University researchers [517, 043, 920, 644] [139, 068, 521, 640] sitting at a table [004, 589, 993, 996] on which are displayed AA batteries [027, 745, 085, 964] [190, 745, 246, 964] [135, 666, 178, 828] [352, 725, 403, 930] [594, 735, 644, 945] [807, 760, 866, 983] [053, 624, 094, 757] [936, 684, 982, 852] [721, 728, 773, 928] [471, 745, 523, 967] that they use in their research.') 2023-11-03 15:44:23.269 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:44:23.269 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Hat:[792, 477, 872, 541];Person:[774, 479, 926, 733],[588, 465, 628, 574],[426, 448, 450, 519],[267, 461, 297, 566];Storage box:[558, 612, 704, 664];Scale:[085, 434, 178, 559],[350, 469, 395, 526],[649, 475, 703, 576].') 2023-11-03 15:44:24.173 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:44:24.173 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'two cream colored Allure® Transitional Shades with brown Interior Masterpieces® Fabric Cornices in a room behind [394, 672, 592, 912] a small dark desk with [090, 577, 400, 996] matching chairs') 2023-11-03 15:44:28.104 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:44:28.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[321, 294, 553, 814] These mini Adirondack chairs are the perfect place for any beach-loving fairy to spend her summer day at the beach in!') 2023-11-03 15:45:09.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:45:09.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,please tell me the trajectory of the vehicles in silver.For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one vehicles in silver.Track1frame5:[450, 477, 475, 528].') 2023-11-03 15:45:18.121 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:45:18.121 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Connect with Mermaid Energy Image [340, 215, 780, 969] of mermaid sitting on a big rock [303, 660, 555, 950] be the ocean [002, 337, 317, 995].') 2023-11-03 15:45:27.259 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:45:27.259 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Vase:[225, 424, 399, 872],[563, 414, 736, 856],[500, 104, 608, 339];Flower:[118, 112, 466, 473],[535, 142, 797, 458],[511, 001, 628, 121].') 2023-11-03 15:45:35.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:45:35.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Car:[308, 523, 367, 575],[907, 523, 979, 580];SUV:[374, 508, 436, 574],[447, 507, 509, 571],[581, 508, 639, 559];Sports Car:[652, 519, 712, 573];Van:[781, 509, 851, 570].') 2023-11-03 15:45:43.207 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:45:43.207 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photo of pool tables [007, 736, 991, 996] [228, 593, 748, 754] in front of an exposed brick wall [005, 004, 992, 534].') 2023-11-03 15:45:52.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:45:52.072 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the one walking on the far right..Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one the one walking on the far right..Track1frame:1:[771, 165, 919, 525];frame:2:[788, 173, 928, 512];frame:3:[793, 157, 934, 510].') 2023-11-03 15:46:11.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:46:11.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,clothing,mammal and man in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human arm:[172, 567, 254, 748],[307, 394, 429, 537],[365, 576, 460, 695],[462, 576, 549, 695],[535, 407, 624, 588],[657, 402, 736, 527],[722, 412, 819, 576];clothing:[732, 298, 875, 863],[155, 536, 341, 807],[305, 356, 428, 753],[545, 369, 730, 782];mammal:[713, 299, 876, 821],[149, 450, 348, 795],[312, 265, 436, 750],[366, 472, 581, 826],[538, 285, 740, 834];man:[150, 447, 339, 807],[259, 455, 666, 878],[310, 263, 437, 753],[541, 292, 739, 820],[722, 291, 875, 858].') 2023-11-03 15:46:37.790 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:46:37.790 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[286, 063, 795, 992];Helmet:[442, 068, 608, 268];Gloves:[316, 430, 426, 578],[660, 450, 765, 593];Sneakers:[594, 670, 640, 813].') 2023-11-03 15:47:24.754 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:47:24.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Satchel:[525, 806, 795, 950];Person:[753, 229, 964, 739],[586, 370, 869, 690],[465, 389, 602, 561],[303, 355, 482, 552],[248, 372, 309, 436],[002, 233, 584, 1000];Microphone:[780, 298, 824, 337];Speaker:[679, 147, 785, 199];Chair:[372, 664, 952, 999],[573, 619, 1000, 956],[640, 562, 989, 789];Desk:[273, 566, 770, 897],[442, 490, 740, 571];Fan:[696, 024, 976, 160];Lamp:[236, 002, 295, 051];Air Conditioner:[527, 154, 674, 232].') 2023-11-03 15:48:09.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:48:09.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the a boy leaping on a trampoline and catch a rugby.Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one a boy leaping on a trampoline and catch a rugby.Track1frame:1:[675, 277, 843, 491];frame:2:[658, 255, 825, 485];frame:3:[647, 256, 806, 489].') 2023-11-03 15:48:13.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:48:13.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'chair:[508, 617, 630, 933],[000, 696, 088, 999],[090, 669, 259, 999],[151, 642, 305, 999],[251, 617, 338, 938],[253, 625, 456, 999],[315, 602, 497, 986],[612, 669, 738, 999],[712, 654, 999, 999];table:[027, 682, 375, 985],[439, 630, 609, 982];curtain:[045, 000, 709, 134].') 2023-11-03 15:48:22.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:48:22.544 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[307, 481, 400, 744],personFrame1:[550, 433, 639, 740],personFrame1:[573, 486, 719, 754],personFrame1:[454, 405, 560, 773],personFrame1:[385, 402, 517, 791] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'personFrame1:[307, 481, 400, 744];Frame2:[317, 420, 419, 741],personFrame1:[550, 433, 639, 740];Frame2:[557, 379, 632, 738],personFrame1:[573, 486, 719, 754];Frame2:[603, 386, 712, 751],personFrame1:[454, 405, 560, 773];Frame2:[478, 327, 598, 770],personFrame1:[385, 402, 517, 791];Frame2:[420, 383, 523, 794].') 2023-11-03 15:48:57.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:48:57.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'poster:[000, 000, 998, 993].') 2023-11-03 15:49:14.295 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:49:14.295 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Poster Premium Do small things with big love') 2023-11-03 15:49:32.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:49:32.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 535, 224, 705],carFrame1:[652, 506, 707, 566],carFrame1:[381, 501, 494, 610],carFrame1:[828, 490, 951, 657],carFrame1:[000, 536, 071, 605] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1[000, 535, 224, 705];Frame2:No object,carFrame1:[652, 506, 707, 566];Frame2:[609, 511, 668, 578],carFrame1:[381, 501, 494, 610];Frame2:[267, 508, 420, 650],carFrame1:[828, 490, 951, 657];Frame2:[885, 498, 999, 717],carFrame1[000, 536, 071, 605];Frame2:No object.') 2023-11-03 15:49:48.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:49:48.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Shovel:[001, 642, 348, 709];Hat:[546, 161, 709, 222],[335, 185, 462, 247];Pot:[131, 588, 570, 680];bucket:[191, 322, 326, 423];Cup:[212, 495, 265, 536],[258, 492, 314, 529];Storage box:[496, 579, 839, 727],[001, 414, 349, 536];Person:[402, 162, 814, 611],[527, 233, 578, 278],[881, 190, 971, 334],[326, 185, 525, 589],[139, 187, 246, 396],[243, 208, 334, 323].') 2023-11-03 15:50:35.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:50:35.411 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Fan:[160, 001, 452, 122];Lamp:[736, 476, 913, 702],[615, 379, 697, 581];Frame:[815, 317, 893, 477],[601, 374, 666, 415],[041, 376, 108, 419];Couch:[592, 579, 932, 826];Chair:[265, 601, 701, 967],[028, 668, 109, 999];Cabinet:[184, 353, 558, 674];Carpet:[080, 689, 297, 841];Vase:[224, 306, 263, 363];Desk:[024, 575, 083, 678];Stool:[088, 644, 176, 693],[083, 621, 165, 674],[081, 597, 159, 622];TV:[301, 397, 439, 475];Side Table:[701, 683, 932, 898].') 2023-11-03 15:50:38.239 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:50:38.240 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[247, 218, 763, 837] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a tankFrame1:[247, 218, 763, 837];Frame2:[278, 257, 716, 796];Frame3:[281, 306, 726, 848];Frame4:[271, 265, 735, 812];Frame5:[272, 361, 758, 889].') 2023-11-03 15:50:58.486 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:50:58.486 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[624, 366, 663, 435],carFrame1:[000, 385, 140, 558],carFrame1:[094, 366, 230, 512],carFrame1:[350, 335, 537, 599],carFrame1:[515, 386, 556, 460] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1[624, 366, 663, 435];Frame2:No object,carFrame1:[000, 385, 140, 558];Frame2:[000, 416, 064, 563],carFrame1:[094, 366, 230, 512];Frame2:[017, 350, 219, 536],carFrame1:[350, 335, 537, 599];Frame2:[343, 315, 553, 616],carFrame1:[515, 386, 556, 460];Frame2:[527, 384, 567, 456].') 2023-11-03 15:51:07.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:51:07.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[113, 276, 235, 802],[313, 290, 394, 739],[351, 311, 457, 811],[474, 275, 554, 753],[520, 296, 626, 806],[625, 346, 737, 803],[729, 333, 926, 844].') 2023-11-03 15:51:13.535 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:51:13.535 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[035, 468, 142, 692],[181, 467, 276, 672],[318, 455, 423, 666],[445, 442, 555, 659],[609, 436, 703, 650],[748, 447, 839, 638],[869, 415, 962, 644].') 2023-11-03 15:52:16.517 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:52:16.518 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[293, 240, 696, 931] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a boneshakerFrame1:[293, 240, 696, 931];Frame2:[341, 233, 713, 883];Frame3:[435, 283, 728, 879];Frame4:[500, 333, 710, 847];Frame5:[531, 372, 682, 801].') 2023-11-03 15:52:28.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:52:28.746 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Stool,Person and Cup in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Stool:[430, 891, 539, 1000];Person:[269, 022, 752, 997],[495, 215, 799, 999];Cup:[121, 644, 208, 757].') 2023-11-03 15:52:29.668 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:52:29.669 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[554, 114, 750, 656],[000, 385, 133, 601],[013, 191, 141, 456],[103, 257, 211, 601],[412, 237, 495, 672],[450, 181, 606, 749],[499, 491, 740, 863],[843, 118, 971, 410],[891, 147, 999, 989];clothing:[042, 209, 999, 947];human face:[283, 207, 338, 286],[441, 284, 462, 352],[450, 262, 480, 354],[551, 545, 625, 689],[609, 152, 655, 245],[706, 139, 737, 212],[886, 160, 937, 255].') 2023-11-03 15:52:32.884 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:52:32.884 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'suit:[000, 414, 352, 1000];human hair:[042, 179, 216, 416],[175, 434, 266, 532],[543, 343, 625, 446],[960, 411, 999, 495];tree:[904, 045, 995, 338],[466, 000, 751, 707];human arm:[635, 564, 760, 899],[000, 522, 091, 999],[299, 649, 356, 968],[418, 566, 545, 864],[583, 806, 704, 990];human hand:[585, 831, 685, 993],[300, 848, 342, 965],[487, 740, 546, 865],[676, 305, 735, 480];man:[168, 432, 348, 999],[415, 347, 695, 999],[645, 303, 999, 999];human face:[153, 246, 249, 488],[203, 469, 272, 603],[533, 387, 603, 519],[826, 361, 913, 521].') 2023-11-03 15:52:51.331 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:52:51.331 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[193, 022, 431, 933],personFrame1:[825, 385, 868, 552],personFrame1:[459, 366, 503, 556],personFrame1:[001, 002, 354, 1000],personFrame1:[846, 406, 885, 525] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1[193, 022, 431, 933];Frame2:No object,personFrame1:[825, 385, 868, 552];Frame2:[826, 381, 864, 556],personFrame1[459, 366, 503, 556];Frame2:No object,personFrame1:[001, 002, 354, 1000];Frame2:[001, 002, 350, 1000],personFrame1:[846, 406, 885, 525];Frame2:[845, 404, 884, 527].') 2023-11-03 15:52:55.431 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 15:52:55.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair and Desk in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Chair:[101, 347, 505, 997],[143, 202, 366, 507],[492, 182, 712, 624];Desk:[311, 462, 615, 715].') 2023-11-03 15:53:04.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:53:04.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Oven,Person and Hat in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Oven:[733, 828, 857, 1000];Person:[596, 179, 864, 1000],[536, 307, 607, 485],[308, 284, 631, 999];Hat:[428, 286, 530, 370],[534, 306, 608, 365],[639, 221, 710, 282].') 2023-11-03 15:53:13.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:53:13.830 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[396, 168, 408, 181] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a white racing car moving along trackFrame1:[396, 168, 408, 181];Frame2:[380, 172, 392, 188];Frame3:[369, 177, 382, 193];Frame4:[356, 186, 368, 202].') 2023-11-03 15:54:33.765 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:54:33.766 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the horses walking towards the right following the cow, then stop?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There are 6 horses walking towards the right following the cow, then stop.Track1frame:1:[000, 571, 061, 746];frame:2:[000, 581, 065, 746];frame:3:[000, 555, 085, 727],Track2frame:1:[118, 552, 205, 631];frame:2:[111, 557, 212, 734];frame:3:[109, 539, 235, 689],Track3frame:1:[185, 611, 275, 798];frame:2:[193, 609, 280, 793];frame:3:[209, 593, 288, 774],Track4frame:1:[109, 590, 192, 776];frame:2:[095, 585, 195, 772];frame:3:[096, 572, 214, 753],Track5frame:1:[241, 569, 347, 725];frame:2:[241, 567, 357, 725];frame:3:[225, 546, 355, 713],Track6frame:1:[168, 579, 245, 751];frame:2:[163, 581, 249, 756];frame:3:[128, 562, 228, 736].') 2023-11-03 15:54:48.905 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:54:48.906 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Young woman [001, 180, 535, 918] with glasses [353, 262, 474, 342] taking a selfie on a old spanish street pretty smart-phone trendy joyful brunette outdoors modern beauty style casual attire [254, 488, 516, 918] fun female') 2023-11-03 15:54:58.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:54:58.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Towel:[100, 694, 235, 767],[613, 473, 751, 583];Glasses:[750, 621, 830, 652];Fork:[001, 928, 279, 961],[671, 939, 764, 1000],[050, 804, 263, 827],[175, 642, 358, 663];Cup:[870, 831, 993, 944],[765, 724, 861, 797],[103, 815, 218, 920],[179, 723, 271, 802],[751, 639, 835, 709],[718, 590, 766, 641];Wine Glass:[781, 762, 895, 973],[392, 441, 439, 501],[329, 465, 394, 542],[504, 660, 634, 788],[348, 628, 461, 709],[337, 707, 478, 834],[504, 768, 653, 989],[388, 792, 529, 998],[361, 814, 492, 997],[276, 803, 397, 1000],[603, 754, 715, 957];Bottle:[236, 569, 273, 614].') 2023-11-03 15:55:40.285 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:55:40.285 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[343, 185, 731, 958].') 2023-11-03 15:55:51.555 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:55:51.555 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[001, 367, 125, 748] [605, 343, 733, 674] [598, 520, 826, 973] [383, 362, 519, 766] [178, 421, 347, 748] Women and [001, 664, 314, 997] [109, 780, 238, 996] children wash [819, 878, 936, 959] [492, 651, 599, 784] clothes in the aftermath of the devastating hurricane.') 2023-11-03 15:56:05.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:56:05.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp and Chair in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lamp:[029, 281, 199, 701];Chair:[001, 482, 179, 728].') 2023-11-03 15:56:16.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:56:16.501 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect table,man,clothing and cake stand in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'table:[000, 831, 365, 987];man:[452, 138, 950, 962],[000, 383, 335, 998],[302, 265, 678, 998];clothing:[010, 441, 299, 956],[292, 402, 675, 998],[458, 262, 942, 998].') 2023-11-03 15:56:24.865 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:56:24.865 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Devils Harvest Washed Buffalo Plaid Button-Down Shirt at Urban Outfitters') 2023-11-03 15:56:32.691 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:56:32.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Billards,Sandals and Person in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Sandals:[878, 632, 955, 672];Person:[370, 162, 681, 940],[806, 304, 895, 600].') 2023-11-03 15:56:38.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:56:38.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Chair:[602, 471, 999, 998];Lamp:[039, 117, 252, 877],[076, 364, 167, 436];Desk:[364, 405, 981, 862];Bottle:[387, 522, 443, 601],[424, 515, 473, 591];Cup:[638, 536, 695, 572];Moniter:[425, 328, 697, 485],[675, 338, 883, 488];Keyboard:[467, 569, 711, 609].') 2023-11-03 15:56:53.530 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:56:53.530 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[001, 085, 961, 1000];Bracelet:[712, 299, 804, 346];Necklace:[077, 419, 150, 491];Glasses:[062, 224, 236, 275];Lamp:[331, 100, 513, 284],[485, 262, 564, 318];Mirror:[370, 248, 584, 386];Desk:[520, 521, 712, 679];Chair:[350, 563, 541, 691];Blender:[536, 450, 999, 999];Cutting:[360, 683, 659, 775];Grape:[793, 220, 937, 374].') 2023-11-03 15:57:44.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:57:44.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[416, 524, 567, 876],[001, 582, 216, 984],[190, 615, 365, 996],[252, 380, 319, 430],[325, 359, 383, 461],[361, 413, 426, 454],[424, 404, 473, 467],[496, 434, 520, 498],[535, 481, 611, 792],[595, 390, 671, 567],[621, 397, 678, 507],[671, 467, 999, 999],[707, 348, 748, 467],[743, 348, 771, 455],[766, 373, 900, 593],[915, 428, 999, 561];clothing:[154, 512, 338, 845],[000, 431, 083, 575],[056, 510, 193, 762],[256, 337, 329, 462],[381, 361, 439, 433],[386, 506, 538, 698],[425, 365, 511, 481],[510, 460, 571, 637],[621, 337, 680, 447],[691, 327, 753, 417],[827, 401, 906, 527],[854, 329, 925, 382],[876, 181, 909, 265],[906, 384, 990, 465];woman:[060, 399, 213, 722],[168, 389, 360, 830],[386, 322, 440, 416],[389, 407, 526, 784],[426, 316, 520, 451],[505, 376, 586, 620],[522, 318, 599, 458],[582, 303, 611, 386],[736, 303, 781, 401],[771, 301, 799, 378],[878, 166, 913, 309];table:[003, 383, 081, 482],[169, 452, 445, 739],[305, 345, 385, 425],[491, 366, 621, 453],[673, 330, 704, 403],[873, 385, 999, 489],[876, 559, 999, 845].') 2023-11-03 15:58:05.597 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:58:05.598 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'night [430, 209, 768, 994] *david rockefeller sr behind microphone [309, 430, 514, 732] reading script about the un flags around the center teenage mutant ninja turtle dressed as santa bg - rockefeller center video stock e b–roll') 2023-11-03 15:58:36.037 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:58:36.037 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[008, 006, 987, 992] British newspaper completely botches the spelling of two Canadian cities') 2023-11-03 15:59:08.371 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 15:59:08.372 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Truck in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Truck:[001, 562, 075, 634].') 2023-11-03 15:59:34.884 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:59:34.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'poster:[000, 047, 357, 635],[355, 046, 771, 633],[740, 018, 999, 628];woman:[000, 781, 055, 847],[000, 824, 120, 956],[740, 826, 882, 996],[875, 833, 930, 946];man:[170, 386, 878, 999],[745, 817, 905, 999];clothing:[000, 855, 102, 948],[105, 283, 938, 420],[255, 450, 856, 999],[695, 855, 737, 920],[758, 848, 799, 913],[785, 857, 886, 953],[877, 850, 913, 937].') 2023-11-03 15:59:54.144 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 15:59:54.144 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[000, 528, 358, 700] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a droneFrame1:[000, 528, 358, 700];Frame2:[175, 603, 568, 778];Frame3:[237, 650, 632, 825];Frame4:[266, 363, 665, 568];Frame5:[180, 387, 575, 592].') 2023-11-03 16:00:03.289 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:00:03.289 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[551, 487, 659, 797],personFrame1:[335, 473, 428, 786],personFrame1:[696, 512, 744, 789],personFrame1:[460, 458, 558, 789],personFrame1:[266, 497, 354, 794] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[551, 487, 659, 797];Frame2:[531, 489, 657, 792],personFrame1:[335, 473, 428, 786];Frame2:[340, 495, 432, 784],personFrame1:[696, 512, 744, 789];Frame2:[692, 506, 745, 791],personFrame1:[460, 458, 558, 789];Frame2:[463, 475, 557, 788],personFrame1:[266, 497, 354, 794];Frame2:[287, 511, 357, 793].') 2023-11-03 16:00:05.708 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:00:05.709 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'two glasses of paleo sweet mint iced tea on a tray [003, 433, 994, 808] with fresh mint leaves [533, 241, 836, 434] and purple flowers [754, 292, 994, 544] behind') 2023-11-03 16:00:13.779 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:00:13.780 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Why Autumn is the best time to treat varicose veins [030, 728, 965, 958] main image') 2023-11-03 16:00:34.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:00:34.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[780, 204, 854, 271];Person:[745, 379, 919, 997],[381, 458, 533, 1000],[326, 488, 425, 999],[235, 453, 348, 977],[164, 471, 297, 999],[080, 472, 187, 1000],[038, 546, 107, 912],[001, 522, 066, 868].') 2023-11-03 16:00:41.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:00:41.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Fan,Helmet and Person in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Fan:[288, 026, 354, 087];Helmet:[206, 218, 272, 290],[516, 182, 582, 247],[725, 325, 789, 415],[909, 301, 944, 337],[453, 285, 491, 330];Person:[097, 342, 129, 412],[001, 427, 028, 559],[060, 391, 146, 583],[169, 502, 207, 577],[237, 287, 298, 593],[065, 220, 283, 787],[291, 385, 366, 569],[325, 392, 400, 572],[373, 331, 411, 551],[405, 351, 442, 541],[372, 287, 567, 692],[370, 182, 668, 790],[550, 326, 800, 807],[812, 376, 848, 537],[811, 299, 943, 673],[948, 430, 969, 527],[978, 429, 999, 526],[790, 381, 814, 440],[564, 325, 614, 565].') 2023-11-03 16:00:59.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:00:59.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Carriage:[148, 292, 828, 806],[689, 274, 1000, 788];Person:[040, 406, 132, 786],[169, 518, 234, 766],[238, 571, 290, 766],[299, 595, 341, 727],[316, 399, 370, 764],[343, 387, 409, 771],[690, 655, 705, 747];Sneakers:[040, 748, 062, 785],[076, 755, 111, 784],[180, 742, 198, 767],[202, 740, 234, 766],[321, 740, 340, 770],[341, 743, 370, 772],[383, 740, 401, 772],[480, 683, 502, 714];Boots:[253, 682, 286, 778];Leather Shoes:[387, 742, 423, 780],[421, 742, 450, 779];Horse:[149, 373, 532, 808],[693, 427, 828, 781];Flower:[524, 326, 567, 382],[671, 314, 708, 368];Bench:[964, 429, 1000, 497];Chair:[826, 434, 903, 516].') 2023-11-03 16:01:30.197 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:01:30.197 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '3D rendering of a compass with [496, 058, 917, 515] a Motivation icon.') 2023-11-03 16:01:45.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:01:45.952 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[850, 272, 914, 374],[923, 319, 945, 367],[747, 285, 771, 324],[614, 318, 630, 345],[636, 325, 647, 345],[608, 323, 616, 336],[000, 330, 022, 671],[012, 309, 039, 384];Other Shoes:[000, 649, 023, 669];Glasses:[749, 310, 768, 321];Chair:[749, 322, 800, 383],[809, 329, 854, 390];Bus:[045, 061, 821, 943];Van:[702, 180, 999, 727],[589, 268, 999, 373];Car:[004, 370, 054, 570].') 2023-11-03 16:01:59.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:01:59.764 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[530, 241, 936, 995],[151, 530, 489, 999];Guitar:[129, 035, 425, 1000];Microphone:[419, 310, 655, 475].') 2023-11-03 16:02:15.401 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:02:15.402 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'two fbi agent [484, 008, 960, 995] [044, 129, 506, 998] open the back of car and looking here from the scene of mindhunter season 2') 2023-11-03 16:02:17.022 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:02:17.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[447, 762, 511, 838],carFrame1:[169, 760, 233, 844],carFrame1:[428, 759, 467, 825],carFrame1:[237, 735, 271, 803],carFrame1:[624, 750, 814, 948] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[447, 762, 511, 838];Frame2:[490, 767, 578, 865],carFrame1:[169, 760, 233, 844];Frame2:[101, 762, 206, 882],carFrame1:[428, 759, 467, 825];Frame2:[459, 762, 513, 849],carFrame1:[237, 735, 271, 803];Frame2:[223, 736, 269, 811],carFrame1:[624, 750, 814, 948];Frame2:[931, 858, 1000, 1000].') 2023-11-03 16:02:28.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:02:28.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect microphone,human face,fashion accessory,suit and person in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'microphone:[063, 301, 291, 523];human face:[506, 166, 613, 491];fashion accessory:[545, 532, 595, 652];suit:[190, 444, 866, 999];person:[118, 063, 867, 999].') 2023-11-03 16:02:38.176 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:02:38.177 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[239, 236, 328, 624] [261, 258, 464, 752] [863, 207, 996, 686] [131, 176, 278, 734] [323, 261, 601, 829] Young boys playing football on [006, 478, 993, 994] the Juhu beach in Mumbai on a Sunday morning') 2023-11-03 16:03:26.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:03:26.442 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[220, 169, 687, 809] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a skidderFrame1:[220, 169, 687, 809];Frame2:[265, 088, 658, 658];Frame3:[298, 175, 641, 679];Frame4:[281, 204, 587, 651];Frame5:[284, 203, 561, 621].') 2023-11-03 16:03:28.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:03:28.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'building:[000, 429, 433, 811],[373, 122, 999, 798];car:[000, 774, 161, 906],[032, 785, 334, 923],[532, 794, 655, 870],[657, 800, 738, 855],[716, 801, 913, 947].') 2023-11-03 16:03:40.376 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:03:40.376 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[391, 710, 499, 999],[292, 737, 379, 1000],[201, 467, 255, 702],[267, 543, 316, 703];Handbag:[604, 467, 638, 533].') 2023-11-03 16:03:48.547 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:03:48.548 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Image of [372, 266, 565, 613] a transparent ball sat in [006, 006, 995, 998] a lush green valley') 2023-11-03 16:03:53.730 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:03:53.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[195, 407, 625, 639] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a guard shipFrame1:[195, 407, 625, 639];Frame2:[216, 396, 648, 639];Frame3:[238, 402, 669, 639];Frame4:[259, 402, 690, 639];Frame5:[281, 402, 710, 639].') 2023-11-03 16:03:56.386 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:03:56.386 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[141, 372, 175, 500] [191, 388, 230, 515] A homeowners association in Chandler poured a lot of water on the green spaces it manages and still had brown and bare areas.') 2023-11-03 16:04:06.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:04:06.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[429, 459, 487, 757] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a snowmobilFrame1:[429, 459, 487, 757];Frame2:[459, 486, 536, 814];Frame3:[542, 342, 640, 622];Frame4:[667, 450, 707, 597];Frame5:[693, 400, 719, 500].') 2023-11-03 16:04:13.436 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:04:13.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bottle,bucket and Person in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Bottle:[185, 238, 346, 366];bucket:[083, 533, 183, 603];Person:[607, 218, 940, 1000],[183, 290, 530, 1000].') 2023-11-03 16:04:22.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:04:22.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Cell Phone:[725, 380, 760, 400];Person:[072, 369, 167, 641],[114, 248, 262, 573],[369, 420, 538, 682],[485, 280, 665, 678],[672, 286, 832, 635],[833, 229, 958, 644];Stuffed Toy:[389, 207, 565, 580];Trash bin Can:[264, 342, 310, 365],[352, 339, 376, 358];Sandals:[070, 622, 118, 640],[120, 620, 158, 640],[392, 660, 412, 675],[410, 664, 440, 681];Leather Shoes:[154, 553, 199, 574],[192, 546, 260, 564];Glasses:[158, 268, 204, 284],[710, 310, 756, 329];Hat:[444, 207, 566, 275];Belt:[437, 375, 503, 392],[864, 399, 944, 414];Gloves:[387, 394, 430, 422];Other Shoes:[481, 549, 550, 583],[454, 556, 469, 582],[764, 595, 795, 614],[699, 616, 750, 635];Sneakers:[534, 628, 571, 648],[509, 648, 601, 676],[841, 592, 930, 624],[833, 610, 930, 644].') 2023-11-03 16:04:25.677 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:04:25.677 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 436, 198, 737],carFrame1:[886, 467, 988, 527],carFrame1:[428, 483, 557, 667] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[000, 436, 198, 737];Frame2:[000, 434, 172, 803],carFrame1:[886, 467, 988, 527];Frame2:[912, 475, 967, 541],carFrame1:[428, 483, 557, 667];Frame2:[439, 486, 604, 696].') 2023-11-03 16:04:53.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:04:53.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Storage box:[710, 243, 814, 319];bucket:[435, 517, 486, 598],[078, 477, 171, 610];Person:[405, 378, 505, 588],[690, 372, 929, 834].') 2023-11-03 16:04:56.623 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:04:56.624 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'crumble dough [100, 057, 759, 772] for gluten-free shortbread crust in a pan') 2023-11-03 16:05:03.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:05:03.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,human face,human arm,human hand and human head in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[010, 067, 557, 963],[471, 169, 725, 999];human face:[058, 046, 271, 406],[517, 232, 589, 361];human arm:[262, 410, 581, 732],[573, 627, 681, 790],[631, 637, 726, 694];human hand:[469, 458, 618, 767],[382, 952, 499, 999],[633, 699, 683, 798],[683, 617, 728, 699];human head:[497, 172, 615, 371],[005, 000, 266, 434].') 2023-11-03 16:05:20.636 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:05:20.637 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[053, 202, 953, 799],[611, 180, 771, 470],[712, 171, 857, 462],[341, 236, 464, 476],[250, 245, 370, 484],[001, 000, 117, 048];Helmet:[450, 202, 619, 282];Hat:[278, 247, 348, 282],[395, 238, 445, 271],[117, 282, 161, 301],[675, 182, 719, 200],[781, 173, 820, 191],[900, 196, 941, 214];Sneakers:[049, 478, 117, 503],[775, 720, 947, 785],[052, 690, 148, 799];Belt:[417, 465, 574, 512];Baseball Bat:[142, 245, 536, 351],[167, 276, 205, 396].') 2023-11-03 16:06:01.749 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:06:01.749 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Caleb Carr picture quote - Every human being must find his own way to cope with.. - [070, 107, 929, 836] Friendship quotes') 2023-11-03 16:06:06.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:06:06.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Tricycle:[880, 446, 1000, 755],[668, 389, 872, 785],[755, 374, 882, 689],[650, 354, 773, 627],[683, 344, 768, 382],[562, 329, 668, 571],[546, 313, 612, 526],[483, 305, 566, 481],[425, 293, 498, 463],[360, 276, 425, 434],[317, 269, 386, 408],[225, 258, 304, 398],[196, 322, 273, 377];Motorcycle:[000, 235, 030, 334];Bicycle:[000, 400, 035, 529],[000, 514, 033, 595];Other Shoes:[769, 735, 795, 765];Bench:[914, 569, 999, 678],[694, 494, 772, 606],[570, 386, 622, 442],[493, 353, 543, 396],[437, 337, 466, 376],[369, 314, 415, 357],[332, 307, 364, 350],[235, 295, 270, 331],[670, 391, 718, 457];Desk:[662, 422, 754, 497];Person:[763, 426, 844, 768],[459, 319, 494, 441],[269, 276, 302, 374],[000, 310, 040, 496];Awning:[919, 446, 1000, 527].') 2023-11-03 16:06:17.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:06:17.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Three students [500, 298, 635, 998] [567, 083, 834, 994] [326, 367, 525, 998] stand outside together in their lab coats [500, 486, 628, 998] [571, 327, 834, 998] [327, 532, 526, 997]. All three are wearing bowties [537, 486, 610, 552], and two have wigs [593, 084, 742, 318] [511, 298, 634, 484] on and are dressed as "wacky scientists."') 2023-11-03 16:06:28.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:06:28.193 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect fox,man and dress in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[757, 291, 976, 781],[110, 291, 287, 536],[443, 303, 748, 773];dress:[030, 254, 655, 1000],[000, 422, 062, 728].') 2023-11-03 16:07:06.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:07:06.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,man,human face and suit in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[465, 332, 905, 999];man:[100, 000, 490, 999];human face:[691, 396, 806, 589],[252, 060, 384, 302];suit:[081, 099, 464, 990].') 2023-11-03 16:07:51.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:07:51.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect table,man,clothing,woman and canary in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'table:[260, 576, 651, 795],[000, 437, 069, 674];man:[054, 383, 460, 987],[067, 289, 174, 459],[216, 319, 395, 655],[222, 255, 276, 433],[338, 253, 412, 328],[346, 317, 449, 568],[415, 253, 509, 517],[418, 361, 854, 998],[533, 308, 694, 534],[626, 330, 881, 679],[742, 244, 809, 356],[802, 249, 865, 384],[844, 249, 983, 545];clothing:[851, 286, 987, 621],[054, 272, 088, 359],[063, 557, 463, 992],[079, 336, 183, 428],[209, 275, 265, 436],[209, 401, 440, 792],[264, 211, 326, 319],[329, 267, 414, 355],[346, 397, 457, 577],[415, 280, 501, 437],[438, 471, 860, 998],[539, 370, 706, 565],[560, 274, 665, 423],[738, 280, 802, 364],[773, 422, 869, 610],[794, 302, 887, 415];woman:[073, 286, 169, 462],[274, 266, 348, 422],[494, 222, 522, 303],[571, 232, 604, 309],[659, 281, 764, 417],[895, 253, 948, 330].') 2023-11-03 16:07:56.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:07:56.299 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Umbrella in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Umbrella:[045, 634, 117, 703],[243, 635, 299, 684],[365, 630, 420, 677],[672, 619, 737, 677],[765, 615, 817, 690],[840, 610, 910, 679],[918, 604, 1000, 677].') 2023-11-03 16:08:03.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:08:03.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bottle:[488, 831, 549, 976],[368, 702, 414, 862],[382, 651, 444, 795],[161, 777, 210, 903],[049, 835, 103, 956],[609, 834, 659, 996],[590, 689, 647, 865];Cup:[107, 738, 155, 840],[131, 731, 185, 818];Person:[653, 402, 923, 891],[381, 441, 548, 743],[031, 483, 273, 810],[894, 378, 999, 799];Canned:[212, 845, 255, 942];Desk:[002, 710, 714, 999];Lamp:[278, 896, 370, 969];Chair:[793, 816, 924, 999].') 2023-11-03 16:08:38.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:08:38.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Towel:[276, 312, 380, 351];Cabinet:[001, 144, 086, 375],[071, 022, 357, 210],[124, 205, 279, 325],[274, 199, 344, 304],[384, 211, 619, 347],[642, 218, 734, 370],[734, 231, 829, 386],[826, 246, 942, 391],[018, 646, 210, 905],[318, 681, 483, 875],[846, 777, 991, 875],[826, 610, 972, 706],[831, 699, 993, 754],[694, 592, 804, 751],[498, 575, 646, 677],[502, 676, 653, 753],[656, 794, 747, 873],[745, 794, 831, 862],[511, 786, 558, 874],[589, 788, 637, 867],[505, 894, 672, 984],[253, 850, 479, 982];Mirror:[759, 006, 929, 156];Carpet:[064, 427, 712, 573],[001, 926, 310, 996];Pillow:[768, 391, 999, 458],[660, 419, 856, 494];Couch:[618, 333, 1000, 575];Nightstand:[926, 914, 995, 979],[868, 900, 941, 964];Moniter:[451, 110, 607, 226],[283, 723, 380, 867];Desk:[200, 312, 589, 533],[682, 912, 855, 979].') 2023-11-03 16:09:25.223 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:09:25.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'man:[021, 000, 993, 1000];glasses:[306, 147, 731, 223];tree:[682, 236, 879, 437],[860, 218, 998, 468];human face:[314, 049, 685, 397].') 2023-11-03 16:09:56.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:09:56.300 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[251, 060, 625, 697];Sandals:[299, 584, 397, 695],[404, 624, 506, 697];Glasses:[514, 161, 572, 262];Drying Machine:[084, 221, 297, 634];Towel:[066, 173, 112, 570],[128, 118, 190, 228];Toilet Paper:[190, 122, 219, 166];Tissue:[234, 119, 269, 182];Cleaning Products:[106, 010, 138, 122];Cup:[572, 526, 651, 670],[189, 122, 219, 163];Bottle:[281, 148, 304, 253],[268, 141, 284, 239],[160, 025, 180, 149],[140, 025, 170, 147],[106, 011, 138, 124],[843, 043, 867, 146];Basin:[851, 087, 875, 121];Dishwasher:[775, 228, 911, 633];Oven:[860, 012, 949, 594];shelf:[825, 144, 860, 563],[697, 367, 735, 513],[946, 001, 1000, 592];Frame:[708, 163, 724, 197].') 2023-11-03 16:12:09.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:12:09.166 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Speaker and Person in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Speaker:[003, 338, 155, 452],[003, 444, 149, 612];Person:[519, 271, 894, 713].') 2023-11-03 16:12:54.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:12:54.658 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'footwear:[543, 656, 662, 757],[335, 957, 417, 999];person:[157, 117, 836, 999].') 2023-11-03 16:13:31.646 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:13:31.647 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Man [358, 372, 512, 985] wearing a face mask [428, 425, 476, 478] or covering due to the COVID-19 pandemic, walks past the closed Donelon's Bar [319, 007, 876, 997] in the rural village of Dunmore, west of...") 2023-11-03 16:14:57.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:14:57.496 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[289, 903, 405, 999].') 2023-11-03 16:15:13.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:15:13.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[087, 279, 222, 951],[171, 254, 321, 911],[249, 175, 460, 1000],[486, 214, 622, 932].') 2023-11-03 16:15:15.430 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:15:15.430 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Chair:[920, 469, 998, 850];Drum:[300, 219, 409, 333];Person:[673, 028, 961, 936],[515, 309, 643, 853],[387, 269, 575, 865],[632, 325, 784, 824];Bracelet:[713, 214, 741, 265];Book:[652, 385, 792, 468];Leather Shoes:[180, 803, 233, 911],[572, 812, 603, 855],[597, 797, 644, 845],[755, 784, 784, 842],[348, 798, 408, 861];Sandals:[708, 776, 734, 823].') 2023-11-03 16:15:46.912 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:15:46.913 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Being a single 24/7 full time dad [138, 252, 973, 825] you'd think my social life would be booming....you'd be mistaken...oh the lonely life of a handsome unicorn 🦄😭") 2023-11-03 16:15:55.094 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:15:55.095 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Slide:[301, 411, 496, 818].') 2023-11-03 16:16:07.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:16:07.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'food:[438, 533, 578, 661];man:[145, 042, 457, 953],[166, 139, 239, 631],[413, 121, 488, 541],[511, 181, 522, 358],[517, 183, 539, 383],[525, 140, 583, 414],[536, 100, 725, 847],[668, 107, 745, 551],[725, 141, 767, 463],[741, 033, 884, 602],[812, 129, 999, 758];jeans:[238, 564, 420, 969],[428, 333, 480, 508],[580, 508, 683, 794],[685, 451, 718, 535],[733, 306, 755, 454];shorts:[000, 412, 096, 537],[778, 346, 854, 485];woman:[000, 203, 126, 658],[062, 179, 231, 724],[368, 188, 400, 265],[388, 199, 415, 327],[470, 171, 513, 443],[575, 179, 614, 262],[810, 088, 945, 590];tree:[673, 000, 999, 145];footwear:[815, 698, 878, 762];human face:[083, 189, 125, 270],[171, 145, 208, 215],[251, 000, 337, 172],[453, 138, 475, 203],[613, 158, 669, 265],[812, 076, 855, 140].') 2023-11-03 16:16:16.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:16:16.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'laptop:[531, 533, 694, 693];person:[742, 390, 938, 909],[000, 654, 225, 999],[039, 361, 416, 852],[417, 376, 635, 847],[439, 652, 643, 936],[576, 689, 818, 999],[825, 683, 999, 999];clothing:[000, 804, 223, 984],[036, 479, 358, 850],[420, 485, 588, 835],[445, 818, 631, 939],[616, 863, 815, 999],[716, 508, 932, 914],[854, 850, 999, 999].') 2023-11-03 16:16:20.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:16:20.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[762, 662, 834, 933],[700, 674, 781, 935],[548, 674, 631, 946],[644, 637, 691, 807],[681, 638, 736, 801],[538, 666, 598, 872],[476, 670, 548, 946],[001, 729, 155, 1000],[348, 694, 483, 1000],[295, 692, 372, 1000],[255, 694, 336, 1000],[140, 677, 273, 1000],[091, 687, 186, 1000],[134, 662, 169, 801],[209, 659, 259, 862],[220, 666, 288, 881],[268, 657, 323, 824],[372, 655, 438, 912],[412, 664, 493, 994],[293, 640, 329, 766],[322, 646, 356, 781],[343, 644, 383, 787],[370, 642, 405, 775],[518, 648, 565, 805],[493, 640, 533, 761],[588, 648, 630, 796],[561, 640, 598, 762],[622, 625, 654, 740],[586, 629, 622, 724],[469, 622, 502, 725],[083, 683, 173, 1000],[227, 957, 347, 1000],[425, 600, 606, 651].') 2023-11-03 16:16:22.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:16:22.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Hat:[348, 457, 456, 538].') 2023-11-03 16:17:15.182 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:17:15.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Cup:[572, 895, 648, 1000],[850, 844, 928, 1000],[866, 801, 937, 970];Plate:[648, 943, 854, 1000];Frame:[833, 001, 1000, 100];Chair:[329, 567, 538, 1000],[339, 362, 404, 479];Desk:[000, 318, 369, 753];Person:[005, 199, 319, 1000],[182, 537, 467, 999],[350, 222, 531, 622],[460, 153, 645, 353],[403, 131, 713, 1000],[669, 164, 798, 957],[760, 110, 842, 383],[717, 152, 1000, 962],[931, 630, 999, 950];Tie:[124, 423, 209, 745],[766, 428, 855, 832],[525, 520, 649, 918],[004, 493, 198, 834];Handbag:[670, 341, 759, 833];Bow Tie:[296, 764, 345, 846].') 2023-11-03 16:17:30.053 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:17:30.054 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Desk,Trash bin Can,Bench and Cleaning Products in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Desk:[588, 670, 849, 770],[184, 545, 216, 593],[856, 578, 891, 644],[559, 558, 640, 605],[432, 555, 513, 613];Trash bin Can:[004, 691, 101, 780];Bench:[493, 732, 692, 777],[825, 602, 883, 645].') 2023-11-03 16:18:08.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:18:08.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bus:[187, 186, 856, 811],[501, 283, 1000, 611],[002, 417, 096, 550].') 2023-11-03 16:18:34.349 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:18:34.350 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[144, 299, 280, 938],[002, 275, 108, 1000],[161, 266, 599, 999],[437, 176, 944, 1000];Cup:[352, 771, 411, 908];Lamp:[296, 203, 387, 292],[810, 211, 872, 359];Tie:[563, 528, 644, 820],[405, 583, 478, 1000].') 2023-11-03 16:18:36.167 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:18:36.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[378, 041, 626, 256];Desk:[292, 644, 547, 926],[753, 559, 897, 602];Chair:[001, 452, 385, 900],[681, 466, 998, 891],[563, 439, 785, 693];Flower:[604, 591, 870, 890];Vase:[709, 847, 801, 898];Cup:[375, 594, 421, 654];Wine Glass:[795, 542, 829, 601];Pillow:[612, 505, 719, 580].') 2023-11-03 16:19:52.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:19:52.548 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Keyboard,Bed,Lamp,TV and Glasses in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Keyboard:[001, 378, 054, 424];Bed:[328, 555, 731, 1000];Lamp:[378, 001, 512, 208],[135, 001, 288, 119];TV:[706, 345, 754, 435];Glasses:[272, 428, 326, 492].') 2023-11-03 16:20:03.176 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:20:03.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[686, 117, 850, 863],[168, 082, 841, 872];man:[561, 130, 701, 810],[163, 081, 449, 892],[321, 165, 443, 778],[400, 159, 563, 790],[660, 145, 850, 831];footwear:[221, 780, 278, 881],[329, 689, 367, 752],[384, 709, 428, 785],[476, 704, 508, 787],[528, 665, 562, 739],[660, 754, 717, 831],[728, 777, 765, 858];human arm:[166, 255, 311, 464],[296, 236, 399, 390],[399, 294, 462, 454],[428, 248, 472, 333],[503, 337, 588, 425],[620, 256, 704, 445],[698, 292, 846, 493];human hair:[395, 143, 443, 209],[445, 144, 501, 208],[580, 126, 644, 193];human face:[245, 123, 301, 208],[389, 158, 436, 249],[445, 163, 504, 268],[591, 158, 640, 257],[700, 185, 761, 305].') 2023-11-03 16:20:04.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:20:04.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[650, 468, 934, 851],[000, 562, 072, 745],[026, 424, 640, 837],[971, 553, 999, 784];clothing:[487, 568, 658, 816],[000, 588, 062, 726],[049, 478, 324, 810],[054, 615, 113, 726],[193, 473, 393, 830],[329, 590, 496, 779],[534, 579, 625, 625],[555, 476, 832, 813],[702, 476, 899, 820],[860, 617, 933, 765],[925, 601, 976, 738],[974, 559, 999, 776];woman:[041, 581, 116, 738],[147, 436, 446, 844],[920, 583, 986, 751];building:[000, 000, 155, 553],[122, 000, 774, 616],[764, 000, 999, 754].') 2023-11-03 16:20:25.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:20:25.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'man:[108, 218, 341, 971],[000, 286, 030, 422],[078, 301, 109, 415],[352, 274, 455, 666],[420, 285, 463, 543],[438, 246, 565, 739],[561, 335, 595, 412],[600, 293, 661, 529],[643, 298, 681, 490],[658, 281, 719, 519],[865, 296, 909, 458];clothing:[113, 252, 330, 924],[000, 306, 030, 415],[042, 308, 070, 411],[077, 316, 106, 410],[353, 319, 451, 665],[421, 313, 458, 470],[438, 300, 558, 725],[598, 312, 658, 522],[641, 315, 675, 475],[653, 300, 715, 498],[862, 308, 906, 451];sports equipment:[433, 466, 451, 534],[456, 593, 502, 724],[501, 539, 540, 705],[654, 355, 670, 418],[666, 442, 688, 505],[689, 433, 712, 504].') 2023-11-03 16:20:46.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 16:20:46.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Backpack:[620, 325, 786, 712],[228, 276, 396, 601];Boots:[573, 835, 651, 966],[653, 910, 717, 999],[357, 893, 454, 999];Person:[228, 084, 513, 999],[555, 155, 781, 1000],[515, 178, 673, 968];Car:[686, 002, 999, 133].') 2023-11-03 16:22:27.003 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:22:27.005 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'topping [409, 079, 632, 231] [682, 190, 922, 351] [747, 014, 971, 167] [852, 483, 997, 656] [544, 353, 786, 526] [001, 257, 123, 422] [177, 388, 423, 552] [219, 215, 455, 382] mini cheesecakes with cherry pie filling on a marble countertop') 2023-11-03 16:22:50.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:22:50.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the a person cutting a bucket in this video clip?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one a person cutting a bucket.Track1frame:1:[024, 002, 998, 997];frame:2:[014, 002, 998, 997];frame:3:[012, 002, 998, 997].') 2023-11-03 16:23:53.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:23:53.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[621, 555, 670, 796],personFrame1:[376, 600, 417, 788],personFrame1:[591, 590, 631, 790],personFrame1:[184, 572, 259, 792],personFrame1:[644, 526, 717, 800],personFrame1:[235, 450, 319, 807],personFrame1:[310, 517, 372, 800],personFrame1:[917, 439, 999, 803],personFrame1:[686, 564, 759, 798],personFrame1:[432, 425, 567, 812] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'personFrame1:[621, 555, 670, 796];Frame2:[623, 548, 669, 797],personFrame1:[376, 600, 417, 788];Frame2:[377, 594, 416, 790],personFrame1:[591, 590, 631, 790];Frame2:[589, 570, 630, 790],personFrame1:[184, 572, 259, 792];Frame2:[186, 550, 258, 792],personFrame1:[644, 526, 717, 800];Frame2:[647, 522, 705, 801],personFrame1:[235, 450, 319, 807];Frame2:[230, 449, 317, 809],personFrame1:[310, 517, 372, 800];Frame2:[299, 505, 369, 800],personFrame1:[917, 439, 999, 803];Frame2:[906, 454, 999, 801],personFrame1:[686, 564, 759, 798];Frame2:[707, 557, 758, 798],personFrame1:[432, 425, 567, 812];Frame2:[434, 426, 564, 813].') 2023-11-03 16:24:25.246 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 16:24:25.246 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Flag:[313, 171, 344, 344];Person:[306, 417, 364, 595],[391, 414, 443, 493],[441, 400, 492, 492];Boat:[054, 064, 933, 760].') 2023-11-03 16:24:42.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:24:42.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Microphone:[172, 464, 193, 512],[217, 257, 235, 320],[207, 336, 237, 386],[100, 363, 134, 385],[601, 486, 628, 506],[925, 628, 964, 648];Tripod:[901, 647, 978, 829];Speaker:[939, 548, 999, 705],[764, 799, 906, 929],[562, 832, 709, 966],[283, 872, 403, 966];Person:[556, 415, 788, 904],[738, 499, 785, 678],[814, 498, 890, 703],[350, 261, 477, 441],[262, 267, 346, 708],[093, 454, 245, 837];Lamp:[364, 429, 578, 725],[125, 254, 199, 409],[174, 175, 252, 314],[347, 182, 421, 299],[000, 465, 069, 770],[003, 419, 050, 469];Hat:[284, 268, 321, 299];Sneakers:[317, 657, 333, 689],[291, 657, 306, 690],[559, 855, 587, 899];Pickup Truck:[714, 401, 999, 703];Van:[744, 345, 988, 495];Car:[699, 372, 756, 496];Truck:[676, 240, 988, 440],[337, 259, 700, 633];Piano:[074, 603, 333, 890];Cymbal:[028, 407, 117, 446],[146, 392, 229, 425];Drum:[243, 415, 311, 608],[200, 402, 264, 599],[079, 471, 137, 570];Guitar:[277, 382, 346, 422];Saxophone:[275, 659, 308, 867],[127, 711, 156, 847].') 2023-11-03 16:24:51.267 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:24:51.267 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A man [384, 009, 641, 992], Tony Heaton, is sat in a wheelchair [384, 512, 660, 994] with a prosthetic leg [537, 074, 694, 506] nestled round his shoulders and neck. He is looking at the camera.') 2023-11-03 16:25:02.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:25:02.324 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,please tell me the trajectory of the woman in black shirt.The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one woman in black shirt.Track1frame1:[361, 263, 747, 633];frame2:[361, 259, 747, 624].') 2023-11-03 16:25:16.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:25:16.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[011, 001, 070, 409],[054, 001, 114, 416],[064, 455, 117, 564];Potted Plant:[042, 487, 080, 558],[001, 460, 043, 568];Couch:[001, 541, 231, 833];Carpet:[001, 618, 250, 962],[209, 538, 369, 580],[509, 585, 945, 810];Chair:[219, 475, 269, 563],[318, 466, 352, 560],[177, 510, 232, 600];Desk:[241, 476, 329, 568],[520, 466, 550, 544];Cabinet:[122, 459, 215, 557],[621, 378, 682, 516],[930, 001, 998, 1000];Frame:[535, 321, 563, 419];Bed:[572, 388, 964, 759];Pillow:[785, 477, 837, 544],[796, 496, 841, 557],[812, 494, 879, 580],[856, 464, 920, 588],[818, 451, 862, 502].') 2023-11-03 16:25:19.071 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:25:19.071 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[324, 014, 524, 818] Flower girl looking anxious as she walks up the aisle') 2023-11-03 16:25:41.683 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:25:41.683 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Children [834, 255, 993, 890] [453, 240, 670, 904] [019, 221, 348, 968] in town of Zadar, where Luka Modric moved to flee conflict as a child, play football [478, 832, 575, 979] (Photo: Getty') 2023-11-03 16:25:42.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:25:42.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'land vehicle:[050, 681, 131, 758],[795, 665, 802, 728],[829, 652, 893, 733];tree:[000, 250, 466, 827],[508, 000, 999, 885],[526, 140, 678, 787],[737, 000, 850, 999];building:[000, 204, 493, 728],[501, 208, 999, 719].') 2023-11-03 16:25:56.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:25:56.750 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect billboard,clothing,human face and man in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[000, 447, 032, 693],[091, 641, 226, 797],[163, 414, 213, 520],[181, 546, 242, 645],[248, 553, 338, 647],[320, 658, 438, 812],[370, 560, 442, 643],[400, 418, 601, 572],[453, 562, 530, 643],[464, 662, 586, 820],[575, 566, 653, 637],[592, 469, 656, 530],[631, 499, 702, 568],[679, 555, 771, 658],[739, 680, 900, 826];human face:[141, 578, 178, 644],[360, 593, 395, 665],[516, 607, 548, 677],[801, 655, 843, 727];man:[738, 630, 921, 879],[001, 559, 226, 800],[172, 516, 228, 646],[236, 527, 335, 659],[295, 395, 709, 576],[320, 578, 440, 805],[377, 524, 443, 665],[440, 516, 523, 672],[472, 575, 587, 818],[570, 509, 653, 659],[681, 492, 769, 665].') 2023-11-03 16:26:14.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:26:14.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'shelf:[846, 280, 1000, 451];Chair:[392, 338, 450, 566];Frame:[668, 011, 727, 095];Person:[001, 037, 308, 687],[413, 211, 549, 477],[620, 090, 851, 506],[668, 374, 998, 1000],[780, 080, 838, 235],[837, 074, 904, 335];Leather Shoes:[613, 865, 672, 923];Cup:[032, 544, 213, 868],[216, 825, 346, 999],[671, 462, 760, 536],[585, 374, 640, 461],[533, 376, 584, 461],[759, 388, 823, 528],[377, 063, 625, 141];Fork:[243, 598, 395, 725];Spoon:[264, 694, 321, 895];Bottle:[001, 686, 091, 876],[626, 412, 665, 508];Plate:[251, 570, 518, 766],[659, 491, 775, 550],[470, 540, 667, 615],[531, 460, 650, 524],[525, 461, 650, 542].') 2023-11-03 16:26:19.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:26:19.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'brassiere:[564, 295, 789, 591];woman:[456, 012, 914, 989],[249, 222, 290, 411],[340, 239, 381, 396],[398, 235, 437, 349];man:[004, 128, 487, 982],[363, 269, 545, 539];sunglasses:[101, 261, 274, 306];swimwear:[004, 402, 525, 989],[536, 678, 842, 970],[581, 284, 770, 580];tree:[008, 014, 991, 436];human face:[117, 162, 245, 419],[604, 085, 705, 299];glasses:[106, 250, 254, 310].') 2023-11-03 16:26:44.131 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:26:44.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the person walking in mall?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one person walking in mall.Track1frame1:[464, 276, 976, 776];frame2:[479, 274, 1000, 772];frame3:[482, 264, 1000, 751].') 2023-11-03 16:26:49.068 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:26:49.068 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[598, 455, 960, 982] [041, 467, 417, 996] Wheels of steel: [002, 029, 318, 998] Dr Helen Miles and [326, 001, 618, 998] Stephen Fearn holding the wheels of the ExoMars Rover model and [580, 003, 998, 998] Dr Matt Gunn showing the mission’s colour swatch that has been inspired by medieval stained glass.') 2023-11-03 16:27:39.914 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:27:39.914 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[344, 092, 735, 904] boris johnson chairs his first reshuffled cabinet after chancellor’s resignation; uk, london; prime minister boris johnson call and response with the... - cabinet stock videos & royalty-free footage') 2023-11-03 16:27:41.731 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:27:41.731 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Happy bride [576, 137, 844, 953] and friends [236, 107, 354, 609] [707, 134, 835, 726] [209, 325, 426, 972] [002, 092, 214, 998] [549, 103, 647, 487] [083, 104, 239, 750] [831, 453, 997, 979] all laughing together outside near a massive mirror [311, 001, 592, 796] for her Mendocino Coast wedding at Spring Ranch') 2023-11-03 16:28:14.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:28:14.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Hot dog:[395, 618, 421, 665],[425, 615, 444, 649],[446, 613, 465, 645],[449, 657, 467, 706],[420, 446, 446, 481],[447, 448, 473, 484],[475, 450, 497, 484];Hamburger:[492, 679, 509, 719],[429, 720, 450, 783],[514, 699, 534, 745],[470, 681, 491, 722];Bread:[448, 717, 469, 767];Street Lights:[597, 073, 688, 274];SUV:[000, 535, 085, 804];Trolley:[175, 493, 352, 917];Truck:[786, 507, 888, 636];Storage box:[274, 604, 337, 633];Umbrella:[057, 352, 280, 643],[306, 264, 477, 402],[490, 261, 669, 477],[662, 292, 690, 486],[208, 468, 326, 499];Person:[812, 502, 877, 753],[688, 474, 755, 866],[153, 548, 258, 899],[145, 519, 170, 649];Awning:[000, 404, 068, 463];Hat:[185, 547, 208, 578],[694, 475, 728, 507];Handbag:[809, 639, 831, 719].') 2023-11-03 16:28:21.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:28:21.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[350, 302, 421, 450] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[350, 302, 421, 450];Frame2:[280, 279, 345, 412];Frame3:[067, 275, 120, 427];Frame4:[000, 001, 000, 001].') 2023-11-03 16:28:32.564 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:28:32.564 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[826, 303, 868, 406] Palestinian fishermen riding [003, 520, 465, 761] [799, 381, 993, 513] a boat cast [162, 487, 389, 742] their net as [581, 584, 656, 678] [862, 730, 905, 805] [658, 769, 718, 850] [569, 877, 636, 961] [641, 490, 724, 547] [367, 857, 431, 943] birds fly off the coast of Gaza City February 19, 2020 (photo credit: REUTERS/MOHAMMED SALEM') 2023-11-03 16:28:50.806 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:28:50.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human arm,human head,footwear and person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human arm:[646, 375, 851, 804],[097, 437, 224, 836],[127, 464, 324, 540];human head:[658, 191, 841, 438],[118, 188, 248, 346];footwear:[320, 930, 401, 999];person:[610, 154, 906, 977],[091, 175, 386, 999].') 2023-11-03 16:29:18.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:29:18.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[442, 519, 706, 784] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a gametarget of a car running on the ground Frame1:[442, 519, 706, 784];Frame2:[442, 523, 702, 795];Frame3:[441, 522, 688, 795].') 2023-11-03 16:29:19.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:29:19.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[664, 562, 732, 812],personFrame1:[339, 451, 404, 686],personFrame1:[392, 552, 458, 805],personFrame1:[500, 404, 540, 581],personFrame1:[633, 505, 681, 740],personFrame1:[432, 447, 497, 670],personFrame1:[496, 518, 574, 793] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[664, 562, 732, 812];Frame2:[721, 576, 785, 806],personFrame1:[339, 451, 404, 686];Frame2:[357, 476, 423, 716],personFrame1:[392, 552, 458, 805];Frame2:[397, 550, 448, 811],personFrame1:[500, 404, 540, 581];Frame2:[511, 413, 557, 570],personFrame1:[633, 505, 681, 740];Frame2:[671, 513, 728, 750],personFrame1:[432, 447, 497, 670];Frame2:[447, 473, 530, 697],personFrame1:[496, 518, 574, 793];Frame2:[494, 516, 555, 784].') 2023-11-03 16:29:36.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:29:36.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[368, 837, 550, 961] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a cat walking on the streetFrame1:[368, 837, 550, 961];Frame2:[365, 797, 533, 916];Frame3:[427, 711, 573, 831].') 2023-11-03 16:29:45.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:29:45.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'swan:[000, 123, 840, 850];duck:[004, 150, 895, 891];goose:[000, 146, 906, 901].') 2023-11-03 16:29:45.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:29:45.780 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[443, 389, 585, 514] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a wombatFrame1:[443, 389, 585, 514];Frame2:[440, 383, 604, 504];Frame3:[423, 385, 609, 495];Frame4:[410, 400, 581, 527].') 2023-11-03 16:30:14.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:30:14.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Steel skid plate for the protection of the engine and the gearbox for Skoda Octavia 3 - automatic gearbox [570, 080, 953, 443]') 2023-11-03 16:30:20.853 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:30:20.854 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'table:[009, 696, 815, 985];person:[000, 192, 033, 528],[003, 236, 101, 526],[070, 261, 115, 491],[091, 274, 146, 561],[193, 388, 253, 491],[245, 399, 283, 483].') 2023-11-03 16:30:22.466 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:30:22.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[232, 284, 360, 898],[007, 287, 128, 915],[091, 313, 166, 847],[201, 302, 305, 597],[363, 304, 478, 849],[460, 298, 535, 838],[496, 295, 625, 867],[631, 298, 744, 886],[749, 291, 985, 884],[844, 267, 980, 878];footwear:[913, 832, 973, 921];human body:[005, 326, 646, 915],[086, 545, 273, 930],[694, 356, 793, 878],[748, 392, 983, 890],[859, 346, 975, 891];human leg:[018, 504, 964, 930];human hair:[057, 276, 107, 345],[098, 318, 166, 446],[156, 452, 224, 538],[171, 296, 208, 359],[240, 292, 306, 395],[284, 305, 341, 439],[484, 296, 517, 369],[528, 301, 586, 401],[775, 301, 856, 492],[846, 270, 927, 417];man:[112, 455, 270, 923],[121, 296, 212, 559],[198, 270, 248, 411],[336, 270, 400, 808],[377, 254, 446, 401],[441, 297, 499, 753],[568, 301, 653, 835],[690, 261, 798, 881].') 2023-11-03 16:30:52.376 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:30:52.376 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How Google plans to use 13 balloons [158, 038, 888, 698] to bring Internet access to every person [540, 455, 607, 745] [456, 447, 528, 749] in Sri Lanka.') 2023-11-03 16:31:03.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:31:03.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[035, 235, 317, 923],[000, 082, 365, 491],[260, 182, 554, 631],[611, 195, 999, 999];human hair:[457, 212, 522, 315],[013, 159, 067, 229],[067, 140, 129, 233],[106, 231, 262, 435],[153, 179, 252, 242],[274, 192, 312, 262],[338, 201, 412, 275],[411, 196, 460, 287],[665, 217, 686, 350],[680, 214, 720, 300],[690, 233, 782, 473],[775, 247, 931, 510];tableware:[465, 489, 603, 617],[199, 246, 701, 981];man:[695, 229, 995, 954],[003, 168, 084, 375],[055, 147, 167, 460],[074, 182, 330, 681],[144, 158, 196, 286],[261, 193, 445, 629],[391, 199, 465, 447],[467, 189, 538, 355],[627, 203, 689, 364],[635, 215, 768, 497];clothing:[703, 377, 981, 983],[000, 277, 076, 490],[000, 483, 273, 991],[064, 229, 141, 432],[256, 297, 445, 597],[396, 267, 445, 433],[442, 283, 474, 382],[636, 415, 733, 550],[639, 335, 707, 481],[666, 496, 816, 999];woman:[000, 219, 313, 999],[431, 216, 533, 450],[615, 201, 665, 317];table:[091, 525, 757, 999],[403, 335, 672, 520];girl:[000, 239, 307, 999],[686, 239, 778, 533];human face:[038, 190, 065, 257],[195, 306, 268, 470],[356, 222, 410, 343],[709, 263, 761, 383],[739, 329, 811, 561].') 2023-11-03 16:31:23.072 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:31:23.073 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[491, 213, 782, 1000],[386, 307, 516, 987],[153, 312, 455, 1000],[149, 356, 221, 703],[350, 220, 431, 474],[498, 538, 550, 780].') 2023-11-03 16:31:43.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:31:43.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[784, 422, 918, 518],carFrame1:[452, 397, 505, 470],carFrame1:[612, 407, 656, 482] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[784, 422, 918, 518];Frame2:[785, 422, 918, 518],carFrame1:[452, 397, 505, 470];Frame2:[452, 397, 505, 470],carFrame1:[612, 407, 656, 482];Frame2:[612, 407, 656, 482].') 2023-11-03 16:31:51.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:31:51.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet/shelf in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Cabinet:[462, 388, 659, 1000],[635, 387, 777, 996].') 2023-11-03 16:32:04.558 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:32:04.559 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[172, 276, 832, 800] GM is recalling its Chevrolet Equinox for a fuel tank seam. [005, 008, 993, 984] - Photo courtesy of GM.') 2023-11-03 16:32:24.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:32:24.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[297, 571, 365, 684],carFrame1:[335, 558, 401, 651],carFrame1:[000, 499, 230, 860],carFrame1:[591, 576, 700, 738],truckFrame1:[495, 498, 568, 649] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[297, 571, 365, 684];Frame2:[221, 544, 352, 714],carFrame1:[335, 558, 401, 651];Frame2:[336, 531, 413, 649],carFrame1[000, 499, 230, 860];Frame2:No object,carFrame1:[591, 576, 700, 738];Frame2:[691, 557, 1000, 854],truckFrame1:[495, 498, 568, 649];Frame2:[528, 456, 621, 651].') 2023-11-03 16:32:38.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:32:38.470 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Watch:[424, 899, 494, 981];Person:[515, 035, 997, 998],[001, 002, 500, 996],[388, 067, 673, 832],[829, 096, 999, 755];Glasses:[142, 265, 296, 380],[718, 333, 857, 448];Traffic Light:[931, 002, 960, 092];Chicken:[001, 325, 120, 597];Wild Bird:[442, 544, 649, 673].') 2023-11-03 16:32:42.803 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:32:42.803 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,man and fashion accessory in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human face:[139, 235, 180, 407],[188, 251, 291, 412],[430, 193, 521, 352],[633, 319, 710, 449],[805, 360, 910, 523],[943, 416, 999, 532];man:[321, 203, 630, 854],[025, 212, 313, 667],[029, 229, 499, 999],[886, 313, 999, 688];fashion accessory:[187, 295, 297, 338],[630, 346, 715, 393],[740, 291, 951, 583],[807, 599, 891, 754].') 2023-11-03 16:33:03.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:33:03.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[285, 161, 423, 897];girl:[306, 111, 415, 910];clothing:[000, 398, 030, 792],[038, 390, 110, 794],[085, 193, 302, 832],[210, 416, 241, 771],[211, 416, 251, 793],[251, 319, 314, 859],[301, 257, 420, 665],[391, 407, 480, 810],[448, 458, 493, 802],[471, 407, 580, 788],[543, 447, 579, 818],[565, 435, 601, 792],[579, 349, 679, 847],[656, 424, 733, 839],[735, 456, 761, 822],[738, 362, 857, 877],[769, 449, 817, 861],[841, 355, 970, 867],[965, 681, 999, 856].') 2023-11-03 16:33:16.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:33:16.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[011, 063, 534, 1000],[000, 429, 071, 840],[378, 286, 999, 999];goggles:[133, 435, 395, 600];sunglasses:[180, 450, 415, 572];flag:[703, 000, 999, 806];human face:[183, 417, 418, 798],[551, 562, 731, 973].') 2023-11-03 16:33:24.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:33:24.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'suit:[015, 114, 296, 986],[269, 243, 331, 365],[290, 293, 580, 999],[498, 241, 776, 999],[731, 300, 999, 999];man:[009, 004, 318, 973],[265, 196, 336, 362],[286, 093, 553, 999],[491, 038, 776, 999],[726, 105, 999, 999];tie:[308, 265, 325, 361],[445, 379, 518, 864],[622, 294, 676, 745],[791, 375, 848, 615].') 2023-11-03 16:33:41.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:33:41.189 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[226, 145, 395, 509],[535, 251, 679, 556],[668, 235, 811, 524],[759, 593, 982, 885];man:[460, 198, 663, 482],[720, 230, 839, 466];girl:[613, 283, 737, 478];clothing:[010, 014, 073, 411],[084, 227, 143, 388],[270, 398, 446, 513],[455, 210, 614, 488],[560, 342, 652, 555],[620, 303, 725, 468],[682, 263, 799, 503],[735, 270, 833, 449],[759, 623, 883, 754],[763, 641, 945, 860],[851, 724, 963, 876],[856, 702, 949, 837];human face:[337, 147, 393, 289].') 2023-11-03 16:34:48.800 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:34:48.800 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[276, 179, 557, 729] Vork Brothers team member painting an outdoor deck of [009, 579, 996, 996] residential home') 2023-11-03 16:35:35.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:35:35.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect building,sculpture,tree and person in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'building:[000, 111, 742, 829],[828, 365, 971, 774],[935, 276, 999, 788];sculpture:[671, 530, 762, 824],[503, 325, 683, 800];tree:[499, 258, 604, 775],[643, 462, 691, 696],[833, 000, 999, 205],[873, 439, 922, 683],[875, 213, 999, 777],[890, 399, 945, 698];person:[547, 287, 669, 829],[010, 734, 034, 836],[044, 725, 075, 849],[056, 745, 115, 905],[095, 721, 120, 807],[113, 728, 134, 834],[134, 742, 148, 830],[151, 740, 176, 836],[277, 719, 328, 853],[296, 734, 316, 846],[373, 721, 396, 830],[383, 721, 417, 849],[413, 727, 440, 842],[514, 727, 554, 824],[600, 696, 623, 798],[646, 696, 700, 793],[723, 681, 778, 811],[809, 697, 838, 842],[875, 699, 919, 863],[969, 734, 991, 836].') 2023-11-03 16:35:56.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:35:56.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lifesaver in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lifesaver:[158, 842, 181, 877].') 2023-11-03 16:36:15.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:36:15.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Candle:[710, 418, 768, 485];Bakset:[513, 788, 794, 999];Toiletry:[674, 476, 846, 585];Towel:[598, 460, 898, 680],[346, 562, 495, 729];Faucet:[526, 124, 590, 265];Sink:[403, 252, 691, 524];Toilet:[164, 252, 428, 755].') 2023-11-03 16:37:13.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:37:13.098 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[130, 116, 300, 943],[002, 188, 143, 922].') 2023-11-03 16:37:15.012 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:37:15.012 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the horse shifting forward.As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There are 8 horse shifting forward.Track1frame:1:[085, 576, 166, 763];frame:2:[057, 555, 171, 737];frame:3:[069, 513, 176, 701],Track2frame:1:[045, 611, 088, 814];frame:2:[001, 581, 020, 651],Track3frame:1:[211, 574, 324, 713];frame:2:[218, 560, 296, 717];frame:3:[216, 526, 284, 671],Track4frame:1:[232, 645, 327, 861];frame:2:[230, 616, 311, 824];frame:3:[230, 579, 311, 781],Track5frame:1:[221, 609, 282, 810];frame:2:[180, 572, 264, 777];frame:3:[166, 534, 256, 741],Track6frame:1:[389, 565, 444, 762];frame:2:[344, 546, 409, 732];frame:3:[342, 498, 393, 697],Track7frame:1:[340, 567, 402, 782];frame:2:[302, 543, 368, 748];frame:3:[285, 505, 350, 720],Track8frame:1:[000, 651, 039, 864];frame:2:[000, 621, 040, 826];frame:3:[000, 586, 041, 786].') 2023-11-03 16:37:15.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:37:15.918 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Backpack:[629, 623, 858, 731];Person:[526, 599, 855, 791].') 2023-11-03 16:37:29.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:37:29.078 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[364, 236, 998, 985];man:[041, 381, 582, 980];dress:[470, 483, 999, 999];girl:[278, 255, 999, 999];human face:[303, 438, 430, 554],[541, 349, 842, 573].') 2023-11-03 16:37:34.924 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:37:34.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the four puppies are feeding on their mother's milk.?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.", "There are 4 the four puppies are feeding on their mother's milk..Track1frame:1:[288, 481, 508, 754];frame:2:[271, 480, 521, 818];frame:3:[275, 496, 531, 810],Track2frame:1:[393, 539, 642, 891];frame:2:[398, 547, 643, 886];frame:3:[402, 601, 648, 870],Track3frame:1:[405, 531, 644, 834];frame:2:[422, 494, 644, 834];frame:3:[434, 480, 642, 831],Track4frame:1:[699, 741, 726, 770];frame:2:[699, 742, 726, 769];frame:3:[697, 724, 772, 769].") 2023-11-03 16:38:42.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:38:42.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the dog that is running after the other dog.The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one the dog that is running after the other dog.Track1frame:1:[117, 324, 733, 570];frame:2:[117, 324, 733, 570];frame:3:[457, 194, 631, 446].') 2023-11-03 16:38:55.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:38:55.349 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Car:[944, 872, 999, 934],[888, 870, 956, 918],[836, 854, 905, 904],[802, 844, 856, 887],[750, 819, 811, 874],[703, 816, 768, 862],[591, 781, 654, 820],[283, 869, 360, 926].') 2023-11-03 16:38:58.161 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:38:58.161 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "One the best things about [006, 090, 939, 772] these marinated Grilled Veggie Kabobs is that they're equally delicious hot or cold. And your kids can help chop the veggies!") 2023-11-03 16:39:25.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:39:25.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me the trajectory of the turning cars in this video clip?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one turning cars.Track1frame1:[000, 458, 115, 634];frame2:[000, 456, 091, 648];frame3:[000, 450, 064, 632];frame4:[000, 442, 034, 634].') 2023-11-03 16:39:53.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:39:53.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[796, 945, 855, 1000],[374, 738, 420, 852],[330, 705, 371, 845],[314, 636, 354, 777],[264, 685, 307, 834],[271, 630, 309, 759],[242, 667, 277, 813],[218, 680, 262, 849],[199, 659, 240, 810],[225, 623, 269, 752],[117, 660, 164, 759],[008, 566, 047, 667],[101, 566, 137, 659],[087, 531, 125, 605],[194, 586, 228, 695],[352, 585, 384, 694],[397, 592, 428, 709],[503, 649, 541, 789],[559, 547, 593, 625],[259, 962, 293, 1000].') 2023-11-03 16:40:14.603 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:40:14.604 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Satchel:[176, 587, 854, 999];Canned:[561, 357, 842, 857];Tomato:[425, 762, 473, 835],[519, 191, 571, 274],[508, 280, 564, 366],[495, 372, 550, 454],[398, 252, 450, 332],[383, 326, 438, 403];Broccoli:[163, 605, 433, 892];Lemon:[360, 487, 469, 683],[574, 290, 688, 447];Pepper:[427, 040, 516, 285];Carrot:[352, 358, 543, 695],[270, 419, 464, 761].') 2023-11-03 16:40:34.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:40:34.574 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[309, 848, 412, 963],[336, 847, 420, 984],[631, 779, 694, 864],[652, 820, 707, 900];Hat:[000, 475, 030, 502],[146, 464, 183, 478],[204, 471, 240, 486],[954, 457, 998, 480];Basketball:[650, 363, 819, 493],[194, 590, 248, 641];Person:[196, 046, 826, 984],[583, 305, 836, 900].') 2023-11-03 16:40:42.195 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:40:42.196 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Blue and green jellyfish painted on a woman's arm [122, 003, 695, 998]") 2023-11-03 16:41:02.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:41:02.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Sailboat:[214, 311, 409, 749].') 2023-11-03 16:41:23.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:41:23.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[000, 311, 461, 942];frame:2:[000, 248, 469, 938];frame:3:[000, 232, 483, 940] is doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an the tiger on the left side of the ground, flipping over and lifting its front legs.') 2023-11-03 16:41:27.042 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:41:27.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bench:[747, 647, 924, 746],[923, 663, 1000, 749];Couch:[873, 563, 1000, 669];Vase:[729, 257, 777, 313],[695, 233, 735, 293];Picture:[591, 308, 645, 474],[485, 347, 545, 426];Cabinet:[685, 281, 831, 703];Person:[455, 356, 584, 780],[081, 342, 216, 920].') 2023-11-03 16:41:30.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:41:30.464 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[576, 411, 657, 530] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a bicycle by a boy on the groundFrame1:[576, 411, 657, 530];Frame2:[522, 409, 615, 533];Frame3:[482, 412, 620, 534];Frame4:[435, 412, 618, 537];Frame5:[431, 412, 618, 537].') 2023-11-03 16:41:42.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:41:42.339 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Laptop:[943, 571, 999, 799];Camera:[708, 609, 771, 716];Book:[220, 364, 347, 471];Chair:[082, 250, 314, 825],[334, 279, 520, 804],[521, 397, 705, 948];Desk:[560, 299, 970, 668],[152, 373, 818, 1000],[909, 753, 990, 932];Storage box:[160, 469, 266, 590],[000, 265, 113, 601];shelf:[805, 612, 956, 952];Guitar:[170, 020, 266, 456];Cup:[690, 705, 725, 781];Basin:[628, 542, 686, 611].') 2023-11-03 16:41:53.772 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:41:53.772 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[523, 002, 646, 352],[210, 002, 343, 352];Frame:[567, 390, 644, 507];shelf:[255, 416, 338, 591];Desk:[049, 892, 526, 999];Bowl:[537, 643, 628, 709],[306, 663, 376, 723];Jug:[147, 633, 194, 687],[087, 796, 169, 962];Person:[648, 337, 818, 999],[281, 314, 580, 1000],[339, 381, 440, 663],[002, 556, 056, 903],[002, 837, 077, 998];Bread:[073, 672, 210, 723];Canned:[099, 891, 150, 999].') 2023-11-03 16:42:19.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:42:19.065 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'drink:[803, 802, 935, 1000];human body:[557, 197, 974, 945],[000, 344, 108, 438],[000, 366, 583, 999];human head:[688, 179, 901, 513],[179, 000, 496, 514];man:[020, 000, 647, 1000],[000, 209, 108, 438];human arm:[021, 300, 338, 1000],[081, 346, 115, 422],[475, 496, 608, 915],[562, 528, 938, 999],[870, 602, 999, 999];human hand:[779, 492, 935, 743],[117, 377, 320, 679];beard:[190, 277, 335, 495];human mouth:[231, 294, 315, 381];woman:[573, 202, 999, 999];human hair:[285, 000, 503, 354],[662, 193, 914, 717];girl:[571, 200, 999, 999];human face:[183, 033, 452, 488],[688, 245, 868, 579];human nose:[257, 187, 334, 301],[737, 383, 781, 459].') 2023-11-03 16:43:02.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:43:02.986 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[325, 787, 901, 984];frame:2:[321, 787, 899, 979];frame:3:[332, 788, 909, 977] doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an lizard does not move') 2023-11-03 16:43:05.805 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:43:05.806 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[368, 595, 736, 1000] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a lizard creeping on the ground Frame1:[368, 595, 736, 1000];Frame2:[410, 584, 773, 1000];Frame3:[392, 572, 763, 1000];Frame4:[378, 597, 744, 1000].') 2023-11-03 16:43:45.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:43:45.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[061, 099, 317, 990],[350, 592, 445, 890],[426, 415, 615, 913],[757, 336, 910, 719];human hair:[179, 115, 301, 232],[791, 417, 849, 489];human head:[167, 105, 323, 265],[779, 334, 848, 430],[790, 428, 852, 500];mammal:[760, 341, 926, 703],[049, 112, 314, 983],[342, 593, 459, 896],[432, 425, 623, 910];man:[041, 056, 322, 1000],[437, 427, 619, 910];shorts:[077, 531, 278, 837],[358, 733, 435, 815],[466, 616, 600, 811];door:[347, 444, 430, 862],[448, 421, 510, 884],[672, 022, 763, 717],[815, 014, 918, 714];human leg:[079, 554, 256, 986],[343, 735, 401, 882],[401, 737, 439, 896],[463, 633, 583, 903],[755, 602, 860, 710];human face:[237, 146, 309, 266];human arm:[047, 264, 190, 618],[421, 649, 454, 752],[433, 488, 530, 680],[536, 477, 624, 681].') 2023-11-03 16:44:10.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:44:10.994 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[403, 481, 565, 659] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a polar hareFrame1:[403, 481, 565, 659];Frame2:[452, 469, 580, 669];Frame3:[446, 473, 580, 675];Frame4:[445, 483, 585, 677].') 2023-11-03 16:45:20.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:45:20.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[601, 318, 864, 554] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a leopardFrame1:[601, 318, 864, 554];Frame2:[601, 314, 855, 541];Frame3:[603, 314, 854, 522];Frame4:[610, 318, 836, 529];Frame5:[576, 329, 804, 529].') 2023-11-03 16:45:23.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:45:23.456 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Boat:[548, 351, 701, 527].') 2023-11-03 16:45:26.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:45:26.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[313, 588, 439, 778],[075, 483, 167, 749],[134, 554, 263, 745],[217, 379, 311, 547],[284, 280, 376, 508],[612, 330, 736, 508],[859, 306, 942, 456],[870, 381, 998, 598];woman:[000, 314, 227, 748],[343, 283, 940, 790],[773, 512, 881, 801],[877, 244, 982, 470];girl:[000, 536, 029, 665],[018, 647, 117, 752],[055, 511, 117, 651],[123, 453, 176, 548],[353, 483, 450, 698],[469, 493, 539, 693],[489, 382, 623, 544],[533, 633, 621, 775],[829, 366, 874, 479],[898, 576, 978, 802];human face:[020, 493, 055, 557],[138, 362, 168, 427],[305, 283, 333, 360],[366, 507, 392, 570],[520, 497, 542, 566],[556, 401, 585, 470],[601, 292, 629, 356],[638, 346, 663, 410],[669, 478, 702, 549],[748, 466, 786, 539],[797, 532, 834, 597],[833, 373, 856, 439],[853, 468, 886, 539],[860, 322, 900, 397],[875, 388, 916, 456].') 2023-11-03 16:46:53.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:46:53.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'sculpture:[242, 000, 775, 140];tree:[000, 000, 266, 569],[736, 000, 999, 552].') 2023-11-03 16:47:01.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:47:01.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Trash bin Can:[297, 619, 341, 748],[717, 645, 743, 709];Bus:[181, 395, 662, 830],[052, 566, 130, 600];Van:[632, 594, 753, 694],[768, 549, 1000, 724];Car:[666, 609, 784, 707],[881, 624, 1000, 744],[138, 595, 180, 643],[084, 592, 176, 642],[011, 591, 111, 635];Street Lights:[778, 385, 817, 732],[910, 381, 963, 552];Bicycle:[029, 609, 057, 637],[152, 609, 171, 646];Person:[548, 564, 588, 627],[405, 561, 445, 801].') 2023-11-03 16:47:29.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:47:29.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[183, 112, 931, 919] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a anteaterFrame1:[183, 112, 931, 919];Frame2:[314, 062, 985, 912];Frame3:[310, 000, 996, 843];Frame4:[197, 000, 987, 850].') 2023-11-03 16:47:55.339 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:47:55.340 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A garden room gable with simple lines and [308, 337, 616, 626] clear glazing') 2023-11-03 16:48:06.592 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:48:06.593 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Trolley,Chair and Necklace in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Trolley:[124, 572, 726, 794];Chair:[958, 359, 999, 440];Necklace:[580, 331, 634, 376],[431, 322, 493, 377].') 2023-11-03 16:48:14.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:48:14.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[414, 616, 462, 719] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a white car running on the beachFrame1:[414, 616, 462, 719];Frame2:[414, 643, 464, 751];Frame3:[416, 662, 467, 772];Frame4:[418, 668, 471, 779];Frame5:[421, 669, 471, 777].') 2023-11-03 16:48:16.189 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:48:16.189 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[224, 460, 785, 837] a vase wrapped with cutout bark and [002, 032, 418, 477] [572, 149, 998, 579] yellow blooms and thistles for [009, 000, 989, 934] a bright and cheerful wedding centerpiece') 2023-11-03 16:48:30.317 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:48:30.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[805, 445, 973, 953],[000, 236, 060, 427],[000, 526, 999, 999],[257, 446, 966, 586],[291, 180, 377, 339],[409, 259, 510, 336],[489, 200, 517, 290],[510, 148, 592, 332],[589, 234, 699, 342],[780, 288, 869, 376],[782, 192, 838, 309],[826, 192, 916, 355],[901, 174, 998, 363];human hair:[182, 433, 260, 568],[008, 491, 068, 580],[058, 655, 113, 753],[088, 448, 134, 529],[340, 461, 385, 529],[370, 457, 434, 541],[475, 477, 542, 556],[486, 483, 542, 589],[497, 131, 541, 206],[550, 568, 671, 753],[567, 522, 635, 587],[619, 445, 674, 547],[632, 534, 719, 635],[814, 472, 914, 568],[856, 424, 926, 495];human head:[385, 590, 524, 797],[005, 488, 070, 580],[012, 189, 048, 253],[060, 653, 121, 753],[083, 449, 136, 543],[200, 444, 272, 574],[322, 130, 354, 196],[327, 750, 393, 865],[343, 459, 390, 546],[377, 453, 435, 572],[440, 211, 480, 286],[474, 480, 515, 590],[485, 488, 569, 634],[531, 563, 672, 783],[536, 092, 577, 170],[568, 524, 632, 574],[617, 455, 677, 546],[631, 192, 671, 266],[640, 534, 717, 662],[781, 462, 842, 567],[795, 490, 833, 567],[814, 240, 851, 307],[815, 469, 905, 622],[853, 149, 891, 224],[860, 426, 921, 490],[871, 701, 999, 953],[900, 492, 932, 565],[924, 123, 958, 196];man:[110, 455, 324, 987],[000, 188, 074, 421],[000, 418, 995, 996],[283, 126, 385, 357],[419, 209, 512, 346],[511, 092, 592, 334],[587, 190, 704, 359],[765, 141, 847, 292],[825, 141, 918, 321],[902, 132, 999, 342];clothing:[111, 497, 325, 987],[000, 236, 055, 430],[000, 579, 051, 805],[028, 521, 167, 770],[034, 740, 131, 800],[288, 188, 378, 328],[326, 528, 368, 783],[326, 750, 517, 999],[342, 538, 445, 767],[414, 257, 505, 346],[465, 570, 532, 723],[507, 150, 585, 339],[520, 732, 771, 994],[591, 236, 695, 348],[659, 644, 759, 838],[731, 570, 854, 975],[777, 567, 973, 999],[782, 198, 833, 276],[813, 871, 989, 996],[833, 184, 911, 359],[916, 172, 992, 334];mammal:[000, 182, 066, 478],[000, 481, 069, 820],[034, 652, 146, 870],[046, 446, 173, 720],[121, 443, 321, 954],[247, 745, 400, 987],[289, 120, 379, 349],[313, 584, 549, 999],[317, 456, 390, 727],[343, 450, 449, 769],[411, 211, 512, 351],[464, 478, 564, 717],[466, 476, 521, 619],[490, 129, 541, 276],[508, 091, 590, 342],[565, 519, 634, 583],[586, 186, 700, 355],[613, 440, 676, 553],[629, 525, 766, 845],[732, 483, 858, 828],[771, 456, 981, 999],[774, 144, 839, 282],[781, 238, 869, 373],[785, 456, 848, 545],[821, 144, 916, 346],[823, 692, 999, 999],[858, 419, 922, 501],[888, 489, 932, 571],[899, 123, 999, 352];human face:[392, 488, 430, 569],[418, 630, 490, 784],[527, 521, 567, 634],[534, 629, 587, 784],[827, 526, 869, 620];human arm:[000, 245, 064, 347],[000, 459, 999, 999],[290, 207, 307, 270],[358, 216, 380, 349],[411, 272, 471, 349],[471, 270, 509, 347],[492, 202, 517, 288],[511, 170, 591, 278],[574, 180, 592, 272],[587, 259, 617, 332],[653, 233, 702, 338],[777, 207, 807, 288],[784, 279, 834, 349],[856, 307, 866, 372],[897, 209, 918, 310],[918, 199, 999, 301];human nose:[033, 130, 943, 900].') 2023-11-03 16:49:28.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:49:28.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[259, 549, 336, 660],carFrame1:[093, 505, 154, 575],carFrame1:[920, 635, 999, 735],carFrame1:[602, 577, 644, 640],carFrame1:[803, 602, 859, 667],carFrame1:[219, 543, 265, 610],carFrame1:[666, 607, 774, 664] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1:[259, 549, 336, 660];Frame2:[220, 535, 313, 668],carFrame1:[093, 505, 154, 575];Frame2:[049, 493, 127, 564],carFrame1[920, 635, 999, 735];Frame2:No object,carFrame1:[602, 577, 644, 640];Frame2:[596, 569, 657, 640],carFrame1:[803, 602, 859, 667];Frame2:[826, 595, 890, 667],carFrame1:[219, 543, 265, 610];Frame2:[191, 528, 239, 606],carFrame1:[666, 607, 774, 664];Frame2:[688, 598, 769, 668].') 2023-11-03 16:49:57.694 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:49:57.694 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A young business woman [101, 130, 945, 997] or student using a laptop [058, 406, 601, 694] with a strong red background') 2023-11-03 16:50:42.247 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:50:42.247 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Polar Express trips feature [003, 019, 995, 997] a range of festive entertainment for [649, 461, 759, 706] [602, 669, 781, 997] [812, 662, 992, 917] [142, 480, 339, 997] [017, 397, 186, 924] families') 2023-11-03 16:51:07.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:51:07.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[932, 314, 998, 659],[918, 366, 943, 429],[824, 460, 910, 765],[656, 294, 999, 999],[517, 387, 727, 999],[402, 330, 698, 997],[396, 320, 477, 415],[042, 376, 452, 999],[903, 047, 964, 200].') 2023-11-03 16:51:56.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:51:56.008 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,human head,human eye,human nose and flag in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[000, 057, 584, 1000];human head:[054, 025, 486, 365],[571, 031, 842, 346];human eye:[261, 212, 349, 243];human nose:[323, 239, 388, 301].') 2023-11-03 16:52:01.490 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:52:01.490 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[173, 260, 377, 604] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[173, 260, 377, 604];Frame2:[218, 246, 418, 604];Frame3:[277, 247, 476, 613];Frame4:[241, 261, 455, 593].') 2023-11-03 16:52:33.549 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:52:33.550 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[487, 017, 765, 977] Ingrid Wiese-Hesson says she was nursing [562, 400, 712, 618] her baby in the back of the Beverly Hills Anthropologie store when she was escorted to the restroom. (Credit: KTLA') 2023-11-03 16:52:51.057 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:52:51.057 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'SpaceX [004, 280, 993, 995] livestreamed the cruise through space, manned by Starman [454, 334, 652, 712] the mannequin') 2023-11-03 16:52:58.610 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:52:58.611 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Air Conditioner:[001, 001, 234, 141];Person:[086, 346, 374, 999],[278, 426, 564, 999],[526, 444, 744, 999],[660, 239, 932, 1000];Bracelet:[169, 874, 199, 937];Necklace:[402, 687, 493, 793];Glasses:[259, 349, 361, 408],[444, 454, 521, 508];Plate:[329, 011, 411, 151],[210, 001, 290, 139].') 2023-11-03 16:53:30.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:53:30.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,human arm,swimwear,human body and ladybug in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[660, 153, 989, 808],[017, 139, 385, 747];human arm:[023, 231, 138, 500],[275, 254, 388, 531],[794, 275, 999, 491];swimwear:[106, 236, 294, 472],[727, 262, 938, 498];human body:[094, 130, 348, 746],[649, 253, 999, 793].') 2023-11-03 16:53:32.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:53:32.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[560, 261, 788, 595] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a cheetahFrame1:[560, 261, 788, 595];Frame2:[546, 273, 840, 725];Frame3:[575, 269, 810, 754];Frame4:[564, 304, 846, 918];Frame5:[611, 070, 864, 702].') 2023-11-03 16:54:23.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:54:23.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Flower,shelf and Laptop in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[690, 766, 838, 964];Cabinet:[007, 691, 294, 1000],[290, 849, 818, 1000];Laptop:[112, 622, 224, 717].') 2023-11-03 16:55:09.972 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 16:55:09.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[839, 848, 878, 938],[913, 874, 946, 940].') 2023-11-03 16:55:13.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:55:13.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[701, 150, 901, 653],[429, 197, 590, 648],[092, 236, 329, 772];Boots:[427, 581, 485, 640],[520, 576, 589, 646],[733, 539, 787, 644],[833, 545, 890, 608];Gloves:[285, 416, 331, 481],[531, 370, 579, 439];Helmet:[211, 233, 278, 315],[485, 198, 559, 286],[752, 154, 821, 237];Hockey Stick:[545, 435, 602, 609],[624, 480, 798, 581].') 2023-11-03 16:55:32.812 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:55:32.813 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'football:[565, 725, 617, 798].') 2023-11-03 16:55:52.512 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:55:52.513 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[043, 478, 145, 560],carFrame1:[641, 475, 766, 612],carFrame1:[460, 457, 569, 613] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[043, 478, 145, 560];Frame2:[002, 475, 130, 562],carFrame1:[641, 475, 766, 612];Frame2:[654, 472, 815, 633],carFrame1:[460, 457, 569, 613];Frame2:[451, 451, 579, 633].') 2023-11-03 16:56:25.426 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:56:25.426 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'frog:[116, 141, 869, 860].') 2023-11-03 16:56:39.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:56:39.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'FILE: People wear face masks [756, 123, 788, 231] [257, 071, 292, 142] [876, 087, 944, 189] [192, 148, 233, 223] [553, 117, 631, 235] [404, 066, 460, 178] ahead police checkpoint in compliance with state directive to curb the spread of COVID-19 at Ojodu-Berger in Lagos, on 4 May 2020. Picture: AFP.') 2023-11-03 16:57:10.707 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:57:10.708 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[350, 200, 804, 996] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', "This is a grevy's zebraFrame1:[350, 200, 804, 996];Frame2:[331, 432, 998, 997];Frame3:[239, 176, 956, 948];Frame4:[410, 197, 998, 997].") 2023-11-03 16:57:14.893 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:57:14.894 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect man and woman in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[000, 684, 222, 1000],[000, 122, 038, 216],[000, 270, 048, 389],[000, 400, 087, 705],[011, 137, 052, 273],[030, 330, 133, 549],[030, 301, 141, 432],[050, 158, 091, 284],[066, 235, 122, 313],[087, 140, 139, 263],[128, 140, 176, 334],[189, 356, 304, 639],[246, 456, 428, 754],[247, 239, 304, 392],[285, 195, 355, 285],[288, 265, 358, 442],[290, 353, 418, 580],[317, 595, 685, 995],[346, 261, 388, 340],[352, 282, 445, 528],[397, 265, 441, 350],[410, 406, 530, 623],[489, 290, 560, 442],[520, 340, 627, 631],[538, 189, 574, 295],[545, 275, 587, 389],[600, 368, 672, 623],[605, 282, 635, 370],[641, 290, 688, 368],[662, 320, 708, 450],[736, 279, 801, 402],[758, 406, 999, 921],[768, 353, 861, 490],[777, 320, 862, 410],[822, 416, 986, 762],[838, 297, 874, 380],[841, 337, 941, 545],[946, 468, 999, 628];woman:[071, 402, 193, 670],[105, 513, 353, 931],[167, 308, 224, 430],[179, 339, 247, 499],[416, 292, 461, 411],[437, 313, 515, 499],[456, 540, 769, 999],[490, 292, 522, 355],[620, 490, 930, 999],[661, 377, 770, 579],[689, 271, 725, 337],[693, 315, 747, 400].') 2023-11-03 16:57:30.393 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:57:30.394 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[728, 575, 905, 951];man:[003, 876, 042, 966],[010, 757, 023, 829],[043, 726, 063, 816],[065, 850, 100, 916],[069, 822, 140, 959],[115, 851, 183, 955],[161, 788, 228, 931],[196, 714, 251, 829],[237, 774, 288, 883],[271, 839, 355, 999],[342, 788, 503, 996],[348, 523, 513, 672],[371, 759, 424, 853],[391, 724, 442, 862],[402, 492, 434, 555],[423, 682, 487, 857],[446, 596, 469, 675],[456, 692, 552, 927],[483, 761, 660, 999],[538, 672, 606, 896],[551, 574, 592, 640],[560, 601, 595, 670],[593, 476, 620, 564],[618, 564, 741, 786],[620, 460, 648, 551],[645, 483, 683, 550],[647, 400, 701, 490],[668, 546, 762, 714],[767, 453, 871, 645],[773, 349, 838, 439],[833, 391, 926, 509],[851, 312, 890, 386],[905, 131, 938, 238],[939, 131, 998, 361],[943, 140, 999, 333];clothing:[010, 158, 999, 999],[353, 911, 458, 995];human face:[003, 008, 999, 935].') 2023-11-03 16:58:07.964 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:58:07.964 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[000, 225, 894, 740] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a airplaneFrame1:[000, 225, 894, 740];Frame2:[000, 196, 905, 756];Frame3:[000, 182, 939, 750];Frame4:[000, 182, 941, 750].') 2023-11-03 16:58:30.991 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:58:30.992 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[453, 361, 532, 758] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[453, 361, 532, 758];Frame2:[456, 354, 523, 719];Frame3:[463, 338, 528, 709];Frame4:[463, 338, 526, 738];Frame5:[469, 351, 543, 715].') 2023-11-03 16:58:41.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:58:41.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is the track of the woman in red jacket and black dress?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one woman in red jacket and black dress.Track1frame1:[667, 419, 1000, 1000];frame2:[676, 425, 1000, 1000];frame3:[698, 418, 1000, 1000];frame4:[711, 421, 1000, 1000].') 2023-11-03 16:58:41.743 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:58:41.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[278, 327, 532, 510] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a common zebraFrame1:[278, 327, 532, 510];Frame2:[373, 331, 625, 518];Frame3:[414, 343, 656, 525];Frame4:[418, 325, 631, 518];Frame5:[478, 331, 665, 514].') 2023-11-03 16:59:19.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:59:19.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[835, 000, 989, 863];boy:[846, 000, 999, 998];tree:[083, 160, 290, 302],[104, 423, 191, 613],[328, 651, 351, 766],[371, 588, 423, 710],[418, 199, 498, 471],[583, 513, 616, 591],[664, 042, 751, 153],[766, 452, 818, 595],[780, 513, 850, 808],[809, 101, 869, 224],[980, 000, 997, 199];clothing:[911, 162, 999, 975].') 2023-11-03 16:59:57.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 16:59:57.758 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'fixed-wing aircraft:[000, 381, 852, 764].') 2023-11-03 17:00:14.891 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:00:14.891 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[252, 056, 749, 875] a cool young man riding [124, 311, 731, 962] an electric scooter in the mist - electric scooter stock pictures, [392, 634, 703, 706] royalty-free photos & images') 2023-11-03 17:01:02.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:01:02.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[356, 548, 546, 993] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a mountain bikeFrame1:[356, 548, 546, 993];Frame2:[361, 548, 564, 993];Frame3:[382, 548, 612, 993];Frame4:[382, 541, 613, 993];Frame5:[386, 534, 624, 993].') 2023-11-03 17:01:17.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:01:17.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[368, 612, 431, 859] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[368, 612, 431, 859];Frame2:[342, 606, 414, 858];Frame3:[351, 602, 406, 855];Frame4:[329, 544, 414, 856].') 2023-11-03 17:01:27.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:01:27.060 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Bottle:[001, 276, 229, 657],[060, 000, 372, 526],[373, 000, 623, 429],[622, 000, 937, 544],[757, 263, 1000, 677].') 2023-11-03 17:02:28.359 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:02:28.360 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[165, 263, 521, 1000],[453, 049, 881, 999];woman:[141, 128, 501, 1000];human hair:[249, 134, 427, 424],[498, 048, 694, 255];man:[431, 041, 877, 996];human arm:[796, 467, 885, 988],[163, 495, 464, 825],[246, 520, 520, 964];glasses:[276, 303, 424, 376];computer keyboard:[055, 742, 178, 855];human mouth:[566, 315, 640, 342];human head:[241, 135, 440, 464],[493, 050, 698, 410];human face:[290, 227, 416, 451],[522, 093, 675, 401].') 2023-11-03 17:02:39.415 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:02:39.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'chair:[194, 730, 418, 989],[000, 678, 092, 909],[207, 625, 362, 795],[406, 730, 605, 999],[571, 649, 735, 904],[865, 495, 928, 707],[930, 531, 987, 665];table:[007, 592, 171, 899],[371, 628, 557, 768];tree:[011, 026, 501, 521],[597, 192, 999, 539],[925, 000, 999, 111];person:[000, 399, 181, 710],[293, 492, 729, 823],[705, 499, 809, 733],[953, 513, 993, 602].') 2023-11-03 17:02:40.910 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:02:40.911 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[242, 487, 292, 600].') 2023-11-03 17:02:51.540 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:02:51.541 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Dr. Dirk Greineder of Wellesley [042, 410, 206, 826], accused of murdering his wife, is led past his children [180, 422, 319, 866] [325, 417, 495, 877], Britt [001, 408, 074, 685], Kirsten, and Colin, during arraignment at Norfolk...') 2023-11-03 17:03:09.481 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:03:09.481 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'an officer [001, 005, 296, 996] looks at the bullet riddled car of sp malik maqsood photo nni file') 2023-11-03 17:03:14.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:03:14.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'jacket:[356, 298, 691, 975],[654, 309, 913, 967];woman:[075, 214, 312, 1000];man:[366, 216, 684, 990],[605, 145, 913, 999];coat:[069, 425, 298, 999];tree:[517, 139, 696, 485];human face:[188, 261, 264, 397],[481, 276, 558, 450],[665, 221, 730, 384].') 2023-11-03 17:03:48.527 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:03:48.527 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How can we use Imported Marbles in a house [002, 001, 996, 993]?') 2023-11-03 17:03:55.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:03:55.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect goggles and human face in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'goggles:[573, 427, 754, 537];human face:[209, 212, 305, 467],[369, 213, 435, 456],[568, 345, 751, 692],[704, 209, 756, 396],[869, 000, 912, 065],[874, 209, 920, 398],[949, 235, 996, 407],[957, 000, 987, 076].') 2023-11-03 17:04:01.931 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:04:01.931 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A studio portrait of [006, 033, 951, 998] a couple dispute who should do the cleaning using [707, 209, 999, 998] household appliances as photography props') 2023-11-03 17:04:09.604 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:04:09.604 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[200, 210, 790, 864] Steam Roller on the run after Trevithick's Day photo and [044, 845, 203, 920] fine art print") 2023-11-03 17:04:33.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:04:33.803 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[250, 587, 306, 827],personFrame1:[332, 429, 409, 713],personFrame1:[886, 493, 932, 752],personFrame1:[493, 602, 573, 841],personFrame1:[204, 694, 288, 965],personFrame1:[300, 498, 352, 733],personFrame1:[434, 438, 492, 627] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[250, 587, 306, 827];Frame2:[257, 580, 299, 843],personFrame1:[332, 429, 409, 713];Frame2:[309, 433, 364, 756],personFrame1:[886, 493, 932, 752];Frame2:[912, 501, 980, 768],personFrame1:[493, 602, 573, 841];Frame2:[500, 606, 585, 840],personFrame1:[204, 694, 288, 965];Frame2:[250, 704, 317, 986],personFrame1:[300, 498, 352, 733];Frame2:[278, 493, 334, 743],personFrame1:[434, 438, 492, 627];Frame2:[407, 444, 489, 638].') 2023-11-03 17:04:40.720 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:04:40.720 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[068, 030, 562, 992] [583, 400, 825, 997] Two soldiers wearing [259, 031, 430, 424] gas masks and holding [584, 505, 995, 867] [127, 330, 641, 877] firearms, stand in a haze of [023, 007, 994, 994] green gas. [007, 853, 156, 994] A logo reads "Call of Duty Warzone, MW"') 2023-11-03 17:05:05.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:05:05.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Chair:[801, 628, 835, 682];Person:[517, 328, 591, 697],[695, 476, 768, 710];Flag:[125, 161, 178, 556].') 2023-11-03 17:06:00.901 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:06:00.901 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[682, 306, 967, 996] Bob Bradley, VP of sales, Bradford & Bigelow (third from the right), stands with [278, 346, 371, 995] [443, 330, 624, 996] [200, 351, 314, 997] [519, 360, 727, 997] [381, 356, 524, 995] [144, 380, 246, 996] [305, 319, 454, 997] the B&B sales team with the company's new manroland Goss four-color offset web press.") 2023-11-03 17:06:24.026 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:06:24.027 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[493, 480, 683, 976] [321, 368, 511, 916] People play by [446, 019, 602, 269] the Dome of the Rock on [003, 416, 995, 994] the compound during a snowy morning in Jerusalem's Old City, February 18, 2021. (Reuters") 2023-11-03 17:06:40.518 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:06:40.518 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[397, 175, 546, 548] [432, 320, 542, 740] [687, 267, 825, 836] [853, 285, 992, 866] Happy family picking [226, 577, 482, 758] fresh organic fruits on a farm.') 2023-11-03 17:06:41.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:06:41.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[054, 527, 138, 746];tree:[000, 000, 999, 901];clothing:[060, 545, 135, 707].') 2023-11-03 17:07:00.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:07:00.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[340, 513, 894, 794],[001, 551, 533, 998],[379, 688, 999, 999];Chair:[018, 811, 318, 998];Person:[591, 105, 767, 744],[207, 317, 407, 652],[087, 348, 329, 822],[001, 542, 096, 891],[064, 477, 405, 998];Book:[447, 807, 697, 959];Satchel:[387, 464, 468, 546],[256, 422, 376, 556],[200, 568, 285, 698],[224, 555, 282, 629];Folder:[271, 706, 459, 818];Whiteboard:[516, 122, 644, 435];Cup:[694, 742, 740, 833],[476, 673, 519, 754];Microphone:[232, 233, 268, 297];Projector:[761, 611, 842, 689];Banana:[553, 713, 620, 771].') 2023-11-03 17:07:14.946 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:07:14.946 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'toasted marshmallow topped Rice Krispie treats on [003, 001, 996, 992] a white platter') 2023-11-03 17:07:37.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:07:37.154 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'If you want a clean and easy shave, try the Little Beauty Razor [045, 120, 569, 473] (January 9th, 1897') 2023-11-03 17:08:04.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:08:04.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[232, 336, 363, 849],[370, 370, 443, 823],[424, 739, 458, 903],[507, 531, 590, 884],[375, 428, 510, 1000],[673, 567, 801, 802];Watch:[380, 661, 397, 681];Satchel:[271, 546, 324, 626];Other Shoes:[282, 692, 319, 715],[255, 798, 292, 853],[425, 883, 459, 905];Sneakers:[380, 928, 434, 994],[467, 944, 498, 999],[503, 845, 540, 877],[539, 855, 566, 886],[691, 785, 714, 801],[735, 781, 746, 794];Belt:[413, 634, 502, 656];Hat:[432, 428, 477, 479].') 2023-11-03 17:08:06.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:08:06.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[150, 152, 773, 975];clothing:[241, 228, 756, 918].') 2023-11-03 17:08:25.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:08:25.083 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect cowboy hat,sun hat,man and christmas tree in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'cowboy hat:[059, 243, 366, 729];sun hat:[046, 261, 410, 705];man:[060, 277, 661, 1000],[339, 338, 580, 807].') 2023-11-03 17:08:45.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 17:08:45.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[746, 636, 769, 732];Hat:[883, 598, 916, 632],[953, 545, 1000, 591].') 2023-11-03 17:08:52.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:08:52.836 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[001, 006, 861, 991];Basin:[045, 030, 823, 941];Spoon:[588, 002, 905, 571];Cup:[015, 157, 160, 402].') 2023-11-03 17:09:23.611 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:09:23.612 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'window:[904, 335, 986, 651];shelf:[325, 172, 632, 804].') 2023-11-03 17:09:56.286 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:09:56.286 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Picture/Frame in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Frame:[323, 530, 473, 695].') 2023-11-03 17:10:06.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:10:06.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the goose moving around and bending forward to eat?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There are 7 goose moving around and bending forward to eat.Track1frame:1:[366, 374, 519, 853];frame:2:[364, 420, 496, 790];frame:3:[357, 375, 494, 761],Track2frame:1:[583, 219, 678, 656];frame:2:[539, 237, 689, 653];frame:3:[590, 249, 711, 611],Track3frame:1:[425, 001, 538, 105];frame:2:[404, 001, 482, 092];frame:3:[282, 001, 384, 094],Track4frame:1:[222, 143, 425, 453];frame:2:[223, 156, 428, 451];frame:3:[223, 140, 429, 449],Track5frame:1:[453, 013, 533, 320];frame:2:[453, 009, 533, 321];frame:3:[457, 012, 536, 325],Track6frame:1:[693, 250, 920, 642];frame:2:[574, 370, 929, 862];frame:3:[434, 512, 853, 997],Track7frame:1:[953, 797, 999, 907];frame:2:[831, 768, 999, 998];frame:3:[853, 462, 999, 998].') 2023-11-03 17:10:17.750 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:10:17.750 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[000, 030, 331, 914],[072, 180, 861, 999];girl:[000, 018, 336, 908],[073, 171, 919, 999];clothing:[000, 138, 256, 895],[041, 171, 541, 938],[069, 338, 853, 999],[630, 226, 999, 979],[754, 546, 999, 998].') 2023-11-03 17:10:51.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:10:51.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Radiator:[001, 340, 059, 733];Chair:[155, 387, 426, 681],[606, 381, 690, 460];Cabinet:[377, 001, 615, 733];Storage box:[451, 004, 524, 074],[534, 101, 594, 185],[536, 157, 595, 220],[525, 224, 601, 294];Lamp:[828, 001, 999, 242];Couch:[673, 453, 1000, 999];Stool:[001, 668, 424, 998];Person:[083, 132, 496, 921],[545, 549, 999, 999],[535, 324, 999, 904],[498, 316, 805, 730];Hat:[220, 302, 297, 370],[884, 327, 976, 444].') 2023-11-03 17:11:27.465 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:11:27.466 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[076, 069, 623, 887] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a snowplowFrame1:[076, 069, 623, 887];Frame2:[173, 199, 669, 954];Frame3:[317, 241, 753, 940];Frame4:[398, 271, 784, 929];Frame5:[471, 259, 801, 844].') 2023-11-03 17:11:51.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:11:51.065 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Leather Shoes:[472, 836, 520, 869],[001, 749, 101, 794],[135, 746, 248, 786],[072, 610, 173, 652],[212, 661, 286, 700],[029, 702, 147, 743];Belt:[410, 478, 564, 572];Handbag:[258, 653, 319, 718],[314, 684, 363, 721],[309, 635, 363, 686];Person:[369, 285, 603, 903],[206, 418, 376, 646],[183, 428, 307, 650],[102, 387, 291, 741],[045, 409, 229, 753],[007, 431, 283, 740],[001, 405, 146, 740],[001, 743, 099, 794].') 2023-11-03 17:12:16.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:12:16.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[255, 111, 535, 394],[490, 108, 798, 351];Boots:[272, 399, 356, 626],[347, 393, 446, 585],[446, 383, 537, 555],[534, 388, 636, 556];Dog:[189, 542, 929, 1000].') 2023-11-03 17:12:40.746 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:12:40.746 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[017, 001, 389, 998] Pregnant womens coat winter pregnancy loose show thin hundred with the medium-long version of autumn and winter fattening womens clothing size hair coat coat coat') 2023-11-03 17:12:42.703 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:12:42.704 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Goxhill Primary School was one of only two in North Lincolnshire to stay open for the full day today, with [004, 385, 996, 995] pupils wearing [320, 788, 493, 976] [677, 722, 827, 910] [559, 742, 696, 911] fancy dress to mark World Book Day') 2023-11-03 17:13:01.708 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:13:01.708 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 373, 189, 875],carFrame1:[662, 523, 758, 663],carFrame1:[381, 494, 470, 643] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'carFrame1[000, 373, 189, 875];Frame2:No object,carFrame1:[662, 523, 758, 663];Frame2:[716, 499, 909, 750],carFrame1:[381, 494, 470, 643];Frame2:[163, 457, 400, 739].') 2023-11-03 17:13:02.562 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:13:02.563 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[033, 031, 953, 998] Girl holds [161, 481, 817, 785] sign protesting the rise in road tolls at the Turun rally in KL on 31.12.2013') 2023-11-03 17:13:27.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:13:27.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'tower:[018, 267, 217, 831],[168, 625, 189, 759],[229, 631, 298, 737],[270, 661, 321, 751],[335, 570, 529, 884],[479, 463, 554, 613],[479, 564, 585, 858],[710, 607, 757, 697],[787, 637, 823, 706],[868, 498, 996, 687];skyscraper:[001, 184, 185, 870],[482, 466, 528, 610],[865, 501, 998, 689].') 2023-11-03 17:13:36.728 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:13:36.729 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Lamp:[298, 401, 418, 675];Bed:[372, 591, 879, 786],[083, 677, 880, 1000];Pillow:[362, 517, 438, 647],[122, 562, 272, 759],[180, 569, 360, 760],[382, 539, 501, 647];Radiator:[001, 593, 062, 1000].') 2023-11-03 17:13:39.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:13:39.663 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'window:[709, 214, 959, 515].') 2023-11-03 17:14:04.280 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:14:04.281 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Flag:[018, 454, 166, 1000],[308, 240, 396, 652],[062, 330, 182, 610],[773, 369, 809, 460],[555, 002, 853, 352],[360, 188, 388, 307],[497, 176, 524, 235],[516, 312, 542, 431],[542, 329, 572, 456];Umbrella:[127, 579, 397, 936],[385, 526, 584, 679],[494, 558, 960, 979],[807, 707, 998, 1000],[805, 477, 1000, 685],[908, 330, 998, 404],[738, 441, 932, 562],[617, 481, 726, 594],[001, 524, 131, 632],[045, 472, 133, 558],[001, 388, 128, 488],[159, 398, 415, 549],[391, 437, 583, 593],[117, 398, 225, 505],[335, 403, 451, 462],[559, 392, 670, 477],[434, 385, 572, 493];Person:[360, 879, 504, 997],[385, 653, 483, 888],[649, 907, 829, 998],[569, 478, 631, 633],[677, 435, 721, 487],[609, 462, 641, 531],[135, 586, 175, 674],[132, 895, 179, 999],[468, 668, 546, 1000],[538, 619, 606, 992],[681, 528, 719, 592];Hat:[576, 476, 611, 515],[683, 528, 718, 583].') 2023-11-03 17:14:08.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:14:08.851 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'clothing:[685, 021, 952, 894],[006, 169, 251, 650],[358, 128, 638, 907];person:[011, 032, 246, 997],[350, 000, 627, 975],[690, 022, 971, 999];hiking equipment:[147, 371, 198, 997],[174, 360, 254, 999],[395, 343, 455, 990],[668, 455, 780, 979];tree:[410, 000, 999, 198].') 2023-11-03 17:14:38.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:14:38.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[457, 404, 644, 652] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a white bicycle ride ridden by a girlFrame1:[457, 404, 644, 652];Frame2:[396, 418, 581, 654];Frame3:[407, 444, 596, 688];Frame4:[405, 465, 596, 708].') 2023-11-03 17:14:48.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:14:48.671 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'shelf:[815, 226, 977, 775];Frame:[828, 681, 885, 768],[676, 642, 750, 746];Chair:[130, 522, 365, 957],[377, 503, 425, 617];Lamp:[643, 128, 749, 775];Carpet:[082, 728, 875, 999];Pillow:[186, 406, 265, 475],[120, 436, 214, 523];Sneakers:[259, 911, 328, 987],[371, 867, 475, 924];Camera:[372, 384, 462, 447],[757, 208, 872, 318];Tripod:[682, 322, 823, 999],[645, 166, 769, 749];Person:[166, 340, 475, 987],[837, 692, 876, 756].') 2023-11-03 17:15:27.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:15:27.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Air Conditioner in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Air Conditioner:[488, 001, 696, 096].') 2023-11-03 17:15:59.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:15:59.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Hat:[364, 462, 407, 519],[173, 465, 225, 519];Satchel:[415, 745, 513, 822];Speaker:[575, 679, 728, 871],[242, 677, 417, 875];Person:[107, 478, 248, 785],[165, 464, 275, 784],[245, 494, 317, 676],[308, 495, 432, 745],[355, 532, 544, 808],[524, 564, 588, 803],[540, 499, 662, 735],[663, 522, 830, 825],[854, 504, 915, 779],[857, 676, 902, 806],[802, 455, 889, 823],[662, 494, 731, 630],[361, 460, 467, 624],[449, 439, 544, 619],[843, 432, 892, 627],[727, 477, 780, 532],[378, 406, 471, 523],[447, 416, 487, 484],[508, 439, 558, 547],[652, 458, 703, 547],[687, 422, 728, 485],[949, 441, 988, 503],[644, 398, 689, 473],[611, 458, 650, 515],[764, 388, 800, 462].') 2023-11-03 17:16:01.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:16:01.974 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'computer keyboard:[035, 482, 479, 707].') 2023-11-03 17:16:09.244 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:16:09.245 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Police officers [452, 558, 499, 663] [517, 529, 563, 664] [686, 533, 728, 702] [649, 533, 689, 701] [368, 545, 411, 655] [724, 522, 788, 708] [605, 545, 649, 695] [821, 551, 876, 717] searching a new area of the Milton landfill site') 2023-11-03 17:16:15.253 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:16:15.254 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Leah and Arthur Winter walk down the aisle on June 21, 1949. [230, 237, 794, 953] The Salinas couple celebrated their 70th wedding anniversary June 21, 2019.') 2023-11-03 17:16:54.838 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:16:54.839 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Blue metal roof [002, 097, 996, 992] on an industrial building.') 2023-11-03 17:17:02.773 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:17:02.774 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'tower:[134, 415, 317, 848],[000, 000, 084, 808],[201, 000, 268, 167],[411, 374, 533, 874];skyscraper:[000, 049, 063, 744],[180, 237, 318, 432],[194, 420, 344, 854],[196, 000, 258, 178],[236, 055, 315, 186],[285, 189, 407, 312],[323, 011, 388, 094],[365, 313, 499, 629],[378, 401, 439, 752],[431, 043, 554, 197],[435, 380, 523, 867],[478, 225, 625, 442],[512, 403, 544, 736],[540, 071, 662, 290],[568, 650, 822, 920],[609, 481, 717, 661],[615, 764, 820, 999],[662, 271, 790, 368],[683, 167, 766, 298],[701, 090, 820, 252],[750, 429, 849, 640],[898, 629, 961, 843],[903, 046, 980, 242],[916, 501, 999, 722].') 2023-11-03 17:17:12.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:17:12.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[100, 471, 737, 999],[398, 549, 746, 999];Street Lights:[244, 415, 317, 635].') 2023-11-03 17:17:32.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:17:32.698 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[454, 450, 544, 875] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a personFrame1:[454, 450, 544, 875];Frame2:[448, 454, 544, 862];Frame3:[456, 447, 549, 860];Frame4:[450, 462, 544, 860].') 2023-11-03 17:17:56.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:17:56.767 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[352, 355, 617, 999].') 2023-11-03 17:17:57.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:17:57.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[019, 429, 197, 825],[176, 580, 625, 970],[149, 596, 480, 924],[491, 602, 923, 944],[673, 211, 875, 886],[471, 258, 703, 899],[421, 260, 578, 921],[369, 072, 859, 607];Sneakers:[375, 550, 456, 601],[315, 583, 409, 635],[320, 600, 445, 631],[613, 553, 668, 600],[591, 602, 673, 646],[639, 603, 706, 648],[514, 933, 625, 968].') 2023-11-03 17:17:59.425 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:17:59.426 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bench:[311, 531, 561, 842],[197, 398, 349, 551],[075, 325, 205, 414],[007, 225, 080, 349],[777, 159, 907, 249],[655, 130, 774, 207];Stroller:[808, 016, 927, 140];Bicycle:[896, 025, 983, 149],[815, 053, 887, 179];Trolley:[597, 468, 820, 905];Backpack:[811, 316, 948, 553],[150, 410, 215, 521];Person:[779, 232, 891, 774],[677, 325, 769, 585],[474, 360, 623, 811],[367, 358, 479, 776],[360, 317, 468, 572],[565, 300, 678, 501],[456, 253, 523, 411],[343, 199, 429, 356],[274, 161, 368, 300],[219, 140, 304, 268],[279, 280, 379, 504],[238, 239, 318, 552],[200, 207, 265, 520],[122, 174, 219, 378],[075, 162, 141, 348],[068, 147, 145, 303],[075, 063, 143, 172],[001, 121, 083, 353],[463, 002, 513, 169],[707, 041, 758, 192],[842, 002, 904, 099].') 2023-11-03 17:18:19.794 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:18:19.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[578, 584, 630, 642],[768, 571, 808, 639],[364, 565, 412, 633];Sailboat:[002, 351, 042, 603],[107, 251, 210, 632],[076, 125, 368, 722],[367, 273, 478, 650],[546, 246, 684, 664],[722, 275, 857, 668],[844, 299, 1000, 681].') 2023-11-03 17:18:24.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:18:24.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Pillow:[037, 362, 399, 675],[201, 138, 421, 407],[388, 169, 697, 380],[338, 001, 610, 172];Chair:[717, 001, 966, 389];Nightstand:[001, 629, 168, 997];Bed:[157, 215, 999, 1000];Book:[802, 219, 895, 278];Boots:[121, 828, 219, 999];Handbag:[850, 507, 1000, 937];Camera:[786, 663, 876, 760];Stuffed Toy:[711, 414, 834, 533];Wallet:[779, 396, 851, 456],[808, 800, 903, 951].') 2023-11-03 17:18:44.568 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:18:44.569 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[512, 136, 859, 368] heart-shaped white chocolate raspberry scones on [003, 053, 989, 940] a gold baking pan') 2023-11-03 17:18:45.274 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:18:45.274 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Picture of kids bored during the summer months and eating [358, 740, 703, 900] [746, 716, 997, 856] [002, 740, 255, 878] watermelon and [378, 052, 615, 292] the words "Help! My kids are home for the summer!" above [440, 351, 625, 634] [005, 432, 210, 687] [776, 329, 956, 642] the kids\' heads') 2023-11-03 17:20:22.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:20:22.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[002, 003, 998, 999],[335, 003, 998, 998],[880, 187, 998, 606];Ring:[584, 838, 626, 886].') 2023-11-03 17:20:30.684 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:20:30.684 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Healthy vegan breakfast [162, 082, 966, 944]. dessert. alternative food. pudding with chia seeds, fresh strawberries, blackberries and mint. on a dark stone background, in an old wooden tray. copyspace') 2023-11-03 17:20:34.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:20:34.750 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[760, 498, 823, 604],personFrame1:[031, 433, 071, 570],personFrame1:[350, 600, 383, 738],personFrame1:[332, 611, 383, 758],personFrame1:[845, 536, 909, 658],personFrame1:[446, 437, 486, 555],personFrame1:[764, 580, 808, 713] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[760, 498, 823, 604];Frame2:[746, 500, 796, 608],personFrame1[031, 433, 071, 570];Frame2:No object,personFrame1:[350, 600, 383, 738];Frame2:[296, 605, 353, 750],personFrame1:[332, 611, 383, 758];Frame2:[303, 619, 348, 763],personFrame1:[845, 536, 909, 658];Frame2:[826, 531, 875, 668],personFrame1:[446, 437, 486, 555];Frame2:[393, 445, 427, 568],personFrame1:[764, 580, 808, 713];Frame2:[725, 566, 766, 702].') 2023-11-03 17:20:47.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:20:47.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'headphones:[171, 125, 676, 840].') 2023-11-03 17:21:04.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:21:04.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'person:[480, 246, 944, 925];drum:[000, 760, 415, 999],[201, 479, 549, 929],[440, 822, 725, 999];microphone:[596, 671, 647, 836],[857, 273, 999, 436].') 2023-11-03 17:21:35.565 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:21:35.566 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[058, 414, 199, 522] [815, 259, 915, 366] [402, 407, 594, 523] [257, 265, 369, 366] [630, 258, 735, 368] [240, 407, 381, 525] [429, 264, 576, 368] Dos and Don'ts of wearing [268, 776, 340, 811] [282, 299, 352, 338] [279, 435, 342, 469] [842, 625, 909, 662] [073, 763, 148, 806] [471, 764, 537, 805] [109, 458, 173, 492] [837, 300, 908, 343] [650, 304, 730, 349] [443, 306, 563, 364] a medical mask safely") 2023-11-03 17:21:42.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:21:42.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Gift ideas [492, 261, 981, 823] [007, 147, 518, 718] for Shakespeare lovers include this cookie cutter if they're a baker.") 2023-11-03 17:21:49.546 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:21:49.547 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Cup:[678, 497, 699, 521],[765, 537, 797, 567],[853, 493, 886, 524];Person:[686, 154, 723, 222],[737, 149, 756, 199],[805, 116, 822, 152],[818, 148, 842, 184];Bakset:[884, 007, 951, 163];Lamp:[438, 001, 574, 270];Picture:[676, 135, 738, 231],[849, 085, 926, 175],[868, 239, 912, 329],[840, 407, 885, 446],[886, 404, 907, 440],[776, 306, 811, 343],[754, 152, 801, 203];shelf:[627, 149, 966, 888];Power outlet:[951, 690, 971, 734];Desk:[327, 556, 819, 1000].') 2023-11-03 17:21:52.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:21:52.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Microphone:[300, 257, 474, 400];Person:[451, 115, 954, 999];Chair:[761, 641, 999, 999];Blackboard:[282, 173, 999, 636].') 2023-11-03 17:22:00.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:22:00.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[315, 002, 999, 725] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a leopardFrame1:[315, 002, 999, 725];Frame2:[359, 011, 914, 776];Frame3:[352, 111, 907, 761];Frame4:[406, 001, 887, 906];Frame5:[349, 001, 731, 998].') 2023-11-03 17:22:04.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:22:04.316 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[424, 038, 593, 480],[666, 093, 689, 133];Chair:[165, 735, 297, 957],[320, 700, 456, 956],[634, 745, 799, 963],[825, 795, 976, 961],[534, 715, 628, 961],[523, 662, 573, 726];Mirror:[891, 343, 973, 573],[050, 356, 115, 550];Flower:[025, 384, 112, 572];Speaker:[789, 493, 813, 552],[890, 472, 930, 565];Person:[928, 667, 975, 770],[879, 580, 970, 736],[807, 650, 955, 958],[796, 611, 863, 695],[725, 613, 776, 700],[648, 606, 675, 709],[631, 609, 787, 947],[541, 601, 578, 667],[611, 596, 661, 712],[576, 596, 618, 670],[521, 610, 655, 906],[374, 605, 444, 710],[352, 584, 395, 674],[303, 580, 345, 688],[288, 606, 395, 872],[243, 566, 297, 700],[165, 588, 304, 912],[136, 501, 206, 648],[075, 553, 137, 684],[025, 578, 086, 714],[047, 557, 078, 630],[072, 549, 108, 630],[888, 534, 961, 618],[735, 513, 784, 640];Cup:[809, 715, 831, 774];Bed:[027, 695, 292, 847].') 2023-11-03 17:22:20.806 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:22:20.807 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'This will take you to [269, 264, 643, 461] [288, 715, 662, 909] the VPN configuration dialog. 3. On the VPNs to Host tab, click the Add button to add a new VPN configuration. This will take you to the VPNs to Host configuration dialog. 4.') 2023-11-03 17:23:04.516 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:23:04.516 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Couch and Pillow in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Couch:[001, 496, 998, 997];Pillow:[879, 701, 983, 799],[811, 726, 999, 883],[856, 786, 999, 926].') 2023-11-03 17:23:41.457 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:23:41.457 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[349, 664, 386, 746].') 2023-11-03 17:23:51.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:23:51.047 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[000, 558, 064, 716],personFrame1:[389, 555, 437, 747],personFrame1:[333, 536, 406, 697],personFrame1:[267, 526, 310, 718],personFrame1:[464, 620, 524, 844],personFrame1:[610, 469, 661, 648],personFrame1:[614, 566, 680, 770] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[000, 558, 064, 716];Frame2:[097, 548, 145, 701],personFrame1:[389, 555, 437, 747];Frame2:[406, 544, 451, 723],personFrame1:[333, 536, 406, 697];Frame2:[388, 540, 425, 695],personFrame1:[267, 526, 310, 718];Frame2:[334, 534, 372, 698],personFrame1:[464, 620, 524, 844];Frame2:[490, 604, 539, 812],personFrame1:[610, 469, 661, 648];Frame2:[646, 479, 692, 650],personFrame1:[614, 566, 680, 770];Frame2:[644, 575, 700, 762].') 2023-11-03 17:23:51.864 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:23:51.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[191, 406, 512, 994],[473, 001, 999, 1000];Flag:[345, 001, 414, 186];Glasses:[594, 041, 798, 270];Hat:[481, 001, 986, 243];Handbag:[157, 746, 212, 810];Street Lights:[232, 122, 355, 487].') 2023-11-03 17:24:07.285 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:24:07.285 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[254, 187, 357, 382],[318, 173, 426, 393],[407, 155, 508, 399],[489, 137, 608, 420],[602, 118, 752, 444],[747, 088, 951, 482];Hat:[831, 088, 917, 141],[656, 118, 730, 164],[524, 142, 591, 181],[435, 158, 491, 192],[363, 174, 411, 204],[301, 186, 345, 210];Bakset:[532, 516, 740, 974],[052, 355, 171, 543];Grapefruit:[521, 498, 564, 525],[421, 464, 460, 489],[343, 442, 364, 460],[280, 424, 307, 444];Flower:[470, 272, 793, 811],[032, 269, 174, 495];Vase:[531, 674, 739, 973],[054, 440, 148, 545];Desk:[438, 510, 850, 828],[185, 419, 318, 533],[282, 459, 524, 666].') 2023-11-03 17:24:35.736 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:24:35.736 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Actions to be taken for suspected use or abuse by employees or volunteers: If an employee or volunteer is suspected of violating this policy or any other related policies, the College [000, 004, 998, 995] will initiate') 2023-11-03 17:25:15.716 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:25:15.717 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[679, 419, 999, 1000],[367, 421, 708, 999],[360, 416, 488, 999],[048, 533, 443, 999],[002, 139, 167, 999].') 2023-11-03 17:25:17.312 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:25:17.313 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Crafting recipe [293, 158, 681, 843] for a new item, the Soul Mirror.') 2023-11-03 17:25:19.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:25:19.914 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 529, 073, 714],carFrame1:[596, 533, 844, 867],carFrame1:[288, 546, 358, 617],carFrame1:[067, 495, 121, 554],carFrame1:[000, 500, 099, 577] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[000, 529, 073, 714];Frame2:[000, 570, 032, 725],carFrame1:[596, 533, 844, 867];Frame2:[597, 526, 879, 898],carFrame1:[288, 546, 358, 617];Frame2:[275, 542, 347, 612],carFrame1:[067, 495, 121, 554];Frame2:[046, 488, 104, 551],carFrame1:[000, 500, 099, 577];Frame2:[000, 493, 078, 577].') 2023-11-03 17:25:33.966 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:25:33.968 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Paddle:[494, 681, 526, 725];Person:[292, 624, 386, 786],[449, 628, 491, 720],[822, 588, 874, 748],[974, 534, 999, 708];Boat:[070, 687, 242, 802],[194, 719, 351, 820],[416, 703, 505, 809],[822, 678, 999, 786];Sailboat:[812, 178, 983, 526].') 2023-11-03 17:25:39.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:25:39.011 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[757, 583, 988, 851],[045, 485, 145, 550],[099, 713, 583, 953],[291, 559, 414, 600],[319, 444, 396, 520],[319, 629, 589, 746],[422, 494, 545, 548],[617, 410, 667, 494],[666, 408, 694, 504],[688, 422, 750, 511],[698, 533, 868, 595],[852, 473, 945, 525],[873, 406, 909, 470];clothing:[766, 618, 984, 838],[000, 507, 138, 551],[033, 561, 223, 616],[173, 595, 360, 718],[227, 821, 454, 933],[320, 452, 384, 491],[407, 639, 593, 718],[436, 503, 553, 553],[617, 417, 666, 492],[705, 531, 828, 579],[745, 558, 850, 603],[848, 507, 915, 529].') 2023-11-03 17:25:53.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:25:53.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect wheel,clothing,person,plant and skull in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'wheel:[595, 645, 718, 801],[000, 835, 078, 999],[019, 615, 048, 774],[059, 701, 195, 925],[126, 913, 176, 999],[294, 632, 395, 698],[346, 672, 468, 803],[347, 800, 519, 999],[481, 579, 540, 685],[692, 774, 775, 966],[767, 606, 864, 760],[770, 858, 830, 995],[945, 798, 999, 949];clothing:[411, 377, 485, 595],[240, 429, 273, 584],[725, 430, 760, 541],[869, 432, 941, 637];person:[183, 374, 278, 601],[403, 362, 481, 615],[725, 395, 778, 565],[738, 370, 764, 477],[865, 377, 959, 656],[868, 380, 906, 615];plant:[085, 640, 476, 844],[298, 554, 491, 659],[555, 560, 738, 645],[706, 859, 905, 999],[902, 936, 998, 999].') 2023-11-03 17:26:04.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:26:04.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[129, 058, 945, 933];Gloves:[347, 492, 486, 625],[143, 293, 280, 443];Helmet:[321, 057, 499, 175];Sneakers:[577, 764, 716, 931],[703, 767, 946, 897];Hockey Stick:[228, 402, 849, 967];Other Balls:[743, 946, 803, 965].') 2023-11-03 17:27:47.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:27:47.450 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'raccoon:[220, 203, 722, 681].') 2023-11-03 17:27:51.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:27:51.753 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 164, 460, 997] A teenager and [447, 117, 897, 997] her grandmother smile at each other and talk saving money around the holidays.') 2023-11-03 17:28:00.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:28:00.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bowl,Cup,Napkin and Bread in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Bowl:[484, 444, 540, 487];Cup:[080, 314, 182, 459],[642, 311, 717, 454];Napkin:[000, 690, 161, 935],[712, 673, 1000, 821];Bread:[726, 505, 810, 558],[410, 548, 543, 622],[145, 548, 228, 611].') 2023-11-03 17:28:16.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:28:16.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pen,Chair and Piano in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Pen:[567, 812, 595, 845];Chair:[104, 523, 1000, 999].') 2023-11-03 17:28:55.839 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:28:55.840 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the turn and take out a doll then move outside the room.To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one turn and take out a doll then move outside the room.Track1frame:1:[092, 001, 505, 988];frame:2:[092, 001, 505, 988];frame:3:[119, 001, 455, 998].') 2023-11-03 17:29:05.713 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:29:05.714 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Roll, count and fill in the heart-shaped cookie cutters in this fun Valentine's game [209, 459, 772, 890] for kids [702, 116, 997, 633]") 2023-11-03 17:29:10.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:29:10.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[598, 291, 727, 486],carFrame1:[109, 351, 148, 461],carFrame1:[584, 308, 628, 429],carFrame1:[245, 309, 304, 404],carFrame1:[370, 305, 429, 390],motorcycleFrame1:[783, 375, 1000, 675],carFrame1:[544, 323, 579, 388] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[598, 291, 727, 486];Frame2:[749, 210, 999, 720],carFrame1[109, 351, 148, 461];Frame2:No object,carFrame1:[584, 308, 628, 429];Frame2:[657, 297, 802, 543],carFrame1:[245, 309, 304, 404];Frame2:[073, 307, 208, 461],carFrame1:[370, 305, 429, 390];Frame2:[265, 291, 381, 449],motorcycleFrame1[783, 375, 1000, 675];Frame2:No object,carFrame1:[544, 323, 579, 388];Frame2:[567, 324, 629, 423].') 2023-11-03 17:31:46.192 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:31:46.193 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'In [017, 009, 980, 989] modern war... you will die like a dog for no good reason. [005, 030, 352, 991] - Ernest Hemingway') 2023-11-03 17:31:59.295 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:31:59.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Handbag and Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Satchel:[062, 572, 107, 638];Person:[178, 061, 877, 998],[036, 473, 136, 770],[091, 467, 150, 757].') 2023-11-03 17:32:11.139 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:32:11.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,man and boat in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[385, 574, 686, 746];man:[341, 578, 695, 737];boat:[101, 572, 919, 802].') 2023-11-03 17:32:23.206 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:32:23.207 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Men love bread so get him [208, 090, 795, 907] a bread tin for your traditional 10th anniversary tin gift') 2023-11-03 17:32:54.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:32:54.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Hat:[217, 710, 269, 756],[001, 796, 050, 846],[048, 746, 140, 818],[409, 736, 505, 803],[355, 819, 476, 973],[785, 834, 975, 989];Person:[001, 797, 065, 979],[068, 751, 201, 991],[341, 823, 489, 991],[426, 737, 546, 991],[820, 839, 978, 990],[657, 742, 837, 991],[646, 677, 763, 912],[167, 715, 281, 878],[243, 685, 319, 808],[275, 661, 334, 784],[308, 671, 349, 764],[355, 669, 399, 733],[371, 619, 481, 786],[513, 575, 587, 743],[455, 591, 516, 677],[610, 543, 702, 737],[901, 535, 998, 713],[910, 538, 991, 671],[864, 501, 955, 646],[825, 497, 875, 617],[848, 353, 914, 533],[963, 339, 999, 539],[907, 382, 947, 504],[354, 592, 406, 687],[057, 673, 102, 723],[131, 657, 170, 708],[221, 623, 259, 675],[705, 523, 759, 636],[625, 544, 675, 654],[497, 572, 535, 645].') 2023-11-03 17:33:38.308 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:33:38.308 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Washington Elementary students [255, 314, 354, 612] [534, 343, 648, 551] [511, 515, 792, 995] [398, 338, 576, 590] [312, 520, 601, 995] [739, 440, 909, 996] [002, 421, 197, 784] learn bicycle safety in an after school program.') 2023-11-03 17:34:16.626 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:34:16.626 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Diana [415, 031, 727, 996], Princess of Wales lays a ceremonial foundation stone in a wall [003, 366, 540, 994] at the Royal Marsden Hospital in London, 6th June 1990. She is wearing a suit...') 2023-11-03 17:34:21.263 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:34:21.264 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Paper Texture Brushes for Procreate') 2023-11-03 17:34:32.642 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:34:32.642 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[142, 268, 283, 564],[303, 291, 438, 599],[486, 303, 636, 654],[730, 275, 896, 645];horse:[096, 368, 290, 793],[300, 332, 458, 814],[492, 394, 601, 812],[760, 356, 867, 841];plant:[000, 431, 999, 999].') 2023-11-03 17:34:47.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:34:47.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human body:[364, 093, 881, 995],[000, 751, 220, 999];human head:[052, 777, 177, 934],[521, 122, 653, 385];man:[290, 062, 881, 998],[000, 745, 258, 999];human arm:[306, 393, 493, 967],[000, 920, 058, 999],[197, 923, 228, 999],[714, 406, 878, 774];microphone:[625, 238, 723, 343];human hand:[302, 864, 376, 996],[790, 613, 877, 744];guitar:[428, 446, 999, 999];human hair:[076, 754, 185, 885],[448, 000, 694, 283];clothing:[025, 893, 197, 999],[445, 363, 725, 978];human face:[545, 150, 655, 366].') 2023-11-03 17:35:33.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:35:33.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Potted Plant:[155, 648, 203, 704],[495, 723, 605, 910];Person:[036, 596, 077, 646],[033, 602, 069, 702].') 2023-11-03 17:35:56.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:35:56.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'man:[185, 105, 432, 965],[471, 353, 677, 999];clothing:[184, 106, 432, 952],[506, 491, 681, 999].') 2023-11-03 17:36:21.027 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:36:21.028 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Handbag:[640, 423, 703, 581];Stool:[381, 324, 453, 433];Lamp:[115, 080, 215, 468];Couch:[002, 469, 649, 1000];Carpet:[508, 770, 961, 1000];Chair:[842, 381, 999, 725];Storage box:[216, 477, 309, 539];Pillow:[209, 533, 297, 687];Person:[580, 699, 818, 1000],[855, 699, 946, 909],[835, 345, 1000, 779],[382, 381, 654, 752],[504, 107, 608, 594];Refrigerator:[197, 136, 299, 394];Microwave:[379, 144, 451, 202].') 2023-11-03 17:36:25.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:36:25.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[404, 501, 445, 576],truckFrame1:[308, 510, 360, 602],truckFrame1:[088, 432, 309, 676],carFrame1:[000, 502, 174, 807],carFrame1:[347, 512, 392, 585],truckFrame1:[554, 474, 721, 674],carFrame1:[572, 457, 999, 993] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[404, 501, 445, 576];Frame2:[417, 510, 460, 588],truckFrame1:[308, 510, 360, 602];Frame2:[326, 518, 373, 612],truckFrame1:[088, 432, 309, 676];Frame2:[088, 432, 318, 694],carFrame1:[000, 502, 174, 807];Frame2:[000, 510, 163, 847],carFrame1:[347, 512, 392, 585];Frame2:[361, 521, 404, 593],truckFrame1:[554, 474, 721, 674];Frame2:[580, 478, 800, 740],carFrame1:[572, 457, 999, 993];Frame2:[904, 564, 999, 939].') 2023-11-03 17:36:37.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:36:37.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[633, 508, 669, 595] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[633, 508, 669, 595];Frame2:[615, 513, 651, 596];Frame3:[497, 510, 526, 597].') 2023-11-03 17:36:40.995 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:36:40.996 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the rabbit standing upright on its hind legs. in this video clip?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one the rabbit standing upright on its hind legs..Track1frame:1:[349, 000, 645, 494];frame:2:[302, 000, 636, 449];frame:3:[275, 000, 600, 461].') 2023-11-03 17:36:55.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:36:55.386 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[455, 582, 545, 774].') 2023-11-03 17:37:51.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:37:51.432 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect furniture,swimming pool and motorcycle in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'furniture:[000, 640, 028, 805],[086, 569, 270, 811],[173, 540, 273, 714],[411, 506, 540, 690],[494, 506, 640, 672],[546, 512, 603, 612],[577, 497, 655, 621],[623, 493, 750, 625],[730, 487, 785, 601],[753, 482, 804, 575],[871, 476, 919, 567];swimming pool:[214, 649, 999, 999].') 2023-11-03 17:37:55.491 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:37:55.491 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A smiling child [001, 003, 997, 845] eating a plate of meatballs stroganoff [380, 647, 997, 905] giving the viewer a thumbs [091, 494, 171, 626] up.') 2023-11-03 17:37:58.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:37:58.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[404, 301, 532, 875] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a personFrame1:[404, 301, 532, 875];Frame2:[403, 288, 528, 850];Frame3:[407, 297, 538, 877];Frame4:[416, 298, 527, 854];Frame5:[416, 280, 528, 830].') 2023-11-03 17:38:28.691 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:38:28.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'flag:[017, 106, 149, 362],[023, 580, 138, 777],[113, 411, 229, 496],[481, 000, 608, 338],[791, 187, 891, 279],[894, 235, 958, 310];bicycle:[036, 718, 140, 999],[141, 661, 255, 996],[233, 676, 310, 999],[336, 672, 458, 999],[440, 650, 531, 999],[523, 650, 660, 999],[650, 650, 765, 970];bicycle helmet:[226, 365, 283, 434],[280, 370, 325, 450],[355, 345, 426, 415],[453, 393, 508, 461],[560, 360, 623, 428],[661, 367, 713, 434];man:[116, 367, 265, 959],[228, 347, 308, 965],[272, 361, 328, 954],[303, 369, 363, 884],[315, 335, 464, 947],[442, 381, 530, 929],[492, 356, 563, 496],[520, 363, 656, 968],[646, 363, 746, 886],[738, 469, 803, 692],[751, 333, 816, 571],[957, 367, 999, 997];tree:[000, 000, 213, 383],[115, 000, 640, 439],[613, 000, 999, 408];footwear:[293, 869, 326, 948],[431, 818, 454, 901],[538, 806, 566, 887],[609, 909, 643, 976],[648, 812, 676, 884].') 2023-11-03 17:38:29.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:38:29.624 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'boat:[112, 687, 322, 832],[000, 596, 088, 740],[000, 696, 143, 813],[000, 507, 643, 638],[045, 593, 235, 738],[247, 626, 374, 703],[253, 626, 361, 692],[305, 650, 402, 723],[340, 688, 507, 827],[476, 638, 551, 721],[505, 622, 661, 827],[506, 617, 591, 675],[578, 593, 657, 684],[620, 558, 741, 754],[685, 461, 762, 534],[720, 681, 866, 842],[795, 575, 917, 734],[870, 681, 999, 842],[884, 536, 943, 600],[905, 607, 999, 705].') 2023-11-03 17:39:29.372 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:39:29.373 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Development for an agriculture farm on Lago di Bolsena overlooking the lake [008, 445, 994, 595]') 2023-11-03 17:40:03.991 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:40:03.992 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[465, 514, 559, 950] a 38 special ammunition cartridge and [029, 035, 941, 762] box') 2023-11-03 17:40:07.094 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:40:07.095 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[574, 515, 596, 611] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[574, 515, 596, 611];Frame2:[721, 511, 746, 605];Frame3:[793, 501, 820, 594];Frame4:[858, 420, 900, 541].') 2023-11-03 17:40:20.897 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:40:20.897 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[194, 306, 742, 660] American swimmer Gertrude Ederle, the first woman to successfully swim the English Channel, 1926 (b/w photo') 2023-11-03 17:40:24.968 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:40:24.968 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'tree:[488, 271, 595, 425],[136, 205, 223, 359],[203, 186, 253, 329],[228, 278, 299, 382],[482, 278, 926, 534].') 2023-11-03 17:41:22.063 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:41:22.064 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Stool:[001, 525, 204, 665],[001, 373, 236, 575];shelf:[001, 000, 321, 379];Chicken:[342, 572, 668, 762];Slippers:[648, 634, 716, 720],[323, 631, 391, 703];Satchel:[484, 000, 707, 056];Hat:[415, 064, 622, 184];Person:[204, 066, 786, 759];Necklace:[526, 355, 561, 375];Ring:[440, 739, 465, 759];Basin:[334, 818, 435, 880],[436, 806, 537, 869],[561, 843, 662, 905],[381, 874, 489, 944],[488, 880, 592, 948],[484, 954, 591, 984];Pot:[806, 257, 999, 498],[217, 424, 385, 485];Bottle:[318, 210, 352, 256].') 2023-11-03 17:41:51.191 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:41:51.191 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track grocery_bagFrame1:[736, 409, 797, 575],shirtFrame1:[508, 345, 625, 666],shirtFrame1:[287, 698, 500, 1000] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'grocery_bagFrame1:[736, 409, 797, 575];Frame2:[713, 394, 797, 609],shirtFrame1:[508, 345, 625, 666];Frame2:[475, 361, 592, 687],shirtFrame1:[287, 698, 500, 1000];Frame2:[287, 698, 500, 1000].') 2023-11-03 17:41:56.906 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:41:56.907 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Episode 28: Top vaccine scientist [002, 003, 995, 996] warns the world: HALT all covid-19 vaccinations immediately') 2023-11-03 17:42:28.466 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:42:28.467 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hard-boiled eggs [126, 265, 881, 723] marinating in a heavenly soy sauce mixture.') 2023-11-03 17:42:31.591 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:42:31.591 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[118, 300, 558, 716] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', "This is a przewalski's horseFrame1:[118, 300, 558, 716];Frame2:[230, 299, 611, 706];Frame3:[128, 324, 542, 722];Frame4:[060, 297, 493, 731].") 2023-11-03 17:42:49.950 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:42:49.950 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'glasses:[633, 360, 900, 502];woman:[000, 000, 589, 820];man:[428, 113, 999, 998];human face:[097, 038, 157, 122],[288, 186, 384, 300],[618, 174, 886, 756].') 2023-11-03 17:44:12.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:44:12.102 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Clock:[292, 097, 426, 246];Person:[001, 460, 061, 753],[336, 489, 388, 693],[397, 466, 477, 681],[445, 465, 498, 684],[476, 471, 531, 701],[527, 455, 602, 706],[690, 462, 763, 688];Trash bin Can:[702, 528, 793, 706],[958, 542, 1000, 683].') 2023-11-03 17:44:14.953 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:44:14.953 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[794, 337, 924, 751] [667, 413, 806, 780] Two boys watch [402, 187, 475, 395] cowboys warm up before their events in [000, 385, 995, 998] the arena outside the Great Southland Stampede Rodeo in 2008.') 2023-11-03 17:44:22.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:44:22.596 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'traffic sign:[153, 166, 558, 433];tree:[000, 000, 999, 536].') 2023-11-03 17:44:43.494 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:44:43.495 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect SUV in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'SUV:[442, 422, 623, 583].') 2023-11-03 17:44:58.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:44:58.156 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat,Guitar and Glasses in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Hat:[472, 156, 684, 319],[767, 113, 999, 496];Guitar:[201, 711, 581, 1000];Glasses:[132, 389, 368, 538],[687, 246, 826, 420].') 2023-11-03 17:45:14.211 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:45:14.211 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '2021 Nitro boat for sale, model of the boat [185, 121, 985, 851] is Z18 Pro & Image # 24 of 41') 2023-11-03 17:45:38.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:45:38.807 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[395, 157, 625, 959],[000, 201, 122, 833],[090, 264, 225, 776],[210, 223, 311, 724],[213, 251, 325, 754],[318, 229, 406, 696],[396, 253, 455, 663],[456, 221, 507, 306],[547, 245, 620, 683],[620, 255, 656, 354],[626, 212, 740, 853],[631, 269, 660, 510],[705, 268, 771, 741],[732, 260, 787, 722],[766, 012, 998, 998],[772, 195, 858, 741];clothing:[401, 195, 628, 942],[000, 280, 092, 748],[096, 332, 211, 706],[218, 310, 303, 702],[323, 288, 402, 667],[396, 299, 446, 584],[548, 306, 608, 630],[632, 280, 745, 833],[725, 299, 792, 702],[768, 223, 998, 998],[792, 275, 851, 378].') 2023-11-03 17:45:46.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:45:46.971 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[369, 397, 651, 660] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a water cartFrame1:[369, 397, 651, 660];Frame2:[396, 314, 753, 643];Frame3:[351, 262, 800, 658];Frame4:[345, 254, 809, 661].') 2023-11-03 17:45:54.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:45:54.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[074, 148, 546, 1000],[002, 217, 088, 456];Sneakers:[356, 900, 414, 996].') 2023-11-03 17:45:56.088 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:45:56.089 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'pretzel:[071, 239, 464, 723];doughnut:[478, 162, 938, 575].') 2023-11-03 17:46:14.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:46:14.261 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[666, 211, 812, 929],[440, 575, 611, 997],[440, 495, 676, 1000],[001, 193, 198, 999],[093, 492, 134, 764],[132, 438, 244, 998],[230, 254, 343, 998];Hat:[705, 213, 791, 275];Boots:[700, 738, 779, 897],[763, 745, 807, 930];Glasses:[045, 267, 074, 341].') 2023-11-03 17:46:29.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:46:29.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[436, 332, 642, 1000],[609, 003, 1000, 1000];Necklace:[504, 541, 556, 619],[791, 224, 1000, 521].') 2023-11-03 17:46:34.978 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:46:34.979 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Portrait of a twenty–five–year–old Japanese man named Sugiura Aizō, [010, 103, 943, 989] a samurai and officer on the 1864 Ikeda mission to France, 1864 (albumen print') 2023-11-03 17:46:36.816 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:46:36.816 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Chair:[280, 095, 321, 200],[174, 078, 234, 190],[046, 104, 101, 194],[398, 086, 446, 198],[449, 093, 490, 203],[500, 091, 536, 202],[536, 086, 579, 203],[579, 094, 625, 210],[632, 094, 671, 207],[674, 094, 714, 205],[711, 091, 757, 193];Baseball Bat:[617, 433, 667, 514];Baseball Glove:[741, 738, 783, 823];Person:[086, 130, 136, 232],[138, 143, 193, 247],[260, 164, 293, 241],[221, 161, 279, 311],[001, 435, 053, 517],[220, 462, 318, 823],[182, 735, 243, 814],[110, 715, 175, 813],[026, 692, 094, 808],[449, 402, 625, 876],[696, 531, 767, 721],[720, 577, 848, 884],[801, 519, 916, 888],[874, 550, 912, 643],[915, 538, 975, 666],[950, 551, 1000, 664],[897, 506, 945, 630],[769, 049, 826, 278],[001, 022, 056, 117],[055, 025, 101, 152],[175, 023, 226, 097],[240, 030, 295, 113],[928, 018, 999, 212],[737, 003, 777, 079],[674, 002, 715, 096],[327, 002, 382, 081],[273, 002, 330, 075],[106, 002, 182, 088];Helmet:[776, 576, 822, 648],[792, 517, 837, 575];Sneakers:[763, 829, 804, 883].') 2023-11-03 17:47:16.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:47:16.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,what is the track of the man wearing purple shirt and tan shorts?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one man wearing purple shirt and tan shorts.Track1frame1:[067, 196, 313, 1000];frame2:[118, 207, 411, 1000].') 2023-11-03 17:47:16.825 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:47:16.826 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Central Asian national dish Lagman of meat, noodles and vegetables in [198, 155, 806, 895] bowl on [166, 638, 647, 940] a napkin, garlic, parsley on wooden board background') 2023-11-03 17:47:32.959 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:47:32.960 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human leg,woman,human arm and girl in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human leg:[059, 515, 118, 753],[146, 541, 189, 739],[326, 398, 393, 765],[385, 414, 455, 808],[465, 403, 531, 778],[527, 345, 565, 646],[560, 092, 610, 250],[642, 073, 683, 254],[683, 398, 763, 807],[765, 403, 851, 866],[849, 331, 937, 720],[946, 432, 992, 642];woman:[827, 029, 991, 650],[305, 058, 453, 774],[339, 010, 539, 786],[649, 050, 888, 841];human arm:[670, 230, 760, 372],[033, 181, 062, 341],[060, 159, 185, 301],[293, 183, 328, 357],[383, 124, 513, 286],[512, 121, 563, 262],[610, 000, 680, 069],[810, 195, 884, 369],[895, 124, 986, 275];girl:[343, 023, 530, 798],[816, 000, 982, 721].') 2023-11-03 17:47:45.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:47:45.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet,Desk,Plate and Glasses in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Cabinet:[472, 001, 748, 124];Desk:[091, 401, 447, 626];Plate:[329, 787, 406, 838],[549, 834, 627, 884];Glasses:[318, 464, 397, 504],[712, 539, 813, 590].') 2023-11-03 17:47:55.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:47:55.908 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Traffic Light:[525, 492, 587, 665],[479, 509, 522, 609],[886, 495, 939, 619];Car:[000, 728, 045, 893];Street Lights:[048, 625, 088, 817],[612, 359, 660, 763],[757, 620, 795, 733],[687, 596, 714, 829];Van:[098, 731, 212, 851],[415, 713, 444, 754];Lamp:[095, 272, 131, 313];Person:[273, 750, 363, 1000],[725, 746, 805, 999],[755, 729, 829, 999],[702, 724, 748, 957].') 2023-11-03 17:48:05.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:05.819 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'table:[000, 095, 964, 1000];toy:[043, 233, 112, 276],[063, 254, 133, 291],[082, 259, 146, 302],[121, 285, 197, 336],[146, 295, 227, 359],[173, 320, 254, 378],[208, 325, 293, 395],[240, 350, 324, 419],[270, 376, 365, 450],[307, 390, 404, 478],[347, 422, 449, 512],[390, 455, 500, 543],[441, 482, 547, 578],[491, 515, 598, 611],[543, 547, 651, 659],[598, 585, 709, 692],[656, 622, 767, 741],[718, 656, 829, 784],[788, 703, 898, 832];land vehicle:[045, 232, 116, 276],[082, 261, 146, 300],[120, 280, 197, 339],[145, 294, 225, 358],[175, 318, 259, 380],[210, 323, 291, 400],[241, 350, 326, 415],[275, 378, 366, 449],[308, 390, 408, 476],[348, 425, 450, 513],[392, 452, 502, 540],[428, 472, 556, 590],[491, 515, 599, 610],[543, 549, 650, 652],[598, 583, 710, 694],[654, 614, 766, 743],[718, 660, 828, 786],[788, 699, 898, 833],[857, 741, 967, 886].') 2023-11-03 17:48:11.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:11.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[713, 185, 879, 907],[000, 295, 023, 473],[000, 295, 060, 473],[046, 300, 085, 459],[059, 293, 109, 468],[103, 283, 124, 456],[109, 295, 148, 449],[141, 295, 176, 459],[169, 270, 192, 437],[190, 290, 219, 449],[214, 283, 258, 444],[252, 281, 290, 452],[271, 297, 315, 447],[297, 263, 341, 433],[328, 265, 346, 424],[338, 275, 386, 435],[369, 267, 400, 433],[379, 293, 505, 552],[390, 267, 418, 424],[408, 288, 445, 386],[474, 260, 519, 422],[508, 265, 555, 424],[543, 263, 573, 422],[573, 263, 611, 412],[601, 260, 629, 395],[616, 253, 646, 391],[843, 248, 890, 351],[867, 251, 938, 515],[917, 263, 994, 531];person:[000, 265, 022, 510],[000, 267, 081, 521],[046, 273, 078, 482],[061, 263, 128, 501],[091, 265, 131, 437],[103, 265, 145, 461],[144, 281, 188, 484],[156, 248, 210, 449],[186, 260, 222, 456],[205, 255, 262, 464],[248, 248, 285, 461],[266, 267, 317, 454],[294, 232, 332, 461],[323, 236, 358, 444],[343, 246, 387, 447],[363, 242, 528, 574],[370, 234, 403, 452],[389, 239, 433, 472],[415, 253, 453, 383],[478, 234, 518, 454],[509, 236, 551, 440],[543, 241, 570, 435],[577, 239, 615, 417],[609, 232, 632, 415],[623, 229, 653, 400],[637, 209, 999, 645],[706, 142, 874, 972].') 2023-11-03 17:48:32.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:32.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Flower:[315, 106, 603, 252];Vase:[351, 222, 580, 380],[851, 877, 999, 997].') 2023-11-03 17:48:44.909 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:44.909 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[404, 048, 489, 627] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[404, 048, 489, 627];Frame2:[404, 051, 489, 626];Frame3:[391, 070, 494, 622];Frame4:[400, 241, 548, 615];Frame5:[364, 000, 457, 591].') 2023-11-03 17:48:46.783 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:46.784 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[004, 008, 995, 983] Closeup of the harvest. The combination of black and green olives makes for tasty oil. Photo by Michael Imwalle.') 2023-11-03 17:48:49.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:49.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[070, 015, 849, 983],[000, 501, 136, 993],[697, 492, 784, 590],[729, 506, 843, 688],[925, 499, 999, 740];sculpture:[070, 040, 873, 987];clothing:[090, 156, 838, 972],[745, 575, 972, 863],[925, 551, 999, 765];human face:[384, 050, 540, 240].') 2023-11-03 17:48:50.369 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:48:50.370 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[643, 499, 998, 678],[086, 683, 347, 918];Flower:[178, 725, 229, 806];Lifesaver:[100, 002, 157, 113];TV:[627, 002, 727, 050],[002, 002, 101, 088],[875, 025, 958, 103];Barrel:[345, 669, 391, 757],[689, 483, 743, 531];Cup:[238, 508, 274, 594];Hat:[360, 329, 413, 380],[436, 328, 501, 400];Person:[322, 470, 682, 998],[002, 596, 242, 999],[613, 579, 714, 842],[260, 355, 479, 676].') 2023-11-03 17:48:57.468 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:57.469 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet/shelf in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cabinet:[097, 663, 192, 870],[359, 629, 468, 772].') 2023-11-03 17:48:58.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:48:58.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[089, 003, 325, 109],[870, 551, 989, 832];Flower:[083, 380, 323, 673];Desk:[002, 704, 413, 999],[794, 799, 999, 1000];Carpet:[397, 814, 631, 998];Frame:[903, 067, 999, 355];Chair:[564, 658, 689, 906];Mirror:[334, 367, 428, 592].') 2023-11-03 17:50:01.601 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:50:01.602 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[002, 213, 999, 1000];Cup:[208, 458, 408, 731],[419, 346, 587, 518],[409, 438, 651, 704],[657, 385, 821, 727];Plate:[183, 621, 419, 759].') 2023-11-03 17:50:11.348 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:50:11.349 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '2010 SunChaser boat for sale, model of [194, 204, 769, 704] the boat is 820 & [470, 795, 610, 890] Image # 1 of 40') 2023-11-03 17:50:54.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:50:54.116 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[575, 141, 785, 808] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a arabian camelFrame1:[575, 141, 785, 808];Frame2:[366, 058, 824, 997];Frame3:[000, 001, 886, 998];Frame4:[522, 126, 999, 997];Frame5:[577, 298, 753, 997].') 2023-11-03 17:51:56.682 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:51:56.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Pumpkin:[851, 466, 900, 550],[377, 471, 388, 492],[053, 490, 064, 516],[057, 486, 077, 517],[039, 489, 055, 517],[014, 490, 033, 516],[004, 487, 018, 515],[000, 495, 010, 516];Desk:[704, 526, 903, 627],[671, 518, 710, 572],[487, 519, 598, 603],[241, 510, 383, 551],[085, 516, 242, 556],[000, 514, 086, 552];Vase:[828, 521, 853, 551],[596, 531, 608, 538],[591, 510, 596, 534],[463, 502, 473, 510],[460, 530, 470, 538],[415, 502, 426, 513],[032, 504, 037, 516],[711, 924, 773, 967],[651, 438, 660, 448],[665, 424, 669, 433],[648, 422, 656, 430],[384, 525, 391, 530],[498, 503, 502, 510],[548, 510, 570, 535],[570, 510, 590, 536],[636, 488, 649, 501],[650, 489, 660, 501];Stool:[964, 510, 987, 525];Potted Plant:[599, 484, 622, 510],[651, 379, 669, 404];Blackboard:[532, 840, 593, 937],[159, 703, 179, 763].') 2023-11-03 17:53:27.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:53:27.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Desk:[089, 555, 127, 701];Stool:[749, 923, 830, 1000];Flower:[460, 254, 518, 307];Fan:[511, 025, 592, 079];Frame:[672, 156, 746, 278];Car:[089, 465, 132, 530];Person:[630, 254, 921, 998],[552, 130, 803, 502],[329, 123, 504, 516],[457, 322, 675, 1000],[299, 341, 497, 1000],[102, 295, 328, 1000];Necklace:[538, 482, 589, 587];Tie:[702, 498, 738, 660],[619, 328, 652, 498].') 2023-11-03 17:54:36.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:54:36.731 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track truckFrame1:[849, 680, 1000, 1000],truckFrame1:[405, 430, 494, 610],personFrame1:[173, 526, 233, 733] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'truckFrame1[849, 680, 1000, 1000];Frame2:No object,truckFrame1:[405, 430, 494, 610];Frame2:[429, 409, 540, 620],personFrame1:[173, 526, 233, 733];Frame2:[000, 511, 086, 873].') 2023-11-03 17:54:58.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:54:58.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,woman,human face and clothing in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[550, 018, 704, 225],[000, 169, 137, 482],[000, 343, 271, 999],[002, 240, 246, 603],[034, 052, 204, 299],[141, 057, 274, 270],[152, 440, 596, 999],[205, 169, 488, 609],[220, 082, 335, 308],[584, 140, 792, 596],[653, 000, 798, 191],[672, 076, 747, 215],[686, 381, 936, 962],[725, 130, 936, 528],[772, 000, 893, 195],[839, 035, 991, 257],[893, 147, 999, 473];woman:[304, 024, 436, 300],[401, 195, 676, 616],[418, 000, 522, 279],[706, 528, 999, 999];human face:[102, 081, 150, 177],[166, 290, 237, 437],[183, 393, 223, 520],[240, 121, 274, 228],[338, 217, 406, 343],[484, 000, 518, 085],[487, 258, 564, 343],[624, 041, 670, 123],[685, 000, 725, 067],[698, 240, 778, 314],[789, 472, 839, 575],[814, 200, 897, 298],[844, 060, 864, 125];clothing:[000, 251, 150, 457],[000, 334, 150, 598],[000, 435, 257, 976],[038, 088, 144, 284],[140, 110, 290, 298],[158, 536, 602, 999],[216, 266, 475, 649],[234, 191, 328, 288],[316, 062, 423, 248],[412, 056, 502, 277],[414, 270, 644, 615],[514, 132, 623, 298],[556, 054, 680, 255],[579, 203, 755, 589],[656, 002, 792, 165],[676, 138, 737, 439],[686, 526, 999, 978],[720, 222, 959, 516],[751, 867, 986, 999],[752, 015, 887, 177],[860, 106, 992, 217],[913, 180, 999, 455].') 2023-11-03 17:55:21.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:55:21.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[091, 291, 385, 860],[473, 051, 779, 822];bowl:[552, 920, 760, 980],[709, 832, 919, 935],[951, 866, 997, 944];clothing:[114, 295, 355, 857],[457, 170, 779, 777];mixing bowl:[540, 921, 757, 999],[710, 826, 925, 938],[946, 864, 998, 935];girl:[098, 258, 395, 921];tree:[000, 000, 998, 530];human face:[173, 286, 250, 369],[567, 218, 632, 291].') 2023-11-03 17:55:45.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:55:45.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Storage box:[891, 236, 998, 431];Person:[303, 085, 909, 998];Glasses:[518, 218, 585, 266];Refrigerator:[425, 008, 944, 999].') 2023-11-03 17:56:03.643 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:56:03.643 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'street light:[023, 260, 059, 655],[200, 525, 237, 656],[885, 483, 944, 711];tree:[000, 351, 440, 592],[699, 492, 999, 611].') 2023-11-03 17:56:29.256 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:56:29.256 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'two young prairie dogs [344, 339, 498, 685] [417, 405, 647, 720] sitting at the entrance to a burrow in the prairie [006, 005, 995, 998]') 2023-11-03 17:57:10.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:57:10.752 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[007, 045, 991, 996] Flavorful pork and vegetables cooked with garlic and ginger, then wrapped and pan fried in a dumpling wrapper. These Garlic Ginger Pork Potstickers are a super tasty dinner for any night!') 2023-11-03 17:57:17.306 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:57:17.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[727, 210, 773, 348];Sports Car:[082, 378, 923, 838];Car:[774, 222, 853, 309],[001, 255, 084, 363],[018, 239, 158, 354],[108, 240, 214, 348],[213, 217, 338, 331],[343, 238, 438, 316];SUV:[110, 212, 280, 342],[257, 214, 387, 319].') 2023-11-03 17:57:26.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:57:26.115 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Storage box and Lamp in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Storage box:[151, 600, 272, 683],[040, 663, 153, 717],[352, 595, 382, 613];Lamp:[411, 048, 583, 289],[449, 295, 540, 434],[464, 428, 520, 467],[600, 581, 640, 623],[331, 563, 361, 580],[482, 000, 526, 035],[483, 289, 508, 317].') 2023-11-03 17:57:41.912 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:57:41.913 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the man in white seated on the left-side?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one man in white seated on the left-side.Track1frame:1:[248, 297, 364, 500];frame:2:[254, 300, 373, 504];frame:3:[254, 297, 371, 506].') 2023-11-03 17:58:20.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:58:20.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[475, 277, 612, 784] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[475, 277, 612, 784];Frame2:[440, 278, 554, 783];Frame3:[371, 268, 476, 789];Frame4:[331, 260, 440, 805];Frame5:[332, 204, 448, 831].') 2023-11-03 17:58:38.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:58:38.870 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'tower:[039, 095, 458, 701],[010, 455, 077, 711],[276, 252, 490, 725],[539, 479, 620, 713],[565, 410, 942, 730];skyscraper:[000, 442, 090, 657],[286, 245, 497, 685];tree:[000, 561, 999, 999].') 2023-11-03 17:58:55.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:58:55.270 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Flower:[450, 365, 931, 999];Tie:[332, 484, 410, 729];Person:[043, 037, 507, 998],[135, 707, 521, 999],[500, 374, 793, 1000].') 2023-11-03 17:59:00.454 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:59:00.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'man:[160, 232, 254, 444],[248, 371, 334, 627],[431, 509, 497, 629],[496, 900, 579, 998],[671, 383, 784, 651],[773, 299, 907, 600],[875, 533, 999, 988];human face:[459, 533, 497, 601].') 2023-11-03 17:59:18.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 17:59:18.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[013, 172, 146, 639],[163, 248, 370, 659],[337, 217, 554, 605],[447, 191, 580, 574];clothing:[596, 125, 775, 652],[000, 195, 686, 713],[013, 771, 108, 905],[730, 219, 854, 592].') 2023-11-03 17:59:38.816 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:59:38.816 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Heather Knight [157, 048, 796, 995] shows her frustration after being dismissed for 60 in the third ODI.') 2023-11-03 17:59:53.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 17:59:53.043 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Chair:[443, 553, 656, 1000],[447, 478, 608, 904],[057, 412, 292, 718];Lamp:[356, 381, 428, 566];Flower:[474, 358, 579, 471];Vase:[523, 484, 550, 555];Picture:[663, 001, 747, 313],[282, 117, 351, 301],[190, 121, 259, 304];Carpet:[051, 748, 274, 968];Desk:[019, 717, 133, 998],[327, 578, 359, 754];Person:[208, 176, 235, 234].') 2023-11-03 18:00:11.651 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:00:11.652 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'These No-Bake Peppermint Chocolate Bites [143, 117, 961, 733] are simple, healthy and ready in minutes. This gluten-free recipe would make a great addition to your holiday cookie tray!') 2023-11-03 18:00:17.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:00:17.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[223, 261, 849, 791];man:[180, 086, 887, 963];human face:[432, 158, 591, 298].') 2023-11-03 18:00:30.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:00:30.052 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'sunglasses:[217, 291, 370, 366];goggles:[215, 289, 367, 374];woman:[127, 183, 978, 999].') 2023-11-03 18:01:06.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:01:06.712 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:2:[600, 379, 686, 661];frame:3:[465, 268, 643, 658] is?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an man following the girl run towards us then squat down and look at the cloth') 2023-11-03 18:01:30.482 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:01:30.488 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[884, 783, 913, 952],[845, 800, 889, 955],[666, 686, 768, 975],[741, 697, 794, 860].') 2023-11-03 18:01:45.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:01:45.948 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[926, 595, 999, 863],carFrame1:[610, 442, 666, 543],carFrame1:[584, 437, 636, 519],carFrame1:[327, 421, 423, 496],carFrame1:[540, 420, 609, 478],carFrame1:[396, 419, 452, 477],carFrame1:[139, 426, 319, 605] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1[926, 595, 999, 863];Frame2:No object,carFrame1:[610, 442, 666, 543];Frame2:[632, 435, 726, 574],carFrame1:[584, 437, 636, 519];Frame2:[596, 438, 665, 534],carFrame1:[327, 421, 423, 496];Frame2:[298, 412, 405, 498],carFrame1:[540, 420, 609, 478];Frame2:[538, 416, 617, 479],carFrame1:[396, 419, 452, 477];Frame2:[383, 412, 442, 478],carFrame1:[139, 426, 319, 605];Frame2:[000, 408, 235, 692].') 2023-11-03 18:01:59.616 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:01:59.616 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Play guess that wine with matthew rhys video [354, 005, 940, 998]') 2023-11-03 18:02:04.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:02:04.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Picture:[934, 080, 999, 324];Radiator:[842, 576, 1000, 830];Person:[301, 519, 922, 999],[002, 289, 360, 1000],[342, 306, 621, 768],[130, 346, 373, 971];Necklace:[447, 688, 606, 785].') 2023-11-03 18:02:06.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:02:06.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[000, 558, 097, 705],carFrame1:[219, 388, 557, 960],truckFrame1:[482, 434, 561, 517],busFrame1:[565, 001, 999, 972],carFrame1:[496, 476, 556, 648] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[000, 558, 097, 705];Frame2:[000, 558, 097, 708],carFrame1:[219, 388, 557, 960];Frame2:[219, 388, 557, 960],truckFrame1:[482, 434, 561, 517];Frame2:[482, 434, 559, 512],busFrame1:[565, 001, 999, 972];Frame2:[559, 001, 999, 917],carFrame1:[496, 476, 556, 648];Frame2:[496, 476, 556, 646].') 2023-11-03 18:02:18.122 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:02:18.122 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[637, 150, 802, 955],[183, 101, 345, 959],[359, 202, 463, 877];car:[809, 309, 986, 756],[000, 334, 035, 409],[000, 334, 203, 625],[066, 315, 205, 356],[308, 280, 660, 683],[790, 354, 846, 573];wheel:[160, 504, 221, 630],[568, 560, 611, 686],[804, 536, 817, 644],[825, 593, 860, 726];footwear:[212, 815, 293, 890],[240, 851, 293, 949],[367, 780, 404, 845],[620, 949, 683, 999],[703, 906, 773, 964];jeans:[365, 508, 463, 817],[647, 623, 776, 917].') 2023-11-03 18:02:35.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:02:35.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[322, 232, 627, 508] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a space shuttleFrame1:[322, 232, 627, 508];Frame2:[285, 233, 615, 515];Frame3:[285, 244, 628, 526];Frame4:[295, 260, 654, 548].') 2023-11-03 18:02:51.201 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:02:51.202 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Shango has manifested in the man wearing [389, 415, 536, 556] the yellow shirt and he is about to give his blessings and guidance. Photograph by Horace Ové') 2023-11-03 18:04:28.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:04:28.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[205, 158, 480, 1000],[394, 180, 611, 998],[660, 238, 764, 578],[864, 197, 958, 628];human hair:[491, 199, 572, 319],[301, 139, 419, 273];human head:[298, 140, 436, 330],[489, 194, 577, 358];man:[189, 106, 491, 973],[394, 196, 618, 998],[660, 241, 749, 484],[862, 191, 956, 642];clothing:[204, 185, 456, 888],[387, 279, 591, 928];human hand:[423, 478, 502, 573],[458, 594, 502, 661];human leg:[210, 770, 404, 988],[430, 597, 511, 998],[505, 566, 592, 897];human face:[360, 188, 419, 363],[501, 266, 568, 361];human arm:[291, 357, 503, 677],[371, 496, 448, 551],[398, 316, 504, 667],[576, 418, 617, 508].') 2023-11-03 18:04:41.486 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:04:41.486 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human head:[853, 445, 960, 615],[200, 350, 326, 625],[530, 130, 672, 366],[622, 328, 745, 556];human body:[000, 346, 456, 998],[387, 126, 792, 838],[773, 426, 998, 755];human hair:[183, 413, 332, 530],[533, 211, 645, 285],[625, 330, 741, 466],[855, 438, 956, 585];man:[007, 343, 453, 998],[390, 123, 785, 841],[778, 438, 998, 736];clothing:[025, 535, 446, 998],[436, 281, 796, 796],[487, 486, 935, 998],[892, 550, 995, 755];human face:[206, 443, 296, 611],[560, 216, 640, 366],[636, 373, 732, 546],[872, 498, 933, 578].') 2023-11-03 18:05:56.694 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:05:56.695 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[189, 325, 454, 758] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a snow leopardFrame1:[189, 325, 454, 758];Frame2:[243, 179, 420, 591];Frame3:[211, 239, 489, 545];Frame4:[285, 389, 582, 727].') 2023-11-03 18:06:02.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:06:02.651 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Ballon,Stroller,Handbag,Soccer,Baseball Bat and Surveillance Camera in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Ballon:[320, 205, 523, 466],[500, 198, 710, 433],[143, 260, 238, 363],[139, 107, 180, 162],[227, 402, 276, 475];Stroller:[643, 699, 722, 906];Satchel:[118, 704, 164, 809].') 2023-11-03 18:06:09.561 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:06:09.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Satchel:[512, 120, 818, 476],[681, 122, 999, 487],[388, 343, 609, 648],[271, 169, 414, 473],[121, 235, 281, 507],[014, 243, 153, 467],[001, 506, 176, 999],[055, 545, 201, 999],[170, 512, 331, 833],[284, 561, 438, 980],[359, 612, 542, 999],[483, 600, 627, 835],[567, 606, 704, 1000],[598, 511, 774, 843],[761, 542, 959, 998],[891, 543, 1000, 849],[586, 861, 917, 1000],[417, 855, 563, 973],[728, 001, 925, 087],[069, 001, 268, 088];Cabinet:[001, 087, 999, 999];Briefcase:[099, 908, 246, 996].') 2023-11-03 18:06:23.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:06:23.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Desk:[000, 732, 395, 1000],[733, 676, 999, 1000],[832, 591, 999, 683],[355, 627, 638, 937],[554, 582, 722, 716];Napkin:[046, 636, 112, 800],[013, 609, 065, 745],[523, 564, 550, 641],[455, 556, 480, 627],[598, 529, 620, 577],[652, 531, 672, 586];Flower:[690, 593, 924, 999],[823, 503, 890, 598],[777, 523, 878, 686],[000, 496, 036, 621];Chair:[532, 695, 787, 999],[299, 654, 500, 1000],[248, 661, 392, 1000],[704, 598, 736, 738],[822, 614, 965, 679],[775, 576, 808, 639],[511, 582, 617, 641],[548, 582, 634, 676];Candle:[503, 624, 520, 652],[808, 692, 825, 728];Lamp:[710, 350, 740, 372];Picture:[128, 419, 165, 465];Plate:[247, 753, 325, 791],[978, 763, 999, 792],[967, 708, 1000, 734];Fork:[192, 772, 246, 801],[403, 643, 438, 664];Wine Glass:[055, 725, 116, 866],[024, 686, 067, 786],[521, 616, 546, 667],[453, 603, 478, 646],[872, 654, 910, 730];Cup:[804, 676, 838, 727],[654, 571, 669, 593];Person:[476, 445, 540, 552].') 2023-11-03 18:06:50.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:06:50.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Truck and Sneakers in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Truck:[001, 233, 153, 357];Sneakers:[266, 919, 347, 971].') 2023-11-03 18:06:50.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:06:50.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'sculpture:[215, 332, 608, 970];person:[202, 315, 588, 928].') 2023-11-03 18:07:11.283 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:07:11.283 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Closeup view of a white plate containing Homemade Healthy Crepes [152, 232, 766, 718] folded over and topped with apple slices and cinnamon.') 2023-11-03 18:07:13.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:07:13.169 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the drinking water without altering location, side facing us.The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There are 2 drinking water without altering location, side facing us.Track1frame:1:[143, 373, 222, 498];frame:2:[146, 373, 223, 497];frame:3:[146, 375, 222, 498],Track2frame:1:[386, 449, 482, 617];frame:2:[399, 445, 482, 613];frame:3:[396, 452, 479, 615].') 2023-11-03 18:07:29.706 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:07:29.706 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you point out the trajectory of the man wearing plaid shirt and tan pants?Stick to the TrackiFrame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'There is one man wearing plaid shirt and tan pants.Track1frame1:[752, 073, 1000, 294];frame2:[753, 067, 1000, 283];frame3:[752, 064, 1000, 277];frame4:[751, 062, 1000, 274].') 2023-11-03 18:08:06.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:08:06.108 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Color ceramic production [201, 074, 869, 991] is pulled from the shelves 】 【 the south China sea guanyin Buddha carp guanyin bodhisattva') 2023-11-03 18:09:48.181 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:09:48.182 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the the initial aggressor among the giraffes.?When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There is one the initial aggressor among the giraffes..Track1frame:1:[262, 296, 506, 885];frame:2:[262, 296, 506, 885];frame:3:[274, 392, 590, 902].') 2023-11-03 18:09:51.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:09:51.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[376, 414, 685, 990],[000, 478, 999, 999];woman:[653, 439, 807, 999],[851, 488, 923, 714],[866, 536, 999, 999];man:[000, 345, 219, 999],[111, 327, 244, 999],[207, 396, 313, 999],[290, 406, 411, 999],[376, 252, 676, 999];human face:[095, 368, 168, 658],[189, 376, 238, 506],[343, 433, 384, 530],[641, 420, 665, 485],[786, 468, 823, 533].') 2023-11-03 18:09:55.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:09:55.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[586, 545, 724, 745] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a lionFrame1:[586, 545, 724, 745];Frame2:[602, 522, 707, 756];Frame3:[562, 500, 643, 800].') 2023-11-03 18:10:08.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:10:08.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[254, 317, 489, 678] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a personFrame1:[254, 317, 489, 678];Frame2:[323, 405, 613, 784];Frame3:[321, 367, 629, 775];Frame4:[250, 393, 573, 823].') 2023-11-03 18:10:51.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:10:51.086 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[777, 543, 899, 706],carFrame1:[275, 639, 343, 714],carFrame1:[393, 578, 497, 738] in this video clip.To respond correctly, utilize the specified classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[777, 543, 899, 706];Frame2:[844, 505, 999, 751],carFrame1:[275, 639, 343, 714];Frame2:[267, 643, 338, 719],carFrame1:[393, 578, 497, 738];Frame2:[390, 575, 503, 747].') 2023-11-03 18:11:04.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:11:04.440 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A man [539, 051, 963, 863] and a toddler [055, 094, 512, 906] playing with blocks [143, 738, 997, 998].') 2023-11-03 18:11:13.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:11:13.965 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[486, 080, 822, 368] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a doveFrame1:[486, 080, 822, 368];Frame2:[485, 089, 822, 373];Frame3:[485, 093, 808, 392];Frame4:[486, 100, 802, 392].') 2023-11-03 18:11:14.697 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:11:14.697 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Tennis:[471, 476, 534, 567];Person:[528, 081, 925, 1000],[525, 002, 702, 802];Leather Shoes:[546, 760, 639, 799];Sneakers:[608, 946, 722, 999],[853, 911, 925, 1000].') 2023-11-03 18:11:28.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:11:28.106 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Prada is going to have a show for Christmas, in their window display, with cute polar bear dolls [471, 357, 628, 746] [173, 364, 294, 732] dressed in colorful dresses [917, 751, 998, 996] [177, 496, 290, 696] [274, 488, 405, 696] [385, 486, 507, 693].') 2023-11-03 18:12:02.377 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:12:02.378 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[540, 072, 989, 994] A character walks through a hallway, as dozens of [292, 597, 418, 730] [511, 355, 647, 462] [559, 266, 797, 431] doll arms reach out in a screenshot from Little Nightmares 2') 2023-11-03 18:12:07.704 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:12:07.705 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the swimming fish in the water.?Your trajectory details should strictly align with the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your submission.', 'There are 6 the swimming fish in the water..Track1frame:1:[682, 282, 920, 875];frame:2:[664, 294, 839, 865];frame:3:[658, 329, 842, 858],Track2frame:1:[000, 048, 260, 481];frame:2:[000, 100, 232, 473];frame:3:[000, 142, 238, 623],Track3frame:1:[169, 399, 513, 966];frame:2:[165, 299, 504, 880];frame:3:[244, 390, 507, 916],Track4frame:1:[511, 655, 610, 986];frame:2:[517, 677, 601, 979];frame:3:[515, 731, 613, 980],Track5frame:1:[254, 138, 514, 590];frame:2:[364, 137, 504, 662];frame:3:[322, 151, 502, 548],Track6frame:1:[038, 163, 354, 741];frame:2:[074, 178, 341, 809];frame:3:[058, 204, 333, 701].') 2023-11-03 18:12:25.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:12:25.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'car:[043, 212, 825, 792],[000, 189, 100, 490],[243, 193, 505, 363];woman:[002, 082, 051, 198],[054, 084, 163, 421],[253, 065, 320, 358],[713, 143, 753, 363];clothing:[000, 109, 043, 204],[001, 096, 536, 413],[005, 120, 072, 195],[060, 119, 150, 277],[062, 085, 107, 215],[078, 102, 115, 172],[116, 277, 173, 419],[148, 100, 246, 380],[250, 104, 321, 345],[313, 124, 605, 235],[341, 117, 383, 205],[424, 120, 486, 219],[533, 130, 591, 223],[595, 172, 825, 289],[604, 148, 714, 222],[704, 175, 728, 240],[721, 178, 754, 269],[775, 174, 820, 262],[796, 122, 927, 572],[908, 110, 995, 580].') 2023-11-03 18:12:30.031 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:12:30.031 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[405, 480, 468, 595],[158, 464, 225, 789],[355, 483, 398, 525];Satchel:[223, 569, 273, 670],[193, 582, 220, 675],[159, 573, 196, 680];Leather Shoes:[189, 761, 225, 780],[160, 766, 190, 788];Sneakers:[222, 789, 249, 866],[249, 835, 270, 875];Moniter:[375, 524, 431, 584];Cup:[599, 613, 611, 634],[586, 614, 598, 633],[569, 608, 588, 634],[655, 405, 666, 436],[645, 405, 656, 436],[632, 417, 646, 437];Bottle:[609, 565, 620, 632],[657, 461, 668, 492],[646, 461, 658, 492],[619, 393, 634, 437],[606, 395, 620, 438],[173, 381, 178, 410];Flower:[316, 206, 421, 269],[573, 254, 652, 333];Desk:[757, 728, 878, 951],[640, 705, 760, 925],[485, 714, 605, 942];Cabinet:[547, 379, 670, 499],[373, 374, 390, 477];Carpet:[154, 832, 271, 917].') 2023-11-03 18:13:01.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:13:01.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Gun:[111, 442, 766, 913],[001, 439, 082, 617];Person:[419, 180, 1000, 998],[001, 056, 566, 996],[002, 134, 105, 253],[559, 119, 680, 229],[656, 149, 776, 303],[735, 157, 846, 314],[809, 136, 879, 253];Hat:[416, 180, 654, 367];Belt:[724, 672, 1000, 718].') 2023-11-03 18:13:04.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:13:04.079 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[089, 719, 192, 998],[075, 482, 155, 687],[094, 722, 135, 997],[327, 385, 380, 568],[242, 373, 289, 553],[047, 352, 094, 534],[281, 278, 335, 409],[454, 273, 503, 417],[799, 248, 835, 389],[974, 183, 998, 265],[879, 193, 916, 242],[620, 261, 663, 413],[152, 251, 182, 296],[183, 246, 213, 294];Hat:[121, 718, 165, 770],[093, 483, 122, 519];Pickup Truck:[001, 095, 076, 169],[165, 086, 252, 156],[696, 185, 747, 247],[464, 176, 499, 229];Car:[193, 213, 284, 292],[232, 130, 283, 173],[400, 187, 470, 279];SUV:[177, 127, 229, 191];Van:[298, 098, 360, 150],[117, 179, 194, 242];Cow:[946, 499, 1000, 626],[862, 397, 914, 534],[608, 463, 738, 588],[444, 490, 618, 618],[028, 366, 175, 468];Horse:[409, 303, 589, 472],[220, 416, 439, 620],[249, 321, 394, 456],[169, 410, 333, 590],[001, 389, 161, 605],[766, 282, 845, 435].') 2023-11-03 18:13:07.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:13:07.666 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Chair:[026, 002, 097, 056],[093, 002, 204, 063],[201, 006, 278, 070],[337, 002, 407, 041],[460, 011, 518, 110],[511, 009, 612, 059],[614, 012, 714, 059],[726, 071, 803, 119],[147, 068, 260, 106],[034, 053, 145, 103],[128, 107, 232, 178],[407, 106, 455, 170],[460, 110, 562, 178],[567, 109, 672, 187],[665, 115, 775, 170],[775, 114, 881, 170],[878, 112, 985, 166],[746, 169, 858, 207],[856, 162, 885, 205],[025, 095, 130, 154];Baseball Bat:[408, 037, 498, 221];Person:[001, 002, 068, 169],[199, 027, 490, 199],[259, 002, 352, 125],[380, 002, 499, 112],[513, 002, 617, 115],[604, 002, 740, 118],[877, 061, 973, 223],[973, 123, 1000, 203],[089, 130, 605, 971];Helmet:[240, 131, 349, 282];Gloves:[391, 216, 428, 270],[347, 248, 391, 288];Belt:[267, 510, 370, 548];Sneakers:[089, 853, 144, 970],[501, 880, 604, 967];Hat:[906, 061, 947, 104].') 2023-11-03 18:13:08.415 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:13:08.415 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[584, 273, 833, 995] [344, 139, 723, 994] [002, 410, 163, 997] [182, 420, 368, 997] Myanmar protesters hold [126, 673, 154, 760] [202, 514, 226, 608] [419, 671, 462, 761] candles and shout slogans as they stage a protest against recent electricity price increases in Yangon on November 7, 2013. (Ye Aung Thu/AFP/Getty Images') 2023-11-03 18:13:19.250 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:13:19.250 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Minack Theatre Cornwall Penzance As You Like It 2') 2023-11-03 18:13:37.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:13:37.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,girl,human face and woman in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'clothing:[073, 320, 165, 576],[303, 452, 456, 699],[393, 482, 536, 681],[436, 513, 626, 857],[491, 501, 738, 910],[496, 399, 565, 516],[623, 434, 653, 513],[640, 580, 846, 926],[689, 456, 770, 547],[701, 597, 999, 999],[770, 400, 818, 485],[796, 471, 895, 576],[875, 437, 933, 511];girl:[060, 213, 181, 780],[440, 450, 627, 871],[548, 382, 620, 510],[592, 475, 843, 999],[685, 373, 770, 549],[750, 327, 804, 410],[796, 395, 889, 595],[822, 323, 867, 413],[860, 355, 916, 458],[871, 375, 946, 526],[900, 411, 999, 623];human face:[110, 229, 144, 301],[400, 313, 982, 609];woman:[036, 197, 195, 660],[282, 400, 534, 790],[310, 385, 460, 757],[436, 449, 632, 870],[494, 423, 737, 952],[592, 471, 848, 999],[691, 370, 777, 534],[754, 322, 806, 412],[803, 388, 895, 585],[822, 319, 873, 410],[870, 369, 946, 532],[871, 350, 916, 448],[896, 412, 999, 610].') 2023-11-03 18:14:15.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:14:15.485 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Car:[000, 000, 329, 174],[591, 000, 723, 082],[802, 000, 931, 049];Van:[364, 000, 589, 103];Stroller:[890, 006, 955, 073];Pen:[573, 671, 714, 749],[754, 545, 791, 617],[910, 577, 927, 643],[926, 579, 933, 646],[932, 595, 944, 645],[941, 574, 957, 629],[943, 579, 963, 643],[955, 589, 971, 655],[965, 593, 978, 655],[965, 608, 987, 655],[974, 626, 999, 655],[932, 635, 941, 649];Trash bin Can:[632, 043, 663, 085];Desk:[223, 517, 999, 1000];Chair:[857, 016, 882, 058];Cup:[896, 594, 1000, 725];Other Shoes:[000, 203, 033, 221],[093, 217, 106, 232],[069, 221, 083, 235],[254, 192, 271, 206],[288, 192, 302, 202],[344, 249, 363, 262],[578, 085, 594, 094],[763, 075, 778, 085],[758, 067, 769, 078],[826, 067, 843, 079],[841, 070, 854, 078],[989, 098, 997, 107];Satchel:[014, 043, 036, 102],[719, 004, 738, 025];Belt:[255, 046, 273, 059];Sandals:[413, 242, 442, 264],[395, 236, 427, 253];Glasses:[813, 257, 863, 390];Sneakers:[726, 079, 741, 089];High Heels:[909, 071, 925, 083];Cell Phone:[741, 475, 802, 555],[355, 045, 373, 061];Person:[370, 299, 621, 868],[449, 078, 952, 689],[000, 045, 033, 221],[019, 000, 106, 234],[205, 000, 304, 205],[325, 058, 379, 261],[358, 000, 442, 263],[498, 000, 574, 153],[487, 014, 511, 111],[576, 000, 596, 094];Bracelet:[729, 464, 761, 506];Key:[809, 530, 842, 567];Watch:[368, 027, 375, 040].') 2023-11-03 18:14:28.425 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:14:28.425 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Hat:[396, 013, 884, 439].') 2023-11-03 18:14:29.243 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:14:29.243 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'ArcGIS supports a range [017, 046, 283, 547] [418, 164, 555, 800] [144, 307, 365, 731] [753, 081, 886, 648] [854, 196, 968, 729] [528, 026, 668, 665] of mobile devices') 2023-11-03 18:14:40.472 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:14:40.473 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'architecture club students [485, 473, 558, 878] [862, 460, 962, 865] [538, 477, 651, 888] [338, 477, 438, 892] [134, 478, 249, 909] [413, 477, 512, 891] [264, 522, 354, 902] [779, 497, 873, 873] [647, 475, 766, 884] [001, 448, 146, 912] on a field trip') 2023-11-03 18:14:51.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:14:51.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[543, 691, 604, 893] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[543, 691, 604, 893];Frame2:[547, 717, 617, 907];Frame3:[552, 706, 610, 910];Frame4:[539, 703, 601, 913].') 2023-11-03 18:14:59.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:14:59.439 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cabinet,Pillow,Potted Plant,Picture and Moniter/TV in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Cabinet:[713, 656, 866, 842];Pillow:[356, 578, 398, 658],[395, 582, 469, 656];Potted Plant:[947, 221, 1000, 550];Frame:[912, 367, 997, 536],[043, 394, 132, 559];TV:[761, 530, 855, 713].') 2023-11-03 18:15:10.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:15:10.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[263, 486, 320, 678],personFrame1:[363, 483, 427, 680],personFrame1:[674, 473, 754, 686],personFrame1:[397, 533, 456, 735],personFrame1:[602, 495, 662, 668],personFrame1:[747, 444, 802, 658],personFrame1:[445, 462, 485, 656],personFrame1:[641, 481, 700, 659],personFrame1:[525, 453, 568, 660],personFrame1:[800, 458, 864, 669] in this video clip.Do not deviate from the established classFrame t:[xmin,ymin,xmax,ymax] format in your answer.', 'personFrame1:[263, 486, 320, 678];Frame2:[281, 486, 334, 677],personFrame1:[363, 483, 427, 680];Frame2:[381, 489, 440, 677],personFrame1:[674, 473, 754, 686];Frame2:[670, 461, 727, 694],personFrame1:[397, 533, 456, 735];Frame2:[382, 520, 449, 738],personFrame1[602, 495, 662, 668];Frame2:No object,personFrame1:[747, 444, 802, 658];Frame2:[743, 441, 790, 657],personFrame1:[445, 462, 485, 656];Frame2:[436, 454, 486, 658],personFrame1[641, 481, 700, 659];Frame2:No object,personFrame1:[525, 453, 568, 660];Frame2:[519, 473, 570, 654],personFrame1:[800, 458, 864, 669];Frame2:[788, 448, 847, 673].') 2023-11-03 18:15:39.012 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:15:39.013 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a teamster [129, 486, 256, 902] uses a horse cart [129, 714, 399, 996] to transport a crate past the brick warehouse [130, 005, 839, 884] in otaru. - horsedrawn stock videos & royalty-free footage') 2023-11-03 18:15:39.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:15:39.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'footwear:[566, 754, 627, 849],[234, 900, 273, 966];woman:[000, 351, 020, 432],[000, 464, 161, 999],[040, 363, 073, 426],[040, 416, 101, 518],[064, 369, 097, 448],[083, 445, 160, 631],[100, 402, 163, 495],[115, 488, 278, 987],[143, 354, 189, 452],[157, 407, 215, 516],[186, 366, 225, 443],[211, 372, 255, 469],[241, 444, 415, 907],[283, 369, 328, 464],[311, 408, 385, 656],[328, 385, 377, 455],[337, 356, 377, 432],[377, 412, 533, 753],[498, 383, 531, 450],[520, 412, 557, 496],[580, 403, 612, 496],[603, 408, 747, 730],[646, 145, 706, 298],[650, 396, 796, 707],[673, 110, 717, 288],[693, 370, 716, 439],[768, 376, 821, 501],[796, 378, 822, 459],[806, 372, 836, 435],[850, 404, 922, 596],[884, 390, 958, 569],[900, 369, 977, 564];man:[000, 376, 043, 461],[080, 391, 118, 496],[198, 430, 309, 830],[228, 338, 248, 401],[251, 387, 323, 496],[448, 393, 530, 519],[461, 421, 630, 840],[505, 295, 534, 370],[530, 395, 697, 785],[700, 391, 818, 693],[738, 391, 830, 648],[817, 393, 920, 608],[921, 374, 999, 555].') 2023-11-03 18:15:47.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:15:47.629 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[164, 512, 224, 686],personFrame1:[214, 563, 268, 755],personFrame1:[632, 463, 673, 630],personFrame1:[456, 441, 492, 623],personFrame1:[455, 481, 497, 694],personFrame1:[682, 445, 723, 588],personFrame1:[214, 415, 258, 562] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1[164, 512, 224, 686];Frame2:No object,personFrame1:[214, 563, 268, 755];Frame2:[200, 563, 265, 755],personFrame1:[632, 463, 673, 630];Frame2:[628, 448, 676, 613],personFrame1:[456, 441, 492, 623];Frame2:[455, 443, 495, 625],personFrame1:[455, 481, 497, 694];Frame2:[453, 483, 496, 693],personFrame1:[682, 445, 723, 588];Frame2:[699, 441, 737, 590],personFrame1:[214, 415, 258, 562];Frame2:[211, 415, 257, 561].') 2023-11-03 18:16:30.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:16:30.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[772, 422, 903, 688],personFrame1:[574, 429, 671, 687],personFrame1:[028, 383, 175, 743] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'personFrame1:[772, 422, 903, 688];Frame2:[801, 420, 881, 688],personFrame1:[574, 429, 671, 687];Frame2:[607, 431, 683, 687],personFrame1:[028, 383, 175, 743];Frame2:[000, 387, 189, 744].') 2023-11-03 18:16:38.983 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:16:38.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[021, 411, 132, 956],[000, 453, 062, 953],[145, 425, 303, 977],[270, 491, 345, 860],[339, 460, 456, 940],[452, 525, 540, 858],[461, 482, 488, 615],[539, 472, 636, 942],[645, 425, 707, 610],[670, 500, 786, 920],[688, 398, 731, 515],[751, 360, 792, 500],[753, 493, 843, 871],[765, 406, 800, 501],[816, 398, 870, 498],[828, 506, 886, 853],[845, 360, 875, 456],[863, 359, 921, 462],[891, 484, 985, 871],[906, 360, 958, 480],[948, 510, 996, 834],[961, 443, 992, 515];clothing:[131, 460, 273, 961],[000, 358, 999, 975].') 2023-11-03 18:16:41.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:16:41.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Bowl:[921, 193, 1000, 235];Desk:[000, 409, 999, 1000],[698, 205, 999, 480];Chair:[552, 112, 659, 289],[810, 079, 991, 212];Flower:[856, 000, 1000, 089];Vase:[940, 075, 999, 210];Cake:[214, 112, 875, 837];Car:[345, 140, 495, 222].') 2023-11-03 18:16:55.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:16:55.076 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[021, 614, 079, 884],[084, 600, 150, 884];tree:[027, 026, 188, 665],[360, 000, 970, 726],[903, 229, 985, 499];building:[000, 000, 989, 699];window:[238, 468, 248, 545];car:[293, 643, 401, 737].') 2023-11-03 18:18:30.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:18:30.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[564, 278, 891, 849];frame:2:[422, 174, 781, 880];frame:3:[411, 223, 789, 880] is?Given the image frames and trajectory coordinates, please respond succinctly to the question.Considering the image frames and trajectory coordinates provided, offer a brief answer.', 'This is a/an two pandas are involved in a fight.') 2023-11-03 18:18:55.204 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:18:55.204 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Satchel:[227, 620, 387, 749],[608, 578, 837, 793];Person:[552, 343, 917, 999],[185, 231, 384, 674],[370, 214, 563, 719].') 2023-11-03 18:18:59.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:18:59.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[272, 069, 872, 1000],[653, 287, 973, 977];Speaker:[036, 492, 347, 739],[653, 865, 999, 999],[003, 739, 346, 1000];Guitar:[664, 460, 1000, 730].') 2023-11-03 18:19:44.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:19:44.564 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Person in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[782, 441, 886, 716].') 2023-11-03 18:19:59.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:19:59.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'boy:[141, 537, 999, 990];human face:[316, 354, 557, 528],[357, 520, 680, 663].') 2023-11-03 18:20:28.959 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:20:28.959 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect human face and man in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'human face:[038, 278, 113, 434],[110, 207, 165, 320],[120, 542, 262, 778],[135, 289, 205, 393],[234, 342, 328, 555],[328, 293, 377, 388],[421, 038, 465, 106],[422, 368, 451, 443],[452, 255, 546, 442],[703, 334, 781, 443],[843, 328, 941, 515];man:[055, 264, 437, 870],[086, 160, 164, 352],[359, 191, 752, 870],[579, 245, 788, 666],[708, 282, 957, 772].') 2023-11-03 18:20:42.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:20:42.447 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Helmet:[583, 061, 670, 194],[367, 053, 460, 166];Gloves:[528, 389, 624, 491],[331, 400, 416, 512],[478, 252, 518, 299];Sneakers:[853, 753, 920, 871],[539, 706, 586, 835],[370, 746, 476, 919],[222, 729, 326, 915];Hockey Stick:[037, 378, 585, 626],[499, 276, 674, 684];Person:[224, 053, 534, 901],[509, 061, 922, 856].') 2023-11-03 18:20:45.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:20:45.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Boat:[002, 606, 101, 939],[103, 201, 846, 820],[468, 386, 972, 698],[890, 504, 999, 652].') 2023-11-03 18:21:03.095 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:21:03.096 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[415, 336, 689, 816] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a personFrame1:[415, 336, 689, 816];Frame2:[501, 314, 708, 775];Frame3:[607, 259, 804, 675];Frame4:[592, 247, 801, 626].') 2023-11-03 18:21:37.203 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:21:37.204 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'wheel:[922, 604, 986, 727],[223, 914, 355, 999];clothing:[503, 347, 731, 963],[303, 424, 543, 999];car:[738, 397, 996, 744],[110, 469, 188, 529],[251, 413, 534, 561],[675, 446, 810, 657];man:[312, 267, 727, 999],[501, 358, 549, 424].') 2023-11-03 18:21:50.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:21:50.178 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[166, 211, 243, 574],[222, 225, 281, 333],[244, 262, 315, 620],[281, 281, 340, 647],[316, 302, 368, 509],[333, 280, 436, 672],[389, 337, 458, 719],[449, 323, 521, 693],[468, 349, 520, 739],[494, 318, 604, 701],[565, 329, 663, 760],[601, 367, 713, 785],[700, 300, 785, 756],[741, 322, 854, 730],[837, 311, 891, 709].') 2023-11-03 18:22:02.712 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:22:02.713 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'vintage finds piled in the back [004, 003, 994, 998] of a Honda') 2023-11-03 18:22:48.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:22:48.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[570, 308, 736, 993];window:[048, 011, 307, 665];house:[000, 000, 999, 999],[088, 382, 161, 432].') 2023-11-03 18:22:50.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:22:50.656 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Picture:[001, 382, 045, 538],[736, 392, 779, 445];Mirror:[123, 394, 199, 520];Chair:[200, 574, 259, 659];Couch:[151, 624, 630, 960],[748, 567, 958, 849];Pillow:[251, 622, 317, 656],[504, 655, 574, 690],[760, 606, 872, 640],[788, 584, 889, 641];Clock:[634, 397, 672, 446];Lamp:[877, 436, 918, 497],[886, 493, 946, 739];Cabinet:[954, 002, 998, 1000];Person:[416, 460, 499, 571];Moniter:[348, 455, 508, 577];Side Table:[001, 715, 133, 865],[827, 723, 936, 909].') 2023-11-03 18:23:04.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:23:04.269 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Glasses:[744, 124, 807, 203];Backpack:[319, 162, 549, 381];Handbag:[000, 332, 228, 600],[063, 273, 124, 373];Sneakers:[218, 716, 252, 797];Marker:[316, 830, 386, 876],[471, 863, 557, 952],[502, 841, 590, 918],[497, 826, 601, 915],[536, 822, 616, 913],[546, 810, 621, 903],[516, 753, 613, 816],[595, 768, 640, 862],[569, 562, 607, 634];Pencil:[815, 527, 881, 565];Cup:[614, 472, 663, 588],[171, 104, 217, 193];Bottle:[819, 558, 902, 790],[913, 488, 973, 668];Flask:[870, 491, 931, 744];Desk:[094, 482, 999, 999];Person:[002, 327, 473, 993],[392, 178, 709, 681],[586, 001, 842, 553],[834, 285, 1000, 625],[752, 818, 999, 1000];Bracelet:[883, 879, 911, 951],[902, 884, 951, 965].') 2023-11-03 18:23:44.546 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:23:44.546 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human body:[020, 335, 258, 985],[273, 257, 301, 359],[295, 272, 317, 348],[312, 270, 333, 367],[323, 305, 351, 400],[347, 314, 380, 417],[382, 307, 408, 398],[396, 210, 590, 944],[484, 303, 781, 999],[743, 279, 956, 999],[897, 336, 921, 464],[944, 359, 999, 600];woman:[021, 323, 263, 1000],[350, 317, 383, 415],[380, 301, 413, 408],[396, 215, 586, 952];human hair:[474, 291, 719, 697],[024, 350, 185, 530],[476, 267, 575, 455],[712, 269, 902, 485];human head:[723, 273, 901, 552],[056, 348, 182, 530],[465, 219, 568, 375],[493, 296, 716, 635];man:[697, 282, 973, 997],[272, 290, 778, 999],[280, 270, 300, 345],[318, 279, 333, 350],[396, 218, 588, 941],[895, 343, 931, 450],[943, 367, 999, 593];human hand:[390, 475, 474, 594],[050, 727, 105, 834],[202, 787, 263, 858],[269, 908, 354, 999],[438, 537, 526, 650];dress:[032, 501, 261, 999];fashion accessory:[096, 512, 236, 834],[469, 219, 566, 301],[770, 499, 863, 602];vehicle:[298, 369, 352, 482],[890, 345, 974, 417];human face:[113, 389, 180, 521],[503, 292, 546, 366],[725, 366, 813, 508];human arm:[000, 558, 098, 823],[198, 505, 273, 844],[273, 799, 651, 999],[393, 400, 473, 589],[431, 529, 585, 753],[790, 632, 933, 899],[987, 364, 999, 443].') 2023-11-03 18:23:48.292 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:23:48.293 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'tree:[000, 000, 850, 999],[853, 000, 996, 999].') 2023-11-03 18:24:12.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 18:24:12.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human body:[025, 031, 789, 885],[000, 557, 187, 663],[545, 622, 925, 999],[828, 552, 999, 670];woman:[606, 372, 966, 991],[000, 344, 198, 663],[000, 000, 998, 999],[824, 375, 999, 656];human head:[671, 533, 883, 664],[000, 356, 130, 570],[052, 000, 771, 617],[761, 378, 912, 566];mammal:[010, 008, 814, 925],[000, 339, 196, 660],[599, 394, 949, 999],[783, 375, 999, 723],[943, 385, 999, 737];clothing:[023, 576, 820, 977],[000, 574, 181, 690],[783, 738, 946, 871],[857, 567, 939, 721];human eye:[295, 440, 372, 467],[433, 445, 505, 481];human mouth:[020, 533, 084, 546],[319, 526, 454, 575];human hair:[000, 520, 135, 616],[627, 568, 926, 693];girl:[000, 350, 187, 660],[000, 082, 966, 999],[664, 408, 950, 928];human face:[005, 500, 108, 554],[258, 435, 513, 605],[700, 556, 828, 656],[840, 506, 906, 571];human nose:[355, 450, 452, 521].') 2023-11-03 18:24:51.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:24:51.237 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lamp:[904, 047, 931, 219],[782, 166, 803, 278],[094, 012, 120, 169],[164, 137, 188, 253],[730, 105, 761, 151],[631, 205, 652, 238],[389, 081, 420, 123],[385, 196, 406, 230];Person:[252, 800, 607, 908].') 2023-11-03 18:25:30.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:25:30.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'tree:[875, 388, 975, 566],[000, 715, 031, 841],[000, 468, 382, 753],[008, 840, 075, 906],[302, 765, 351, 838],[310, 660, 370, 760],[348, 740, 592, 883],[350, 455, 438, 620],[352, 666, 420, 788],[412, 458, 518, 606],[456, 623, 630, 815],[566, 751, 623, 840],[591, 441, 656, 543],[603, 780, 643, 855],[626, 663, 963, 821],[638, 428, 726, 533],[691, 570, 765, 673],[693, 416, 810, 585],[693, 796, 896, 855],[756, 565, 833, 661],[926, 726, 998, 883],[938, 660, 998, 726];castle:[312, 050, 970, 775];house:[022, 751, 076, 845],[056, 883, 242, 983],[070, 508, 278, 918],[221, 851, 491, 998],[336, 401, 998, 613],[370, 111, 998, 533],[431, 841, 646, 995],[667, 835, 901, 998].') 2023-11-03 18:25:33.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:25:33.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Gloves:[069, 654, 241, 690];Sink:[717, 763, 999, 888];Toiletry:[932, 685, 996, 767];Pot:[273, 677, 484, 761];Gas stove:[067, 573, 626, 800];Oven:[064, 722, 423, 1000];Cup:[236, 612, 305, 665];Basin:[640, 612, 772, 697],[601, 548, 813, 661];Microwave:[203, 404, 552, 578];shelf:[001, 694, 083, 966],[422, 800, 1000, 1000].') 2023-11-03 18:25:39.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:25:39.997 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'person:[590, 527, 651, 754].') 2023-11-03 18:25:53.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:25:53.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'plant:[227, 294, 756, 786].') 2023-11-03 18:25:55.894 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:25:55.894 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Satchel:[722, 002, 910, 319];Bench:[323, 279, 787, 811];shelf:[002, 230, 218, 753];Stuffed Toy:[294, 693, 421, 863],[461, 689, 624, 938],[625, 702, 744, 877],[518, 362, 714, 676].') 2023-11-03 18:26:06.337 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:26:06.337 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[557, 949, 600, 999],[609, 789, 650, 911].') 2023-11-03 18:26:19.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:26:19.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Pillow:[029, 258, 100, 374],[589, 400, 801, 567],[592, 272, 787, 540],[758, 270, 1000, 561],[926, 776, 999, 998],[166, 259, 252, 340];Picture:[212, 195, 254, 250];Flower:[179, 371, 314, 492];Vase:[207, 495, 291, 573],[207, 572, 294, 655];Couch:[402, 398, 998, 999];Bench:[059, 337, 253, 467].') 2023-11-03 18:26:25.399 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:26:25.399 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Street Lights:[657, 246, 718, 990];Awning:[540, 546, 974, 710],[372, 544, 538, 683],[039, 625, 120, 725].') 2023-11-03 18:26:27.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:26:27.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Desk:[002, 847, 102, 999];Person:[099, 207, 440, 999],[479, 301, 1000, 1000].') 2023-11-03 18:26:28.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:26:28.449 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Gloves:[361, 264, 447, 396],[337, 172, 406, 334];Glasses:[503, 033, 539, 148];Person:[257, 026, 682, 1000],[793, 019, 899, 584],[893, 065, 1000, 565],[624, 070, 702, 446],[106, 072, 230, 609].') 2023-11-03 18:26:32.959 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:26:32.959 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '// day fiance [003, 003, 541, 994] anfisa arkhipchenko fight co stars tell all special pp') 2023-11-03 18:26:36.981 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:26:36.982 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Happy young multiethnic couple [339, 215, 589, 918] [546, 230, 845, 916] unpacking or packing boxes [243, 494, 527, 746] [588, 583, 858, 837] and moving into a new home') 2023-11-03 18:27:05.003 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:27:05.004 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[776, 117, 852, 249];Handbag:[071, 446, 149, 641],[574, 582, 615, 644];Person:[002, 399, 075, 905],[058, 371, 152, 920],[145, 381, 239, 919],[276, 396, 387, 930],[357, 397, 425, 730],[469, 429, 536, 736],[579, 380, 682, 802];Umbrella:[035, 261, 257, 457],[246, 263, 462, 404],[565, 325, 700, 579],[480, 377, 579, 442],[219, 561, 274, 820].') 2023-11-03 18:27:06.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:27:06.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the two humans battling with swords?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There are 2 two humans battling with swords.Track1frame:1:[358, 204, 532, 768];frame:2:[345, 210, 542, 777];frame:3:[362, 218, 516, 772],Track2frame:1:[618, 229, 840, 791];frame:2:[599, 235, 837, 802];frame:3:[633, 252, 858, 804].') 2023-11-03 18:27:23.975 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:27:23.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[967, 874, 996, 999];Motorcycle:[001, 906, 030, 970];Car:[055, 884, 132, 932],[001, 883, 029, 927].') 2023-11-03 18:27:47.300 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:27:47.300 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Apartments under construction in Melbourne') 2023-11-03 18:28:07.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:28:07.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[326, 642, 452, 874] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[326, 642, 452, 874];Frame2:[330, 644, 440, 868];Frame3:[301, 732, 427, 868];Frame4:[309, 723, 397, 874];Frame5:[329, 740, 416, 876].') 2023-11-03 18:28:56.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:28:56.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[403, 461, 446, 544],carFrame1:[308, 434, 379, 556],carFrame1:[201, 491, 315, 619] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[403, 461, 446, 544];Frame2:[400, 465, 443, 550],carFrame1:[308, 434, 379, 556];Frame2:[269, 426, 366, 579],carFrame1:[201, 491, 315, 619];Frame2:[143, 496, 294, 649].') 2023-11-03 18:29:20.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:29:20.137 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Lamp:[717, 110, 823, 169];Person:[403, 785, 532, 998],[881, 650, 919, 773],[265, 537, 305, 594];Luggage:[433, 913, 494, 999];Machinery Vehicle:[238, 546, 353, 626].') 2023-11-03 18:29:26.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:29:26.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[028, 685, 087, 733],[012, 708, 111, 902],[001, 731, 126, 977],[384, 694, 448, 780],[729, 696, 832, 795],[850, 698, 960, 795],[304, 691, 367, 776];Street Lights:[074, 001, 270, 909];Car:[076, 681, 125, 730].') 2023-11-03 18:29:48.816 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:29:48.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'mammal:[500, 349, 881, 978],[008, 748, 463, 976],[400, 034, 680, 744];human body:[023, 701, 470, 976],[401, 027, 685, 752],[536, 345, 891, 969];woman:[243, 770, 326, 962],[393, 031, 678, 743];human head:[478, 076, 583, 141],[648, 353, 795, 433];clothing:[016, 739, 106, 860],[130, 803, 353, 961],[355, 790, 451, 960],[445, 034, 633, 735],[570, 425, 835, 798];human face:[488, 076, 573, 145];human arm:[030, 879, 100, 924],[058, 837, 200, 895],[086, 801, 246, 860],[216, 701, 303, 790],[233, 810, 348, 912],[393, 239, 535, 335],[436, 811, 473, 875],[771, 468, 888, 647].') 2023-11-03 18:29:58.914 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:29:58.915 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Dallastown's Andrew Chronister [410, 185, 997, 997], seen here in a file photo, earned a singles win on Tuesday for Dallastown against Palmyra.") 2023-11-03 18:29:59.814 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:29:59.815 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[491, 455, 628, 649],carFrame1:[128, 521, 212, 614],carFrame1:[432, 506, 479, 595] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'carFrame1:[491, 455, 628, 649];Frame2:[529, 412, 757, 670],carFrame1:[128, 521, 212, 614];Frame2:[081, 512, 173, 619],carFrame1:[432, 506, 479, 595];Frame2:[431, 491, 498, 595].') 2023-11-03 18:30:05.537 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:30:05.538 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[006, 002, 929, 885] Madeleine McCann pictured on the day she went missing from the apartment in Praia da Luz six years ago') 2023-11-03 18:32:28.536 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:32:28.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Couch:[001, 522, 103, 911];Person:[561, 156, 860, 755],[549, 107, 698, 787],[328, 024, 619, 902],[089, 095, 372, 927];Glasses:[577, 161, 624, 197];Leather Shoes:[358, 848, 422, 899],[501, 860, 568, 903];Wheelchair:[516, 420, 958, 951];Car:[906, 521, 972, 637];Dog:[246, 571, 505, 873],[001, 615, 481, 956].') 2023-11-03 18:32:37.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:32:37.661 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Pepper,Helmet and Sneakers in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Helmet:[050, 045, 104, 144],[319, 292, 381, 389],[214, 500, 284, 624],[759, 172, 813, 262];Sneakers:[836, 755, 888, 850],[702, 778, 746, 873],[415, 653, 479, 717],[448, 378, 513, 469],[494, 389, 567, 458],[030, 661, 111, 744].') 2023-11-03 18:32:39.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:32:39.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[136, 348, 229, 575],personFrame1:[238, 348, 317, 547],personFrame1:[350, 444, 489, 709],personFrame1:[500, 344, 578, 551],personFrame1:[360, 452, 440, 651],personFrame1:[350, 409, 389, 536],personFrame1:[707, 395, 777, 716] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[136, 348, 229, 575];Frame2:[143, 345, 235, 575],personFrame1:[238, 348, 317, 547];Frame2:[243, 345, 321, 547],personFrame1:[350, 444, 489, 709];Frame2:[364, 448, 496, 711],personFrame1:[500, 344, 578, 551];Frame2:[500, 344, 601, 551],personFrame1:[360, 452, 440, 651];Frame2:[365, 477, 442, 645],personFrame1:[350, 409, 389, 536];Frame2:[347, 405, 398, 537],personFrame1:[707, 395, 777, 716];Frame2:[707, 379, 784, 718].') 2023-11-03 18:32:47.697 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:32:47.697 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[385, 400, 425, 555],carFrame1:[593, 444, 652, 505],carFrame1:[000, 417, 066, 574],carFrame1:[560, 450, 622, 515],carFrame1:[575, 449, 804, 590] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'personFrame1:[385, 400, 425, 555];Frame2:[352, 399, 404, 550],carFrame1[593, 444, 652, 505];Frame2:No object,carFrame1:[000, 417, 066, 574];Frame2:[000, 417, 066, 574],carFrame1:[560, 450, 622, 515];Frame2:[560, 450, 622, 515],carFrame1:[575, 449, 804, 590];Frame2:[534, 445, 689, 566].') 2023-11-03 18:32:51.633 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:32:51.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[232, 167, 423, 845],[382, 280, 557, 854],[524, 370, 660, 842],[623, 115, 810, 874],[764, 380, 930, 611],[792, 389, 926, 620],[802, 319, 869, 575],[535, 338, 596, 554],[926, 382, 961, 511],[928, 419, 990, 541],[412, 326, 444, 451],[920, 367, 996, 479],[430, 329, 527, 421],[084, 269, 131, 332].') 2023-11-03 18:33:00.648 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:33:00.648 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Click here to join [118, 129, 882, 546] our weekly worship on Youtube') 2023-11-03 18:33:35.876 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:33:35.877 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'building:[042, 194, 953, 814].') 2023-11-03 18:33:45.361 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:33:45.361 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Happy young woman doctor [215, 026, 749, 997] in medical coat [215, 345, 743, 998] with stethoscope [339, 330, 608, 730] around her neck looking at front with big smile [413, 219, 503, 290] on face showing v-sign standing over orange wall [001, 008, 995, 998]') 2023-11-03 18:33:47.074 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:33:47.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[003, 283, 556, 892];Flower:[315, 422, 438, 538];Carpet:[003, 712, 999, 999].') 2023-11-03 18:34:07.577 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:34:07.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Dinning Table:[000, 128, 1000, 1000];Candle:[729, 226, 823, 387];Plate:[303, 522, 798, 965];Wine Glass:[522, 114, 656, 529];Cup:[921, 277, 999, 558],[730, 224, 823, 385];Knife:[899, 620, 999, 879];Fork:[819, 684, 902, 1000].') 2023-11-03 18:34:10.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:34:10.362 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track truckFrame1:[264, 417, 328, 474],personFrame1:[722, 497, 759, 578],personFrame1:[833, 507, 895, 639],personFrame1:[349, 447, 411, 720],carFrame1:[419, 531, 753, 953] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'truckFrame1[264, 417, 328, 474];Frame2:No object,personFrame1:[722, 497, 759, 578];Frame2:[722, 497, 759, 578],personFrame1:[833, 507, 895, 639];Frame2:[848, 512, 913, 638],personFrame1:[349, 447, 411, 720];Frame2:[351, 447, 427, 722],carFrame1:[419, 531, 753, 953];Frame2:[419, 530, 754, 954].') 2023-11-03 18:34:17.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:34:17.303 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[503, 581, 584, 682],carFrame1:[320, 573, 406, 665],carFrame1:[135, 554, 352, 749] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[503, 581, 584, 682];Frame2:[565, 564, 650, 675],carFrame1:[320, 573, 406, 665];Frame2:[414, 564, 493, 652],carFrame1:[135, 554, 352, 749];Frame2:[271, 553, 446, 720].') 2023-11-03 18:34:31.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:34:31.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the green feathered creatures moving around and some flying away.When you incorporate trajectories into your answer, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.,Present the trajectories in your response using the designated TrackiFrame t:[xmin,ymin,xmax,ymax] layout.', 'There are 4 green feathered creatures moving around and some flying away.Track1frame:1:[640, 484, 812, 845];frame:2:[603, 480, 830, 836];frame:3:[601, 508, 841, 884],Track2frame:1:[409, 502, 629, 820];frame:2:[402, 408, 484, 778];frame:3:[392, 465, 562, 848],Track3frame:1:[418, 615, 634, 998];frame:2:[439, 528, 646, 998];frame:3:[456, 540, 627, 928],Track4frame:1:[279, 407, 413, 892];frame:2:[332, 372, 426, 604];frame:3:[353, 268, 460, 695].') 2023-11-03 18:34:58.496 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:34:58.496 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[194, 006, 459, 988] [399, 063, 692, 997] Happy kids in halloween costume trick or treating outdoors. [191, 012, 696, 996] Two little girls in halloween costume with [591, 562, 713, 816] [169, 486, 284, 688] buckets walking outdoors on the street.') 2023-11-03 18:35:16.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:35:16.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Belt:[602, 773, 767, 848],[404, 805, 557, 879];Hat:[537, 237, 650, 350],[406, 221, 514, 339],[095, 399, 157, 452];Sneakers:[068, 907, 139, 956];Baseball Glove:[002, 556, 052, 654],[742, 919, 833, 999];Baseball:[506, 349, 539, 409];Person:[002, 398, 206, 957],[314, 222, 754, 1000],[536, 236, 834, 1000].') 2023-11-03 18:35:18.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:35:18.319 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Chair:[189, 253, 434, 620],[001, 809, 345, 1000],[655, 803, 998, 1000],[558, 251, 809, 651];Pillow:[227, 352, 353, 487],[641, 345, 767, 488].') 2023-11-03 18:35:18.817 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:35:18.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Hat:[060, 122, 155, 221];Sandals:[500, 741, 589, 827];Sneakers:[751, 846, 800, 935];Person:[754, 002, 999, 999],[554, 002, 804, 914],[439, 137, 685, 828],[266, 002, 414, 471],[031, 002, 223, 605],[002, 002, 059, 583],[029, 122, 234, 860].') 2023-11-03 18:35:19.366 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:35:19.366 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Nicki [519, 229, 641, 833] and Moe [609, 202, 754, 840] standing happily in front of the tent [723, 383, 997, 772] and the motorcycle [033, 314, 512, 877]') 2023-11-03 18:35:52.782 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:35:52.783 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[371, 419, 448, 747],personFrame1:[726, 380, 806, 719],personFrame1:[606, 422, 689, 743],personFrame1:[517, 381, 587, 642],personFrame1:[225, 379, 282, 689] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[371, 419, 448, 747];Frame2:[371, 417, 448, 746],personFrame1:[726, 380, 806, 719];Frame2:[726, 389, 806, 719],personFrame1:[606, 422, 689, 743];Frame2:[606, 413, 689, 744],personFrame1:[517, 381, 587, 642];Frame2:[517, 391, 587, 643],personFrame1:[225, 379, 282, 689];Frame2:[225, 380, 282, 689].') 2023-11-03 18:36:36.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:36:36.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[000, 252, 085, 899],[003, 179, 217, 976],[030, 265, 087, 766],[228, 229, 316, 761],[229, 234, 425, 897],[301, 236, 375, 333],[373, 274, 458, 365],[405, 233, 577, 980],[514, 218, 661, 806],[593, 218, 706, 820],[639, 256, 745, 802],[830, 281, 981, 937],[900, 282, 999, 867],[938, 268, 995, 842];footwear:[133, 866, 176, 945],[201, 716, 223, 779],[298, 754, 328, 841],[451, 839, 491, 931],[529, 764, 562, 833],[858, 784, 886, 858].') 2023-11-03 18:36:40.755 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:36:40.755 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a page [004, 005, 995, 996] from a book showing a photograph of twelve black men [474, 570, 622, 737] [575, 490, 722, 738] [573, 276, 687, 628] [471, 249, 585, 616] [712, 481, 856, 741] [130, 274, 280, 723] lined up') 2023-11-03 18:36:50.465 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:36:50.465 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'If you want something done, you should ask [139, 322, 858, 618] a busy person Picture Quote #1') 2023-11-03 18:38:13.179 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:38:13.179 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Blonde pub hostess Ruth Ellis [003, 053, 996, 992], who shot and killed her lover') 2023-11-03 18:38:19.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:38:19.818 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the yellow feathered creature move around in this video clip?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one yellow feathered creature move around.Track1frame:1:[633, 452, 820, 905];frame:2:[609, 460, 823, 881];frame:3:[634, 433, 816, 851].') 2023-11-03 18:38:28.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:38:28.448 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[001, 351, 393, 670],[612, 780, 877, 1000];Stool:[348, 676, 601, 895];Cabinet:[068, 167, 350, 493],[595, 171, 764, 524],[870, 001, 1000, 998];Storage box:[138, 185, 195, 254],[261, 184, 324, 256],[677, 357, 743, 430],[605, 428, 670, 511];Person:[300, 060, 617, 968];Necklace:[440, 266, 509, 376];High Heels:[343, 828, 420, 945],[384, 833, 484, 971];Satchel:[166, 330, 202, 384];Laptop:[335, 308, 392, 372];Chair:[072, 256, 207, 560],[192, 302, 361, 747].') 2023-11-03 18:38:38.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:38:38.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A graphic [774, 360, 898, 617] of two cardinals with dates for the Mothers Day events') 2023-11-03 18:38:43.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:38:43.284 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the among the three rabbits devouring their food, the final one to stand up and elevate its legs onto the fence..Always default to the TrackiFrame t:[xmin,ymin,xmax,ymax] pattern when discussing trajectories in your answer.', 'There is one among the three rabbits devouring their food, the final one to stand up and elevate its legs onto the fence..Track1frame:1:[366, 228, 479, 392];frame:2:[355, 240, 480, 393];frame:3:[356, 227, 480, 393].') 2023-11-03 18:39:20.038 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:39:20.039 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[578, 313, 956, 859] New Nissan Juke spied testing together with [037, 390, 359, 858] the current model') 2023-11-03 18:39:33.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:39:33.330 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[406, 480, 527, 745] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a mountain bikeFrame1:[406, 480, 527, 745];Frame2:[403, 480, 528, 800];Frame3:[399, 501, 534, 873];Frame4:[389, 491, 544, 887];Frame5:[364, 508, 533, 933].') 2023-11-03 18:39:35.289 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:39:35.289 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Desk:[002, 844, 1000, 1000];Flower:[813, 732, 940, 825];Person:[546, 084, 838, 885],[106, 085, 584, 883];Glasses:[549, 211, 666, 285].') 2023-11-03 18:39:40.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:39:40.512 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[728, 185, 797, 408],[856, 143, 916, 393],[953, 139, 1000, 302],[396, 356, 542, 520],[546, 268, 758, 566],[026, 357, 175, 609],[055, 294, 168, 457];Glasses:[309, 391, 348, 425];Hat:[436, 365, 500, 430],[389, 308, 447, 372],[211, 300, 255, 336];Backpack:[718, 220, 748, 319];Bottle:[512, 442, 537, 532],[380, 466, 406, 568];bucket:[032, 358, 064, 409];Chair:[129, 493, 172, 592],[020, 452, 150, 731],[012, 419, 060, 644],[564, 416, 608, 504],[505, 405, 538, 482];Stool:[000, 467, 033, 561];Storage box:[479, 662, 639, 887],[287, 870, 513, 1000],[263, 914, 409, 999],[432, 560, 543, 636],[475, 548, 586, 698],[353, 514, 435, 572],[404, 501, 469, 536],[411, 545, 491, 622],[250, 957, 470, 999];Lamp:[440, 017, 469, 050];Pliers:[617, 905, 649, 983],[420, 667, 452, 695];Trash bin Can:[474, 296, 501, 353].') 2023-11-03 18:39:53.564 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:39:53.564 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[366, 761, 505, 923],[213, 551, 364, 874],[315, 557, 383, 742],[371, 552, 443, 730],[693, 062, 741, 119];Machinery Vehicle:[002, 078, 795, 800].') 2023-11-03 18:39:54.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:39:54.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect cosmetics,trousers and footwear in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'trousers:[145, 586, 326, 915],[600, 618, 832, 917];footwear:[112, 859, 215, 927],[559, 905, 666, 944].') 2023-11-03 18:40:34.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:40:34.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human hair:[188, 874, 260, 977],[000, 786, 050, 887],[068, 853, 104, 933],[292, 836, 340, 910],[571, 794, 627, 887],[672, 811, 731, 887];man:[532, 828, 704, 1000],[000, 782, 080, 998],[034, 826, 184, 998],[071, 780, 111, 891],[131, 761, 182, 967],[170, 790, 204, 895],[171, 794, 205, 956],[200, 801, 244, 904],[235, 843, 338, 998],[244, 790, 271, 900],[327, 777, 398, 920],[421, 782, 537, 998],[645, 807, 687, 916],[645, 803, 780, 998],[675, 805, 702, 904],[804, 828, 944, 998];clothing:[325, 815, 481, 1000],[000, 912, 064, 998],[040, 923, 187, 998],[068, 796, 097, 912],[141, 798, 171, 927],[198, 931, 252, 998],[204, 819, 241, 897],[235, 815, 264, 881],[244, 921, 331, 998],[255, 815, 292, 918],[442, 813, 515, 980],[511, 887, 678, 998],[662, 897, 785, 998],[822, 895, 944, 998];human head:[001, 803, 052, 891],[101, 824, 148, 908],[220, 868, 271, 935],[297, 843, 338, 906],[378, 782, 438, 866],[572, 792, 622, 889],[675, 807, 732, 899],[844, 828, 897, 906];building:[000, 000, 950, 918],[871, 337, 997, 680],[927, 392, 998, 788];human arm:[140, 811, 161, 906],[241, 931, 277, 998],[310, 908, 345, 998],[431, 899, 478, 998],[491, 820, 535, 906],[651, 906, 711, 998],[818, 920, 875, 998].') 2023-11-03 18:41:07.897 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:41:07.898 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a large round fish tank [007, 038, 731, 694] in the refined space will make it more living, less formal and more cool') 2023-11-03 18:42:05.559 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:42:05.561 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'walking in the tunnel - christian beirle stock pictures, royalty-free photos [595, 616, 808, 730] & images') 2023-11-03 18:42:07.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:42:07.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'wheel:[086, 102, 795, 748];man:[348, 815, 406, 933],[395, 769, 547, 998];tree:[308, 718, 411, 848],[390, 704, 475, 844],[410, 648, 503, 825],[455, 612, 632, 828],[490, 580, 691, 862],[638, 648, 735, 861],[691, 653, 759, 861],[717, 614, 883, 866],[864, 581, 998, 812];clothing:[362, 836, 388, 934],[405, 828, 529, 998].') 2023-11-03 18:42:18.232 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:42:18.233 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hockey Stick,Sneakers and Person in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Hockey Stick:[410, 003, 856, 394];Sneakers:[172, 654, 248, 835],[300, 854, 363, 923];Person:[426, 076, 737, 1000],[172, 003, 501, 915],[435, 003, 714, 836].') 2023-11-03 18:42:21.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:42:21.207 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[690, 392, 820, 904],[147, 444, 220, 752],[173, 414, 225, 742],[595, 448, 665, 745],[922, 464, 998, 874];clothing:[217, 435, 304, 744],[004, 501, 062, 998],[026, 451, 096, 688],[075, 467, 126, 675],[080, 452, 156, 718],[144, 500, 214, 721],[299, 467, 349, 665],[334, 412, 392, 678],[348, 468, 434, 725],[366, 492, 540, 742],[521, 474, 602, 855],[570, 482, 621, 747],[608, 482, 667, 694],[656, 464, 754, 741],[697, 467, 800, 858],[782, 504, 825, 768],[825, 490, 900, 800],[843, 605, 908, 842],[891, 455, 957, 801],[905, 445, 977, 565],[929, 514, 994, 842],[972, 581, 999, 811].') 2023-11-03 18:42:26.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:42:26.552 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'chair:[000, 611, 111, 955],[000, 797, 264, 999],[028, 595, 141, 789],[028, 555, 142, 737],[641, 830, 801, 999],[653, 661, 745, 821],[901, 795, 999, 999];man:[080, 521, 311, 915],[111, 225, 267, 635],[240, 238, 376, 705],[243, 525, 501, 999],[293, 235, 488, 678],[435, 200, 561, 443],[463, 575, 758, 999],[473, 211, 688, 740],[664, 125, 894, 462],[743, 282, 951, 999];clothing:[081, 627, 307, 999],[107, 312, 273, 645],[237, 343, 377, 670],[278, 369, 486, 665],[280, 617, 506, 999],[446, 307, 559, 472],[463, 698, 748, 999],[467, 303, 679, 758],[705, 227, 896, 485],[737, 445, 954, 999];human face:[156, 540, 221, 635],[184, 255, 236, 340],[290, 255, 349, 340],[367, 553, 428, 646],[371, 262, 426, 360],[477, 227, 531, 320],[544, 606, 616, 716],[549, 243, 611, 346],[685, 162, 757, 262],[788, 334, 866, 451].') 2023-11-03 18:43:20.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:43:20.856 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me what is thisFrame1:[339, 413, 471, 758] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a white cat standing around a cat colored in white and black on the groundFrame1:[339, 413, 471, 758];Frame2:[342, 405, 476, 750];Frame3:[350, 416, 482, 761].') 2023-11-03 18:43:47.491 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:43:47.491 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,doll,human nose,beard and human face in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'doll:[130, 248, 400, 802],[132, 009, 254, 179],[689, 251, 995, 807],[959, 066, 998, 187];human nose:[488, 380, 571, 475];beard:[459, 477, 640, 681];human face:[163, 321, 258, 403],[403, 271, 642, 674],[790, 300, 885, 406].') 2023-11-03 18:44:13.763 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:44:13.763 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The bugatti chiron price may seem overwhelming, but the below specs justify the price of admission. [041, 182, 863, 934] Bugatti Chiron Black Car price is Rs 118 crores - Most ...') 2023-11-03 18:44:15.841 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:44:15.841 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[393, 289, 470, 506].') 2023-11-03 18:44:35.825 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:44:35.825 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[404, 347, 574, 546] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a reconnaissance vehicleFrame1:[404, 347, 574, 546];Frame2:[375, 306, 555, 523];Frame3:[371, 325, 558, 546];Frame4:[294, 340, 495, 577].') 2023-11-03 18:44:58.621 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:44:58.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human hair:[030, 311, 259, 578];bottle:[855, 169, 958, 496],[634, 380, 702, 700],[686, 365, 730, 522],[687, 330, 721, 438],[710, 382, 767, 692],[750, 368, 784, 550],[772, 380, 833, 536],[783, 375, 841, 525],[967, 251, 999, 390],[970, 300, 999, 445];man:[205, 223, 385, 607];table:[302, 531, 999, 999];mammal:[174, 182, 386, 737];wine:[549, 672, 615, 735],[635, 388, 696, 692],[643, 664, 730, 762],[683, 365, 730, 541],[704, 391, 758, 543],[745, 372, 784, 630],[760, 685, 847, 775],[776, 386, 835, 537],[786, 379, 836, 537],[863, 185, 939, 484],[970, 251, 999, 390],[973, 301, 996, 445];clothing:[016, 547, 305, 999],[233, 360, 388, 634];wine glass:[541, 553, 623, 861],[586, 415, 628, 548],[643, 563, 735, 880],[695, 525, 753, 779],[695, 503, 756, 600],[754, 585, 853, 915],[768, 532, 841, 795],[772, 510, 836, 627],[826, 441, 848, 565],[901, 491, 970, 698].') 2023-11-03 18:45:08.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:45:08.075 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Plate:[787, 806, 999, 944],[840, 713, 999, 819],[510, 684, 837, 787],[253, 709, 587, 829],[331, 649, 598, 731];Bowl:[447, 608, 644, 669],[001, 693, 215, 767];Chopsticks:[375, 517, 574, 613];Cup:[303, 558, 411, 666];Bottle:[171, 419, 258, 681];Desk:[001, 626, 835, 999],[723, 764, 999, 999];Chair:[842, 419, 940, 741],[001, 617, 141, 672];Stool:[241, 397, 418, 602],[001, 398, 114, 635],[847, 453, 1000, 719];Picture:[619, 000, 931, 127];Cell Phone:[044, 631, 151, 672];Person:[368, 157, 889, 718],[179, 191, 464, 593],[001, 703, 176, 999].') 2023-11-03 18:45:16.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:45:16.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[278, 366, 365, 538],[296, 468, 440, 816],[315, 235, 376, 445],[346, 324, 389, 454].') 2023-11-03 18:45:18.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:45:18.925 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Head Phone:[092, 230, 201, 400];Microphone:[934, 319, 999, 418];Person:[002, 036, 548, 1000],[773, 196, 987, 1000].') 2023-11-03 18:45:46.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:45:46.109 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[613, 150, 844, 883];man:[175, 353, 419, 980];tree:[791, 432, 884, 547];clothing:[171, 461, 433, 999],[614, 326, 845, 999].') 2023-11-03 18:46:13.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:46:13.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect vehicle,clothing and person in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'vehicle:[188, 841, 337, 956];clothing:[334, 821, 404, 868],[386, 823, 462, 942],[653, 826, 735, 951],[723, 828, 801, 993],[798, 814, 863, 951],[856, 911, 926, 988],[860, 826, 935, 921];person:[337, 788, 402, 949],[363, 824, 405, 889],[390, 800, 456, 942],[630, 816, 723, 959],[723, 804, 805, 996],[763, 804, 875, 963],[857, 795, 930, 999].') 2023-11-03 18:46:15.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:46:15.874 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[163, 306, 487, 781],busFrame1:[500, 030, 968, 653],carFrame1:[101, 402, 197, 524] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[163, 306, 487, 781];Frame2:[163, 307, 487, 781],busFrame1:[500, 030, 968, 653];Frame2:[500, 029, 968, 653],carFrame1:[101, 402, 197, 524];Frame2:[009, 390, 198, 534].') 2023-11-03 18:47:03.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:47:03.035 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'gas stove:[026, 652, 664, 919];clothing:[470, 326, 969, 944];human arm:[412, 415, 620, 664];human hand:[369, 491, 448, 626];person:[371, 224, 999, 998];kitchen appliance:[051, 692, 650, 998].') 2023-11-03 18:47:40.236 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:47:40.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Picture,Potted Plant,Chair and Hat in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Frame:[251, 245, 313, 371];Potted Plant:[731, 160, 810, 409],[650, 269, 739, 413],[902, 183, 964, 342];Chair:[601, 556, 734, 805],[254, 508, 416, 897];Hat:[115, 044, 237, 162].') 2023-11-03 18:48:00.797 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:48:00.798 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[961, 560, 999, 636],personFrame1:[190, 547, 228, 642],carFrame1:[881, 559, 963, 647] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'carFrame1:[961, 560, 999, 636];Frame2:[714, 569, 770, 638],personFrame1[190, 547, 228, 642];Frame2:No object,carFrame1:[881, 559, 963, 647];Frame2:[652, 564, 725, 647].') 2023-11-03 18:48:21.144 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:48:21.144 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'the fessenheim nuclear power plant [633, 256, 927, 802] in france is set to be dismantled with its two reactors to be permanently closed on 22 february and 30 june - nuclear energy stock videos & royalty-free footage') 2023-11-03 18:48:23.737 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:48:23.737 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Skateboard:[148, 581, 314, 775];Person:[156, 322, 588, 759],[002, 002, 100, 271],[053, 041, 140, 386],[834, 692, 928, 837],[909, 680, 960, 867],[932, 551, 1000, 796],[226, 120, 372, 437],[112, 002, 207, 210],[175, 004, 284, 253],[706, 453, 825, 780],[790, 477, 868, 802],[322, 178, 413, 339],[385, 196, 475, 590],[446, 237, 510, 614],[479, 263, 588, 675],[550, 353, 673, 669],[650, 326, 740, 575],[248, 002, 316, 163],[712, 310, 772, 516],[789, 418, 826, 489],[572, 235, 641, 384],[310, 024, 406, 180],[387, 006, 459, 133],[819, 392, 863, 480],[844, 402, 923, 588],[413, 053, 518, 153],[485, 068, 543, 139],[381, 132, 470, 231],[473, 117, 516, 227],[503, 160, 569, 234],[505, 175, 555, 234],[493, 223, 539, 325],[388, 091, 493, 233];Sneakers:[155, 549, 197, 659],[239, 643, 290, 761],[410, 531, 444, 589],[431, 538, 468, 593],[048, 323, 084, 386],[085, 282, 123, 326],[573, 617, 605, 665],[529, 615, 580, 667];Hat:[435, 192, 476, 239],[702, 324, 739, 367];Handbag:[814, 607, 860, 663],[322, 282, 393, 346],[625, 546, 670, 620];Backpack:[236, 425, 301, 499].') 2023-11-03 18:49:37.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:49:37.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[150, 465, 225, 675],personFrame1:[501, 105, 549, 297],personFrame1:[363, 348, 448, 580],personFrame1:[662, 505, 700, 747],personFrame1:[617, 650, 668, 895],personFrame1:[668, 345, 724, 576],personFrame1:[635, 368, 688, 551] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[150, 465, 225, 675];Frame2:[156, 431, 235, 644],personFrame1:[501, 105, 549, 297];Frame2:[486, 104, 550, 288],personFrame1:[363, 348, 448, 580];Frame2:[340, 347, 394, 565],personFrame1:[662, 505, 700, 747];Frame2:[646, 501, 685, 738],personFrame1:[617, 650, 668, 895];Frame2:[587, 654, 651, 900],personFrame1:[668, 345, 724, 576];Frame2:[646, 334, 682, 561],personFrame1:[635, 368, 688, 551];Frame2:[591, 372, 634, 551].') 2023-11-03 18:49:39.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:49:39.463 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[002, 478, 078, 839],[028, 414, 141, 830],[103, 414, 204, 817],[145, 237, 445, 969],[354, 432, 522, 786],[348, 099, 677, 845],[565, 291, 806, 948],[757, 391, 887, 803],[924, 353, 1000, 840];Necklace:[228, 446, 306, 542],[674, 493, 749, 569];Satchel:[123, 635, 159, 724].') 2023-11-03 18:49:42.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:49:42.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Glasses:[488, 188, 643, 298];Tie:[441, 455, 530, 752];Picture:[840, 579, 957, 715],[326, 357, 386, 405],[093, 314, 161, 416],[004, 308, 069, 420];Desk:[002, 684, 1000, 999];Moniter:[609, 374, 848, 705],[037, 415, 280, 679];Dog:[112, 107, 153, 184];Person:[168, 051, 767, 998],[848, 596, 913, 691].') 2023-11-03 18:50:09.792 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:50:09.793 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Apple:[758, 424, 802, 473];Lamp:[477, 166, 546, 357];Mirror:[340, 393, 400, 540];Desk:[313, 582, 429, 687],[279, 603, 372, 739],[219, 627, 343, 779],[083, 680, 279, 945],[000, 801, 140, 1000],[552, 601, 628, 735],[573, 642, 682, 847];Chair:[313, 584, 362, 685],[368, 585, 413, 706],[351, 596, 405, 739],[279, 622, 368, 804],[189, 679, 328, 958],[036, 805, 224, 999],[554, 584, 595, 704],[568, 612, 625, 764],[603, 675, 706, 911];shelf:[710, 151, 999, 662].') 2023-11-03 18:50:49.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:50:49.983 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'fast food:[048, 140, 903, 821];snack:[050, 127, 905, 807];baked goods:[058, 113, 891, 866];human hand:[000, 346, 833, 999].') 2023-11-03 18:51:00.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:51:00.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[617, 196, 732, 721],[232, 202, 384, 620],[033, 145, 168, 808];Helmet:[066, 140, 151, 247],[277, 200, 371, 326],[546, 425, 630, 545],[603, 584, 690, 722],[439, 588, 536, 742],[324, 553, 434, 686].') 2023-11-03 18:51:15.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:51:15.185 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'toy:[068, 312, 221, 602],[230, 207, 386, 592],[400, 185, 556, 595],[573, 202, 721, 605],[737, 197, 891, 600].') 2023-11-03 18:51:48.223 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:51:48.224 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Other Fish:[331, 403, 462, 504],[790, 510, 999, 664];Scale:[716, 355, 756, 402];Gloves:[504, 408, 564, 478],[601, 372, 690, 458];Hat:[607, 184, 688, 264];Storage box:[744, 063, 836, 153];Person:[311, 173, 401, 435],[468, 143, 707, 479],[150, 238, 222, 401],[080, 249, 113, 334],[000, 260, 045, 366],[503, 185, 724, 518],[028, 235, 085, 389].') 2023-11-03 18:51:55.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:51:55.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'shelf:[219, 332, 461, 566];Chair:[508, 677, 784, 1000],[018, 518, 320, 834],[788, 648, 999, 1000],[760, 431, 880, 614],[654, 412, 799, 558],[412, 521, 475, 613];Person:[001, 118, 819, 999];Book:[064, 560, 287, 840],[052, 664, 094, 709];Cup:[463, 432, 508, 514].') 2023-11-03 18:52:43.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 18:52:43.040 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[006, 150, 365, 803],[348, 073, 694, 638],[624, 198, 940, 698];Desk:[000, 779, 999, 999],[894, 420, 999, 774],[832, 343, 999, 399];shelf:[000, 386, 116, 747];Storage box:[594, 205, 686, 273],[861, 281, 959, 359];Frame:[768, 125, 850, 194];Hat:[681, 196, 813, 282],[174, 154, 333, 279];Cake:[100, 621, 938, 942].') 2023-11-03 18:53:11.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:53:11.534 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track truckFrame1:[395, 416, 440, 504],carFrame1:[444, 309, 793, 789],carFrame1:[275, 413, 370, 536] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'truckFrame1:[395, 416, 440, 504];Frame2:[407, 428, 448, 497],carFrame1:[444, 309, 793, 789];Frame2:[441, 316, 797, 789],carFrame1:[275, 413, 370, 536];Frame2:[306, 425, 375, 538].') 2023-11-03 18:53:12.421 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:53:12.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Cup:[347, 748, 434, 889];shelf:[000, 001, 1000, 997].') 2023-11-03 18:53:18.099 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:53:18.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[411, 329, 523, 556] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a dhowFrame1:[411, 329, 523, 556];Frame2:[403, 319, 521, 558];Frame3:[386, 293, 517, 554];Frame4:[383, 290, 517, 570];Frame5:[381, 269, 528, 577].') 2023-11-03 18:53:23.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:53:23.876 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[464, 360, 798, 816],[000, 444, 070, 771],[850, 241, 999, 834];clothing:[483, 433, 788, 816],[000, 462, 072, 763],[101, 396, 338, 967],[857, 448, 999, 825];woman:[111, 211, 348, 999];human face:[143, 283, 213, 436],[579, 352, 652, 507],[944, 254, 999, 444].') 2023-11-03 18:54:07.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:54:07.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the faster white car driving left.Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is no faster white car driving left.') 2023-11-03 18:54:47.676 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:54:47.677 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Boat:[470, 270, 1000, 589],[006, 413, 166, 504].') 2023-11-03 18:55:04.818 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:55:04.818 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[054, 027, 993, 925] Heritage rhododendrons blooming late October. These are 8m high, possibly the largest in Australia') 2023-11-03 18:55:19.784 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:55:19.785 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Watch [288, 004, 718, 994] the 5G-DRIVE project video') 2023-11-03 18:56:12.552 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:56:12.552 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[465, 282, 548, 679] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a personFrame1:[465, 282, 548, 679];Frame2:[483, 280, 566, 684];Frame3:[496, 294, 571, 687];Frame4:[541, 276, 606, 719].') 2023-11-03 18:56:46.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:56:46.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'baked goods:[388, 576, 893, 992],[000, 000, 110, 253],[000, 864, 165, 999],[000, 206, 218, 518],[002, 450, 127, 654],[050, 382, 538, 954],[135, 000, 388, 224],[322, 000, 693, 458],[592, 037, 987, 580],[641, 000, 910, 074],[861, 747, 999, 999];snack:[000, 000, 385, 527],[003, 868, 162, 999],[051, 390, 871, 999],[304, 000, 993, 584],[860, 734, 999, 999];cake:[047, 379, 539, 954],[312, 258, 600, 402],[315, 000, 689, 301],[345, 337, 600, 460],[420, 471, 878, 999],[600, 390, 932, 585],[615, 275, 969, 508],[615, 034, 987, 421].') 2023-11-03 18:56:53.374 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 18:56:53.375 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Person:[906, 741, 999, 999],[151, 831, 189, 867];Hat:[928, 742, 992, 810];Chair:[670, 769, 773, 831],[830, 768, 905, 817],[783, 763, 854, 806],[760, 757, 809, 799],[613, 761, 699, 815],[697, 752, 762, 792],[651, 747, 715, 781],[562, 755, 647, 805],[557, 739, 623, 772],[538, 733, 579, 765],[515, 749, 589, 798],[471, 741, 541, 783],[434, 736, 496, 776],[398, 733, 454, 766].') 2023-11-03 18:58:38.934 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:58:38.935 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[014, 518, 802, 731] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a millipedeFrame1:[014, 518, 802, 731];Frame2:[001, 455, 666, 781];Frame3:[127, 534, 834, 883];Frame4:[244, 494, 931, 861].') 2023-11-03 18:59:08.805 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:59:08.806 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Storage box:[316, 301, 529, 531],[017, 444, 298, 571],[129, 392, 207, 442];Desk:[077, 188, 611, 448];Chair:[694, 002, 1000, 626];shelf:[735, 018, 999, 469];Converter:[054, 582, 176, 669];Moniter:[198, 002, 551, 222];Computer Box:[002, 162, 118, 415];Person:[220, 101, 770, 857].') 2023-11-03 18:59:38.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 18:59:38.866 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,man,dress and girl in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'woman:[014, 316, 196, 977],[184, 305, 312, 537],[220, 275, 817, 999],[463, 334, 783, 999];man:[311, 238, 445, 415],[840, 226, 978, 974];dress:[213, 445, 310, 549],[227, 554, 498, 999],[465, 646, 708, 999];girl:[211, 267, 812, 999].') 2023-11-03 19:00:01.962 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:00:01.963 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'boy:[000, 000, 178, 719];man:[000, 000, 166, 361],[092, 727, 730, 964],[134, 000, 276, 277],[255, 114, 488, 717],[410, 001, 608, 483],[449, 466, 690, 729],[599, 000, 686, 447],[699, 468, 881, 971],[850, 466, 999, 820];clothing:[008, 023, 144, 318],[144, 012, 260, 189],[157, 737, 348, 957],[251, 433, 431, 768],[253, 241, 478, 486],[273, 057, 346, 238],[341, 783, 568, 978],[394, 019, 586, 329],[454, 492, 666, 828],[586, 098, 673, 298],[632, 295, 772, 540],[636, 775, 855, 959],[686, 540, 811, 788],[699, 057, 814, 281],[850, 760, 996, 894],[865, 524, 982, 735];human face:[024, 000, 085, 067],[130, 820, 191, 891],[189, 000, 252, 061],[311, 011, 372, 118],[378, 154, 443, 287],[499, 000, 555, 080],[501, 514, 561, 618],[599, 001, 672, 111],[696, 239, 764, 349],[776, 000, 831, 093],[786, 508, 848, 623],[935, 499, 999, 614].') 2023-11-03 19:00:18.219 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:00:18.220 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,woman and clothing in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'girl:[134, 221, 499, 998];woman:[131, 219, 503, 998];clothing:[136, 402, 488, 965],[091, 433, 234, 998],[122, 807, 209, 998],[195, 401, 266, 491],[489, 320, 970, 998],[878, 279, 999, 998].') 2023-11-03 19:00:30.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:00:30.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[045, 144, 733, 946];Sneakers:[558, 823, 731, 948].') 2023-11-03 19:00:43.029 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:00:43.030 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'billboard:[116, 412, 380, 548];tree:[743, 443, 999, 914],[853, 132, 999, 762];building:[000, 048, 900, 931],[851, 477, 999, 896].') 2023-11-03 19:00:55.583 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:00:55.584 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[236, 214, 278, 394],[475, 287, 535, 483],[757, 275, 796, 370],[892, 235, 957, 483];girl:[015, 365, 337, 809],[070, 355, 117, 448],[113, 353, 341, 743],[248, 330, 283, 415],[429, 334, 482, 522],[550, 332, 603, 486],[600, 339, 670, 460],[657, 339, 728, 519],[714, 330, 796, 522],[774, 323, 846, 514],[832, 318, 899, 518],[877, 320, 943, 501];footwear:[126, 905, 229, 968],[155, 827, 223, 873],[268, 716, 331, 753].') 2023-11-03 19:01:06.901 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:01:06.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'motorcycle:[832, 231, 977, 389],[000, 220, 022, 387],[013, 205, 125, 367],[107, 183, 205, 366],[135, 015, 870, 900],[140, 196, 603, 568],[255, 174, 323, 306],[303, 205, 430, 340],[516, 189, 650, 322],[653, 194, 798, 380];wheel:[150, 624, 319, 873],[023, 647, 146, 701],[088, 287, 125, 371],[394, 273, 428, 337],[650, 287, 703, 354],[655, 597, 841, 829],[749, 302, 798, 379],[825, 311, 891, 386],[969, 333, 999, 408];tire:[668, 588, 830, 819],[027, 657, 140, 699],[086, 292, 123, 368],[136, 662, 313, 875],[648, 295, 702, 358],[753, 305, 795, 376],[826, 318, 886, 387],[973, 334, 999, 410];person:[021, 114, 103, 411].') 2023-11-03 19:01:20.535 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:01:20.535 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[293, 275, 660, 716] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a humveeFrame1:[293, 275, 660, 716];Frame2:[329, 277, 684, 712];Frame3:[381, 295, 728, 722];Frame4:[443, 250, 784, 687];Frame5:[453, 220, 792, 641].') 2023-11-03 19:01:33.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 19:01:33.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect footwear,woman,dress and human face in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'footwear:[364, 888, 434, 920],[428, 851, 528, 913];woman:[280, 066, 735, 997];dress:[287, 235, 561, 997];human face:[421, 104, 535, 193].') 2023-11-03 19:01:56.629 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:01:56.629 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Jamie Vardy [235, 041, 793, 998] could be a dangerous fantasy option in Gameweek 13') 2023-11-03 19:01:58.519 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:01:58.519 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Lamp:[199, 037, 257, 173],[819, 002, 923, 213];Chair:[003, 639, 131, 1000];Dinning Table:[407, 858, 1000, 1000];Flower:[749, 574, 822, 635];Tie:[409, 263, 474, 426],[649, 549, 731, 880];Plate:[609, 895, 885, 974],[866, 859, 1000, 927];Cup:[693, 913, 810, 1000],[808, 902, 907, 993];Person:[085, 365, 959, 935],[049, 397, 654, 1000],[525, 123, 881, 535],[003, 108, 870, 526],[003, 173, 384, 653];Bracelet:[312, 891, 370, 934],[529, 828, 593, 862];Necklace:[156, 314, 275, 365].') 2023-11-03 19:02:11.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:02:11.846 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'man:[524, 410, 606, 890],[584, 400, 672, 902],[727, 492, 775, 656];tree:[000, 000, 142, 430],[000, 366, 999, 572],[107, 000, 855, 495];clothing:[365, 520, 457, 861],[457, 481, 552, 880],[518, 468, 618, 848],[590, 442, 675, 882],[721, 517, 765, 662],[899, 538, 934, 643].') 2023-11-03 19:02:28.036 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:02:28.036 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The State Government has moved to shut down [002, 300, 995, 957] a troubled homeless camp in Fremantle amid criticism from [286, 275, 362, 471] advocates.') 2023-11-03 19:02:42.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:02:42.183 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[505, 103, 889, 974],[000, 499, 143, 999],[234, 397, 588, 999],[688, 883, 814, 999],[935, 607, 999, 825];woman:[230, 114, 551, 1000],[000, 105, 140, 999],[693, 638, 890, 999],[799, 725, 999, 999],[927, 458, 999, 827];human hair:[316, 174, 458, 377],[000, 108, 073, 448],[610, 060, 710, 160],[718, 646, 865, 887],[803, 729, 999, 999],[940, 470, 999, 700];human head:[731, 661, 890, 910],[000, 112, 067, 435],[320, 167, 463, 414],[601, 063, 716, 289],[801, 741, 999, 999],[964, 460, 999, 599];mammal:[487, 084, 839, 1000],[000, 110, 145, 999],[238, 163, 584, 999],[692, 641, 876, 999],[797, 731, 999, 999],[930, 453, 999, 814];clothing:[243, 327, 528, 986],[000, 491, 076, 999],[491, 221, 844, 999],[690, 885, 814, 999],[932, 601, 999, 819];human arm:[015, 658, 128, 838],[036, 601, 148, 697],[238, 467, 307, 999],[376, 441, 589, 750],[489, 246, 691, 506],[748, 278, 831, 644];human ear:[600, 153, 620, 221],[701, 153, 713, 225];man:[491, 062, 846, 999];human face:[014, 237, 069, 397],[318, 228, 413, 419],[617, 134, 701, 278],[791, 713, 871, 877].') 2023-11-03 19:02:48.365 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:02:48.365 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [005, 031, 996, 996] [732, 466, 997, 995] line up outside Nonghyup Hanaro Mart in Seoul on Thursday to buy face masks [532, 296, 579, 358] [034, 222, 080, 281] [673, 549, 746, 637] [201, 207, 249, 263] [378, 373, 449, 474] [816, 571, 907, 661] [152, 239, 202, 294] [282, 235, 331, 291] [001, 704, 076, 872] [012, 419, 091, 507] [462, 311, 531, 394] [053, 867, 188, 994] [321, 775, 432, 915] to protect themselves from the coronavirus.') 2023-11-03 19:03:35.896 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:03:35.897 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The exterior [007, 002, 989, 990] of the Bedford Presbyterian Church under a bright blue sky and fluffy white clouds. A nearby tree [522, 032, 991, 872] with fiery red fall foliage provides a striking color contrast. Bedford, New Hampshire.') 2023-11-03 19:04:19.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:04:19.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect boy and girl in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'boy:[201, 090, 840, 999];girl:[195, 201, 839, 999].') 2023-11-03 19:04:36.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:04:36.329 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'chair:[000, 503, 998, 998].') 2023-11-03 19:05:22.822 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:05:22.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human body:[357, 346, 609, 937],[912, 542, 998, 799];woman:[335, 339, 614, 936];human hair:[405, 367, 521, 460];human leg:[465, 587, 578, 947];building:[269, 100, 675, 623],[665, 199, 795, 594],[828, 110, 998, 594].') 2023-11-03 19:05:28.308 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:05:28.308 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Fans [002, 329, 998, 998] look on as Leicester City players [208, 685, 327, 926] ride in an open top bus through the city centre during a victory parade after winning the Championship League in 2014 (Photo by Matthew Lewis/Getty Images') 2023-11-03 19:05:48.088 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:05:48.089 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Distance education. a schoolgirl [330, 271, 778, 919] in pink headphones [512, 326, 735, 575] studying homework during their online lesson at home via the internet. social distance during quarantine') 2023-11-03 19:06:11.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:06:11.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[874, 423, 999, 836],carFrame1:[714, 428, 836, 600],carFrame1:[522, 472, 563, 547],truckFrame1:[657, 423, 725, 545],carFrame1:[759, 445, 951, 667],busFrame1:[425, 406, 514, 488],carFrame1:[414, 478, 460, 552] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1[874, 423, 999, 836];Frame2:No object,carFrame1:[714, 428, 836, 600];Frame2:[721, 402, 887, 622],carFrame1:[522, 472, 563, 547];Frame2:[491, 461, 541, 545],truckFrame1:[657, 423, 725, 545];Frame2:[648, 406, 729, 542],carFrame1:[759, 445, 951, 667];Frame2:[796, 414, 1000, 758],busFrame1:[425, 406, 514, 488];Frame2:[388, 389, 489, 487],carFrame1:[414, 478, 460, 552];Frame2:[364, 469, 405, 557].') 2023-11-03 19:06:28.406 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:06:28.407 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "I-Ride: It's the future of electric skateboards [044, 435, 954, 985], today") 2023-11-03 19:06:38.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:06:38.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Observe the image and give me some insight into the question 'QUESTION: Why is [070, 009, 689, 979] taking off her sunglasses ? \nANSWER: [070, 009, 689, 979] wants the driver to see her eyes . You should explain the reason for the above answer.'. Don't forget to incorporate your reasoning process and mention the bounding boxes of related objects within square brackets.", '[070, 009, 689, 979] is making a comment to the driver .') 2023-11-03 19:06:56.740 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:06:56.741 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'man:[352, 188, 835, 988];hat:[454, 216, 591, 366];flower:[000, 800, 597, 997],[825, 819, 997, 997];human face:[468, 355, 562, 494].') 2023-11-03 19:07:23.825 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:07:23.826 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'group of [676, 375, 835, 998] [773, 282, 996, 995] [215, 377, 487, 998] four people near [051, 001, 753, 551] a small wind turbine, set in the woods at the Centre for Alternative Technology in Wales') 2023-11-03 19:07:24.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:07:24.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[197, 272, 646, 845] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a polo ponyFrame1:[197, 272, 646, 845];Frame2:[469, 345, 813, 884];Frame3:[509, 327, 826, 891];Frame4:[444, 318, 789, 902];Frame5:[391, 326, 721, 919].') 2023-11-03 19:07:39.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:07:39.101 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'person:[029, 433, 090, 557],[000, 442, 019, 547],[074, 413, 126, 501],[114, 472, 188, 675],[116, 423, 169, 492],[175, 421, 238, 505],[228, 486, 326, 655],[237, 393, 279, 470],[274, 426, 321, 536],[303, 529, 405, 699],[308, 395, 343, 465],[343, 392, 410, 553],[456, 457, 518, 582],[463, 377, 516, 519],[476, 547, 580, 790],[521, 426, 590, 513],[524, 390, 583, 496],[593, 393, 642, 494],[593, 478, 712, 778],[623, 590, 738, 959],[624, 426, 675, 532],[693, 371, 743, 461],[696, 421, 753, 540],[721, 474, 851, 651],[780, 371, 835, 461],[817, 428, 883, 542],[857, 388, 905, 483];table:[007, 532, 196, 730],[095, 630, 717, 999],[100, 497, 317, 537],[160, 454, 408, 551],[373, 378, 483, 506],[393, 548, 868, 907],[455, 506, 880, 751],[666, 438, 836, 518];chair:[150, 543, 195, 646],[396, 635, 426, 713],[678, 716, 810, 999],[849, 529, 896, 732],[880, 459, 911, 540];desk:[091, 693, 368, 888];building:[000, 000, 999, 999].') 2023-11-03 19:07:59.755 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:07:59.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Backpack:[168, 381, 237, 465];Gloves:[384, 381, 476, 488];Person:[528, 369, 590, 562],[323, 340, 482, 771],[058, 342, 237, 745];Bicycle:[333, 561, 472, 907],[061, 545, 257, 845],[530, 450, 582, 592].') 2023-11-03 19:08:05.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:08:05.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Pillow:[019, 667, 118, 791],[016, 578, 090, 659],[749, 571, 804, 660],[792, 578, 838, 640],[631, 560, 702, 634];Chair:[015, 633, 292, 999],[016, 554, 162, 736],[316, 512, 459, 697],[638, 665, 999, 999];Lamp:[365, 381, 441, 517],[535, 487, 588, 586];Frame:[436, 221, 477, 313],[438, 312, 481, 388],[437, 388, 481, 464],[013, 244, 097, 502];Cabinet:[517, 577, 603, 706],[325, 500, 457, 646];Potted Plant:[410, 041, 645, 659],[183, 286, 301, 642];Bench:[577, 537, 881, 806];Desk:[583, 917, 820, 1000],[043, 552, 183, 690];Carpet:[253, 768, 604, 981];Stool:[150, 596, 223, 693],[780, 665, 875, 813];Person:[862, 429, 944, 788];Cup:[598, 880, 640, 946];TV:[043, 440, 167, 566].') 2023-11-03 19:08:14.777 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:08:14.777 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Satchel:[731, 437, 886, 713];Backpack:[029, 277, 135, 589];Person:[002, 156, 152, 890],[115, 159, 310, 999],[267, 206, 391, 744],[354, 181, 461, 679],[407, 145, 629, 828],[497, 122, 630, 783],[603, 719, 732, 828],[593, 004, 866, 1000],[799, 221, 894, 982].') 2023-11-03 19:08:29.711 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:08:29.711 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Anti-government protesters [082, 664, 195, 957] clash with riot police [470, 710, 552, 987] in Madrid, on the outskirts of Bogota, Colombia, on May 28, 2021.') 2023-11-03 19:08:46.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:08:46.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect billiard table,table,tree,building and person in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'table:[030, 792, 095, 843],[138, 812, 239, 937],[330, 801, 416, 920],[551, 785, 603, 865],[608, 788, 676, 854],[903, 783, 987, 865];tree:[237, 433, 316, 633],[306, 394, 419, 631],[401, 471, 518, 575],[470, 378, 614, 575],[605, 367, 781, 563],[623, 228, 906, 694],[820, 202, 999, 702];building:[291, 000, 999, 771];person:[000, 716, 042, 783],[006, 734, 071, 853],[090, 692, 178, 981],[181, 732, 233, 822],[216, 730, 294, 910],[234, 696, 268, 762],[279, 697, 326, 786],[314, 736, 357, 818],[365, 742, 431, 903],[405, 732, 475, 918],[878, 753, 924, 864],[918, 742, 965, 863].') 2023-11-03 19:09:37.358 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:09:37.358 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[426, 498, 457, 654] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a rhodesian ridgebackFrame1:[426, 498, 457, 654];Frame2:[412, 503, 449, 649];Frame3:[397, 500, 438, 650];Frame4:[407, 503, 450, 636];Frame5:[431, 499, 455, 631].') 2023-11-03 19:09:48.355 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:09:48.356 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[156, 336, 677, 741] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a mountain beaverFrame1:[156, 336, 677, 741];Frame2:[116, 361, 664, 744];Frame3:[101, 375, 535, 762];Frame4:[108, 387, 534, 770].') 2023-11-03 19:10:49.525 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:10:49.526 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect computer monitor,table,clothing,desk and person in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'computer monitor:[517, 373, 672, 616],[643, 377, 772, 638];table:[453, 586, 997, 998];clothing:[187, 352, 553, 871];desk:[000, 307, 146, 998],[387, 574, 999, 998];person:[217, 297, 711, 973].') 2023-11-03 19:10:54.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:10:54.413 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[898, 369, 991, 678],[003, 269, 999, 720];clothing:[615, 448, 799, 721],[000, 350, 999, 679];woman:[019, 505, 175, 702],[208, 479, 382, 708],[717, 455, 778, 544],[888, 472, 931, 557].') 2023-11-03 19:10:58.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:10:58.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the aerial display of four fighter jets in motion.?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There are 4 aerial display of four fighter jets in motion..Track1frame:1:[209, 369, 640, 617];frame:2:[215, 388, 646, 631];frame:3:[165, 333, 599, 547],Track2frame:1:[256, 236, 602, 457];frame:2:[264, 273, 592, 480];frame:3:[229, 319, 574, 642],Track3frame:1:[649, 598, 835, 704];frame:2:[652, 633, 839, 738];frame:3:[596, 663, 782, 766],Track4frame:1:[242, 241, 453, 349];frame:2:[249, 276, 461, 382];frame:3:[209, 320, 421, 428].') 2023-11-03 19:11:10.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:11:10.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect table,office building and person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'table:[095, 420, 886, 989];office building:[086, 000, 961, 990];person:[360, 211, 452, 420],[663, 256, 841, 478],[840, 244, 999, 484].') 2023-11-03 19:11:27.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:11:27.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[076, 261, 792, 993];clothing:[194, 505, 750, 999];human face:[545, 345, 805, 608].') 2023-11-03 19:11:33.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:11:33.484 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[532, 324, 616, 909] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[532, 324, 616, 909];Frame2:[541, 337, 625, 812];Frame3:[569, 343, 653, 766];Frame4:[555, 351, 638, 780];Frame5:[502, 332, 640, 838].') 2023-11-03 19:11:36.064 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:11:36.065 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'These Funfetti Chocolate Chip Cookies [003, 001, 996, 999] are perfectly soft and chewy. Loaded with two kinds of chocolate chips and rainbow sprinkles — these are the perfect cookies for a celebration!') 2023-11-03 19:12:12.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:12:12.301 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tire,building,van and wheel in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'tire:[071, 787, 176, 936],[568, 712, 636, 830],[817, 666, 872, 764],[942, 645, 983, 729];building:[000, 086, 998, 761];van:[455, 505, 904, 836];wheel:[051, 785, 165, 947],[569, 712, 636, 833],[816, 668, 871, 769],[944, 651, 984, 723].') 2023-11-03 19:12:38.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:12:38.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Plate,Glasses,Cup and Chair in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Plate:[537, 627, 674, 709],[355, 665, 432, 723];Glasses:[440, 636, 527, 702];Cup:[426, 496, 491, 646];Chair:[844, 771, 1000, 1000],[498, 439, 691, 566],[199, 510, 236, 663].') 2023-11-03 19:12:58.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:12:58.664 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[140, 049, 375, 475];Plate:[083, 560, 434, 714],[127, 530, 287, 576];Cup:[166, 434, 278, 561],[389, 307, 421, 387],[018, 440, 075, 508],[516, 278, 552, 374];Spoon:[379, 513, 479, 579];Basin:[085, 323, 131, 357];Tea pot:[043, 077, 093, 146],[002, 080, 044, 148],[025, 266, 078, 341],[001, 266, 024, 338];Bottle:[702, 169, 732, 285],[737, 168, 767, 285],[678, 320, 736, 534],[777, 193, 801, 282];Coffee Machine:[001, 148, 159, 364];Laptop:[421, 272, 999, 865].') 2023-11-03 19:13:01.426 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:01.427 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect man,tire,clothing and land vehicle in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[000, 476, 063, 987],[216, 106, 281, 414],[326, 461, 484, 997],[484, 461, 616, 997],[600, 514, 658, 861],[649, 512, 715, 729],[689, 508, 753, 746],[735, 542, 775, 695],[753, 504, 841, 627],[792, 634, 843, 751],[838, 497, 989, 997],[849, 506, 895, 612],[878, 348, 889, 419],[946, 614, 998, 997];tire:[746, 640, 760, 742];clothing:[000, 400, 015, 506],[000, 527, 061, 997],[215, 165, 267, 419],[330, 563, 447, 997],[440, 551, 529, 608],[515, 563, 630, 997],[649, 578, 700, 740],[695, 576, 750, 755],[752, 551, 816, 631],[778, 655, 846, 751],[821, 548, 856, 629],[829, 702, 930, 997],[843, 540, 981, 778],[849, 555, 889, 621],[898, 368, 909, 438],[938, 578, 980, 736],[950, 621, 1000, 1000],[958, 578, 996, 712],[975, 372, 993, 436];land vehicle:[192, 089, 421, 448],[035, 380, 433, 776].') 2023-11-03 19:13:11.386 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:11.386 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[167, 363, 430, 729] Delicious tapas in Granada - a must have for [003, 021, 995, 991] every itinerary') 2023-11-03 19:13:46.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:46.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[001, 000, 844, 564],[712, 062, 999, 574],[401, 044, 645, 332];Bracelet:[001, 338, 045, 354],[684, 448, 773, 490];Necklace:[238, 208, 392, 356];Tent:[553, 089, 779, 188];Awning:[455, 000, 796, 093];Dinning Table:[001, 494, 1000, 1000];Napkin:[653, 645, 1000, 906],[665, 526, 958, 621];Desk:[631, 268, 923, 447];Chair:[038, 411, 626, 507];Bowl:[768, 679, 999, 835],[001, 674, 560, 1000],[441, 550, 641, 649];Plate:[594, 648, 999, 966],[422, 760, 694, 1000];Pot:[018, 489, 449, 666];Knife:[590, 496, 706, 619];Fork:[624, 498, 756, 583];Cup:[709, 222, 761, 306],[596, 199, 618, 242];Wine Glass:[637, 207, 673, 256];Pie:[074, 524, 374, 632];Glasses:[176, 066, 423, 134].') 2023-11-03 19:13:52.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:52.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Cabinet:[001, 356, 089, 635],[088, 404, 217, 617],[370, 567, 464, 667];Desk:[272, 593, 377, 667],[662, 808, 937, 997],[001, 940, 219, 1000];Chair:[059, 723, 188, 977];Person:[835, 714, 999, 999],[463, 619, 680, 998],[253, 759, 465, 998],[436, 601, 563, 999],[080, 579, 265, 989],[195, 505, 297, 713],[503, 368, 648, 563];Trash bin Can:[807, 669, 903, 741];TV:[469, 334, 699, 644],[827, 546, 850, 655];Printer:[756, 574, 827, 638];Speaker:[641, 283, 669, 336],[496, 292, 520, 335];Bakset:[677, 710, 813, 821],[015, 641, 095, 720].') 2023-11-03 19:13:53.127 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:53.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("With careful inspection of the image , please address the question 'QUESTION: Where are [545, 231, 614, 491] and [014, 319, 127, 618] and [770, 212, 827, 442] and [547, 345, 681, 961] and [877, 242, 999, 808] and [257, 239, 340, 419] and [609, 198, 676, 378] and [660, 433, 895, 974] and [005, 396, 287, 991] and [811, 376, 960, 855] and [229, 327, 414, 956] and [911, 235, 952, 363] and [645, 261, 755, 646] and [350, 229, 430, 372] and [362, 275, 461, 661] and [174, 270, 279, 585] and [332, 441, 599, 962] and [443, 207, 490, 386] and [456, 251, 565, 541] , and [727, 256, 804, 440] ?' by explaining your reasoning process and using [xmin,ymin,xmax,ymax] to denote the spatial locations of relevant objects.", '[545, 231, 614, 491] and [014, 319, 127, 618] and [770, 212, 827, 442] and [547, 345, 681, 961] and [877, 242, 999, 808] and [257, 239, 340, 419] and [609, 198, 676, 378] and [660, 433, 895, 974] and [005, 396, 287, 991] and [811, 376, 960, 855] and [229, 327, 414, 956] and [911, 235, 952, 363] and [645, 261, 755, 646] and [350, 229, 430, 372] and [362, 275, 461, 661] and [174, 270, 279, 585] and [332, 441, 599, 962] and [443, 207, 490, 386] and [456, 251, 565, 541] , and [727, 256, 804, 440] are of school age . [545, 231, 614, 491] and [014, 319, 127, 618] and [770, 212, 827, 442] and [547, 345, 681, 961] and [877, 242, 999, 808] and [257, 239, 340, 419] and [609, 198, 676, 378] and [660, 433, 895, 974] and [005, 396, 287, 991] and [811, 376, 960, 855] and [229, 327, 414, 956] and [911, 235, 952, 363] and [645, 261, 755, 646] and [350, 229, 430, 372] and [362, 275, 461, 661] and [174, 270, 279, 585] and [332, 441, 599, 962] and [443, 207, 490, 386] and [456, 251, 565, 541] , and [727, 256, 804, 440] are sitting on school desks . many of the students have textbooks on their desks . [545, 231, 614, 491] and [014, 319, 127, 618] and [770, 212, 827, 442] and [547, 345, 681, 961] and [877, 242, 999, 808] and [257, 239, 340, 419] and [609, 198, 676, 378] and [660, 433, 895, 974] and [005, 396, 287, 991] and [811, 376, 960, 855] and [229, 327, 414, 956] and [911, 235, 952, 363] and [645, 261, 755, 646] and [350, 229, 430, 372] and [362, 275, 461, 661] and [174, 270, 279, 585] and [332, 441, 599, 962] and [443, 207, 490, 386] and [456, 251, 565, 541] , and [727, 256, 804, 440] are in a classroom .') 2023-11-03 19:13:54.126 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:54.126 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'effie [205, 058, 795, 852] the elephant by lily & george') 2023-11-03 19:13:56.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:13:56.352 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[305, 664, 344, 827];Umbrella:[279, 606, 358, 672],[733, 433, 747, 478].') 2023-11-03 19:14:01.502 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:14:01.503 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'girl:[000, 000, 862, 998];clothing:[000, 534, 736, 998];human face:[124, 044, 627, 691].') 2023-11-03 19:14:07.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:14:07.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Other Shoes,Person and Handbag/Satchel in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Other Shoes:[001, 829, 035, 872],[001, 959, 024, 992],[875, 960, 919, 976];Person:[001, 775, 035, 990],[062, 761, 120, 852],[315, 755, 376, 856],[968, 807, 1000, 895],[630, 789, 653, 906],[861, 728, 978, 977],[816, 733, 923, 952],[405, 756, 457, 900],[667, 747, 760, 892];Handbag:[079, 781, 102, 814],[865, 832, 919, 860],[749, 818, 786, 862].') 2023-11-03 19:14:44.062 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:14:44.062 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree,window and house in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'tree:[000, 255, 084, 358],[000, 000, 528, 780];window:[501, 402, 534, 545],[600, 334, 746, 460];house:[001, 329, 141, 685],[311, 315, 517, 689],[415, 198, 981, 729],[700, 524, 992, 910],[934, 301, 999, 564].') 2023-11-03 19:15:05.171 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:15:05.171 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the man dressed in black pants and a white shirt.For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one man dressed in black pants and a white shirt.Track1frame1:[661, 137, 1000, 897];frame2:[663, 139, 1000, 900];frame3:[657, 144, 1000, 914].') 2023-11-03 19:15:17.226 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:15:17.226 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'tree:[558, 027, 968, 923],[000, 000, 575, 210],[169, 455, 199, 526],[172, 373, 235, 515],[229, 361, 293, 518],[326, 354, 413, 540],[347, 580, 488, 794],[370, 419, 451, 546].') 2023-11-03 19:15:39.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:15:39.380 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'wheel:[112, 642, 196, 764],[022, 635, 064, 739],[023, 642, 055, 777],[174, 626, 235, 743],[227, 600, 287, 701],[250, 597, 305, 687],[346, 586, 411, 680],[384, 582, 450, 671],[427, 635, 528, 788],[443, 528, 609, 631],[469, 617, 569, 755],[572, 635, 656, 757],[601, 665, 708, 757],[606, 566, 992, 669],[718, 653, 813, 800],[794, 669, 895, 822],[876, 662, 988, 795];bicycle:[023, 498, 985, 831],[433, 577, 656, 779];bicycle wheel:[015, 510, 990, 824];house:[180, 200, 873, 582];plant:[018, 548, 321, 638],[025, 793, 047, 887],[025, 912, 371, 975],[045, 752, 246, 928],[291, 764, 424, 948],[552, 714, 768, 950],[825, 725, 985, 959],[884, 494, 992, 528].') 2023-11-03 19:15:43.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:15:43.622 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Chair:[665, 431, 869, 994];Lamp:[332, 275, 382, 331];Desk:[113, 485, 512, 952],[001, 597, 358, 996];Person:[333, 200, 846, 1000];Book:[119, 606, 219, 671];Barrel:[105, 532, 163, 613];Glasses:[076, 762, 171, 824];Trash bin Can:[852, 841, 1000, 1000];Cup:[217, 444, 257, 527];TV:[353, 285, 509, 558];Keyboard:[392, 554, 488, 629];Laptop:[204, 521, 385, 641];Tissue:[155, 432, 228, 541];Toiletry:[062, 474, 097, 593].') 2023-11-03 19:17:08.346 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:17:08.347 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Man gets on one knee to propose to his girlfriend [606, 672, 658, 755] on Wyoming mountain [022, 741, 995, 997]') 2023-11-03 19:17:33.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:17:33.034 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[163, 002, 960, 999],[400, 129, 534, 487],[904, 157, 1000, 813];Gloves:[282, 217, 451, 892],[838, 825, 962, 1000];Boots:[158, 791, 235, 999].') 2023-11-03 19:17:50.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:17:50.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bottle,Person and Bicycle in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Bottle:[505, 613, 547, 716],[543, 616, 573, 717];Person:[908, 422, 999, 779];Bicycle:[909, 537, 1000, 807],[341, 470, 698, 908],[396, 487, 670, 835],[000, 639, 121, 1000].') 2023-11-03 19:18:09.352 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:18:09.352 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Close up of [341, 396, 577, 898] a green juice and a mango smoothie in [603, 396, 927, 996] glass jars on [002, 432, 997, 995] a wooden table') 2023-11-03 19:18:23.065 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:18:23.066 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'San Francisco Family Photographer captures the playful father [409, 174, 654, 450] looking in on his children [228, 369, 540, 987] and their cardboard box home [002, 007, 995, 997]') 2023-11-03 19:18:33.264 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:18:33.265 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,train,footwear and girl in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'human face:[440, 329, 461, 393],[566, 333, 610, 398],[679, 352, 724, 423],[683, 225, 720, 298];train:[000, 000, 999, 999];footwear:[437, 859, 473, 926],[525, 886, 596, 999],[580, 859, 627, 999];girl:[400, 318, 500, 928],[453, 277, 526, 442],[524, 310, 562, 419],[566, 233, 676, 444].') 2023-11-03 19:18:59.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:18:59.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[483, 306, 615, 643];human hair:[495, 313, 582, 452],[007, 444, 138, 604],[406, 353, 471, 423],[940, 210, 999, 315];human head:[005, 451, 150, 651],[067, 010, 151, 145],[134, 739, 425, 999],[291, 000, 350, 098],[395, 364, 461, 472],[505, 312, 576, 424],[696, 291, 789, 425],[940, 236, 999, 417];mammal:[318, 337, 484, 685],[000, 102, 094, 477],[000, 453, 247, 999],[053, 019, 279, 620],[140, 737, 416, 999],[254, 000, 391, 561],[495, 314, 640, 670],[566, 072, 607, 235],[621, 286, 835, 734],[838, 086, 861, 183],[840, 524, 999, 999],[906, 063, 936, 140],[933, 227, 999, 560];man:[000, 462, 237, 955],[000, 085, 100, 499],[061, 013, 270, 633],[252, 000, 393, 554],[331, 360, 515, 716],[569, 072, 611, 230],[616, 285, 832, 747],[941, 225, 999, 454];footwear:[711, 637, 803, 744],[577, 561, 623, 669];human arm:[498, 469, 592, 597],[026, 658, 245, 889],[065, 143, 113, 396],[089, 595, 249, 792],[196, 145, 241, 396],[260, 114, 288, 355],[331, 427, 395, 703],[361, 126, 400, 293],[438, 400, 515, 514],[542, 372, 647, 496],[655, 395, 730, 629],[693, 351, 823, 577];hat:[176, 740, 428, 999],[696, 282, 787, 410];human hand:[051, 277, 075, 340],[693, 505, 758, 579].') 2023-11-03 19:19:15.224 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:19:15.224 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'T-Shirt - Vintage Cassette audio') 2023-11-03 19:19:16.156 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:19:16.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Desk:[190, 194, 521, 503];Person:[474, 097, 723, 628],[687, 271, 999, 999],[100, 337, 440, 885];Book:[487, 305, 691, 494];Sneakers:[317, 796, 440, 863];Sandals:[841, 917, 913, 1000];Bakset:[501, 069, 601, 203].') 2023-11-03 19:19:27.027 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:19:27.028 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[526, 014, 998, 973] A preview screen from student Yasmin's vlog, 'Staying connected to your friends and family while social distancing'") 2023-11-03 19:19:37.218 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:19:37.220 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[612, 512, 692, 627] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a scotch terrierFrame1:[612, 512, 692, 627];Frame2:[451, 466, 538, 597];Frame3:[384, 485, 477, 579];Frame4:[427, 287, 507, 475].') 2023-11-03 19:19:41.979 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:19:41.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human nose,human head,man and human face in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human nose:[252, 408, 280, 491],[642, 242, 679, 335];human head:[067, 166, 318, 576],[318, 333, 380, 490],[463, 271, 508, 387],[539, 296, 594, 426],[644, 025, 832, 448];man:[621, 074, 961, 1000],[000, 153, 326, 998],[276, 342, 377, 998],[420, 273, 538, 998],[534, 284, 643, 998];human face:[160, 366, 278, 590],[340, 376, 375, 487],[463, 303, 501, 381],[542, 326, 581, 412],[642, 196, 763, 460].') 2023-11-03 19:19:54.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:19:54.881 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[521, 643, 673, 994],[181, 626, 245, 826],[273, 620, 333, 767],[321, 598, 394, 740],[435, 631, 503, 791],[810, 646, 911, 893];man:[700, 738, 879, 1000],[520, 616, 597, 810];drink:[652, 865, 678, 977],[686, 818, 711, 921];tree:[000, 000, 998, 998];clothing:[187, 665, 237, 810],[281, 682, 333, 805],[326, 658, 371, 805],[433, 686, 486, 775],[509, 665, 566, 750],[513, 766, 638, 998],[640, 738, 660, 851],[736, 786, 879, 998],[833, 686, 913, 907];building:[373, 337, 497, 654].') 2023-11-03 19:20:01.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:20:01.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[468, 400, 484, 466] and track its trajectory.When detailing trajectories in your response, adhere to the Frame t:[xmin,ymin,xmax,ymax] format.', 'This is a italian greyhoundFrame1:[468, 400, 484, 466];Frame2:[483, 439, 502, 491];Frame3:[495, 455, 509, 497];Frame4:[493, 451, 506, 503];Frame5:[413, 454, 429, 503].') 2023-11-03 19:20:16.157 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:20:16.158 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[229, 073, 524, 994] Nick Mannisto shows off his cheese pizza he created in his mobile food truck. (Courtesy Photos') 2023-11-03 19:21:39.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:21:39.188 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect SUV in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'SUV:[191, 476, 229, 520].') 2023-11-03 19:22:06.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:22:06.094 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'human body:[578, 043, 999, 995],[208, 675, 846, 999],[923, 451, 971, 545],[943, 360, 999, 601];human hair:[216, 300, 575, 838],[651, 045, 919, 335],[950, 305, 999, 379];human head:[653, 095, 918, 495],[222, 303, 576, 941];human hand:[515, 667, 716, 988],[641, 485, 780, 779],[931, 535, 999, 716];human eye:[248, 642, 322, 701],[386, 618, 473, 674],[693, 320, 764, 362],[806, 317, 880, 355];boy:[598, 040, 999, 991];human mouth:[328, 817, 433, 871],[745, 451, 813, 495];woman:[929, 430, 972, 540],[940, 299, 999, 588];girl:[101, 262, 848, 999];human face:[230, 444, 540, 941],[662, 166, 900, 554];human arm:[516, 680, 853, 999],[644, 483, 790, 858],[931, 529, 999, 923],[944, 353, 995, 495];human nose:[325, 687, 405, 792],[762, 356, 815, 437].') 2023-11-03 19:22:24.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:22:24.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[334, 185, 568, 676] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a eyeFrame1:[334, 185, 568, 676];Frame2:[482, 177, 715, 675];Frame3:[518, 185, 716, 667];Frame4:[308, 146, 508, 628];Frame5:[403, 137, 595, 629].') 2023-11-03 19:22:42.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:22:42.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Dinning Table:[079, 588, 972, 1000];Storage box:[137, 549, 334, 685];Bread:[629, 788, 716, 840],[611, 733, 687, 794];Laptop:[538, 343, 759, 644];Plate:[080, 707, 318, 825],[224, 740, 470, 869],[441, 799, 640, 894],[595, 735, 838, 862],[741, 720, 918, 795],[764, 636, 941, 723];Fork:[710, 723, 868, 818].') 2023-11-03 19:22:46.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:22:46.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[000, 031, 300, 766],[266, 476, 868, 999],[298, 352, 461, 635],[696, 118, 843, 465],[735, 309, 999, 995];woman:[000, 030, 321, 725],[270, 133, 865, 999],[310, 092, 538, 628];human hair:[000, 605, 123, 1000],[018, 016, 196, 363],[065, 000, 234, 156],[366, 099, 511, 407],[448, 135, 758, 611],[821, 061, 999, 404];human head:[429, 122, 703, 506],[000, 027, 188, 308],[118, 000, 251, 177],[226, 010, 276, 167],[359, 121, 505, 389],[448, 000, 592, 127],[651, 000, 735, 057],[724, 000, 812, 063],[795, 000, 918, 144],[824, 072, 999, 386];mammal:[002, 652, 130, 993],[000, 062, 036, 354],[000, 000, 046, 075],[000, 008, 320, 724],[110, 000, 361, 508],[205, 003, 323, 251],[258, 134, 881, 999],[273, 000, 400, 131],[325, 000, 452, 349],[343, 000, 638, 389],[568, 000, 663, 100],[602, 000, 736, 172],[646, 000, 806, 248],[685, 000, 996, 482],[743, 058, 999, 999],[874, 527, 999, 999];man:[738, 083, 998, 996],[341, 000, 643, 330],[921, 501, 999, 994];clothing:[331, 459, 863, 975],[000, 297, 294, 716],[015, 181, 357, 444],[250, 140, 327, 247],[267, 009, 410, 164],[306, 361, 471, 637],[321, 028, 450, 166],[566, 000, 663, 087],[606, 037, 728, 151],[638, 046, 804, 230],[686, 104, 848, 423],[735, 312, 999, 916];human arm:[260, 635, 417, 955],[182, 340, 320, 714],[205, 161, 355, 497],[681, 187, 816, 471],[734, 603, 875, 999],[743, 427, 934, 892];human mouth:[488, 406, 554, 461];human ear:[640, 361, 658, 425],[794, 000, 804, 066],[826, 199, 850, 291],[980, 235, 999, 302];human face:[001, 860, 059, 999],[055, 080, 175, 315],[161, 044, 216, 174],[414, 172, 469, 392],[452, 177, 636, 517],[468, 001, 570, 142],[724, 000, 802, 060],[808, 000, 902, 141],[857, 121, 979, 382];microphone:[383, 473, 522, 808];human nose:[495, 311, 546, 399].') 2023-11-03 19:22:57.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:22:57.941 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[156, 002, 483, 895];frame:2:[156, 002, 483, 895];frame:3:[152, 002, 484, 952] is?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an the left one of two dancing persons') 2023-11-03 19:23:12.013 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:23:12.014 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'human hand:[262, 639, 613, 972],[479, 510, 748, 950];clothing:[000, 000, 605, 997],[600, 000, 998, 997].') 2023-11-03 19:23:24.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:23:24.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Plate:[087, 286, 629, 823],[300, 140, 553, 304],[071, 547, 642, 985],[824, 282, 999, 579].') 2023-11-03 19:23:29.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:23:29.071 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[278, 129, 798, 999],[472, 056, 1000, 1000],[624, 001, 999, 888],[831, 001, 999, 537];Hat:[000, 548, 260, 873],[281, 134, 595, 355],[472, 054, 648, 226],[624, 001, 839, 106],[834, 001, 999, 060];Glasses:[355, 336, 527, 399];Belt:[962, 726, 999, 794],[882, 873, 958, 976].') 2023-11-03 19:23:59.067 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:23:59.068 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bonn, Germany - June 8 2019: [057, 062, 568, 997] Wilson Cruz (*1973, American actor - Star Trek: Discovery) is happy to meet fans at FedCon 28, a four day sci-fi convention. FedCon 28 took place Jun 7-10 2019.') 2023-11-03 19:24:02.386 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:24:02.386 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "detail shot of Laurie Kang's sculpture 'Worm;' shot is a top-down view of [002, 003, 828, 999] the silver aluminum sculpture") 2023-11-03 19:24:10.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:24:10.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'woman:[615, 519, 775, 992],[188, 645, 355, 999],[250, 768, 435, 999],[485, 429, 662, 806],[578, 424, 666, 639],[744, 504, 870, 830],[777, 623, 965, 999],[865, 449, 938, 612],[896, 522, 999, 954];boy:[677, 274, 809, 701];man:[372, 571, 586, 969],[496, 496, 618, 924],[674, 269, 823, 698];girl:[000, 000, 999, 999];human face:[147, 111, 202, 269],[283, 729, 338, 797],[350, 815, 430, 958],[415, 497, 457, 581],[445, 600, 511, 728],[556, 513, 601, 622],[603, 438, 645, 513],[662, 572, 721, 689],[699, 281, 732, 356],[800, 540, 843, 607],[844, 661, 902, 764],[868, 467, 915, 561],[912, 559, 975, 658].') 2023-11-03 19:24:36.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:24:36.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'dessert:[312, 568, 495, 777],[003, 050, 332, 381],[003, 178, 945, 761];pastry:[320, 552, 488, 766],[002, 362, 140, 591],[095, 407, 251, 647],[133, 327, 235, 472],[192, 465, 340, 718],[240, 362, 351, 523],[265, 281, 359, 416],[323, 407, 480, 548],[342, 320, 463, 482],[354, 252, 465, 380],[435, 348, 575, 493],[446, 215, 538, 312],[448, 426, 617, 690],[452, 275, 558, 397],[532, 248, 635, 312],[533, 288, 645, 397],[544, 180, 647, 291],[565, 380, 720, 631],[644, 210, 732, 278],[645, 285, 740, 391],[653, 342, 805, 538],[713, 243, 808, 376],[760, 300, 892, 500],[845, 263, 955, 433].') 2023-11-03 19:24:44.572 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:24:44.572 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'AAMI – [032, 125, 569, 903] Tradie Work Mates – Win 1 of [572, 052, 938, 427] 5 prize packages valued at $160 each') 2023-11-03 19:24:51.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:24:51.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[002, 002, 509, 999],[782, 002, 999, 1000];Van:[271, 002, 883, 177];Handbag:[474, 251, 746, 616].') 2023-11-03 19:24:56.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:24:56.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'drink:[751, 590, 807, 788],[597, 613, 647, 805],[645, 668, 673, 805],[705, 698, 743, 798];clothing:[023, 330, 421, 803],[563, 420, 900, 750];person:[000, 215, 467, 998],[567, 228, 903, 998];table:[001, 623, 998, 998];bottle:[603, 578, 643, 808],[747, 576, 790, 798].') 2023-11-03 19:25:21.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:25:21.373 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Dinning Table:[040, 573, 731, 1000];Napkin:[130, 762, 199, 817],[525, 822, 625, 877];Wine Glass:[469, 660, 529, 873],[293, 635, 348, 821],[279, 574, 327, 691],[354, 522, 392, 647],[429, 492, 462, 604];Plate:[360, 818, 465, 875],[550, 801, 648, 849],[477, 749, 590, 791],[180, 741, 271, 790];Basin:[596, 741, 667, 807],[478, 701, 595, 773],[582, 705, 641, 755],[426, 619, 501, 663],[309, 771, 390, 844],[217, 689, 280, 743],[347, 622, 398, 665];Chopsticks:[619, 681, 654, 726];Person:[046, 346, 291, 1000],[037, 169, 118, 551],[019, 323, 350, 711],[182, 168, 355, 471],[326, 355, 447, 637],[454, 324, 616, 597],[660, 370, 782, 855],[636, 352, 999, 981],[698, 386, 1000, 998];Other Shoes:[246, 933, 293, 999];Leather Shoes:[630, 942, 706, 984].') 2023-11-03 19:25:44.121 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:25:44.121 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the man standing in front and lifting dumbbells?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There is one man standing in front and lifting dumbbells.Track1frame:1:[415, 275, 581, 997];frame:2:[414, 275, 581, 997];frame:3:[416, 275, 581, 997].') 2023-11-03 19:25:45.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:25:45.050 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,human head,microphone and man in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human face:[177, 219, 411, 410];human head:[126, 172, 439, 415];microphone:[365, 198, 463, 288];man:[092, 165, 999, 999].') 2023-11-03 19:25:56.870 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:25:56.870 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'suit:[000, 468, 461, 981];tie:[141, 713, 207, 997],[746, 880, 787, 998];man:[017, 262, 432, 998],[317, 202, 571, 998],[522, 121, 999, 998];human face:[079, 368, 207, 611],[338, 357, 533, 654],[541, 258, 775, 712].') 2023-11-03 19:26:15.690 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:26:15.691 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[673, 737, 807, 1000],[598, 730, 894, 982],[370, 671, 633, 999],[170, 479, 638, 999],[109, 460, 405, 999],[538, 299, 769, 744],[673, 078, 887, 629];Helmet:[684, 083, 770, 170],[535, 303, 619, 399],[453, 517, 547, 606],[225, 461, 304, 574],[681, 738, 771, 810],[597, 727, 698, 833],[624, 979, 695, 999],[867, 950, 955, 999],[240, 959, 325, 999];Gloves:[672, 312, 718, 389],[553, 504, 623, 577],[624, 496, 700, 577],[300, 476, 357, 551],[603, 947, 661, 999],[734, 276, 784, 353];Sneakers:[805, 554, 891, 640],[701, 541, 787, 629],[141, 967, 241, 999],[348, 928, 391, 999],[686, 673, 771, 741],[660, 700, 698, 731];Hat:[468, 671, 549, 762];Hockey Stick:[345, 358, 673, 500],[797, 819, 870, 1000],[328, 000, 353, 484];Paddle:[205, 844, 229, 999].') 2023-11-03 19:26:43.343 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:26:43.343 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Storage box:[596, 022, 731, 144],[522, 070, 641, 133],[793, 085, 969, 302],[751, 129, 857, 296],[931, 566, 1000, 689],[209, 676, 256, 820],[172, 683, 211, 833];Desk:[896, 674, 1000, 826];Radiator:[000, 824, 143, 1000];Potted Plant:[000, 580, 146, 804];Air Conditioner:[000, 021, 098, 181];Power outlet:[106, 192, 139, 236].') 2023-11-03 19:26:58.862 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:26:58.863 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'image of a emerald pearl granite with [433, 097, 529, 208] bird ornament for a product listing for [150, 057, 855, 932] a headstone') 2023-11-03 19:27:14.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:27:14.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[885, 547, 954, 675],personFrame1:[922, 535, 988, 686],carFrame1:[241, 521, 332, 632] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[885, 547, 954, 675];Frame2:[937, 539, 999, 683],personFrame1:[922, 535, 988, 686];Frame2:[920, 522, 1000, 673],carFrame1:[241, 521, 332, 632];Frame2:[238, 517, 330, 630].') 2023-11-03 19:27:37.272 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:27:37.273 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[605, 552, 638, 662],personFrame1:[398, 755, 439, 861],personFrame1:[575, 397, 614, 483] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[605, 552, 638, 662];Frame2:[628, 555, 662, 658],personFrame1:[398, 755, 439, 861];Frame2:[417, 775, 459, 854],personFrame1[575, 397, 614, 483];Frame2:No object.') 2023-11-03 19:27:59.376 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:27:59.376 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Van,Street Lights and Peach in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Van:[604, 407, 674, 499];Street Lights:[358, 222, 409, 465].') 2023-11-03 19:28:36.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:28:36.186 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human hair,human head,human face and man in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human hair:[085, 229, 117, 328],[462, 277, 536, 364],[808, 260, 860, 337];human head:[083, 142, 227, 357],[000, 130, 999, 434],[456, 201, 572, 375],[805, 209, 891, 348];human face:[000, 214, 003, 282],[005, 231, 044, 316],[066, 286, 105, 350],[140, 227, 212, 345],[173, 308, 396, 433],[443, 191, 483, 260],[523, 273, 570, 368],[555, 239, 786, 352],[848, 247, 893, 343];man:[758, 197, 913, 950],[000, 136, 999, 999].') 2023-11-03 19:28:44.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:28:44.880 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Street Lights:[170, 550, 224, 706],[252, 589, 271, 703],[573, 456, 648, 730],[731, 640, 747, 674];Car:[241, 692, 301, 727],[122, 689, 215, 737];SUV:[799, 652, 1000, 738].') 2023-11-03 19:28:52.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:28:52.616 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[565, 370, 656, 772],[026, 376, 114, 647],[213, 360, 268, 625],[355, 372, 405, 666];tree:[015, 038, 515, 468],[576, 000, 999, 529];footwear:[378, 724, 441, 805];shorts:[143, 491, 180, 557],[380, 587, 443, 654],[471, 515, 515, 581],[575, 576, 642, 615];land vehicle:[788, 391, 861, 470];human leg:[033, 529, 057, 650],[073, 530, 103, 635],[133, 532, 154, 626],[153, 530, 194, 628],[217, 519, 250, 622],[380, 648, 398, 794],[420, 646, 436, 794],[461, 569, 488, 665],[499, 575, 520, 673],[516, 549, 531, 636],[565, 573, 604, 747],[628, 584, 655, 763];man:[075, 365, 134, 623],[128, 362, 203, 638],[367, 374, 468, 801],[388, 360, 413, 429],[434, 350, 466, 662],[453, 341, 531, 675],[499, 348, 553, 656],[554, 378, 658, 766];human arm:[053, 434, 077, 542],[148, 426, 164, 509],[211, 422, 225, 505],[364, 476, 386, 626],[436, 458, 463, 606],[562, 454, 578, 547],[620, 451, 652, 575].') 2023-11-03 19:29:24.794 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:29:24.794 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Microphone:[154, 422, 196, 585],[527, 413, 567, 592],[877, 444, 937, 596];Desk:[002, 552, 998, 1000];Person:[326, 670, 639, 999],[000, 265, 123, 559],[218, 242, 420, 569],[454, 264, 644, 570],[664, 288, 817, 577],[861, 267, 1000, 592].') 2023-11-03 19:29:40.887 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:29:40.887 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect teapot,saucer and drawer in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'teapot:[773, 431, 999, 586];saucer:[796, 617, 944, 649].') 2023-11-03 19:29:46.689 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:29:46.689 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[141, 297, 656, 705],[463, 236, 679, 676];Flag:[816, 367, 966, 474];Bench:[639, 400, 822, 682];Glasses:[364, 328, 480, 362];Hat:[366, 294, 498, 362];Helmet:[475, 233, 599, 294];Gloves:[564, 440, 647, 486],[138, 432, 213, 493];Leather Shoes:[432, 632, 520, 700],[206, 642, 260, 703];Boots:[607, 594, 675, 674];Mask:[497, 292, 578, 329];Motorcycle:[032, 363, 1000, 853].') 2023-11-03 19:29:47.427 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:29:47.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,human face,man and suit in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'woman:[586, 176, 711, 804],[228, 136, 350, 885],[341, 194, 448, 872],[435, 148, 549, 845],[688, 129, 783, 826],[765, 122, 875, 854];human face:[056, 104, 102, 204],[162, 122, 206, 224],[185, 074, 225, 167],[250, 168, 295, 250],[307, 144, 344, 215],[376, 230, 413, 306],[420, 138, 449, 220],[471, 162, 506, 248],[538, 110, 577, 211],[627, 184, 660, 251],[713, 146, 753, 217],[813, 162, 846, 238],[872, 102, 927, 209];man:[012, 027, 136, 896],[109, 105, 240, 932],[124, 061, 255, 821],[287, 130, 368, 821],[410, 123, 463, 792],[508, 096, 621, 804],[833, 067, 994, 998];suit:[016, 067, 131, 917],[107, 202, 241, 894],[115, 171, 258, 786],[415, 213, 461, 736],[508, 171, 622, 751],[596, 246, 711, 482],[688, 222, 778, 820],[763, 236, 864, 854],[828, 162, 992, 956].') 2023-11-03 19:29:50.066 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:29:50.066 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'jacket:[255, 221, 489, 713],[468, 267, 600, 643],[583, 273, 725, 704];footwear:[592, 856, 691, 968],[485, 857, 553, 887];woman:[411, 326, 589, 999];man:[282, 200, 482, 925],[464, 165, 632, 902],[575, 240, 732, 990];girl:[411, 323, 596, 998].') 2023-11-03 19:30:06.976 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:30:06.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:3:[989, 239, 999, 311] is doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an bird moving a bit') 2023-11-03 19:30:20.699 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:30:20.699 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Sebastien Buemi of Switzerland and [136, 132, 775, 677] Scuderia Toro Rosso clips [742, 178, 997, 931] a wall as he drives in the final practice session prior to qualifying for the Canadian...') 2023-11-03 19:30:21.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:30:21.670 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given an image,please detect person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[908, 447, 1000, 1000],[713, 754, 791, 1000],[690, 429, 811, 1000],[583, 545, 706, 1000],[839, 431, 949, 972],[929, 412, 1000, 918],[753, 422, 836, 820],[541, 427, 629, 875],[514, 425, 602, 850],[598, 450, 665, 787],[433, 420, 516, 837],[321, 454, 461, 1000],[153, 341, 322, 1000],[053, 500, 156, 995],[001, 466, 063, 952],[001, 458, 025, 1000],[077, 770, 180, 960],[055, 487, 133, 929],[287, 454, 363, 793],[346, 437, 406, 725],[388, 439, 440, 716],[416, 456, 485, 808],[501, 454, 535, 610],[876, 441, 934, 756],[836, 439, 910, 791],[809, 454, 870, 768],[673, 445, 720, 670],[618, 443, 667, 729],[255, 445, 381, 522],[359, 920, 441, 1000],[450, 806, 535, 964],[796, 806, 915, 922],[538, 856, 680, 997],[920, 470, 956, 575].') 2023-11-03 19:30:36.802 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:30:36.803 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human body:[309, 579, 401, 731],[066, 671, 189, 838],[143, 695, 222, 845],[197, 736, 274, 881],[305, 168, 417, 219],[348, 161, 432, 222],[374, 619, 432, 699],[384, 183, 505, 246],[502, 540, 576, 679],[505, 796, 610, 896],[574, 698, 700, 886],[678, 085, 766, 223],[794, 139, 865, 247],[878, 585, 938, 676];man:[307, 581, 403, 729],[178, 710, 270, 879],[298, 163, 409, 214],[357, 134, 421, 226],[401, 175, 501, 232],[498, 522, 578, 677],[575, 686, 680, 883].') 2023-11-03 19:30:51.841 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:30:51.842 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[525, 278, 903, 737] chocolate truffle with [043, 054, 861, 816] a card saying i love you - greeting card stock pictures, [594, 624, 806, 721] royalty-free photos & images') 2023-11-03 19:31:30.955 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:31:30.956 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Car:[002, 560, 147, 713];Bus:[171, 263, 973, 789].') 2023-11-03 19:31:58.397 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:31:58.397 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Frame:[001, 231, 031, 324],[001, 140, 037, 233];Cabinet:[100, 263, 236, 435],[867, 604, 1000, 999];Person:[736, 375, 798, 494],[709, 359, 744, 454],[740, 346, 774, 419],[440, 287, 912, 1000],[153, 146, 896, 999],[239, 315, 290, 425];Gloves:[778, 623, 896, 825];Hat:[457, 148, 613, 292];Glasses:[534, 670, 586, 822];Chair:[001, 728, 056, 998],[001, 626, 102, 912],[119, 650, 254, 952].') 2023-11-03 19:32:23.822 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:32:23.823 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[445, 209, 876, 654] and track its trajectory.To maintain consistency, ensure that trajectories in your answer match the Frame t:[xmin,ymin,xmax,ymax] setup.', 'This is a grizzlyFrame1:[445, 209, 876, 654];Frame2:[573, 220, 997, 637];Frame3:[654, 202, 998, 648];Frame4:[618, 220, 998, 648].') 2023-11-03 19:32:38.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:32:38.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'boy:[326, 216, 576, 572];clothing:[000, 000, 191, 928],[348, 426, 557, 549],[528, 315, 697, 553],[838, 000, 999, 155];auto part:[101, 000, 276, 159],[147, 000, 911, 256],[166, 147, 758, 998],[186, 135, 686, 705],[418, 489, 698, 713],[623, 010, 999, 739],[716, 707, 999, 998];human face:[405, 299, 498, 485];car:[122, 000, 999, 998].') 2023-11-03 19:32:46.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:32:46.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tick,pencil case,human leg,human body,woman,trousers,table and human hair in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'human leg:[013, 429, 063, 724],[063, 425, 148, 716],[253, 506, 338, 667],[778, 560, 871, 755],[835, 579, 958, 790];human body:[815, 398, 975, 796],[008, 268, 147, 720],[249, 379, 388, 673],[427, 389, 496, 466];woman:[010, 183, 161, 740],[244, 313, 390, 673],[430, 342, 503, 460],[538, 338, 596, 425],[596, 356, 653, 472];trousers:[024, 439, 133, 714],[253, 501, 340, 672],[799, 553, 978, 746];table:[330, 460, 761, 708];human hair:[013, 185, 105, 339],[455, 339, 503, 402],[598, 360, 649, 428].') 2023-11-03 19:32:52.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:32:52.811 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[850, 362, 859, 383];frame:2:[849, 361, 859, 382];frame:3:[848, 361, 858, 382] is doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an person seated down and looking down') 2023-11-03 19:32:54.920 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:32:54.921 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person,sports equipment,jacket and footwear in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[867, 097, 985, 795],[000, 261, 059, 804],[000, 019, 999, 812],[027, 218, 158, 809],[111, 109, 256, 723],[243, 178, 366, 706],[291, 240, 530, 784],[336, 081, 459, 715],[414, 183, 683, 798],[462, 117, 516, 266],[511, 097, 556, 169],[583, 105, 638, 536],[590, 090, 720, 639],[668, 120, 800, 687],[733, 058, 842, 651],[959, 066, 999, 804];sports equipment:[000, 736, 059, 809],[073, 754, 217, 807],[548, 721, 646, 809],[858, 702, 970, 777];jacket:[110, 159, 258, 491],[000, 353, 054, 575],[035, 255, 147, 521],[265, 236, 366, 470],[335, 117, 469, 412],[486, 243, 685, 509],[607, 169, 697, 383],[677, 189, 794, 452],[854, 189, 997, 482];footwear:[502, 724, 533, 792],[584, 729, 642, 807],[772, 619, 840, 652],[889, 612, 946, 738],[967, 679, 999, 789].') 2023-11-03 19:33:09.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:33:09.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'suit:[088, 105, 316, 867],[270, 227, 532, 895],[834, 259, 998, 861];woman:[425, 102, 719, 979],[626, 224, 720, 452],[723, 245, 843, 731];man:[087, 098, 320, 869],[000, 463, 104, 823],[275, 060, 529, 900],[831, 163, 991, 910];dress:[470, 456, 703, 999];chair:[662, 730, 732, 975],[698, 572, 910, 806],[830, 593, 999, 999];table:[000, 820, 536, 999];flower:[249, 870, 365, 999],[331, 803, 420, 930],[335, 857, 416, 999],[403, 866, 445, 950],[423, 918, 489, 997],[542, 880, 653, 999].') 2023-11-03 19:33:47.490 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:33:47.491 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Caribou [349, 420, 666, 735] blending with the boulders [592, 230, 799, 364] [780, 208, 983, 366] [283, 286, 669, 568] [322, 667, 500, 808] [147, 099, 487, 249] [588, 488, 901, 754] on the beach in Newfoundland.') 2023-11-03 19:34:08.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:34:08.751 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Cardio Equipment | [004, 005, 995, 994] Our fitness center has all the cardio and weight training equipment including [087, 340, 578, 993] treadmills and spinning bikes.') 2023-11-03 19:34:17.475 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:34:17.475 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Advertisement poster [009, 007, 980, 993] with the word "Pears" in upper left corner. Features a small child [118, 239, 786, 699] and puppy near a fireplace. A basket of coals has spilled, and the child and puppy are covered in soot marks.') 2023-11-03 19:34:23.343 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:34:23.343 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect girl,woman and plant in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'girl:[000, 279, 028, 590],[650, 356, 702, 594],[705, 257, 781, 607],[753, 262, 811, 577],[836, 324, 911, 602];woman:[461, 148, 702, 889],[095, 194, 173, 557],[184, 240, 270, 606],[231, 163, 286, 309],[271, 238, 356, 571],[582, 201, 646, 348],[641, 236, 704, 417],[698, 243, 788, 611],[746, 253, 815, 593],[916, 207, 999, 565];plant:[000, 023, 108, 311],[075, 408, 125, 519],[146, 106, 340, 280],[173, 411, 323, 555],[371, 029, 470, 250],[511, 000, 999, 569].') 2023-11-03 19:34:38.200 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:34:38.200 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the two birds in cage?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one two birds in cage.Track1frame:3:[097, 069, 183, 299].') 2023-11-03 19:34:44.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:34:44.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[002, 132, 108, 999],[068, 543, 132, 870],[098, 536, 163, 867],[225, 482, 315, 907],[297, 595, 528, 999],[439, 378, 555, 999],[458, 264, 598, 999],[516, 150, 665, 999],[606, 129, 772, 999],[804, 416, 916, 936],[918, 520, 999, 871];Helmet:[002, 132, 108, 327],[297, 605, 366, 764],[456, 263, 541, 434],[517, 152, 613, 339],[604, 133, 706, 307].') 2023-11-03 19:34:49.513 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:34:49.514 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'clothing:[115, 766, 295, 986],[034, 781, 156, 907],[077, 800, 205, 913],[258, 852, 399, 999],[380, 826, 523, 999],[454, 948, 626, 999],[552, 820, 633, 876],[621, 767, 724, 843],[727, 823, 831, 946],[727, 939, 999, 999];woman:[234, 678, 407, 997],[380, 724, 539, 999],[441, 831, 603, 999],[542, 742, 644, 883],[587, 833, 756, 999],[681, 542, 911, 970];girl:[356, 731, 518, 999],[432, 831, 600, 999],[586, 826, 753, 999].') 2023-11-03 19:35:17.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:35:17.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'flower:[518, 031, 880, 468],[310, 637, 339, 699],[343, 142, 553, 415],[361, 460, 974, 975],[741, 671, 780, 735].') 2023-11-03 19:35:19.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:35:19.139 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A course worker [406, 515, 645, 893] takes his shirt [528, 600, 637, 827] off to enjoy warm weather during the snowboard parallel slalom competition on February 22.') 2023-11-03 19:35:25.355 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:35:25.356 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Jackson the Bully Basset as a puppy [461, 322, 737, 814] standing in between two jack [159, 375, 490, 887]-o- [718, 468, 956, 851]lanterns with a person [032, 002, 793, 631] holding him in place') 2023-11-03 19:35:45.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:35:45.885 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Traffic Light:[301, 318, 358, 411];Car:[422, 595, 999, 1000],[001, 550, 098, 628];Traffic Sign:[263, 424, 331, 492];Person:[311, 541, 380, 702].') 2023-11-03 19:35:47.857 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:35:47.858 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Train:[223, 550, 351, 644],[097, 554, 183, 627],[178, 557, 227, 627],[002, 391, 091, 996],[342, 529, 1000, 757].') 2023-11-03 19:37:35.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:37:35.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'person:[165, 199, 434, 884];footwear:[250, 826, 392, 893],[264, 720, 424, 777];football:[671, 748, 791, 838].') 2023-11-03 19:37:47.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:37:47.308 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the two zebras that are currently fighting.?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There are 2 the two zebras that are currently fighting..Track1frame:1:[051, 321, 600, 849];frame:2:[051, 321, 600, 849];frame:3:[214, 242, 652, 876],Track2frame:1:[232, 293, 671, 862];frame:2:[232, 293, 671, 862];frame:3:[365, 249, 925, 790].') 2023-11-03 19:39:00.367 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:39:00.368 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Ezekiel 36:1 1 Also, thou [032, 230, 527, 911] son of man, prophesy unto the mountains of Israel, and say, Ye mountains of Israel, hear the word of the LORD:') 2023-11-03 19:39:12.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:39:12.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Chair:[469, 724, 534, 872],[526, 738, 588, 895],[543, 686, 581, 732],[582, 689, 616, 756],[626, 717, 659, 775],[671, 748, 715, 799],[722, 754, 773, 945],[576, 787, 628, 929],[622, 785, 673, 954],[663, 828, 725, 978],[734, 550, 776, 666],[773, 568, 828, 717],[834, 611, 883, 718],[888, 607, 931, 752],[690, 591, 738, 707],[730, 627, 776, 734],[800, 636, 849, 757],[842, 658, 897, 784].') 2023-11-03 19:40:05.422 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:40:05.422 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'a spring work look with a mustard sleeveless tee [412, 171, 668, 509], white pants [416, 490, 656, 849], tan mules and a tan bag [241, 323, 522, 543]') 2023-11-03 19:40:58.182 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:40:58.182 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Moniter,Book,Frame and Desk in this image.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[897, 002, 958, 160];TV:[351, 176, 917, 772];Book:[071, 816, 404, 1000];Picture:[814, 002, 1000, 204],[914, 237, 965, 385];Desk:[914, 373, 999, 504].') 2023-11-03 19:41:13.459 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:41:13.459 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[245, 024, 687, 978] Sydne Style shows how to wear [320, 330, 604, 963] black high waisted pants with a crop top for summer outfit ideas in guess gingham') 2023-11-03 19:41:29.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:41:29.404 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'flag:[201, 000, 651, 999];woman:[536, 392, 999, 999];girl:[273, 518, 656, 999];clothing:[381, 813, 640, 999],[630, 679, 993, 996];human face:[486, 644, 621, 840],[644, 515, 778, 776].') 2023-11-03 19:41:53.651 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:41:53.651 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[002, 037, 991, 924] Freezer Omelette Muffin Cups: [002, 205, 256, 426] [136, 686, 448, 910] [002, 434, 284, 657] [579, 045, 892, 259] [246, 074, 562, 298] [337, 318, 664, 547] [727, 304, 997, 527] [460, 560, 778, 788] Omelettes baked in muffin tins and frozen for the perfect protein-packed, quick and easy breakfast!') 2023-11-03 19:41:54.580 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:41:54.581 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Sneakers:[416, 946, 450, 999];Person:[342, 256, 538, 998].') 2023-11-03 19:42:00.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:42:00.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'window:[020, 000, 180, 585];door:[613, 165, 725, 721],[768, 210, 843, 550],[900, 148, 995, 828];window blind:[000, 000, 143, 554].') 2023-11-03 19:42:41.400 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:42:41.401 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[716, 011, 936, 479] an orange litter bin hanging on [002, 005, 996, 926] a wall by [000, 621, 665, 993] a bench with [691, 713, 743, 945] a bottle on the floor that did not fit in it in Hamburg Germany') 2023-11-03 19:42:49.761 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:42:49.762 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Lamp in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lamp:[100, 001, 160, 172],[345, 001, 405, 171],[590, 001, 648, 169],[836, 001, 895, 168].') 2023-11-03 19:42:55.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:42:55.744 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Lamp:[605, 223, 671, 481],[224, 083, 359, 774];Nightstand:[539, 576, 696, 718];Pillow:[787, 407, 998, 732],[837, 452, 999, 743],[743, 427, 869, 719];Bed:[418, 110, 999, 999];Picture:[518, 109, 627, 280],[410, 111, 507, 272];Chair:[319, 393, 536, 905];Desk:[098, 561, 267, 957];Stool:[110, 658, 261, 967];Flower:[077, 319, 215, 595].') 2023-11-03 19:42:57.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:42:57.655 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Other Shoes:[448, 573, 519, 635],[619, 575, 681, 624];Gloves:[695, 368, 755, 460];Camera:[682, 279, 730, 314];Person:[450, 110, 807, 629].') 2023-11-03 19:43:08.898 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:08.898 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [445, 183, 673, 992] stand outside a closed vaccination centre [002, 164, 604, 997] due stock shortages of Covid-19 coronavirus vaccines in Mumbai on May 6, 2021.') 2023-11-03 19:43:09.732 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:09.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Wine Glass:[370, 573, 438, 786];Tissue:[002, 565, 118, 799];Towel:[033, 762, 400, 961];Fan:[433, 112, 653, 206];Frame:[225, 002, 364, 333];Cabinet:[002, 002, 288, 254],[373, 767, 505, 999];Couch:[439, 496, 588, 672],[552, 532, 714, 819];Carpet:[499, 853, 713, 946];Coffee Table:[447, 566, 550, 706];Person:[509, 002, 999, 1000].') 2023-11-03 19:43:14.639 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:14.640 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Desk:[659, 863, 707, 911];Person:[499, 823, 578, 998],[807, 789, 999, 999];Bicycle:[008, 886, 085, 999].') 2023-11-03 19:43:24.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:24.057 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'person:[428, 182, 584, 575],[134, 280, 270, 384],[134, 398, 270, 490],[154, 177, 263, 264],[289, 282, 421, 376],[297, 170, 426, 268],[581, 177, 713, 275],[727, 170, 860, 272],[734, 280, 863, 381];human head:[441, 188, 530, 266],[288, 286, 417, 336],[609, 182, 682, 234],[760, 174, 833, 228],[760, 284, 847, 340];clothing:[290, 192, 410, 270],[131, 317, 260, 378],[134, 454, 267, 498],[158, 200, 257, 258],[299, 332, 415, 384],[434, 274, 538, 476],[589, 223, 714, 270],[732, 216, 865, 270],[732, 315, 865, 385];poster:[000, 000, 999, 999];dog:[148, 605, 289, 698],[350, 536, 562, 670],[552, 390, 858, 663].') 2023-11-03 19:43:31.531 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:31.531 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[363, 056, 962, 998] robert fairchild (jerry mulligan) and [050, 092, 407, 998] leanne michelle cope (lise dassin) star in christopher wheeldon's an american in paris at the dominion theatre") 2023-11-03 19:43:34.090 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:34.091 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3,can you tell me what is thisFrame1:[489, 241, 663, 941] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a personFrame1:[489, 241, 663, 941];Frame2:[496, 235, 653, 864];Frame3:[535, 281, 668, 883].') 2023-11-03 19:43:48.631 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:43:48.632 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'butterfly:[017, 048, 217, 178],[018, 842, 090, 938],[026, 283, 092, 371],[028, 512, 091, 600],[031, 603, 098, 694],[036, 383, 112, 507],[041, 178, 102, 271],[075, 692, 136, 772],[100, 276, 271, 373],[101, 515, 172, 608],[102, 846, 180, 934],[105, 617, 181, 683],[113, 189, 186, 267],[126, 400, 210, 490],[155, 703, 262, 774],[161, 779, 263, 844],[176, 850, 247, 934],[181, 503, 265, 602],[191, 615, 262, 692],[211, 168, 297, 271],[217, 074, 310, 169],[228, 385, 403, 507],[247, 848, 318, 942],[265, 613, 338, 692],[267, 696, 321, 762],[267, 510, 338, 596],[281, 287, 406, 362],[308, 072, 387, 162],[321, 166, 421, 291],[323, 692, 457, 786],[330, 851, 451, 936],[348, 787, 463, 851],[352, 515, 415, 605],[352, 612, 423, 679],[390, 060, 561, 181],[408, 354, 545, 514],[416, 295, 487, 356],[416, 501, 581, 617],[423, 617, 497, 686],[437, 167, 618, 295],[460, 698, 536, 781],[461, 858, 532, 925],[470, 787, 537, 848],[497, 287, 577, 357],[500, 615, 561, 686],[535, 856, 596, 930],[540, 693, 657, 763],[551, 779, 608, 846],[566, 613, 645, 688],[566, 388, 837, 532],[568, 061, 663, 167],[576, 531, 695, 602],[580, 293, 656, 357],[596, 848, 693, 940],[608, 775, 675, 848],[640, 167, 748, 288],[647, 615, 708, 683],[658, 694, 728, 763],[672, 277, 775, 390],[680, 763, 765, 844],[685, 061, 775, 168],[692, 524, 761, 608],[703, 853, 782, 944],[721, 612, 785, 688],[737, 693, 800, 765],[771, 524, 856, 610],[772, 162, 870, 285],[775, 279, 872, 385],[777, 772, 865, 856],[782, 074, 858, 162],[790, 615, 863, 679],[793, 851, 860, 942],[797, 696, 866, 767].') 2023-11-03 19:44:00.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:44:00.217 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[415, 149, 529, 922],[165, 024, 300, 495],[339, 072, 446, 434];man:[745, 014, 940, 878],[000, 098, 105, 374],[071, 100, 186, 527],[301, 000, 396, 160],[301, 043, 417, 315],[415, 000, 503, 191],[504, 000, 882, 999],[772, 000, 876, 162];shorts:[000, 873, 091, 999],[098, 392, 175, 487],[171, 265, 242, 334],[805, 384, 890, 510],[932, 403, 999, 520];boy:[000, 437, 141, 999],[001, 403, 124, 997],[015, 376, 108, 634],[051, 340, 117, 889];swimwear:[000, 876, 093, 999],[080, 760, 128, 832],[188, 453, 260, 718],[265, 446, 338, 721],[301, 475, 373, 761],[348, 551, 436, 825],[439, 338, 516, 478],[801, 380, 886, 509],[932, 404, 999, 511];girl:[179, 320, 275, 996],[260, 312, 353, 998],[295, 334, 417, 989],[345, 405, 436, 999],[431, 171, 546, 999].') 2023-11-03 19:44:19.282 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:44:19.283 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[270, 272, 664, 704] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a platypusFrame1:[270, 272, 664, 704];Frame2:[207, 041, 631, 710];Frame3:[023, 002, 601, 135];Frame4:[015, 002, 568, 500];Frame5:[107, 085, 817, 587].') 2023-11-03 19:44:50.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:44:50.497 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'sculpture:[661, 297, 983, 1000];fountain:[000, 306, 999, 995];skyscraper:[002, 288, 135, 505],[194, 276, 415, 481].') 2023-11-03 19:45:05.121 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:45:05.121 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[316, 078, 643, 998] Magician in HK standing in the middle of magic hat and [107, 659, 160, 748] vase performing stage magic with [326, 230, 521, 833] colorful silks .') 2023-11-03 19:45:19.800 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:45:19.801 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Zebra:[088, 147, 998, 974],[023, 010, 789, 1000],[086, 002, 833, 472].') 2023-11-03 19:45:22.737 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:45:22.737 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human body,human hair,man and human hand in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human body:[393, 189, 678, 998];human hair:[313, 068, 474, 296],[493, 188, 617, 315];man:[076, 167, 565, 943],[398, 182, 683, 998];human hand:[374, 456, 585, 760],[367, 740, 502, 890],[562, 467, 629, 573],[583, 522, 685, 697].') 2023-11-03 19:45:26.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:45:26.195 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human body:[307, 492, 607, 1000],[000, 537, 102, 714],[038, 530, 272, 889],[102, 516, 179, 676],[225, 462, 281, 581],[231, 528, 391, 854],[471, 662, 758, 999],[481, 410, 558, 508],[530, 429, 595, 515],[550, 560, 625, 743],[598, 441, 645, 543],[618, 555, 754, 714],[732, 436, 833, 557],[758, 454, 880, 652],[832, 515, 968, 927],[906, 453, 999, 695];woman:[551, 523, 756, 1000],[000, 484, 136, 789],[091, 443, 180, 613],[250, 445, 395, 859],[546, 482, 634, 728],[762, 397, 873, 555];clothing:[565, 628, 765, 984],[008, 545, 121, 688],[085, 509, 178, 661],[120, 513, 275, 861],[217, 463, 283, 558],[238, 532, 385, 859],[298, 527, 480, 877],[323, 621, 581, 996],[482, 397, 553, 513],[519, 424, 603, 511],[546, 571, 627, 732],[631, 399, 674, 463],[696, 428, 756, 507],[713, 582, 750, 686],[725, 397, 793, 488],[736, 513, 795, 659],[756, 424, 831, 530],[792, 457, 874, 551],[833, 511, 960, 851],[918, 439, 999, 684];human hair:[000, 363, 999, 699],[483, 512, 559, 626];man:[129, 438, 278, 837],[215, 399, 283, 570],[294, 512, 575, 996],[348, 441, 476, 876],[394, 376, 440, 476],[444, 409, 503, 581],[488, 383, 558, 499],[518, 374, 604, 508],[586, 403, 646, 570],[598, 439, 755, 745],[630, 364, 683, 460],[644, 389, 699, 482],[726, 364, 786, 471],[739, 383, 835, 541],[844, 464, 976, 907],[922, 389, 999, 606];human face:[180, 449, 220, 533],[328, 458, 363, 535],[390, 460, 430, 560],[446, 422, 485, 511],[470, 544, 529, 658],[549, 513, 593, 590],[618, 553, 671, 676],[644, 449, 696, 533];human arm:[108, 553, 133, 690],[134, 555, 162, 759],[157, 551, 275, 755],[258, 560, 371, 716],[289, 646, 435, 789],[471, 682, 621, 999],[518, 441, 550, 522],[523, 696, 582, 999],[558, 715, 747, 999],[710, 597, 745, 696],[742, 549, 788, 650];human hand:[291, 710, 323, 773],[468, 939, 535, 999],[508, 934, 564, 999].') 2023-11-03 19:45:28.738 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:45:28.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lobster:[570, 573, 825, 885],[406, 528, 630, 694];Lamp:[246, 174, 395, 759];Plate:[709, 720, 926, 836],[177, 801, 598, 905],[000, 791, 149, 937],[000, 648, 101, 748],[808, 870, 1000, 1000];Cup:[117, 532, 253, 797],[028, 449, 084, 637];Bottle:[396, 519, 424, 617],[401, 184, 474, 588],[217, 200, 297, 357],[107, 285, 261, 695],[075, 199, 152, 653],[000, 313, 031, 647];Canned:[146, 503, 228, 536].') 2023-11-03 19:45:42.868 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:45:42.869 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is thisFrame1:[485, 263, 491, 284] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a european hareFrame1:[485, 263, 491, 284];Frame2:[496, 284, 504, 309];Frame3:[500, 284, 508, 310].') 2023-11-03 19:46:22.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:46:22.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[674, 516, 714, 650];frame:2:[713, 507, 748, 624];frame:3:[750, 497, 779, 596] is?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an the rider on the motorized-bicycle, operating to the right, and his bike.') 2023-11-03 19:46:51.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:46:51.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'baked goods:[202, 199, 828, 763].') 2023-11-03 19:47:08.942 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:47:08.943 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'knowing the difference between [713, 204, 928, 394] types of essays and being able to 5 which image matches what type') 2023-11-03 19:48:13.473 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:48:13.473 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Cup:[943, 224, 999, 413],[903, 123, 1000, 327];Bottle:[710, 001, 776, 094];Person:[001, 249, 759, 1000];Cat:[106, 328, 745, 783];Other Shoes:[527, 292, 621, 406];shelf:[651, 363, 1000, 909];Desk:[000, 001, 999, 444];Mouse:[732, 136, 824, 254];Keyboard:[090, 001, 685, 295];Scissors:[573, 005, 682, 058].') 2023-11-03 19:49:07.402 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:49:07.403 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[076, 002, 941, 764];frame:2:[076, 002, 941, 764];frame:3:[086, 002, 929, 766] doing?Briefly articulate your response by taking cues from the provided image frames and trajectory coordinates.', 'This is a/an a man cleaning shoes') 2023-11-03 19:50:20.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:50:20.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Coconut:[404, 182, 437, 234];Bakset:[502, 330, 573, 405];Backpack:[550, 194, 584, 251];Car:[461, 221, 999, 959],[778, 211, 832, 255];Bicycle:[496, 325, 608, 404];Person:[039, 110, 221, 954],[165, 070, 438, 865],[553, 172, 589, 319],[696, 176, 734, 352],[740, 154, 782, 345].') 2023-11-03 19:50:37.323 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:50:37.323 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Lamp:[562, 002, 658, 156],[970, 110, 1000, 187],[085, 002, 174, 169];Chair:[002, 414, 054, 633],[528, 408, 588, 620],[057, 453, 118, 724];Person:[434, 152, 923, 996],[105, 043, 623, 996],[913, 433, 998, 581],[851, 246, 922, 473],[946, 304, 993, 379],[911, 303, 947, 390],[489, 338, 570, 464],[420, 349, 459, 429],[407, 306, 438, 359],[107, 333, 137, 401],[056, 408, 111, 611];Glasses:[230, 150, 374, 203],[612, 461, 731, 604];Tie:[299, 341, 398, 572],[608, 443, 699, 889];Wine Glass:[964, 544, 996, 648];Plate:[925, 812, 1000, 883],[920, 742, 999, 786],[921, 691, 999, 741];Desk:[474, 380, 625, 457],[910, 360, 953, 427],[908, 434, 1000, 499].') 2023-11-03 19:51:00.116 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:51:00.117 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'When combined, the small individual contributors of caring, friendship, forgiveness, and love, Jimmy Carter Picture Quote [109, 193, 887, 797]') 2023-11-03 19:51:44.030 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:51:44.031 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[319, 547, 824, 998] A man kisses an icon of Soviet dictator Joseph Stalin next to his grave in front of [653, 002, 891, 432] the Kremlin wall in Moscow.') 2023-11-03 19:51:45.502 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:51:45.502 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Drum:[436, 507, 505, 599];Guitar:[750, 480, 853, 533];Person:[360, 481, 424, 734],[464, 374, 539, 543],[173, 413, 233, 517],[743, 393, 812, 656];Speaker:[629, 581, 736, 656];Tripod:[691, 429, 783, 664],[490, 493, 529, 598],[518, 487, 561, 571],[422, 464, 457, 604].') 2023-11-03 19:52:00.561 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:52:00.561 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'an astronaut [454, 088, 734, 834] gardening on the moon - out of context stock pictures, royalty-free photos [598, 626, 796, 719] & images') 2023-11-03 19:52:09.058 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:52:09.059 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[045, 384, 335, 973],[154, 465, 314, 595],[339, 093, 758, 512],[537, 651, 722, 884],[575, 242, 943, 854];sculpture:[031, 378, 343, 962],[343, 100, 758, 509],[545, 656, 712, 889],[562, 246, 954, 887].') 2023-11-03 19:52:24.800 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:52:24.800 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect bottle,woman,toy,clothing and human face in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'woman:[000, 048, 999, 999];toy:[000, 018, 358, 671];clothing:[062, 361, 345, 564],[101, 469, 999, 999];human face:[386, 219, 743, 454].') 2023-11-03 19:53:16.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:53:16.197 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Other Shoes:[822, 800, 875, 829],[880, 802, 921, 829],[925, 734, 957, 783],[935, 698, 951, 721];Sneakers:[787, 699, 809, 723],[764, 702, 790, 728];SUV:[696, 528, 768, 657],[435, 518, 707, 696];Street Lights:[377, 332, 421, 468],[349, 282, 377, 473],[513, 453, 537, 548];Car:[049, 542, 417, 744],[000, 559, 044, 612],[000, 603, 052, 658];Fire Truck:[037, 460, 454, 654];Person:[755, 462, 810, 729],[821, 376, 929, 829].') 2023-11-03 19:53:25.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 19:53:25.569 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Wild Bird:[418, 268, 489, 382].') 2023-11-03 19:53:33.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:53:33.461 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Gloves,Person and Shovel in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Gloves:[525, 313, 565, 374],[598, 351, 634, 408];Person:[490, 091, 737, 412],[001, 205, 103, 936];Shovel:[544, 375, 600, 420],[863, 390, 939, 475],[905, 712, 1000, 816].') 2023-11-03 19:54:02.242 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:02.242 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[000, 092, 998, 999];clothing:[000, 654, 998, 999];human face:[288, 151, 729, 453].') 2023-11-03 19:54:03.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:03.250 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Car and Desk in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[276, 735, 329, 841],[457, 741, 491, 790],[530, 737, 576, 812],[894, 737, 929, 791],[927, 727, 966, 795];Car:[167, 787, 718, 996],[708, 789, 999, 997],[724, 776, 927, 881];Desk:[001, 833, 177, 979].') 2023-11-03 19:54:18.135 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:18.135 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Two drivers died after [633, 349, 954, 579] a head-on collision in Essa Township, south of Barrie, Ont., early Saturday.') 2023-11-03 19:54:19.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:19.455 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[762, 724, 878, 943],[411, 716, 506, 894],[528, 832, 683, 999],[568, 734, 672, 979];human body:[106, 570, 231, 918],[362, 543, 404, 826],[852, 237, 928, 407];woman:[106, 562, 228, 914],[388, 550, 426, 634],[852, 232, 934, 413];table:[867, 703, 1000, 948],[305, 807, 492, 997],[477, 733, 587, 938];man:[104, 582, 225, 911],[360, 545, 398, 815];human leg:[198, 761, 290, 917],[368, 656, 399, 817];human hair:[149, 603, 230, 649];human arm:[158, 656, 242, 761],[357, 589, 398, 657].') 2023-11-03 19:54:22.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:22.128 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[548, 527, 933, 1000],[001, 689, 109, 1000],[108, 725, 427, 1000],[429, 744, 656, 1000],[867, 707, 999, 1000].') 2023-11-03 19:54:37.337 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:37.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Scissors:[342, 721, 384, 876],[243, 238, 266, 266];Tape:[616, 736, 747, 795];Storage box:[000, 403, 197, 615];Desk:[000, 549, 999, 1000],[753, 241, 893, 490],[768, 330, 1000, 781];shelf:[000, 162, 119, 433],[226, 275, 379, 612],[957, 008, 993, 174];Chair:[903, 254, 999, 394];Person:[955, 277, 1000, 411],[898, 174, 993, 388],[472, 159, 873, 1000],[256, 183, 572, 808],[515, 001, 771, 474],[330, 001, 569, 290],[331, 140, 468, 347],[341, 083, 387, 152],[316, 136, 343, 210],[257, 134, 335, 282],[209, 103, 261, 289],[206, 062, 225, 114],[267, 075, 300, 117],[324, 084, 345, 117],[000, 001, 112, 402];Bracelet:[482, 797, 544, 834],[289, 696, 330, 735];Moniter:[703, 039, 921, 254];Cell Phone:[247, 239, 267, 265];Glasses:[551, 075, 663, 154],[405, 025, 443, 064],[343, 132, 379, 153];Gloves:[345, 181, 386, 223];Other Shoes:[911, 359, 939, 386].') 2023-11-03 19:54:44.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:44.981 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[290, 174, 434, 545] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a fireboatFrame1:[290, 174, 434, 545];Frame2:[290, 157, 452, 565];Frame3:[279, 165, 460, 602];Frame4:[255, 116, 458, 600].') 2023-11-03 19:54:49.899 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:54:49.899 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'There are so many different ways you can make [636, 781, 883, 917] [147, 758, 390, 878] [088, 258, 393, 420] [597, 526, 959, 655] [077, 510, 414, 687] headbands. Check out this post to learn how to make headbands with fabric 10 different ways.') 2023-11-03 19:55:09.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:55:09.338 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,human arm,man and human hair in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'human face:[237, 096, 388, 363],[440, 065, 553, 284],[576, 205, 691, 434],[693, 013, 717, 118],[840, 085, 861, 156];human arm:[772, 410, 873, 953],[056, 410, 222, 999],[123, 240, 391, 417],[641, 107, 803, 325],[706, 391, 830, 480],[834, 119, 996, 703];man:[045, 062, 433, 979],[563, 000, 671, 262],[647, 000, 825, 415],[817, 000, 999, 999];human hair:[570, 144, 744, 503],[224, 050, 390, 234],[406, 000, 608, 239],[574, 000, 644, 080],[702, 000, 781, 089],[850, 027, 935, 137].') 2023-11-03 19:55:15.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:55:15.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'balloon:[567, 303, 785, 999];woman:[356, 707, 421, 866];man:[013, 812, 141, 999],[055, 829, 199, 999],[113, 727, 236, 979],[213, 575, 261, 638],[221, 746, 316, 930],[223, 640, 304, 767],[229, 786, 312, 999],[281, 678, 362, 863],[300, 571, 319, 644],[317, 598, 340, 691],[321, 614, 371, 737],[358, 648, 404, 742],[359, 744, 531, 924],[440, 428, 483, 575],[480, 620, 528, 704],[488, 689, 521, 847],[517, 570, 550, 665],[568, 518, 590, 585],[568, 560, 622, 763],[610, 445, 658, 518],[665, 315, 794, 780],[738, 313, 783, 378],[756, 040, 999, 954],[802, 383, 998, 999];jeans:[179, 882, 210, 967],[454, 507, 479, 575],[725, 522, 796, 745],[818, 543, 931, 931].') 2023-11-03 19:55:41.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:55:41.277 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Clock:[241, 354, 316, 455];shelf:[349, 303, 456, 488],[503, 270, 647, 501],[127, 365, 176, 559],[501, 608, 642, 727],[465, 608, 501, 738],[357, 582, 422, 613],[426, 300, 504, 423];Couch:[001, 678, 941, 999];Chair:[099, 575, 260, 747],[045, 557, 173, 812],[001, 537, 123, 819];Faucet:[213, 548, 249, 588];Cleaning Products:[408, 548, 432, 588];Cup:[159, 521, 199, 561];Projector:[011, 001, 172, 102];Tissue:[611, 576, 641, 616].') 2023-11-03 19:55:52.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:55:52.721 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Cat:[338, 206, 849, 934].') 2023-11-03 19:56:11.642 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:56:11.642 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Concentrated boy [328, 011, 727, 873] plays in developing game at the table [078, 556, 999, 941] with colorful cards [853, 818, 997, 937] [477, 819, 579, 892] [357, 738, 457, 803] [518, 770, 616, 838] [439, 749, 534, 816] [388, 801, 488, 872] with his parent indoors poster') 2023-11-03 19:56:48.968 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:56:48.969 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Machinery Vehicle:[002, 018, 673, 984];Stroller:[905, 448, 989, 653];Person:[033, 245, 089, 426],[295, 206, 458, 1000],[509, 302, 624, 841],[591, 164, 752, 999],[709, 439, 999, 999],[679, 225, 802, 658],[736, 422, 778, 838],[771, 310, 860, 651];Bracelet:[760, 581, 789, 612];Watch:[761, 443, 780, 458];Hat:[592, 167, 677, 260],[360, 211, 418, 267],[939, 245, 975, 283],[055, 246, 088, 274],[799, 321, 813, 346];Glasses:[372, 261, 415, 281],[939, 266, 963, 288],[866, 290, 884, 304],[682, 267, 700, 284];Boots:[368, 962, 458, 999],[622, 934, 689, 999];Sandals:[730, 873, 751, 902],[570, 792, 620, 823],[505, 801, 549, 838];Sneakers:[746, 782, 775, 835],[606, 732, 616, 781],[965, 429, 977, 449],[983, 430, 996, 456];Satchel:[782, 453, 794, 493];Backpack:[521, 364, 600, 424];Slippers:[950, 445, 976, 459];Belt:[907, 435, 945, 453];Other Shoes:[867, 426, 888, 445].') 2023-11-03 19:57:34.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:57:34.133 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Guitar in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Guitar:[372, 502, 632, 941].') 2023-11-03 19:57:59.683 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 19:57:59.684 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'person:[236, 242, 862, 625],[477, 491, 999, 999],[609, 500, 816, 685];clothing:[379, 313, 821, 565];microphone:[332, 315, 414, 377],[371, 873, 450, 933].') 2023-11-03 19:58:12.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:58:12.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[400, 534, 447, 718],personFrame1:[285, 539, 329, 725],personFrame1:[490, 528, 535, 718],personFrame1:[070, 541, 129, 754],personFrame1:[622, 504, 659, 666],personFrame1:[746, 500, 788, 665],personFrame1:[636, 518, 680, 702],personFrame1:[784, 516, 830, 691],personFrame1:[565, 526, 614, 713],personFrame1:[852, 518, 899, 690] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[400, 534, 447, 718];Frame2:[386, 536, 428, 723],personFrame1:[285, 539, 329, 725];Frame2:[268, 540, 310, 729],personFrame1:[490, 528, 535, 718];Frame2:[470, 530, 516, 720],personFrame1:[070, 541, 129, 754];Frame2:[061, 551, 118, 751],personFrame1[622, 504, 659, 666];Frame2:No object,personFrame1:[746, 500, 788, 665];Frame2:[754, 500, 789, 666],personFrame1:[636, 518, 680, 702];Frame2:[622, 520, 667, 708],personFrame1:[784, 516, 830, 691];Frame2:[770, 515, 816, 690],personFrame1:[565, 526, 614, 713];Frame2:[553, 526, 593, 716],personFrame1:[852, 518, 899, 690];Frame2:[841, 519, 882, 688].') 2023-11-03 19:59:06.003 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:59:06.003 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[481, 066, 771, 996] Not even the bell ringer knew that [427, 362, 496, 433] a $1000 bill was donated in [399, 601, 508, 838] her bucket') 2023-11-03 19:59:16.635 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 19:59:16.636 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Hat:[366, 416, 415, 503],[405, 339, 457, 414];Person:[340, 413, 456, 926],[328, 338, 517, 917],[316, 284, 417, 937],[206, 262, 346, 938].') 2023-11-03 19:59:19.471 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:59:19.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Car:[015, 150, 631, 763];Bicycle:[538, 235, 611, 354];Scooter:[933, 259, 999, 458];Person:[659, 149, 738, 415].') 2023-11-03 19:59:33.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 19:59:33.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[280, 504, 698, 641] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a stealth bomberFrame1:[280, 504, 698, 641];Frame2:[256, 580, 713, 740];Frame3:[234, 461, 739, 650];Frame4:[236, 469, 783, 673];Frame5:[182, 511, 794, 743].') 2023-11-03 20:00:14.161 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:00:14.162 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track carFrame1:[616, 518, 712, 619],carFrame1:[066, 428, 183, 531],carFrame1:[294, 458, 342, 518] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'carFrame1:[616, 518, 712, 619];Frame2:[620, 515, 720, 618],carFrame1[066, 428, 183, 531];Frame2:No object,carFrame1:[294, 458, 342, 518];Frame2:[000, 405, 183, 565].') 2023-11-03 20:00:19.085 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:00:19.085 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[707, 052, 774, 104],[206, 377, 333, 452],[384, 394, 505, 468],[523, 425, 646, 493],[656, 419, 776, 485],[778, 437, 887, 500],[889, 448, 999, 579],[033, 361, 176, 547];Potted Plant:[180, 325, 847, 436];Stool:[487, 801, 624, 963],[439, 775, 537, 933],[319, 808, 450, 984],[135, 801, 265, 971],[896, 760, 1000, 905],[477, 749, 582, 910];Desk:[274, 782, 449, 969],[519, 767, 667, 880];Person:[170, 701, 389, 921],[276, 699, 417, 845];Awning:[154, 514, 1000, 673];Blackboard:[694, 662, 807, 717].') 2023-11-03 20:01:26.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:01:26.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:1:[472, 548, 877, 685];frame:2:[472, 550, 855, 686];frame:3:[468, 553, 849, 686] is doing?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an tiger eat') 2023-11-03 20:01:35.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:01:35.361 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Lamp:[351, 001, 416, 083],[804, 031, 864, 147],[895, 001, 935, 073],[891, 173, 936, 256],[961, 153, 991, 211],[795, 205, 813, 276];Desk:[237, 622, 536, 999],[931, 682, 1000, 740],[000, 590, 288, 1000];Cabinet:[821, 556, 917, 674];Bench:[936, 884, 1000, 999];Frame:[959, 426, 1000, 450];Person:[249, 336, 433, 647];Speaker:[849, 015, 892, 074];Bow Tie:[299, 434, 337, 501];Backpack:[938, 733, 999, 813];Bottle:[341, 432, 367, 533],[447, 522, 476, 651];Cup:[409, 357, 433, 407],[411, 592, 431, 629];Bowl:[337, 594, 397, 668];Barrel:[177, 180, 337, 358],[324, 227, 440, 383],[432, 266, 517, 400],[514, 308, 586, 419],[154, 373, 300, 551],[329, 388, 412, 518],[409, 409, 488, 518],[484, 424, 549, 518],[541, 433, 587, 520],[110, 574, 240, 598];Plate:[398, 630, 464, 662].') 2023-11-03 20:02:08.490 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:02:08.491 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A charity worker [419, 017, 635, 968] educates children [842, 300, 997, 942] [754, 285, 960, 872] [109, 446, 347, 997] [117, 324, 334, 778] [295, 252, 460, 766] [667, 239, 811, 957] on how to prevent and identify the Ebola virus in their communities at Freetown, Sierra Leone') 2023-11-03 20:02:39.046 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:02:39.046 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Stetson Allie [299, 067, 705, 982] will move from first base to the outfield [007, 007, 992, 513] this season (Mark Olson / MiLB') 2023-11-03 20:02:40.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:02:40.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human head:[550, 139, 666, 280];man:[501, 162, 726, 385];window:[843, 641, 911, 777],[000, 296, 042, 485],[666, 265, 727, 355],[794, 245, 866, 350],[823, 417, 891, 565],[938, 225, 999, 336],[953, 401, 999, 556],[973, 639, 999, 795];clothing:[485, 136, 736, 386];human face:[567, 159, 645, 308].') 2023-11-03 20:02:56.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:02:56.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[042, 099, 575, 999],[527, 122, 999, 1000].') 2023-11-03 20:03:07.085 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:03:07.085 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Samaritan's Purse DC-8 cargo plane brought the disaster team to northern Italy, along with [289, 232, 612, 747] 20 tonnes of tents and medical equipment they would need.") 2023-11-03 20:03:09.572 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:03:09.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Bicycle:[000, 458, 187, 750];Scooter:[821, 525, 1000, 1000];Rickshaw:[320, 269, 972, 894];Street Lights:[530, 015, 614, 297],[239, 064, 282, 188];Person:[390, 299, 659, 923],[680, 243, 734, 345],[616, 155, 854, 592],[556, 188, 780, 573],[866, 183, 893, 225],[849, 195, 866, 224],[634, 196, 653, 230],[216, 182, 245, 223];Umbrella:[563, 074, 818, 207];Other Shoes:[434, 881, 496, 923],[630, 721, 659, 814],[557, 541, 579, 571],[562, 546, 608, 573];Bakset:[889, 850, 1000, 999].') 2023-11-03 20:03:34.120 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:03:34.120 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Whiteboard:[002, 002, 238, 292],[246, 002, 572, 306],[569, 002, 920, 266];Toiletry:[701, 749, 857, 999];Person:[002, 214, 035, 467],[166, 267, 432, 825],[238, 407, 560, 725],[333, 043, 490, 721],[475, 428, 771, 986],[769, 354, 906, 743],[833, 314, 999, 998];Other Shoes:[167, 746, 209, 823],[401, 638, 497, 694],[506, 606, 558, 724],[297, 553, 333, 605],[796, 686, 835, 734];Sandals:[474, 902, 556, 982];Chair:[002, 460, 101, 806];Stool:[036, 441, 092, 733];Desk:[193, 398, 403, 775],[258, 450, 574, 888],[383, 511, 759, 960],[790, 493, 923, 791],[651, 448, 799, 562],[523, 409, 775, 478].') 2023-11-03 20:03:40.519 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:03:40.519 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[192, 651, 265, 825] [519, 655, 589, 829] [804, 645, 875, 827] [562, 638, 624, 827] [601, 640, 679, 829] [749, 639, 818, 829] [379, 652, 465, 826] [669, 642, 739, 828] Crew gathers at [002, 111, 996, 965] the end of the timber frame barn raising') 2023-11-03 20:03:49.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:03:49.100 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'man:[363, 401, 608, 997],[851, 000, 999, 394];plant:[000, 000, 999, 999];clothing:[415, 586, 603, 994],[910, 000, 999, 196].') 2023-11-03 20:03:51.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:03:51.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Couch,Picture and Flower in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Couch:[019, 497, 479, 929],[687, 405, 987, 617],[222, 400, 502, 563];Frame:[451, 209, 502, 293],[786, 209, 846, 298],[005, 183, 079, 301];Flower:[490, 269, 604, 474].') 2023-11-03 20:04:00.898 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:04:00.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[424, 218, 593, 817] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a rhodesian ridgebackFrame1:[424, 218, 593, 817];Frame2:[328, 187, 613, 922];Frame3:[213, 070, 605, 974];Frame4:[218, 077, 618, 951].') 2023-11-03 20:04:48.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:04:48.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[467, 381, 796, 990],[000, 485, 208, 810],[000, 755, 266, 999],[303, 402, 491, 864],[678, 279, 801, 513];chair:[291, 665, 425, 999],[377, 836, 575, 999];man:[033, 434, 306, 802];girl:[294, 404, 498, 861],[460, 407, 874, 999];television:[036, 122, 406, 441];clothing:[000, 669, 273, 999],[000, 840, 275, 999],[000, 585, 289, 958],[032, 530, 298, 803],[255, 730, 428, 999],[295, 548, 485, 880],[461, 646, 501, 711],[461, 608, 874, 999],[687, 365, 803, 505].') 2023-11-03 20:05:05.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:05:05.160 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect shelf,Satchel and Backpack in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'shelf:[403, 001, 999, 883];Handbag:[735, 414, 827, 484];Backpack:[103, 634, 253, 916].') 2023-11-03 20:05:23.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:05:23.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'goggles:[546, 331, 756, 466];woman:[436, 155, 986, 975];sunglasses:[526, 352, 732, 456];man:[319, 358, 564, 935];human face:[540, 255, 734, 654].') 2023-11-03 20:05:34.039 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:05:34.040 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Carlos Onan Galo Perez [268, 229, 471, 802] (with a yellow rope [281, 235, 627, 979]) and members [183, 453, 257, 814] [884, 366, 997, 674] [784, 434, 889, 691] [594, 408, 793, 743] [365, 502, 498, 788] [518, 479, 657, 760] of his family atop a freight train [000, 636, 999, 996] that migrants call La Bestia (the Beast). The train travels through Mexico toward the U.S.') 2023-11-03 20:06:10.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:06:10.888 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[406, 258, 494, 588],truckFrame1:[629, 001, 999, 965],carFrame1:[382, 189, 437, 590],carFrame1:[088, 591, 226, 827],truckFrame1:[112, 434, 285, 762] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[406, 258, 494, 588];Frame2:[406, 250, 499, 597],truckFrame1:[629, 001, 999, 965];Frame2:[657, 001, 999, 962],carFrame1:[382, 189, 437, 590];Frame2:[380, 178, 437, 597],carFrame1:[088, 591, 226, 827];Frame2:[065, 596, 219, 851],truckFrame1:[112, 434, 285, 762];Frame2:[102, 431, 284, 762].') 2023-11-03 20:06:44.218 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:06:44.218 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[000, 365, 068, 567];frame:2:[015, 382, 081, 583];frame:3:[013, 386, 078, 585] is?Craft a concise reply using the image frames and trajectory specifics you have at hand.', 'This is a/an step to the right-side') 2023-11-03 20:06:51.348 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:06:51.348 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Over 40 fashion blogger Wardrobe Oxygen [159, 045, 628, 984] in a cold shoulder sweater [171, 189, 633, 528], JAG jeans [192, 491, 546, 934] and Payless sandals [429, 832, 535, 979] [322, 916, 423, 985]') 2023-11-03 20:07:10.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:07:10.165 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track monkeyFrame1:[606, 001, 817, 679],monkeyFrame1:[316, 659, 453, 1000] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'monkeyFrame1:[606, 001, 817, 679];Frame2:[604, 031, 750, 862],monkeyFrame1:[316, 659, 453, 1000];Frame2:[328, 827, 460, 1000].') 2023-11-03 20:07:28.647 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:07:28.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track personFrame1:[192, 730, 238, 856],personFrame1:[166, 386, 199, 475],personFrame1:[329, 348, 361, 443] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'personFrame1:[192, 730, 238, 856];Frame2:[195, 727, 240, 862],personFrame1:[166, 386, 199, 475];Frame2:[179, 397, 217, 483],personFrame1:[329, 348, 361, 443];Frame2:[323, 366, 358, 450].') 2023-11-03 20:07:30.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:07:30.566 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[099, 323, 186, 726],[383, 095, 567, 918],[593, 289, 665, 782],[631, 304, 768, 942],[721, 280, 785, 406],[748, 312, 830, 872];man:[000, 302, 020, 524],[000, 288, 041, 557],[015, 275, 079, 609],[074, 271, 126, 622],[131, 260, 174, 375],[133, 273, 200, 426],[179, 277, 221, 363],[181, 245, 373, 999],[183, 260, 251, 450],[322, 247, 409, 676],[484, 219, 541, 688],[509, 264, 635, 912],[644, 254, 689, 409],[695, 273, 783, 297],[783, 253, 996, 999];tree:[048, 228, 171, 268],[523, 224, 793, 304],[828, 195, 956, 281];jeans:[013, 444, 032, 551],[028, 461, 078, 596],[091, 478, 119, 609],[120, 557, 173, 716],[205, 682, 345, 999],[403, 467, 495, 775],[652, 681, 738, 930],[818, 704, 946, 999].') 2023-11-03 20:07:48.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:07:48.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'toy:[038, 127, 381, 936];clothing:[063, 310, 367, 881];flowerpot:[396, 417, 640, 654];person:[036, 178, 378, 949];houseplant:[110, 011, 983, 703].') 2023-11-03 20:08:10.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:08:10.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'person:[268, 388, 338, 680],[710, 497, 855, 876];tree:[000, 023, 304, 443],[426, 000, 584, 332];footwear:[694, 758, 747, 821].') 2023-11-03 20:08:17.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:08:17.328 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Enjoy the flavors of fall with [219, 289, 566, 998] this grilled pork loin, served over [033, 086, 811, 993] a bed of farro with\xa0[512, 672, 642, 828] [458, 523, 561, 643] [109, 458, 224, 608] [361, 288, 476, 479] [212, 200, 331, 333] roasted harvest vegetables and [586, 002, 743, 124] [130, 215, 266, 386] ripe plums.') 2023-11-03 20:08:23.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:08:23.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[531, 509, 544, 564] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a rhodesian ridgebackFrame1:[531, 509, 544, 564];Frame2:[530, 502, 545, 557];Frame3:[525, 498, 543, 550];Frame4:[521, 497, 539, 549].') 2023-11-03 20:08:32.677 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:08:32.678 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'AWESOME AFRICA [002, 003, 995, 996] a South African gift shop offering authentic products made by local crafters') 2023-11-03 20:09:05.213 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:09:05.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'car:[118, 719, 323, 879],[000, 697, 080, 793],[400, 529, 498, 566];building:[000, 126, 413, 756],[390, 261, 786, 556];window:[185, 450, 220, 539],[188, 304, 234, 392],[240, 315, 276, 401],[245, 450, 271, 535],[286, 321, 315, 407],[290, 452, 317, 547],[333, 343, 353, 417],[334, 460, 358, 537],[365, 350, 384, 413],[373, 470, 389, 539].') 2023-11-03 20:09:14.487 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:09:14.488 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Stuffed Toy:[141, 096, 352, 589],[297, 217, 427, 567],[355, 310, 488, 594],[401, 421, 533, 745],[475, 325, 611, 649],[454, 108, 596, 440],[336, 087, 490, 348],[587, 181, 797, 613];Book:[747, 236, 929, 552],[001, 313, 212, 664];Person:[115, 209, 142, 288];Plate:[740, 618, 812, 689],[696, 665, 741, 703],[572, 683, 643, 763],[525, 766, 571, 813],[298, 589, 393, 692],[366, 636, 431, 704],[339, 718, 383, 754];Cup:[689, 627, 734, 684],[532, 733, 572, 800],[345, 682, 383, 743],[425, 736, 472, 841],[400, 787, 452, 893],[390, 848, 443, 971],[280, 733, 323, 829],[269, 786, 318, 847],[266, 840, 322, 952];Spoon:[605, 685, 625, 749];Tea pot:[303, 563, 378, 663];Jug:[287, 754, 401, 959];Desk:[000, 414, 1000, 999].') 2023-11-03 20:09:42.610 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:09:42.611 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'SUV:[000, 445, 562, 827];Air Conditioner:[025, 223, 090, 284];Flower:[551, 657, 614, 694],[757, 653, 834, 731],[658, 794, 914, 944];Vase:[549, 684, 606, 737],[773, 705, 816, 737],[923, 713, 953, 742],[685, 884, 902, 999];Bench:[937, 667, 988, 743],[068, 527, 175, 581];Chair:[186, 519, 208, 579].') 2023-11-03 20:09:49.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:09:49.573 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'man:[500, 821, 576, 991],[293, 861, 331, 970],[378, 863, 410, 937],[455, 825, 501, 976],[545, 834, 566, 970],[708, 800, 820, 999];window:[045, 468, 100, 676],[015, 120, 053, 204],[161, 525, 211, 670],[239, 558, 270, 687],[353, 054, 411, 131],[461, 075, 500, 141],[545, 086, 594, 150],[835, 141, 861, 204],[913, 000, 960, 076];house:[000, 037, 110, 213];tree:[035, 135, 148, 220],[629, 145, 812, 365];clothing:[295, 862, 331, 973],[356, 864, 387, 952],[381, 857, 413, 920],[455, 835, 507, 980],[495, 826, 565, 999],[568, 868, 627, 999],[702, 822, 816, 999].') 2023-11-03 20:10:16.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:10:16.168 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[311, 294, 764, 824] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a slothFrame1:[311, 294, 764, 824];Frame2:[413, 352, 640, 924];Frame3:[303, 580, 532, 997];Frame4:[075, 636, 505, 932].') 2023-11-03 20:10:38.934 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:10:38.934 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "I know he's a player, but I want to be the girl [005, 006, 990, 997] that he hangs up hes jersey for and leaves the game.") 2023-11-03 20:10:46.669 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:10:46.669 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Portrait of a group of happy multiracial university graduates with [626, 510, 669, 712] [544, 521, 584, 758] [693, 504, 738, 728] [379, 561, 420, 807] [481, 565, 525, 786] [309, 586, 356, 844] diplomas in [545, 682, 600, 777] [367, 717, 430, 810] [472, 716, 538, 814] [270, 771, 352, 854] [615, 678, 679, 769] [694, 676, 762, 789] their hands.') 2023-11-03 20:11:02.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:11:02.828 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Person:[272, 340, 308, 533],[305, 322, 348, 450],[847, 343, 993, 597],[910, 678, 1000, 889],[903, 557, 997, 690],[514, 722, 686, 958],[655, 648, 727, 722],[307, 563, 443, 857],[744, 916, 894, 999].') 2023-11-03 20:11:04.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:11:04.268 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'dress:[428, 501, 997, 998];woman:[000, 000, 997, 998];man:[000, 092, 997, 998].') 2023-11-03 20:12:23.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:12:23.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Handbag:[116, 599, 152, 668],[499, 723, 579, 843],[762, 540, 798, 584],[401, 594, 444, 633];Sneakers:[268, 818, 301, 853],[848, 870, 873, 916];Leather Shoes:[425, 792, 473, 826],[596, 759, 634, 795];Picture:[081, 096, 330, 421],[509, 096, 645, 332],[787, 078, 988, 349];Chair:[867, 837, 972, 972],[719, 822, 871, 980],[627, 871, 729, 999],[438, 911, 535, 999],[165, 685, 336, 998],[001, 711, 147, 999],[317, 655, 481, 998],[481, 631, 629, 1000],[609, 613, 791, 931],[756, 449, 797, 561],[627, 469, 667, 575],[479, 480, 519, 593],[332, 490, 369, 611],[240, 956, 323, 999],[768, 590, 896, 846],[908, 579, 998, 826];Desk:[001, 541, 994, 883];Person:[001, 475, 170, 856],[218, 440, 336, 853],[342, 416, 477, 824],[491, 422, 633, 799],[642, 396, 784, 781],[771, 400, 859, 552],[778, 483, 935, 908],[920, 516, 999, 819],[895, 702, 1000, 954],[118, 156, 163, 365],[826, 122, 948, 311],[539, 133, 613, 293],[826, 534, 876, 575],[167, 162, 224, 354],[226, 159, 292, 353];Book:[001, 670, 070, 711],[049, 693, 117, 739],[024, 610, 106, 686],[193, 672, 257, 716],[435, 637, 517, 686],[577, 574, 659, 609],[655, 614, 715, 652],[334, 653, 407, 692],[447, 592, 519, 630],[219, 623, 284, 663],[275, 610, 350, 650],[694, 564, 759, 596],[923, 938, 998, 998];Backpack:[116, 597, 157, 669].') 2023-11-03 20:12:31.752 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:12:31.753 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("I urge you to inspect the picture and supply a response to my question 'QUESTION: Why has [629, 544, 928, 734] landed on the water ?'. Please share your thought process, and reveal the location of identified objects using the format [xmin,ymin,xmax,ymax].", '[196, 258, 347, 712] and [176, 665, 453, 710] have been damaged and can not be sailed back , and they have all retreated to [406, 203, 481, 650] , but [406, 203, 481, 650] can not carry them all safely . [629, 544, 928, 734] will take [355, 641, 387, 692] and [434, 654, 444, 684] and [301, 654, 320, 687] to shore .') 2023-11-03 20:12:36.639 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:12:36.640 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Basin:[588, 830, 675, 872],[703, 878, 762, 935];Vase:[703, 878, 764, 933],[614, 319, 692, 356],[282, 362, 477, 409];Flower:[588, 197, 747, 334],[344, 318, 389, 372];Potted Plant:[494, 275, 606, 346],[371, 182, 545, 344],[121, 293, 577, 489].') 2023-11-03 20:12:58.199 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:12:58.200 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the person in the leading position among the pair of cyclists traveling in the same direction..To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one the person in the leading position among the pair of cyclists traveling in the same direction..Track1frame:1:[483, 387, 543, 644];frame:2:[482, 390, 543, 647];frame:3:[474, 395, 542, 644].') 2023-11-03 20:13:00.826 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:13:00.826 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[000, 244, 216, 798],[248, 286, 556, 730],[463, 270, 639, 614],[756, 278, 884, 458],[924, 159, 1000, 394],[776, 187, 905, 460];Tent:[789, 101, 958, 196];Other Shoes:[090, 752, 139, 799];Hat:[001, 247, 098, 313],[413, 305, 497, 383];Head Phone:[380, 286, 502, 427],[507, 288, 587, 365];Microphone:[597, 401, 678, 486],[434, 447, 526, 569];Laptop:[725, 378, 866, 499],[684, 444, 817, 540],[385, 522, 684, 803];Converter:[679, 641, 756, 686];Extention Cord:[757, 576, 803, 660];Camera:[396, 626, 451, 779];Tripod:[361, 777, 458, 978];Tablet:[189, 647, 402, 846];Bottle:[213, 677, 301, 1000],[147, 736, 222, 966],[195, 703, 233, 867];Plate:[706, 185, 746, 233],[036, 961, 127, 999];Chair:[066, 571, 247, 806],[233, 525, 262, 664],[693, 383, 741, 490],[716, 338, 775, 441],[897, 338, 929, 405];Storage box:[930, 365, 964, 412],[969, 359, 999, 414];Desk:[844, 431, 999, 769],[000, 481, 965, 1000].') 2023-11-03 20:13:17.795 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:13:17.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[002, 012, 634, 526],[684, 003, 999, 604];Bottle:[751, 269, 866, 569];Spoon:[064, 706, 425, 863],[027, 899, 388, 999].') 2023-11-03 20:13:34.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:13:34.515 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Boat:[775, 305, 1000, 930];Flag:[134, 396, 142, 404],[160, 344, 168, 405],[267, 274, 289, 304],[268, 310, 284, 331],[575, 528, 580, 534],[628, 601, 632, 609],[870, 466, 901, 645];Person:[819, 566, 859, 780],[841, 573, 856, 608];Lifesaver:[922, 533, 976, 659],[969, 522, 999, 658],[348, 549, 359, 588],[396, 551, 405, 588],[394, 557, 403, 594],[590, 601, 593, 611],[591, 612, 594, 621],[578, 602, 581, 613],[577, 613, 580, 621];Satchel:[832, 605, 858, 676];Glasses:[851, 590, 856, 596].') 2023-11-03 20:13:51.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:13:51.413 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Caledonia’s success comes from [578, 331, 884, 996] [291, 313, 639, 996] its people') 2023-11-03 20:13:55.073 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:13:55.073 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[343, 003, 995, 677] A stand mixer whipping up pavlova') 2023-11-03 20:14:10.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:14:10.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'drink:[750, 401, 845, 803],[474, 629, 584, 723];human hair:[433, 219, 595, 438],[012, 160, 218, 397],[272, 219, 336, 274],[281, 277, 348, 337],[341, 212, 426, 298];bottle:[721, 409, 841, 804];clothing:[009, 304, 315, 840];human hand:[020, 488, 108, 629],[432, 604, 496, 643],[449, 541, 510, 608];woman:[000, 169, 331, 999],[373, 219, 645, 647];human head:[018, 168, 228, 440],[166, 177, 286, 351],[277, 150, 322, 217],[285, 280, 370, 351],[345, 217, 416, 293],[431, 213, 569, 398];mammal:[000, 172, 333, 924],[166, 178, 433, 370],[264, 150, 323, 235],[275, 280, 369, 368],[276, 225, 340, 285],[347, 217, 424, 293],[374, 218, 638, 657];man:[203, 183, 441, 346];wine glass:[440, 678, 548, 925];human face:[141, 218, 225, 403],[215, 220, 289, 332],[435, 283, 481, 380];human arm:[000, 357, 329, 677],[256, 325, 355, 375],[356, 287, 435, 351],[374, 391, 526, 606],[426, 402, 632, 637].') 2023-11-03 20:14:17.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:14:17.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Bottle:[095, 552, 102, 590];Camera:[904, 548, 925, 611];Person:[991, 432, 999, 594],[873, 489, 943, 695],[828, 381, 891, 564],[780, 362, 839, 567],[609, 085, 804, 374],[597, 224, 807, 978],[563, 267, 712, 910],[543, 374, 633, 564],[429, 235, 540, 886],[348, 244, 568, 922],[245, 385, 283, 524],[199, 029, 504, 956],[419, 436, 451, 608],[040, 374, 095, 591],[000, 364, 038, 562];Watch:[792, 297, 804, 311];Bracelet:[073, 483, 081, 495];Hat:[000, 364, 022, 387];Slippers:[041, 580, 053, 589],[080, 578, 093, 590];Backpack:[093, 461, 121, 520];Sneakers:[241, 863, 289, 959],[361, 886, 402, 948],[347, 852, 405, 906],[402, 871, 446, 922],[434, 799, 490, 882],[474, 805, 507, 876],[575, 856, 622, 908],[597, 828, 654, 939],[671, 856, 711, 910],[676, 907, 706, 932],[456, 897, 504, 954],[402, 921, 445, 953],[723, 893, 805, 976];Sandals:[554, 547, 569, 563],[614, 547, 633, 562];Other Shoes:[881, 675, 901, 697],[917, 678, 936, 695],[436, 584, 451, 605];Belt:[719, 273, 774, 289].') 2023-11-03 20:14:19.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:14:19.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'flower:[291, 453, 346, 650],[053, 036, 251, 648],[282, 000, 520, 550],[586, 000, 801, 816].') 2023-11-03 20:14:27.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:14:27.692 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the sheep moving right?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There are 6 sheep moving right.Track1frame:1:[169, 350, 491, 975];frame:2:[377, 294, 579, 814];frame:3:[394, 274, 624, 803],Track2frame:1:[573, 568, 633, 635];frame:2:[795, 270, 902, 596];frame:3:[792, 262, 904, 591],Track3frame:1:[663, 315, 875, 645];frame:2:[784, 297, 994, 641];frame:3:[826, 300, 999, 643],Track4frame:1:[492, 207, 878, 772];frame:2:[485, 225, 815, 778];frame:3:[481, 225, 810, 779],Track5frame:1:[652, 187, 849, 583];frame:2:[642, 189, 824, 589];frame:3:[637, 188, 825, 591],Track6frame:1:[000, 332, 403, 949];frame:2:[137, 239, 497, 836];frame:3:[162, 228, 500, 831].') 2023-11-03 20:15:10.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:15:10.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[231, 355, 680, 783] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a pole horseFrame1:[231, 355, 680, 783];Frame2:[222, 257, 664, 741];Frame3:[120, 312, 593, 794];Frame4:[180, 287, 607, 746];Frame5:[094, 294, 604, 734].') 2023-11-03 20:15:27.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:15:27.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[817, 525, 985, 662],carFrame1:[456, 538, 536, 654],carFrame1:[228, 557, 284, 628],carFrame1:[260, 547, 299, 617],carFrame1:[517, 458, 815, 808] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'carFrame1[817, 525, 985, 662];Frame2:No object,carFrame1:[456, 538, 536, 654];Frame2:[495, 541, 657, 742],carFrame1:[228, 557, 284, 628];Frame2:[174, 572, 261, 682],carFrame1:[260, 547, 299, 617];Frame2:[237, 555, 292, 648],carFrame1:[517, 458, 815, 808];Frame2:[816, 583, 999, 994].') 2023-11-03 20:15:29.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:15:29.579 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect person and rifle in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'person:[000, 084, 580, 804],[000, 000, 922, 603];rifle:[298, 288, 841, 745].') 2023-11-03 20:15:54.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:15:54.543 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Lamp:[298, 322, 341, 431],[138, 154, 237, 337],[144, 350, 226, 423],[000, 323, 046, 389],[398, 001, 573, 354],[642, 199, 742, 434];Person:[360, 597, 485, 1000],[614, 719, 856, 1000],[017, 449, 076, 541],[108, 457, 144, 557],[956, 570, 1000, 641].') 2023-11-03 20:16:11.726 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:16:11.726 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[590, 193, 787, 990],[020, 250, 189, 999],[128, 218, 376, 999],[327, 302, 407, 478],[344, 181, 561, 999],[701, 198, 927, 999],[932, 306, 999, 996];human hair:[385, 191, 532, 423],[077, 252, 186, 437],[171, 218, 313, 437],[194, 186, 288, 397],[309, 303, 339, 386],[339, 299, 408, 395],[501, 212, 556, 329],[599, 197, 758, 463],[754, 187, 880, 378],[868, 239, 950, 340],[943, 312, 985, 399],[983, 305, 999, 411];human arm:[866, 451, 945, 970],[038, 395, 081, 569],[124, 448, 205, 793],[348, 407, 413, 850],[541, 419, 607, 835],[579, 401, 635, 611],[905, 437, 961, 515],[920, 500, 951, 625];dress:[024, 369, 188, 997],[343, 397, 576, 999],[598, 371, 802, 977],[692, 413, 939, 999],[903, 486, 999, 999];suit:[000, 390, 063, 996],[120, 406, 376, 996],[520, 313, 578, 530],[585, 359, 635, 669],[848, 344, 926, 453],[936, 399, 999, 484];man:[000, 232, 083, 999],[302, 313, 356, 429],[574, 296, 646, 660],[840, 236, 943, 478],[934, 320, 999, 491];human face:[000, 281, 036, 383],[229, 249, 312, 420],[430, 204, 503, 359],[655, 234, 722, 383],[768, 254, 848, 403],[891, 267, 948, 375].') 2023-11-03 20:16:33.610 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:16:33.610 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[017, 158, 311, 564],[000, 558, 389, 998],[230, 192, 381, 394],[824, 141, 914, 267];man:[448, 194, 891, 931],[000, 091, 097, 278],[881, 202, 998, 447];girl:[103, 618, 402, 998],[114, 350, 326, 652],[251, 374, 482, 714],[639, 665, 986, 998];clothing:[000, 250, 078, 324],[000, 147, 324, 614],[000, 727, 381, 998],[011, 168, 091, 254],[049, 622, 181, 740],[110, 422, 337, 621],[162, 688, 414, 984],[265, 482, 448, 671],[360, 310, 898, 860],[438, 434, 633, 548],[753, 785, 996, 984],[791, 215, 839, 294],[841, 138, 906, 275],[862, 204, 998, 427],[895, 124, 969, 261];human face:[000, 115, 072, 210],[072, 625, 166, 690],[133, 192, 215, 264],[173, 361, 244, 435],[267, 218, 341, 282],[374, 414, 448, 478],[421, 357, 494, 402],[580, 254, 679, 337],[583, 358, 648, 428].') 2023-11-03 20:16:34.569 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:16:34.569 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Yara Shahidi [194, 052, 842, 995] in a snakeskin pantsuit [196, 343, 843, 998] in front of a black and whitie background') 2023-11-03 20:16:46.389 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:16:46.389 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Animal color cartoon showing a lion with binoculars on a safari man\'s shoulders. [010, 010, 524, 585] The lion says, "Oooo, there\'s a gazelle in the grass".') 2023-11-03 20:16:55.389 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:16:55.389 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[730, 324, 910, 549],[812, 317, 998, 590];man:[413, 104, 705, 968],[606, 155, 727, 451],[702, 313, 912, 552],[711, 323, 850, 526],[778, 313, 843, 358],[875, 302, 993, 517];trousers:[435, 521, 658, 912],[732, 410, 832, 489],[788, 429, 890, 543],[923, 477, 998, 584].') 2023-11-03 20:16:56.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:16:56.967 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[743, 533, 769, 609],[081, 602, 110, 689];Machinery Vehicle:[277, 269, 424, 587],[950, 539, 999, 709];Truck:[828, 452, 998, 603].') 2023-11-03 20:17:06.513 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:17:06.513 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Truck:[340, 320, 562, 579],[538, 300, 749, 584],[829, 266, 1000, 591];Street Lights:[476, 191, 541, 318].') 2023-11-03 20:17:10.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:17:10.582 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[452, 195, 619, 787],[616, 075, 926, 1000];Street Lights:[447, 132, 469, 308],[629, 046, 663, 215];Car:[593, 298, 648, 382];Sneakers:[096, 942, 186, 999],[085, 907, 172, 970],[183, 825, 248, 899],[284, 820, 306, 867],[299, 866, 382, 937],[333, 841, 384, 882],[013, 667, 046, 721],[066, 695, 081, 725];Slippers:[356, 770, 423, 812],[865, 949, 933, 991],[923, 922, 999, 974];Boots:[482, 683, 518, 786],[556, 681, 613, 773];Backpack:[866, 399, 978, 641];Hat:[512, 195, 561, 243].') 2023-11-03 20:17:49.006 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:17:49.006 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[141, 309, 611, 817] hand showing [000, 008, 998, 992] a digital image of human brain in the network - [225, 118, 466, 419] brain stock pictures, royalty-free photos & images') 2023-11-03 20:18:10.599 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:18:10.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,human face,fast food and clothing in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'woman:[462, 161, 594, 297],[000, 132, 334, 998],[107, 000, 267, 351],[200, 135, 299, 349],[300, 137, 457, 330],[689, 156, 870, 431],[727, 147, 794, 338];human face:[014, 361, 133, 529],[132, 214, 194, 313],[168, 027, 210, 101],[781, 199, 830, 264],[849, 243, 921, 334],[954, 277, 998, 379];fast food:[197, 676, 331, 767],[094, 968, 166, 998],[253, 521, 343, 575],[323, 426, 426, 470],[379, 839, 509, 911],[434, 400, 498, 450],[436, 441, 511, 482],[464, 743, 570, 833],[594, 566, 725, 645],[658, 367, 725, 395],[704, 421, 810, 460],[764, 575, 915, 673],[794, 727, 903, 781],[802, 883, 900, 957],[826, 486, 954, 545],[903, 792, 998, 887];clothing:[000, 524, 166, 866],[102, 080, 228, 344],[137, 299, 205, 413],[222, 233, 292, 346],[307, 192, 450, 313],[468, 197, 591, 310],[589, 179, 741, 328],[723, 235, 797, 339],[741, 258, 869, 433],[836, 299, 998, 509].') 2023-11-03 20:18:29.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:18:29.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[000, 514, 289, 932],[003, 173, 487, 409];man:[040, 515, 290, 931],[079, 037, 737, 553],[631, 385, 962, 840];girl:[000, 521, 285, 928],[018, 173, 496, 418];clothing:[020, 160, 362, 421],[020, 496, 379, 964],[239, 004, 692, 500],[604, 468, 968, 795];human face:[059, 678, 256, 810],[335, 218, 429, 312],[487, 384, 603, 501],[657, 634, 782, 787].') 2023-11-03 20:19:22.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 20:19:22.808 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'dress:[357, 293, 744, 730];woman:[018, 126, 858, 820];girl:[067, 129, 872, 793];plant:[030, 266, 476, 863],[733, 381, 999, 665].') 2023-11-03 20:19:26.290 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:19:26.291 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Bear:[493, 393, 678, 548].') 2023-11-03 20:19:38.842 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:19:38.842 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'window:[631, 594, 700, 824],[301, 804, 321, 890];house:[289, 000, 999, 982].') 2023-11-03 20:20:04.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:20:04.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'person:[164, 317, 414, 960];flag:[635, 059, 760, 780];clothing:[152, 474, 425, 994];human face:[230, 387, 301, 538].') 2023-11-03 20:20:10.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:20:10.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Microphone:[437, 114, 554, 221],[525, 137, 624, 207];Person:[003, 173, 136, 424],[137, 101, 578, 938],[670, 135, 867, 583],[731, 153, 999, 654];Backpack:[693, 243, 883, 421];Hat:[234, 102, 345, 169].') 2023-11-03 20:20:42.278 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:20:42.279 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Spoon,Person,Plate and Slippers in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Spoon:[741, 838, 764, 921];Person:[001, 001, 040, 341],[004, 001, 127, 352],[121, 001, 198, 339],[122, 109, 533, 863];Plate:[654, 819, 793, 949];Slippers:[274, 777, 417, 855].') 2023-11-03 20:21:12.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:21:12.114 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Chair in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Chair:[127, 686, 269, 975],[587, 815, 830, 976],[515, 700, 653, 999],[753, 689, 866, 997],[793, 645, 942, 886],[895, 616, 988, 823],[751, 564, 825, 687],[810, 552, 866, 676],[732, 541, 779, 640],[659, 565, 708, 692],[598, 592, 685, 729],[504, 605, 608, 753],[459, 573, 539, 735].') 2023-11-03 20:21:21.498 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:21:21.499 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'bicycle wheel:[000, 729, 998, 998],[042, 849, 078, 931],[045, 820, 082, 885],[071, 843, 131, 917],[092, 895, 177, 998],[111, 855, 186, 944],[161, 920, 288, 998],[300, 965, 405, 998],[495, 943, 588, 998],[611, 847, 662, 953],[652, 874, 727, 982],[701, 888, 793, 998],[808, 911, 928, 998],[912, 920, 998, 998];wheel:[000, 768, 998, 998];maple:[092, 000, 998, 804];tire:[000, 797, 998, 998].') 2023-11-03 20:21:38.606 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:21:38.607 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[000, 223, 066, 998],[000, 072, 123, 553],[020, 274, 490, 998],[078, 149, 250, 808],[150, 045, 224, 170],[214, 018, 329, 219],[232, 165, 523, 885],[385, 213, 519, 616],[391, 184, 478, 440],[426, 064, 566, 589],[484, 191, 823, 761],[485, 323, 848, 998],[575, 108, 690, 253],[700, 203, 850, 614],[740, 138, 852, 508],[748, 331, 999, 998],[824, 278, 999, 819],[830, 282, 921, 691],[844, 108, 941, 286],[931, 140, 999, 294];clothing:[000, 548, 148, 810],[001, 663, 499, 998],[020, 399, 128, 599],[043, 235, 106, 399],[307, 496, 538, 854],[390, 457, 518, 607],[431, 316, 575, 599],[466, 565, 804, 765],[478, 671, 849, 998],[755, 415, 845, 609],[761, 718, 999, 998],[773, 360, 851, 505],[804, 552, 916, 708];human face:[023, 078, 085, 246],[122, 327, 285, 684],[216, 089, 310, 211],[453, 091, 554, 319],[699, 242, 771, 355],[737, 168, 822, 352],[845, 122, 927, 298].') 2023-11-03 20:22:25.600 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:22:25.601 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'clothing:[408, 639, 485, 791],[720, 641, 780, 750];swimming pool:[053, 469, 972, 959];human body:[018, 538, 070, 605],[121, 666, 224, 693],[168, 471, 585, 551],[429, 588, 567, 795],[608, 601, 710, 647],[658, 574, 786, 819],[716, 553, 750, 616],[778, 486, 818, 641],[913, 574, 962, 662];human leg:[431, 741, 575, 791],[736, 701, 755, 791],[752, 706, 775, 798],[788, 540, 804, 628];woman:[413, 597, 576, 795],[665, 570, 777, 806],[783, 452, 814, 634];human arm:[421, 641, 489, 720],[800, 498, 814, 572].') 2023-11-03 20:22:34.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:22:34.231 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[341, 485, 662, 986],[000, 271, 040, 337],[000, 404, 077, 644],[000, 445, 398, 999],[055, 334, 138, 468],[070, 320, 191, 497],[146, 344, 261, 518],[196, 352, 341, 536],[438, 302, 496, 385],[503, 340, 560, 466],[550, 340, 632, 445],[550, 363, 685, 702];man:[655, 350, 936, 979],[376, 362, 546, 627],[655, 296, 753, 500];clothing:[346, 605, 668, 980],[000, 360, 751, 999],[653, 496, 931, 999];human face:[241, 475, 311, 598],[448, 374, 493, 472],[452, 529, 543, 703],[571, 403, 620, 479],[758, 405, 840, 520].') 2023-11-03 20:22:46.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:22:46.106 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Baseball Bat:[664, 137, 763, 300];Person:[918, 175, 1000, 407],[678, 067, 803, 204],[770, 210, 839, 312],[664, 164, 820, 633],[594, 211, 683, 323],[510, 158, 692, 436],[513, 357, 616, 533],[380, 109, 475, 295],[296, 106, 379, 264],[249, 168, 343, 414],[152, 173, 251, 390],[062, 173, 171, 395],[001, 173, 062, 403],[001, 002, 072, 115],[019, 008, 111, 168],[112, 002, 209, 169],[164, 002, 245, 078],[459, 002, 558, 124],[548, 002, 642, 118],[630, 002, 709, 118],[317, 304, 523, 787],[140, 475, 377, 764],[011, 399, 186, 773],[014, 574, 168, 1000];Helmet:[095, 576, 170, 667],[126, 394, 184, 509],[234, 471, 295, 588].') 2023-11-03 20:22:50.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:22:50.711 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[261, 447, 383, 733] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a mountain bikeFrame1:[261, 447, 383, 733];Frame2:[295, 393, 414, 679];Frame3:[314, 315, 432, 601];Frame4:[308, 312, 427, 609].') 2023-11-03 20:22:56.535 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:22:56.535 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[204, 250, 738, 632] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a puntFrame1:[204, 250, 738, 632];Frame2:[215, 248, 661, 603];Frame3:[249, 255, 640, 600];Frame4:[287, 282, 626, 626];Frame5:[285, 391, 605, 685].') 2023-11-03 20:23:09.296 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:23:09.297 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Photo of our favorite Peruvian roast chicken recipe showing [057, 034, 961, 936] the chicken in [000, 004, 994, 997] a cast iron skillet with a small container of...') 2023-11-03 20:23:10.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:23:10.739 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Traffic cone:[847, 796, 955, 927];Hat:[648, 111, 746, 159];Other Shoes:[156, 601, 225, 647];Skating and Skiing shoes:[014, 800, 202, 921],[376, 789, 554, 915],[669, 677, 774, 758],[788, 674, 860, 759];Person:[013, 183, 747, 919],[078, 256, 304, 649],[620, 113, 864, 761].') 2023-11-03 20:23:27.067 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:23:27.068 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Bacon, lettuce [274, 556, 478, 843], tomato, onions and croutons [175, 307, 299, 448] [189, 693, 332, 854] in a white bowl [004, 011, 995, 924]') 2023-11-03 20:23:35.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:23:35.070 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'fountain:[046, 136, 933, 861];person:[137, 490, 164, 578],[336, 494, 360, 573],[840, 489, 865, 568],[863, 521, 898, 586],[931, 492, 999, 607];tree:[000, 000, 999, 489];umbrella:[000, 457, 103, 585].') 2023-11-03 20:23:57.788 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:23:57.788 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Repairing a sump pump in a basement with [712, 512, 995, 995] a red LED light illuminating the pit and pipe work for draining [105, 327, 750, 881] ground water') 2023-11-03 20:23:58.219 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:23:58.219 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of [839, 506, 892, 676] [293, 373, 350, 549] [368, 326, 411, 543] [419, 345, 467, 556] [191, 331, 257, 575] [721, 403, 770, 609] [486, 327, 545, 559] [552, 331, 607, 550] [119, 392, 185, 589] [607, 346, 665, 565] [667, 366, 722, 571] [018, 398, 085, 632] photographers taking a picture of a sunrise from [002, 516, 993, 995] a mountain top') 2023-11-03 20:24:39.296 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:24:39.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'building:[000, 001, 999, 999];window:[000, 775, 075, 856],[000, 416, 077, 501],[000, 553, 123, 733],[018, 000, 193, 105],[031, 638, 191, 793],[063, 831, 139, 915],[072, 500, 150, 568],[109, 081, 250, 246],[125, 713, 250, 843],[148, 880, 223, 957],[177, 214, 316, 358],[193, 764, 307, 898],[198, 624, 268, 698],[216, 067, 295, 156],[243, 315, 355, 450],[260, 877, 328, 933],[294, 178, 357, 258],[300, 404, 396, 527],[305, 879, 387, 971],[320, 004, 380, 103],[339, 283, 394, 351],[350, 479, 435, 586],[355, 918, 425, 999],[364, 125, 423, 199],[380, 366, 435, 435],[385, 549, 471, 653],[387, 947, 455, 999],[400, 223, 450, 295],[423, 607, 496, 700],[455, 656, 512, 747],[485, 711, 541, 788],[499, 436, 530, 501],[505, 759, 567, 826].') 2023-11-03 20:24:44.522 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:24:44.522 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Get more gold in "Avabel Online" with [115, 518, 886, 869] [822, 084, 961, 631] these handy farming tips.') 2023-11-03 20:24:57.867 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:24:57.867 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect stop sign,human face,man and woman in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human face:[000, 500, 058, 661],[060, 398, 117, 505],[171, 430, 218, 561],[214, 432, 368, 674],[387, 424, 422, 511],[803, 279, 996, 615];man:[036, 363, 540, 976],[054, 359, 200, 715],[636, 193, 999, 995];woman:[355, 365, 500, 791],[000, 444, 132, 930],[149, 400, 254, 681].') 2023-11-03 20:25:17.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:17.613 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[345, 750, 418, 850] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', "This is a red basketball on a woman's handFrame1:[345, 750, 418, 850];Frame2:[350, 758, 422, 858];Frame3:[343, 650, 416, 750];Frame4:[337, 616, 410, 716];Frame5:[350, 647, 422, 744].") 2023-11-03 20:25:25.067 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:25.068 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[463, 334, 657, 573],[355, 036, 408, 177],[169, 009, 203, 093],[000, 003, 039, 339],[265, 105, 319, 177],[000, 417, 040, 491];Car:[674, 037, 933, 182],[706, 001, 911, 085],[457, 061, 672, 181],[156, 001, 372, 181],[961, 026, 1000, 176];Traffic Light:[165, 001, 207, 098];SUV:[009, 001, 177, 179];Helmet:[482, 337, 591, 493];Gloves:[621, 462, 656, 515],[526, 468, 570, 517].') 2023-11-03 20:25:26.474 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:26.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Helmet:[643, 459, 699, 557];Bicycle:[107, 530, 471, 997];Motorcycle:[712, 510, 1000, 851],[816, 430, 1000, 822],[614, 542, 916, 915],[531, 466, 875, 978],[908, 415, 997, 609].') 2023-11-03 20:25:29.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:29.382 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[719, 341, 771, 415],truckFrame1:[225, 340, 286, 443],busFrame1:[511, 311, 594, 396] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[719, 341, 771, 415];Frame2:[730, 336, 781, 408],truckFrame1:[225, 340, 286, 443];Frame2:[095, 332, 204, 465],busFrame1:[511, 311, 594, 396];Frame2:[479, 304, 575, 395].') 2023-11-03 20:25:47.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:47.408 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[001, 404, 533, 999],[538, 252, 1000, 873],[479, 220, 715, 587],[001, 191, 459, 847],[001, 267, 095, 575],[001, 153, 095, 414],[029, 065, 149, 221],[351, 161, 460, 447];Watch:[391, 672, 439, 753];Handbag:[449, 309, 503, 428];Chopsticks:[707, 751, 799, 866];Basin:[723, 771, 808, 849],[938, 903, 998, 993],[687, 865, 849, 1000];Plate:[834, 885, 973, 999];Cup:[501, 619, 558, 756],[401, 485, 450, 611].') 2023-11-03 20:25:51.931 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:51.931 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Couch:[017, 454, 480, 1000];Lamp:[389, 353, 434, 498],[544, 042, 571, 235];Chair:[604, 476, 1000, 999],[762, 428, 928, 798];Flower:[732, 187, 996, 426];Vase:[831, 413, 911, 587];Carpet:[206, 719, 720, 1000];Stool:[547, 500, 606, 680];Cabinet:[547, 121, 576, 334],[574, 161, 604, 340],[632, 189, 673, 299];Refrigerator:[650, 299, 697, 585];Bed:[586, 517, 869, 884].') 2023-11-03 20:25:52.816 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:52.816 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A dog [002, 004, 996, 731] looking sad next to a bowl [281, 591, 522, 870] of pet food because they have a food allergy.') 2023-11-03 20:25:55.461 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:55.462 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,woman and window blind in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[000, 617, 054, 872],[145, 615, 235, 866],[311, 581, 537, 999],[500, 557, 999, 999];woman:[132, 580, 234, 884],[310, 569, 379, 836],[345, 597, 536, 999],[496, 571, 527, 679],[546, 555, 647, 999],[592, 594, 748, 999],[709, 558, 743, 705],[732, 558, 775, 675],[750, 537, 807, 649].') 2023-11-03 20:25:56.238 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:25:56.239 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'food:[108, 111, 844, 887].') 2023-11-03 20:25:59.843 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:25:59.843 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the shifting motorized-bicycle?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There are 4 shifting motorized-bicycle.Track1frame:1:[304, 409, 433, 516];frame:2:[306, 387, 462, 529];frame:3:[342, 373, 507, 527],Track2frame:1:[625, 382, 738, 466];frame:2:[668, 336, 796, 457];frame:3:[707, 301, 839, 444],Track3frame:1:[315, 389, 432, 468];frame:2:[328, 345, 473, 472];frame:3:[365, 330, 521, 461],Track4frame:1:[116, 389, 240, 477];frame:2:[107, 362, 263, 487];frame:3:[137, 336, 307, 483].') 2023-11-03 20:26:01.326 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:26:01.326 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[414, 113, 578, 690],[000, 504, 137, 999],[557, 773, 672, 1000];Lamp:[264, 162, 304, 260],[296, 210, 331, 295];Stool:[323, 912, 398, 998];Ladder:[394, 001, 545, 999].') 2023-11-03 20:26:06.634 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:26:06.635 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Trash bin Can:[151, 801, 260, 992],[282, 698, 379, 883],[201, 758, 264, 813],[356, 657, 390, 722];Whiteboard:[737, 457, 791, 536],[929, 425, 999, 522];Moniter:[500, 405, 587, 499],[602, 519, 631, 594];Printer:[888, 518, 997, 720];Backpack:[202, 677, 235, 744];Chair:[901, 594, 1000, 975],[712, 562, 753, 614],[489, 561, 569, 784],[383, 561, 454, 730],[236, 573, 310, 741],[471, 557, 510, 759];Cabinet:[800, 717, 900, 1000],[686, 637, 793, 803],[572, 638, 702, 816];Desk:[893, 699, 1000, 1000],[210, 596, 480, 717],[552, 610, 662, 646],[682, 607, 772, 645];Person:[242, 514, 333, 708].') 2023-11-03 20:26:11.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:26:11.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[768, 492, 788, 562];frame:2:[738, 474, 784, 549];frame:3:[753, 474, 790, 537] doing?Using the given image frames and trajectory coordinates, please provide a concise answer.Refer to the supplied image frames and trajectory coordinates for a short response to the question.Please give a succinct reply by referring to the image frames and trajectory coordinates provided.Make sure your response is brief and references the image frames and trajectory coordinates you've been given.Draw upon the provided image frames and trajectory coordinates for a concise answer to the question.Relying on the image frames and trajectory coordinates shared, please keep your answer brief.", 'This is a/an second truck shifting to the left-side') 2023-11-03 20:26:15.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:26:15.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[172, 228, 360, 537],[356, 255, 702, 642];human hair:[406, 259, 518, 379],[225, 207, 285, 298];human leg:[294, 449, 366, 518],[510, 494, 695, 575];woman:[337, 243, 697, 646];mammal:[181, 206, 365, 525],[338, 239, 718, 685];man:[185, 215, 369, 525];clothing:[291, 475, 356, 530],[351, 375, 516, 559];human arm:[288, 322, 372, 461],[476, 395, 653, 635].') 2023-11-03 20:26:41.383 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:26:41.384 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'football:[918, 283, 991, 373];boy:[317, 146, 676, 859],[348, 326, 761, 962],[455, 247, 638, 780].') 2023-11-03 20:27:30.843 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:27:30.843 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Father of the (almost every) year: [151, 157, 857, 827] Coffee Mug') 2023-11-03 20:28:04.625 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:28:04.625 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "closeup of Shazia's diorama featuring a miniature self portrait sitting at [000, 464, 998, 902] the kitchen table with [017, 369, 375, 542] a laptop and [616, 481, 856, 567] [250, 507, 599, 701] art materials laid out on it, there is also [701, 339, 960, 551] a bag of groceries on a chair beside them.") 2023-11-03 20:28:12.518 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:28:12.519 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[799, 398, 937, 502],[930, 427, 1000, 517],[783, 465, 846, 555],[569, 421, 730, 579],[694, 531, 770, 624],[190, 404, 350, 529];Cup:[404, 895, 435, 974],[424, 748, 457, 826];Person:[000, 366, 149, 1000],[081, 404, 416, 999],[298, 494, 469, 838],[360, 261, 539, 745],[449, 424, 827, 1000],[584, 366, 687, 701],[898, 266, 1000, 458],[729, 244, 899, 585];Necklace:[102, 604, 142, 680],[467, 383, 492, 425].') 2023-11-03 20:28:18.702 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:28:18.702 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hands [498, 787, 746, 997] placing an icepack [430, 496, 744, 903] in the freezer [007, 009, 985, 995]') 2023-11-03 20:28:24.972 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:28:24.972 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'person:[160, 191, 498, 1000];guitar:[085, 371, 495, 885];microphone:[313, 304, 405, 375].') 2023-11-03 20:28:32.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:28:32.297 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'clothing:[042, 131, 596, 718].') 2023-11-03 20:28:55.155 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:28:55.156 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman and clothing in this image.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[709, 310, 997, 1000];clothing:[030, 383, 188, 933],[784, 652, 998, 912].') 2023-11-03 20:29:12.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:29:12.570 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Sneakers:[303, 153, 405, 270],[338, 267, 430, 349],[195, 701, 318, 735];Person:[302, 152, 986, 562],[459, 305, 604, 721],[263, 263, 505, 712],[148, 257, 358, 739].') 2023-11-03 20:29:14.492 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:29:14.493 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Flag:[152, 228, 198, 309],[848, 238, 896, 344];Car:[053, 823, 132, 914],[109, 821, 215, 929],[265, 813, 391, 956],[683, 816, 811, 946],[863, 817, 949, 916].') 2023-11-03 20:29:20.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:29:20.958 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'boat:[201, 154, 963, 965].') 2023-11-03 20:29:39.209 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:29:39.210 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the adult man moving and dancing in the center of the field?For clarity, represent trajectories using the TrackiFrame t:[xmin,ymin,xmax,ymax] format in your response.', 'There is one adult man moving and dancing in the center of the field.Track1frame:1:[352, 191, 758, 997];frame:2:[412, 222, 722, 995];frame:3:[460, 256, 720, 997].') 2023-11-03 20:30:05.973 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:30:05.974 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bicycle wheel,wheel,clothing and person in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'bicycle wheel:[446, 545, 832, 1000],[000, 487, 050, 581],[098, 495, 165, 568],[212, 510, 404, 920];wheel:[481, 560, 834, 987],[001, 470, 048, 568],[106, 481, 154, 562],[209, 516, 398, 916];clothing:[084, 425, 159, 500];person:[060, 387, 162, 512],[093, 406, 165, 550].') 2023-11-03 20:31:01.311 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:31:01.312 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[388, 280, 533, 660],personFrame1:[448, 243, 675, 755],personFrame1:[357, 282, 481, 584],personFrame1:[329, 313, 426, 547],personFrame1:[678, 443, 756, 667] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'personFrame1:[388, 280, 533, 660];Frame2:[378, 266, 520, 667],personFrame1:[448, 243, 675, 755];Frame2:[445, 242, 696, 765],personFrame1:[357, 282, 481, 584];Frame2:[341, 278, 453, 593],personFrame1:[329, 313, 426, 547];Frame2:[304, 308, 402, 555],personFrame1:[678, 443, 756, 667];Frame2:[680, 434, 759, 665].') 2023-11-03 20:31:43.719 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:31:43.720 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Horse:[247, 164, 571, 618],[620, 130, 825, 528];Hat:[271, 544, 525, 706],[017, 027, 099, 104],[361, 037, 424, 085];Leather Shoes:[085, 813, 136, 877];Person:[179, 543, 528, 1000],[002, 226, 138, 881],[002, 020, 177, 618],[338, 017, 478, 478],[662, 015, 770, 379].') 2023-11-03 20:32:08.943 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:32:08.944 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[296, 727, 633, 1000],[774, 681, 1000, 1000],[000, 633, 157, 1000],[399, 441, 493, 736],[656, 377, 782, 945],[869, 368, 995, 833],[000, 750, 119, 999];Handbag:[112, 723, 173, 999],[580, 530, 655, 673],[662, 467, 705, 669];Hat:[777, 371, 832, 420];Speaker:[103, 329, 206, 568];Microphone:[394, 686, 427, 727];Air Conditioner:[698, 074, 788, 122].') 2023-11-03 20:32:46.254 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:32:46.255 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[230, 350, 743, 522] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a pt boatFrame1:[230, 350, 743, 522];Frame2:[198, 350, 713, 522];Frame3:[177, 348, 691, 520];Frame4:[177, 347, 697, 523];Frame5:[190, 348, 716, 522].') 2023-11-03 20:33:02.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:33:02.587 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[028, 279, 865, 810] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a platypusFrame1:[028, 279, 865, 810];Frame2:[023, 272, 854, 829];Frame3:[023, 262, 890, 877];Frame4:[025, 262, 895, 875].') 2023-11-03 20:33:06.124 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:33:06.125 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'boy:[006, 308, 120, 762],[053, 467, 176, 790],[098, 291, 195, 738],[161, 472, 268, 777],[187, 287, 276, 570],[271, 295, 333, 635],[327, 269, 416, 614],[390, 482, 495, 766],[405, 259, 571, 594],[488, 465, 611, 764],[491, 289, 580, 553],[498, 299, 727, 725],[612, 469, 743, 771],[630, 287, 732, 738],[716, 274, 806, 723],[795, 263, 891, 743],[873, 250, 982, 747];sports equipment:[480, 723, 537, 805];clothing:[013, 383, 106, 547],[013, 370, 111, 727],[023, 540, 062, 732],[056, 530, 173, 732],[098, 351, 196, 592],[146, 545, 181, 611],[171, 542, 265, 721],[180, 371, 276, 560],[185, 644, 253, 714],[198, 536, 265, 555],[260, 355, 337, 611],[281, 545, 383, 723],[292, 635, 370, 727],[327, 347, 410, 594],[345, 532, 403, 596],[390, 527, 480, 747],[407, 321, 527, 491],[410, 656, 476, 728],[411, 327, 558, 586],[420, 448, 495, 579],[496, 540, 600, 699],[498, 358, 580, 553],[508, 628, 580, 708],[508, 500, 586, 596],[565, 357, 648, 600],[580, 482, 640, 598],[613, 528, 716, 732],[641, 347, 736, 577],[651, 648, 732, 715],[686, 517, 725, 581],[716, 351, 807, 614],[732, 517, 781, 667],[748, 650, 866, 721],[750, 517, 843, 712],[797, 343, 883, 590],[815, 525, 880, 609],[876, 515, 967, 598],[880, 327, 985, 609].') 2023-11-03 20:33:11.255 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:33:11.256 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[321, 097, 782, 997] Woman with "Save our future" markered on [378, 283, 594, 694] face during Climate Strike protest.') 2023-11-03 20:33:32.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:33:32.980 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[198, 307, 323, 476],[052, 308, 145, 432],[202, 079, 298, 216],[231, 925, 285, 999],[290, 864, 365, 991],[361, 285, 425, 437],[386, 881, 459, 978],[424, 139, 689, 437],[465, 891, 517, 980],[501, 631, 574, 725],[506, 795, 562, 909],[564, 697, 667, 860],[578, 592, 649, 686],[630, 325, 818, 637],[806, 870, 926, 999],[840, 240, 999, 389],[890, 423, 999, 620],[955, 146, 999, 279];window:[000, 473, 095, 554],[076, 740, 139, 750],[086, 783, 149, 802],[091, 039, 145, 127],[161, 000, 301, 175],[366, 024, 413, 105],[402, 089, 470, 215],[631, 058, 786, 245],[684, 930, 760, 985].') 2023-11-03 20:33:56.995 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:33:56.996 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Students [001, 385, 192, 997] [776, 314, 893, 646] [798, 399, 998, 995] [556, 273, 673, 581] [417, 277, 573, 601] [216, 285, 389, 604] wearing masks [013, 458, 096, 564] [001, 362, 048, 423] [811, 369, 860, 433] [598, 316, 637, 371] [476, 337, 526, 405] [261, 356, 312, 439] [922, 486, 998, 613] listen to a lecture during a welding class [006, 007, 994, 995].') 2023-11-03 20:34:06.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:34:06.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Potted Plant:[120, 001, 828, 586],[795, 239, 997, 500],[120, 001, 828, 586],[795, 239, 997, 500];Desk:[068, 478, 945, 868],[068, 478, 945, 868];Vase:[244, 348, 389, 598],[244, 348, 389, 598];Chair:[183, 510, 458, 789],[317, 471, 516, 710],[661, 521, 919, 778],[568, 554, 843, 857],[165, 608, 533, 941].') 2023-11-03 20:34:10.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:34:10.710 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person and Helmet in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[139, 488, 281, 997],[319, 371, 501, 858],[764, 462, 994, 999],[001, 039, 129, 523];Helmet:[058, 038, 120, 128],[247, 264, 303, 325],[288, 283, 349, 377],[433, 372, 496, 469],[189, 487, 257, 585],[522, 310, 585, 408],[574, 098, 631, 187],[714, 360, 785, 469],[683, 408, 727, 510],[832, 465, 899, 565].') 2023-11-03 20:34:33.753 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:34:33.754 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Fan:[383, 556, 458, 723],[001, 583, 075, 732];Lamp:[868, 248, 901, 317];Person:[652, 598, 801, 999],[589, 651, 681, 1000],[275, 598, 398, 999],[200, 599, 254, 789],[117, 622, 215, 775],[541, 636, 614, 910],[671, 609, 745, 705],[902, 603, 942, 734],[327, 515, 373, 608];Helmet:[481, 742, 519, 792];Satchel:[894, 660, 928, 711].') 2023-11-03 20:34:42.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:34:42.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Cabinet:[002, 002, 588, 297],[740, 121, 1000, 477];Person:[299, 360, 959, 1000],[773, 533, 1000, 856],[002, 002, 227, 339],[603, 866, 998, 998];Leather Shoes:[064, 284, 213, 340];Oven:[393, 002, 999, 400];Kettle:[851, 002, 931, 051];bucket:[616, 868, 999, 1000].') 2023-11-03 20:34:44.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:34:44.112 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Hikers [025, 561, 104, 855] [083, 475, 171, 666] [325, 466, 388, 779] [001, 545, 085, 835] [465, 440, 568, 765] [423, 485, 495, 788] [360, 475, 442, 828] [085, 585, 204, 866] [236, 483, 363, 996] take an uphill trek to a waterfall [481, 213, 830, 626] in the forest [007, 000, 994, 995] near the Dawes Glacier, a stop on the Safari Explorer cruise.') 2023-11-03 20:35:39.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:35:39.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[128, 357, 285, 548],[110, 362, 390, 584],[676, 257, 777, 440],[838, 340, 913, 509];Hat:[716, 257, 747, 284];Boat:[038, 001, 942, 869];Lifesaver:[765, 403, 846, 490].') 2023-11-03 20:35:48.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:35:48.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[543, 589, 660, 943],[190, 645, 242, 788],[435, 594, 548, 962];tree:[000, 570, 038, 690],[078, 513, 235, 716],[231, 090, 999, 718].') 2023-11-03 20:35:54.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:35:54.170 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect tree,land vehicle and window in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'tree:[088, 000, 499, 513],[736, 000, 999, 680];land vehicle:[230, 255, 926, 941];window:[473, 000, 662, 136],[493, 335, 646, 505].') 2023-11-03 20:36:22.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:36:22.796 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[383, 442, 451, 786] and track its trajectory.For the trajectories included in the answer, please use the format Frame t:[xmin,ymin,xmax,ymax].', 'This is a personFrame1:[383, 442, 451, 786];Frame2:[335, 456, 425, 798];Frame3:[308, 437, 379, 777];Frame4:[298, 440, 368, 784];Frame5:[285, 439, 374, 792].') 2023-11-03 20:36:45.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:36:45.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'palm tree:[000, 201, 085, 501],[595, 304, 763, 552];person:[230, 468, 271, 680],[468, 515, 483, 610],[513, 514, 540, 608];chair:[572, 588, 609, 657],[579, 600, 632, 707],[626, 600, 676, 704],[668, 605, 709, 717],[694, 670, 760, 778],[766, 665, 826, 795];vehicle:[000, 480, 109, 592],[285, 477, 337, 555],[491, 475, 538, 548];clothing:[227, 482, 271, 675],[239, 515, 266, 667],[464, 531, 483, 608],[511, 511, 538, 601],[836, 482, 906, 610],[960, 430, 999, 607].') 2023-11-03 20:36:50.243 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:36:50.243 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[157, 295, 710, 677] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a dumpcartFrame1:[157, 295, 710, 677];Frame2:[146, 295, 713, 684];Frame3:[146, 295, 720, 688];Frame4:[148, 280, 739, 686];Frame5:[148, 276, 746, 681].') 2023-11-03 20:37:07.499 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:37:07.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[001, 000, 998, 1000],[066, 000, 999, 1000];Bracelet:[456, 386, 857, 618],[180, 636, 566, 903];Ring:[926, 502, 996, 585].') 2023-11-03 20:37:08.440 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:37:08.441 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'man:[221, 407, 850, 955];clothing:[197, 548, 500, 820],[358, 496, 849, 951];flower:[086, 911, 188, 980],[000, 728, 131, 847];balloon:[661, 285, 929, 493];girl:[181, 478, 519, 881].') 2023-11-03 20:37:09.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:37:09.388 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'human body:[466, 506, 543, 684],[105, 524, 204, 653],[638, 538, 763, 682],[828, 398, 878, 570],[866, 390, 930, 663];mammal:[468, 523, 551, 684],[141, 525, 212, 631],[636, 534, 775, 671],[823, 425, 862, 571],[867, 389, 930, 666];clothing:[456, 520, 535, 680],[114, 540, 212, 635],[662, 539, 770, 654],[870, 423, 925, 655];person:[108, 529, 216, 655],[456, 511, 547, 674],[644, 535, 772, 674],[825, 393, 873, 582],[867, 386, 928, 687].') 2023-11-03 20:37:31.936 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:37:31.936 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Stool:[354, 257, 515, 675];Desk:[002, 820, 506, 1000];Chair:[793, 355, 1000, 999];shelf:[886, 003, 1000, 250];Couch:[757, 225, 999, 369];Hat:[541, 107, 779, 319];Handbag:[283, 456, 361, 718];Person:[261, 110, 941, 999],[002, 003, 349, 846],[312, 003, 561, 496],[636, 003, 810, 266],[818, 059, 1000, 253];Book:[279, 717, 377, 934].') 2023-11-03 20:37:37.595 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:37:37.595 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'human body:[224, 725, 365, 961],[000, 711, 294, 976],[317, 720, 999, 999];human hair:[538, 818, 793, 978],[875, 723, 943, 798];human head:[515, 823, 817, 964],[000, 695, 955, 783];mammal:[217, 748, 379, 935],[005, 708, 755, 956],[525, 783, 764, 999],[735, 728, 805, 933],[786, 716, 999, 999];man:[240, 745, 383, 971],[406, 718, 515, 937],[488, 744, 645, 999],[534, 790, 800, 999],[714, 723, 816, 923],[791, 716, 999, 998];clothing:[238, 748, 377, 967],[000, 810, 077, 883],[111, 728, 175, 892],[410, 771, 499, 874],[476, 735, 517, 832],[710, 760, 809, 868],[776, 738, 850, 837],[795, 796, 999, 999];human leg:[114, 786, 829, 995];woman:[161, 764, 259, 976],[425, 744, 501, 958];human arm:[248, 779, 266, 848],[336, 783, 368, 850],[440, 776, 467, 873].') 2023-11-03 20:37:48.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:37:48.623 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[657, 522, 719, 838],personFrame1:[597, 520, 656, 830],personFrame1:[362, 548, 420, 812],personFrame1:[242, 488, 321, 847],personFrame1:[441, 457, 507, 847],personFrame1:[298, 537, 336, 821],personFrame1:[653, 549, 697, 815] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'personFrame1:[657, 522, 719, 838];Frame2:[653, 450, 726, 836],personFrame1:[597, 520, 656, 830];Frame2:[583, 487, 650, 828],personFrame1:[362, 548, 420, 812];Frame2:[370, 561, 413, 813],personFrame1:[242, 488, 321, 847];Frame2:[221, 450, 303, 848],personFrame1:[441, 457, 507, 847];Frame2:[426, 461, 509, 840],personFrame1:[298, 537, 336, 821];Frame2:[295, 531, 333, 822],personFrame1:[653, 549, 697, 815];Frame2:[640, 520, 696, 815].') 2023-11-03 20:37:58.015 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:37:58.015 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Card of [256, 468, 461, 758] cute raccoon in [003, 431, 477, 968] the carriage and [198, 228, 317, 431] [091, 246, 209, 442] [134, 089, 251, 293] balloons') 2023-11-03 20:38:07.604 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:38:07.604 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A sliced papaya [223, 041, 435, 930] on a pink background next to a banana on a blue background.') 2023-11-03 20:38:10.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:38:10.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[354, 248, 527, 637] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a sloth bearFrame1:[354, 248, 527, 637];Frame2:[345, 192, 520, 608];Frame3:[263, 000, 611, 725];Frame4:[141, 034, 725, 998].') 2023-11-03 20:38:13.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:38:13.132 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Desk:[629, 825, 998, 1000];Chair:[023, 780, 220, 1000];Cabinet:[002, 397, 248, 696];Cell Phone:[879, 867, 1000, 951];Glasses:[341, 370, 531, 487];Cup:[738, 837, 860, 1000];Person:[206, 166, 732, 1000],[830, 487, 1000, 883],[096, 503, 180, 710].') 2023-11-03 20:38:27.113 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:38:27.114 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'HUDWAY Glass uses your phones reflection for a heads-up-display (HUD) in your car [046, 031, 890, 949] - Phone Heads-Up-Display HUD for car') 2023-11-03 20:39:00.299 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:39:00.300 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Rescue workers remove one of a number of bodies from the wreckage [023, 686, 537, 938] of homes destroyed by a mudslide near Oso, Wash., Monday, March 24, 2014.') 2023-11-03 20:39:27.982 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:39:27.983 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Scrutinize the image and respond to my inquiry: 'QUESTION: Where are [336, 670, 473, 967] and [102, 330, 156, 470] and [001, 504, 032, 729] and [348, 516, 433, 696] and [128, 318, 162, 419] and [121, 149, 160, 258] and [591, 285, 668, 486] and [352, 278, 386, 444] and [552, 578, 658, 928] and [419, 257, 445, 386] and [020, 503, 106, 767] and [137, 380, 218, 584] and [640, 349, 722, 753] and [373, 233, 419, 448] and [025, 183, 053, 280] and [074, 597, 132, 785] and [277, 518, 338, 635] , and [258, 632, 353, 960] ? \nANSWER: [336, 670, 473, 967] and [102, 330, 156, 470] and [001, 504, 032, 729] and [348, 516, 433, 696] and [128, 318, 162, 419] and [121, 149, 160, 258] and [591, 285, 668, 486] and [352, 278, 386, 444] and [552, 578, 658, 928] and [419, 257, 445, 386] and [020, 503, 106, 767] and [137, 380, 218, 584] and [640, 349, 722, 753] and [373, 233, 419, 448] and [025, 183, 053, 280] and [074, 597, 132, 785] and [277, 518, 338, 635] , and [258, 632, 353, 960] are at a lawn picnic . You should explain the reason for the above answer.'. Make sure you mention the reasoning process and clarify the bounding boxes of relevant objects using square brackets.", '[336, 670, 473, 967] and [102, 330, 156, 470] and [001, 504, 032, 729] and [348, 516, 433, 696] and [128, 318, 162, 419] and [121, 149, 160, 258] and [591, 285, 668, 486] and [352, 278, 386, 444] and [552, 578, 658, 928] and [419, 257, 445, 386] and [020, 503, 106, 767] and [137, 380, 218, 584] and [640, 349, 722, 753] and [373, 233, 419, 448] and [025, 183, 053, 280] and [074, 597, 132, 785] and [277, 518, 338, 635] , and [258, 632, 353, 960] are in the front yard of a house . there is a bbq grill . there are tables with food and drinks .') 2023-11-03 20:39:34.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:39:34.467 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[457, 350, 496, 530],personFrame1:[245, 364, 284, 572],personFrame1:[895, 423, 935, 616],personFrame1:[030, 299, 072, 487],personFrame1:[368, 308, 404, 475],personFrame1:[421, 337, 465, 539],personFrame1:[294, 375, 327, 571],personFrame1:[092, 529, 136, 761],personFrame1:[236, 141, 274, 333],personFrame1:[196, 437, 243, 657] in this video clip.Adhere strictly to the format categoryFrame t:[xmin,ymin,xmax,ymax] when providing your answer.', 'personFrame1:[457, 350, 496, 530];Frame2:[459, 356, 497, 536],personFrame1:[245, 364, 284, 572];Frame2:[244, 368, 280, 573],personFrame1:[895, 423, 935, 616];Frame2:[895, 423, 935, 616],personFrame1:[030, 299, 072, 487];Frame2:[030, 299, 072, 487],personFrame1:[368, 308, 404, 475];Frame2:[380, 312, 415, 483],personFrame1:[421, 337, 465, 539];Frame2:[421, 342, 467, 539],personFrame1:[294, 375, 327, 571];Frame2:[291, 381, 325, 575],personFrame1:[092, 529, 136, 761];Frame2:[098, 531, 143, 763],personFrame1:[236, 141, 274, 333];Frame2:[236, 137, 274, 327],personFrame1:[196, 437, 243, 657];Frame2:[201, 428, 246, 646].') 2023-11-03 20:40:46.089 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:40:46.090 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'File image of [829, 270, 976, 986] [698, 274, 849, 907] [446, 241, 609, 964] [264, 260, 372, 903] [085, 259, 240, 931] people wearing [589, 250, 630, 309] [074, 307, 113, 362] [226, 163, 267, 224] [669, 269, 712, 327] [392, 302, 435, 361] [298, 310, 341, 375] [884, 323, 934, 400] [747, 319, 796, 387] [148, 301, 194, 364] [503, 284, 550, 362] masks in the Brisbane CBD.') 2023-11-03 20:40:48.137 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:40:48.138 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A smiling 70-year-old woman [136, 079, 858, 995] with a pork-pie hat [384, 076, 694, 385] and dark-rimmed glasses [404, 232, 614, 315] enjoying listing to a podcast on her mobile phone [691, 403, 842, 585] with earphones [418, 316, 729, 994].') 2023-11-03 20:41:02.687 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:41:02.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[289, 484, 380, 628] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a bovineFrame1:[289, 484, 380, 628];Frame2:[264, 487, 344, 603];Frame3:[260, 450, 345, 585];Frame4:[282, 446, 360, 567].') 2023-11-03 20:42:33.204 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:42:33.205 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman and human face in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[433, 382, 660, 999],[589, 175, 999, 999];human face:[033, 061, 136, 506],[349, 223, 445, 464],[527, 403, 632, 617],[671, 310, 701, 431],[910, 517, 955, 582].') 2023-11-03 20:42:43.649 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:42:43.650 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Person:[038, 162, 134, 863],[127, 150, 217, 870],[208, 139, 299, 879],[285, 140, 370, 882],[355, 140, 444, 888],[430, 127, 523, 889],[513, 120, 609, 891],[606, 116, 705, 892],[699, 118, 800, 899],[791, 104, 891, 902],[873, 092, 979, 908];Sneakers:[621, 826, 688, 900],[747, 828, 784, 900],[808, 840, 871, 905],[898, 834, 961, 909],[223, 800, 266, 878],[251, 799, 290, 881],[296, 817, 333, 883],[369, 815, 435, 889],[445, 821, 507, 896],[526, 824, 588, 897].') 2023-11-03 20:42:44.540 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:42:44.541 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Bracelet and Tennis Racket in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[205, 022, 680, 999],[001, 088, 120, 558];Bracelet:[460, 584, 538, 653],[548, 388, 621, 452];Tennis Racket:[400, 524, 985, 742].') 2023-11-03 20:43:13.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:13.112 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'DIY mitten ornament [133, 138, 732, 750] on a Christmas tree [002, 000, 996, 995]') 2023-11-03 20:43:17.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:17.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Golf Club:[305, 590, 412, 905],[595, 570, 688, 899];Person:[146, 147, 416, 954],[602, 130, 838, 947];Hat:[327, 149, 411, 282];Sneakers:[171, 849, 234, 931],[213, 865, 327, 951].') 2023-11-03 20:43:30.177 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:30.178 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Thoughts on turning [029, 003, 971, 322] the page…To 41') 2023-11-03 20:43:37.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:37.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:2:[937, 239, 996, 355];frame:3:[378, 265, 806, 548] doing?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an zebras shifting from far right-side to far left-side') 2023-11-03 20:43:42.449 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:42.449 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The perfect combination [058, 006, 985, 981]...chocolate and peppermint. This chocolate peppermint cookies recipe is bound to be a hit for the upcoming holidays. BONUS...it also freezes really well.') 2023-11-03 20:43:43.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:43.353 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[612, 121, 657, 218],[792, 132, 823, 220],[730, 198, 755, 253];Handbag:[000, 280, 095, 542];Hanger:[251, 002, 335, 055];Picture:[773, 110, 847, 242],[585, 086, 692, 261];Carpet:[237, 818, 832, 1000].') 2023-11-03 20:43:56.285 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:43:56.286 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[502, 527, 596, 760];frame:2:[514, 565, 586, 779];frame:3:[419, 363, 703, 828] is?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an the pandas having fun together.') 2023-11-03 20:44:06.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:44:06.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'person:[021, 663, 100, 913],[103, 551, 364, 909],[553, 310, 689, 670],[789, 408, 870, 776],[853, 463, 984, 767],[858, 397, 908, 491],[923, 400, 979, 534];clothing:[026, 694, 094, 857],[108, 707, 187, 893],[124, 669, 201, 762],[188, 637, 271, 893],[280, 614, 358, 786],[556, 356, 681, 650],[786, 420, 911, 776],[845, 521, 966, 705],[938, 463, 976, 534].') 2023-11-03 20:44:34.560 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:44:34.561 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[000, 402, 882, 666] [110, 208, 965, 469] New cars with [055, 532, 159, 659] [737, 339, 840, 469] [180, 303, 280, 426] alloy rims being hauled away from ground zero. Notice the rust created by WTC nuclear blast') 2023-11-03 20:44:55.418 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:44:55.419 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me the trajectory of the left vehicles in silver in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one left vehicles in silver.Track1frame1:[466, 464, 487, 506];frame2:[447, 464, 471, 514];frame3:[424, 474, 453, 528];frame4:[396, 474, 430, 536].') 2023-11-03 20:45:09.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:45:09.113 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[845, 046, 985, 417],[285, 060, 371, 244],[570, 025, 675, 182],[840, 050, 898, 137];clothing:[001, 188, 346, 970],[286, 137, 362, 235],[423, 500, 995, 999],[449, 152, 585, 345],[568, 080, 686, 163],[688, 074, 727, 141],[708, 089, 811, 255],[848, 080, 977, 302],[965, 074, 999, 205];boy:[428, 134, 490, 254];woman:[000, 000, 554, 999],[410, 050, 595, 369],[670, 019, 732, 169];girl:[003, 000, 556, 999],[668, 027, 739, 152],[798, 056, 855, 146];human face:[177, 071, 288, 306],[291, 087, 326, 161],[483, 087, 539, 196].') 2023-11-03 20:46:38.080 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:46:38.082 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human body:[000, 184, 201, 940],[000, 000, 075, 285],[084, 017, 336, 470],[125, 403, 481, 999],[341, 153, 631, 737],[476, 289, 802, 923];woman:[332, 169, 601, 745],[000, 000, 109, 305],[080, 013, 334, 458];human hair:[603, 290, 788, 566],[000, 000, 073, 057],[001, 290, 161, 505],[061, 019, 223, 225],[148, 408, 341, 747],[350, 159, 545, 406];human head:[350, 155, 527, 385],[000, 000, 061, 134],[000, 273, 138, 492],[080, 020, 197, 210],[157, 410, 336, 657],[610, 329, 751, 560];clothing:[109, 588, 503, 972],[000, 066, 105, 287],[000, 224, 184, 838],[075, 153, 333, 442],[346, 301, 616, 730],[477, 511, 765, 941],[843, 211, 999, 731];human arm:[193, 670, 413, 960],[007, 513, 100, 864],[007, 127, 168, 428],[256, 203, 322, 435],[356, 452, 423, 655],[460, 416, 557, 740],[517, 615, 600, 838],[538, 630, 724, 905],[963, 430, 999, 823];human leg:[222, 820, 480, 999],[348, 496, 475, 610],[453, 534, 606, 737],[476, 732, 741, 932];mammal:[000, 000, 083, 285],[000, 132, 179, 844],[087, 015, 335, 452],[125, 379, 473, 999],[341, 155, 608, 740],[492, 304, 796, 923],[840, 220, 999, 807];girl:[000, 130, 183, 840],[137, 410, 470, 971],[340, 170, 620, 744],[482, 282, 801, 940];human face:[000, 020, 052, 127],[083, 101, 150, 213],[388, 284, 481, 397],[613, 416, 684, 560].') 2023-11-03 20:47:07.799 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:47:07.799 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[518, 286, 727, 999],[216, 126, 464, 1000],[504, 325, 581, 862],[796, 331, 893, 718],[378, 111, 476, 410],[002, 541, 085, 707],[143, 249, 266, 813],[048, 415, 091, 594],[169, 382, 260, 728];Satchel:[512, 588, 594, 804].') 2023-11-03 20:47:18.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:47:18.087 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'Couch:[002, 490, 311, 999];Pillow:[002, 856, 150, 999],[390, 610, 611, 750];Side Table:[163, 522, 342, 777];Lamp:[182, 285, 306, 567];Coffee Table:[367, 724, 613, 998];Carpet:[210, 777, 754, 998];Frame:[761, 171, 830, 383];Person:[582, 143, 906, 999],[755, 002, 1000, 999].') 2023-11-03 20:47:58.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:47:58.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lamp:[431, 001, 501, 067];Frame:[895, 133, 954, 444],[280, 252, 349, 393];Couch:[569, 543, 740, 898];shelf:[421, 496, 594, 769];Bed:[001, 509, 397, 1000];Desk:[248, 507, 439, 744],[742, 651, 862, 892].') 2023-11-03 20:48:05.203 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:48:05.203 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the kitten shifting around?As you detail the trajectories in your reply, the TrackiFrame t:[xmin,ymin,xmax,ymax] format is the preferred structure.', 'There are 4 kitten shifting around.Track1frame:1:[113, 284, 390, 998];frame:2:[113, 284, 390, 998];frame:3:[119, 240, 397, 909],Track2frame:1:[347, 206, 520, 957];frame:2:[347, 206, 520, 957];frame:3:[328, 439, 542, 918],Track3frame:1:[496, 287, 674, 940];frame:2:[496, 287, 674, 940];frame:3:[493, 292, 622, 888],Track4frame:1:[586, 360, 869, 998];frame:2:[586, 360, 869, 998];frame:3:[601, 219, 877, 998].') 2023-11-03 20:48:41.631 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:48:41.631 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[553, 446, 608, 536],carFrame1:[405, 459, 444, 531],carFrame1:[573, 464, 611, 550],carFrame1:[608, 460, 709, 601],carFrame1:[589, 464, 635, 568],carFrame1:[215, 454, 344, 619],carFrame1:[000, 444, 261, 723] in this video clip.Your response should conform strictly to the categoryFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[553, 446, 608, 536];Frame2:[559, 419, 622, 525],carFrame1:[405, 459, 444, 531];Frame2:[379, 432, 430, 514],carFrame1:[573, 464, 611, 550];Frame2:[582, 441, 639, 547],carFrame1:[608, 460, 709, 601];Frame2:[641, 434, 844, 644],carFrame1:[589, 464, 635, 568];Frame2:[605, 441, 688, 581],carFrame1:[215, 454, 344, 619];Frame2:[023, 412, 276, 662],carFrame1:[000, 444, 261, 723];Frame2:[000, 515, 080, 771].') 2023-11-03 20:48:52.579 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:48:52.579 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the feline outside the entrance, abruptly dashing indoors.?To ensure accuracy, apply the TrackiFrame t:[xmin,ymin,xmax,ymax] template for every trajectory in your response.', 'There is one the feline outside the entrance, abruptly dashing indoors..Track1frame:1:[757, 501, 804, 568];frame:2:[757, 501, 804, 568];frame:3:[735, 477, 793, 690].') 2023-11-03 20:49:02.685 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:49:02.686 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bench in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Bench:[459, 725, 614, 856],[001, 774, 229, 881].') 2023-11-03 20:49:09.038 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:49:09.039 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'plant:[046, 670, 999, 999].') 2023-11-03 20:49:13.426 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:49:13.426 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[296, 084, 412, 347],[032, 398, 138, 530];Person:[023, 525, 058, 571],[135, 507, 224, 619],[316, 521, 361, 567],[544, 464, 620, 759],[737, 415, 917, 991],[792, 225, 895, 431],[891, 354, 972, 662];Hat:[803, 424, 853, 475].') 2023-11-03 20:49:24.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:49:24.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[346, 368, 578, 694] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a gecko creeping in waterFrame1:[346, 368, 578, 694];Frame2:[339, 380, 574, 706];Frame3:[328, 383, 563, 702];Frame4:[307, 381, 542, 705].') 2023-11-03 20:49:33.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:49:33.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Cup:[275, 790, 368, 987];Person:[627, 374, 965, 999],[366, 093, 658, 1000],[002, 350, 111, 999],[007, 202, 406, 1000];Ballon:[096, 196, 167, 334],[057, 328, 133, 464],[076, 446, 149, 583].') 2023-11-03 20:49:56.546 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:49:56.546 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'jodie foster [047, 100, 969, 990] at the 1995 academy awards arrivals at the shrine auditorium in los angeles, california on march 27, 1995. - 67th annual academy awards stock videos & royalty-free footage') 2023-11-03 20:50:06.233 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:50:06.233 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The healthy pineapple upside down cake [250, 076, 770, 998] that takes 90 seconds to cook') 2023-11-03 20:50:09.556 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:50:09.556 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Ian Ashford [002, 249, 362, 998] with his animatronic witch [497, 148, 995, 996] for Halloween, which is catching the attention of shoppers outside his Castle Costumes shop in Norfolk Street [002, 735, 717, 996].') 2023-11-03 20:50:13.136 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:50:13.137 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The designers [264, 306, 593, 995] with a squad [672, 163, 861, 993] [501, 178, 709, 997] [000, 157, 155, 998] [783, 191, 923, 995] [101, 242, 302, 994] [875, 187, 997, 998] of underwear-clad models (Photo: Parke & Ronen).') 2023-11-03 20:50:28.378 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:50:28.379 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[259, 156, 578, 568] and track its trajectory.For clarity, represent trajectories using the Frame t:[xmin,ymin,xmax,ymax] format in your response.', 'This is a manateeFrame1:[259, 156, 578, 568];Frame2:[246, 156, 583, 564];Frame3:[239, 154, 594, 558];Frame4:[234, 156, 621, 555].') 2023-11-03 20:50:48.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:50:48.157 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Person:[293, 610, 326, 716],[728, 844, 758, 955],[954, 830, 986, 953],[662, 561, 694, 684],[612, 503, 668, 639],[469, 512, 526, 581],[683, 079, 721, 170],[715, 068, 754, 166],[557, 157, 587, 240],[580, 149, 618, 229];Street Lights:[029, 540, 081, 914],[004, 755, 033, 890];Motorcycle:[908, 879, 964, 944];Horse:[257, 623, 329, 746],[635, 529, 668, 699].') 2023-11-03 20:50:58.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:50:58.123 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[644, 523, 747, 637] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a border collieFrame1:[644, 523, 747, 637];Frame2:[634, 518, 703, 638];Frame3:[635, 482, 676, 617];Frame4:[629, 466, 668, 604];Frame5:[628, 462, 666, 600].') 2023-11-03 20:51:01.390 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:51:01.391 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2 and frame3, please tell me what this oneframe:2:[000, 489, 030, 752];frame:3:[000, 383, 103, 723] is doing?With reference to the supplied image frames and trajectory coordinates, please formulate a concise response.', 'This is a/an car driving forward on road next to bicycles') 2023-11-03 20:51:06.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:51:06.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect clothing,man,tree and food in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'clothing:[015, 174, 237, 1000],[121, 392, 226, 879],[133, 299, 231, 613],[194, 279, 307, 470],[215, 307, 510, 999],[421, 278, 681, 968];man:[143, 128, 232, 617],[196, 196, 368, 946],[418, 196, 681, 999];tree:[323, 000, 999, 477];food:[554, 705, 720, 804],[686, 811, 792, 914],[765, 704, 806, 812],[771, 844, 947, 999].') 2023-11-03 20:51:11.015 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:51:11.016 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'wheel:[251, 601, 321, 840],[128, 460, 161, 619],[841, 578, 968, 784];tire:[846, 542, 968, 773],[124, 470, 159, 619],[258, 596, 325, 848];vehicle registration plate:[587, 528, 676, 595];car:[000, 315, 085, 998],[126, 209, 802, 837],[737, 246, 999, 797].') 2023-11-03 20:51:26.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:51:26.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'man:[832, 410, 946, 767],[000, 410, 140, 720],[179, 395, 712, 746],[712, 406, 815, 731];clothing:[829, 443, 939, 717],[039, 451, 140, 631],[220, 434, 318, 606],[235, 579, 309, 710],[320, 442, 409, 625],[385, 443, 500, 710],[476, 454, 560, 621],[543, 484, 639, 650],[612, 454, 700, 620],[720, 454, 810, 632].') 2023-11-03 20:51:29.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:51:29.346 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[125, 153, 547, 729],carFrame1:[765, 196, 999, 788],carFrame1:[577, 352, 644, 441],carFrame1:[774, 342, 857, 445],carFrame1:[720, 349, 792, 435] in this video clip.When answering, maintain the format classFrame t:[xmin,ymin,xmax,ymax] without deviation.', 'carFrame1:[125, 153, 547, 729];Frame2:[124, 157, 547, 729],carFrame1:[765, 196, 999, 788];Frame2:[767, 200, 999, 788],carFrame1:[577, 352, 644, 441];Frame2:[576, 352, 643, 441],carFrame1:[774, 342, 857, 445];Frame2:[766, 346, 858, 450],carFrame1:[720, 349, 792, 435];Frame2:[720, 349, 791, 435].') 2023-11-03 20:51:29.890 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:51:29.891 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Bakset:[727, 634, 819, 700],[410, 527, 529, 583],[408, 610, 546, 669];Satchel:[053, 589, 142, 667];Person:[175, 273, 427, 818];Cabinet:[325, 356, 521, 617],[637, 477, 745, 610],[000, 640, 185, 831],[761, 338, 1000, 618];Storage box:[402, 554, 441, 599],[407, 534, 500, 582],[436, 529, 529, 574],[405, 610, 545, 668],[623, 728, 691, 774],[417, 139, 495, 209],[419, 201, 493, 269],[510, 218, 547, 289],[494, 172, 539, 227];Frame:[561, 312, 602, 417];Barrel:[733, 917, 803, 999];Scale:[000, 456, 191, 1000].') 2023-11-03 20:52:40.937 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:52:40.938 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'woman:[430, 366, 555, 943],[856, 435, 911, 633];girl:[003, 400, 173, 947],[599, 503, 674, 749];footwear:[087, 895, 166, 947],[173, 811, 245, 835].') 2023-11-03 20:52:46.478 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:52:46.478 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'This week\'s book blurb [002, 004, 996, 992]: "Mommy\'s Khimar"') 2023-11-03 20:53:03.228 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:53:03.229 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Bicycle:[002, 639, 109, 998];Trash bin Can:[002, 532, 145, 956].') 2023-11-03 20:53:08.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:53:08.747 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[296, 529, 365, 643] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a hudson bay collared lemmingFrame1:[296, 529, 365, 643];Frame2:[275, 506, 341, 602];Frame3:[279, 537, 345, 625];Frame4:[285, 545, 341, 639];Frame5:[265, 518, 317, 600].') 2023-11-03 20:53:25.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:53:25.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Picture:[109, 001, 238, 248],[708, 004, 845, 257],[956, 056, 1000, 198];Cabinet:[764, 355, 1000, 878];Person:[280, 197, 491, 1000],[256, 089, 731, 1000],[134, 049, 196, 208],[744, 066, 790, 218],[774, 040, 815, 217],[957, 304, 999, 398];Necklace:[522, 288, 600, 452];Glasses:[377, 192, 473, 247],[487, 556, 524, 660];Cup:[315, 210, 352, 298],[386, 087, 447, 193],[598, 092, 648, 202];Bottle:[070, 417, 107, 519];Moniter:[272, 020, 475, 219],[474, 016, 679, 219];Ice cream:[441, 396, 492, 486],[590, 048, 646, 117],[392, 046, 442, 116];Cake:[213, 218, 253, 268].') 2023-11-03 20:53:30.707 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:53:30.707 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'boy:[601, 008, 999, 822];toy:[151, 567, 273, 671],[160, 461, 250, 554],[211, 365, 286, 446],[218, 000, 286, 064],[316, 314, 393, 385];man:[000, 000, 306, 579];dog:[000, 158, 680, 999];human face:[576, 000, 755, 267].') 2023-11-03 20:54:44.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:54:44.848 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[838, 114, 998, 708];frame:2:[852, 102, 997, 708];frame:3:[852, 116, 998, 706] is?Given the image frames and trajectory coordinates, please respond succinctly to the question.Considering the image frames and trajectory coordinates provided, offer a brief answer.', 'This is a/an man standing next to the car door and turning slightly') 2023-11-03 20:54:47.088 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:54:47.089 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Motorcycle in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Motorcycle:[547, 363, 706, 731],[677, 350, 792, 614],[730, 419, 916, 719].') 2023-11-03 20:54:53.028 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:54:53.029 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[880, 550, 951, 826] [121, 585, 179, 764] [918, 556, 993, 833] [249, 536, 365, 917] [164, 613, 239, 769] [651, 541, 719, 841] [039, 565, 106, 849] [443, 571, 498, 759] [722, 538, 811, 821] The girls volleyball teams work on hitting and bumping during a recent practice.') 2023-11-03 20:55:29.558 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:55:29.559 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\ndetect Laptop,Person and Handbag/Satchel in this image.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Laptop:[546, 716, 760, 931];Person:[854, 611, 999, 1000],[538, 401, 953, 999],[384, 365, 769, 998],[205, 325, 511, 924],[002, 403, 305, 951],[002, 406, 084, 798];Satchel:[555, 884, 723, 992].') 2023-11-03 20:56:03.824 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:56:03.825 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Hat:[002, 239, 044, 337];Formula 1 :[077, 515, 343, 693],[406, 442, 597, 596];Person:[002, 239, 130, 999].') 2023-11-03 20:56:31.531 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:56:31.532 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Soccer:[749, 726, 870, 839];Person:[068, 563, 850, 949],[053, 175, 448, 742],[345, 101, 592, 765],[301, 071, 893, 732];Sneakers:[710, 643, 850, 729],[399, 893, 518, 948].') 2023-11-03 20:56:32.306 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:56:32.307 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'My Favorite Things GIVEAWAY 2015! From a label maker and clear elastic hair ties, to a bluetooth waterproof speaker [039, 260, 255, 515] and body buttercream and more!!') 2023-11-03 20:56:41.111 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:56:41.112 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'How to use a Type G power charger [333, 375, 440, 586] for recharging your iPhone 4S from a St Helenian power outlet') 2023-11-03 20:56:47.965 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:56:47.966 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The newly-opened motorway [003, 332, 994, 668] running alongside the main Lancaster to Carlisle railway line. Photo - Laing’s 30 minute documentary/BFI') 2023-11-03 20:56:58.944 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:56:58.944 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[274, 198, 360, 440] [452, 161, 534, 615] [333, 218, 415, 542] [516, 118, 704, 839] [369, 177, 525, 739] [117, 146, 349, 998] Commuters walk along [046, 416, 997, 995] the platform at a tube station in London on April 22, 2020, as Britain remains under lockdown during the COVID-19 pandemic. AFP') 2023-11-03 20:57:05.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:57:05.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Stop Sign:[201, 551, 277, 657];Car:[240, 740, 310, 812],[298, 741, 341, 795],[328, 740, 354, 787],[375, 737, 435, 804],[447, 745, 528, 839],[952, 835, 999, 1000];Bus:[384, 692, 410, 738];Street Lights:[486, 454, 558, 843],[488, 590, 525, 771].') 2023-11-03 20:57:42.398 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 20:57:42.398 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'wheel:[320, 573, 398, 750],[235, 480, 279, 569];tire:[310, 572, 433, 761],[235, 472, 258, 554];car:[073, 278, 999, 996].') 2023-11-03 20:58:22.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:58:22.791 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect submarine,woman,building and man in this image.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'woman:[922, 666, 983, 848],[000, 464, 012, 579],[004, 485, 030, 614],[036, 487, 098, 685],[169, 512, 212, 707],[306, 540, 333, 688],[319, 533, 352, 698],[572, 588, 605, 738],[644, 581, 669, 707],[733, 616, 784, 781],[794, 646, 855, 809],[863, 635, 905, 827];building:[002, 000, 340, 520],[876, 011, 998, 711];man:[466, 577, 591, 983],[012, 462, 059, 622],[079, 483, 122, 637],[091, 487, 144, 662],[134, 488, 166, 651],[145, 492, 188, 679],[198, 501, 245, 679],[225, 527, 276, 703],[280, 514, 320, 711],[352, 529, 390, 685],[429, 538, 458, 627],[769, 629, 830, 811],[833, 638, 894, 837].') 2023-11-03 20:58:49.680 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:58:49.680 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Traffic Light:[415, 438, 431, 502],[463, 353, 485, 439],[810, 032, 852, 175],[375, 533, 392, 567],[128, 225, 151, 310],[152, 452, 168, 519];Pickup Truck:[946, 478, 1000, 560];SUV:[886, 506, 969, 591],[517, 591, 680, 730];Car:[814, 534, 919, 624],[685, 570, 806, 672];Person:[515, 613, 557, 756],[217, 608, 235, 679].') 2023-11-03 20:59:03.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:59:03.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'football helmet:[019, 021, 166, 212],[581, 022, 703, 219];person:[000, 007, 436, 999],[047, 140, 446, 999],[315, 476, 371, 846],[342, 349, 406, 519],[366, 446, 502, 911],[379, 357, 438, 492],[430, 045, 873, 999],[461, 333, 530, 650],[507, 503, 607, 908],[616, 432, 739, 930],[706, 373, 778, 757],[710, 348, 786, 436],[788, 359, 871, 776],[831, 442, 943, 895],[917, 361, 991, 523],[933, 397, 996, 895];plant:[000, 857, 080, 999],[149, 895, 318, 999],[368, 884, 451, 999],[478, 882, 833, 999],[788, 811, 999, 999].') 2023-11-03 20:59:11.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:59:11.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[340, 268, 561, 855] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a puttererFrame1:[340, 268, 561, 855];Frame2:[296, 268, 491, 913];Frame3:[136, 263, 404, 999];Frame4:[058, 257, 402, 999];Frame5:[049, 255, 407, 999].') 2023-11-03 20:59:24.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:59:24.481 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'skyscraper:[000, 000, 554, 999].') 2023-11-03 20:59:25.459 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 20:59:25.460 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Sneakers in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Sneakers:[870, 622, 971, 661],[897, 649, 999, 694].') 2023-11-03 20:59:47.646 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 20:59:47.647 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the scared monkeys running away?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There are 4 scared monkeys running away.Track1frame:1:[035, 248, 128, 470];frame:2:[084, 251, 161, 450];frame:3:[144, 225, 262, 459],Track2frame:1:[106, 153, 313, 427];frame:2:[191, 157, 373, 427];frame:3:[262, 209, 484, 450],Track3frame:1:[185, 192, 396, 485];frame:2:[236, 190, 433, 500];frame:3:[338, 216, 490, 516],Track4frame:1:[399, 109, 512, 375];frame:2:[460, 133, 559, 396];frame:3:[544, 164, 636, 412].') 2023-11-03 21:00:06.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:00:06.524 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect ladle,man and clothing in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'man:[054, 392, 293, 723],[252, 321, 706, 958],[650, 354, 952, 891];clothing:[049, 508, 290, 925],[050, 723, 310, 962],[256, 482, 717, 961],[665, 505, 964, 799],[692, 781, 961, 961].') 2023-11-03 21:00:16.065 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:00:16.066 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Stool:[193, 866, 249, 918];Cabinet:[388, 663, 507, 771],[740, 634, 873, 874];Desk:[322, 811, 452, 935],[571, 642, 630, 812];Frame:[569, 443, 587, 495],[586, 443, 627, 534];Person:[483, 056, 696, 302],[617, 504, 723, 961],[456, 670, 596, 946],[205, 678, 296, 919];Necklace:[651, 576, 674, 654];Awning:[217, 379, 892, 442],[000, 183, 233, 259];Other Shoes:[457, 921, 502, 946];Handbag:[490, 621, 522, 703],[492, 625, 555, 740];Basin:[330, 793, 356, 816],[355, 788, 385, 813],[384, 769, 440, 809],[428, 771, 463, 815],[419, 701, 468, 747],[387, 626, 442, 666];Pot:[433, 594, 515, 669];Bottle:[500, 091, 607, 155].') 2023-11-03 21:00:20.928 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:00:20.929 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[393, 499, 434, 558],personFrame1:[637, 449, 673, 673],carFrame1:[601, 483, 640, 551],carFrame1:[000, 513, 139, 726],carFrame1:[421, 471, 514, 621],carFrame1:[484, 486, 643, 703],carFrame1:[199, 470, 256, 573] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[393, 499, 434, 558];Frame2:[393, 499, 434, 558],personFrame1:[637, 449, 673, 673];Frame2:[637, 449, 673, 673],carFrame1:[601, 483, 640, 551];Frame2:[601, 483, 640, 551],carFrame1:[000, 513, 139, 726];Frame2:[000, 513, 139, 727],carFrame1:[421, 471, 514, 621];Frame2:[421, 471, 514, 621],carFrame1:[484, 486, 643, 703];Frame2:[484, 486, 643, 703],carFrame1:[199, 470, 256, 573];Frame2:[199, 470, 256, 573].') 2023-11-03 21:00:50.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:00:50.557 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'person:[233, 676, 351, 986],[006, 042, 140, 315],[014, 338, 933, 996],[111, 049, 990, 240].') 2023-11-03 21:01:21.237 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:01:21.237 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[139, 384, 547, 476] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a stealth bomberFrame1:[139, 384, 547, 476];Frame2:[131, 391, 554, 489];Frame3:[109, 394, 540, 501];Frame4:[100, 394, 558, 501].') 2023-11-03 21:01:28.348 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:01:28.348 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Knit every day sweaters [070, 283, 238, 632] [681, 590, 766, 810] for dogs [064, 187, 269, 659] [669, 501, 822, 976] [553, 162, 724, 463] [318, 430, 488, 652]') 2023-11-03 21:01:58.899 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:01:58.900 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[593, 166, 862, 269],[211, 205, 418, 292];Mirror:[593, 243, 896, 543],[226, 269, 451, 520];shelf:[040, 589, 411, 888],[409, 617, 995, 985];TV:[423, 498, 573, 606].') 2023-11-03 21:02:22.422 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:02:22.423 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[053, 367, 829, 887] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a hammerheadFrame1:[053, 367, 829, 887];Frame2:[076, 366, 801, 904];Frame3:[073, 379, 815, 862];Frame4:[122, 342, 879, 912];Frame5:[277, 330, 852, 765].') 2023-11-03 21:02:27.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:02:27.768 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat in this image.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Hat:[715, 138, 999, 276],[105, 102, 719, 328],[001, 275, 101, 336].') 2023-11-03 21:02:28.802 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:02:28.802 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "[580, 227, 671, 575] TOPSHOT Policemen beat [073, 224, 170, 546] a demonstrator during a rally of former Ivory Coast President Laurent Gbagbo's supporters to protest against his absence on...") 2023-11-03 21:02:40.090 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:02:40.090 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[450, 318, 813, 820],[409, 225, 688, 496],[203, 040, 381, 374];Desk:[056, 191, 787, 389];Slippers:[608, 812, 713, 845],[277, 357, 349, 374],[225, 354, 281, 371];Bakset:[954, 751, 1000, 939];Handbag:[214, 179, 286, 221],[037, 536, 161, 619].') 2023-11-03 21:02:44.145 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:02:44.146 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Bicycle:[514, 385, 850, 804],[317, 312, 592, 645],[022, 276, 185, 441];Person:[029, 202, 128, 402],[254, 197, 471, 573],[551, 239, 815, 713].') 2023-11-03 21:03:18.215 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:03:18.216 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'pumpkin:[226, 623, 433, 793],[076, 518, 290, 669],[281, 488, 463, 633],[379, 413, 513, 501],[429, 581, 618, 730],[477, 332, 611, 423],[502, 414, 664, 548],[584, 518, 735, 618],[611, 315, 955, 493],[652, 445, 810, 568],[752, 253, 999, 385];fruit:[120, 747, 267, 791];plant:[000, 000, 059, 090],[000, 198, 125, 291],[000, 540, 370, 999],[000, 035, 500, 257],[148, 000, 266, 059],[280, 000, 380, 040],[373, 000, 717, 176],[676, 000, 925, 161],[833, 000, 999, 091].') 2023-11-03 21:03:23.696 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:03:23.696 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track faucetFrame1:[293, 551, 510, 856],personFrame1:[562, 138, 1000, 1000],personFrame1:[461, 129, 1000, 869] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'faucetFrame1:[293, 551, 510, 856];Frame2:[289, 516, 510, 856],personFrame1:[562, 138, 1000, 1000];Frame2:[544, 169, 1000, 1000],personFrame1:[461, 129, 1000, 869];Frame2:[446, 115, 1000, 865].') 2023-11-03 21:03:45.977 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:03:45.978 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the person spray painting the painting on the ground?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one person spray painting the painting on the ground.Track1frame:1:[605, 002, 998, 847];frame:2:[598, 002, 998, 847];frame:3:[584, 002, 998, 843].') 2023-11-03 21:03:47.733 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:03:47.734 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[054, 001, 926, 940];High Heels:[054, 269, 595, 937],[497, 222, 926, 911].') 2023-11-03 21:04:09.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:04:09.158 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Chair:[596, 202, 660, 228],[759, 201, 821, 250];Car:[039, 077, 965, 926],[102, 122, 263, 253],[394, 123, 439, 208];Person:[865, 108, 906, 220],[689, 153, 742, 215],[595, 150, 631, 218],[503, 073, 524, 135],[534, 062, 557, 162],[115, 118, 154, 198],[000, 052, 043, 605].') 2023-11-03 21:04:18.458 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:04:18.458 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Lamp:[001, 934, 108, 1000],[001, 789, 026, 877];Person:[871, 324, 960, 569],[627, 245, 999, 997],[664, 359, 695, 416],[491, 359, 521, 407],[507, 162, 728, 996],[290, 147, 553, 993],[035, 060, 354, 999];Bench:[951, 002, 1000, 100],[833, 002, 893, 132],[721, 033, 774, 155],[624, 055, 680, 163],[531, 081, 584, 189],[454, 100, 504, 203],[004, 157, 050, 240],[066, 157, 092, 240],[077, 253, 107, 296],[039, 243, 071, 282],[013, 235, 041, 273],[127, 156, 157, 242],[156, 186, 189, 233],[313, 135, 350, 234],[729, 161, 761, 221],[768, 160, 801, 218],[800, 154, 828, 213];Bed:[840, 068, 1000, 228];Gas stove:[336, 667, 618, 896].') 2023-11-03 21:04:19.290 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:04:19.290 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'tie [111, 198, 896, 847] embroidery floss to make a tassel') 2023-11-03 21:04:27.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:04:27.970 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track bicycleFrame1:[103, 486, 167, 550] in this video clip.Use only the categoryFrame t:[xmin,ymin,xmax,ymax] format to structure your answer.', 'bicycleFrame1:[103, 486, 167, 550];Frame2:[103, 486, 167, 548].') 2023-11-03 21:04:28.859 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:04:28.860 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Frame:[052, 310, 193, 446],[001, 293, 025, 460];Storage box:[373, 638, 608, 750],[229, 814, 484, 1000];Person:[216, 367, 345, 832],[338, 265, 478, 656],[261, 623, 435, 852],[639, 659, 726, 725],[388, 695, 549, 927],[471, 832, 644, 998],[635, 710, 836, 999],[599, 690, 795, 999],[001, 848, 047, 975];Hat:[369, 265, 432, 314].') 2023-11-03 21:04:41.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:04:41.417 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'High Heels:[662, 797, 944, 911],[352, 738, 416, 813];Person:[219, 113, 949, 913];TV:[038, 354, 124, 401];Lamp:[073, 132, 193, 278];Desk:[492, 323, 861, 507].') 2023-11-03 21:05:05.347 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:05:05.348 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The MIller Planisphere [067, 115, 929, 878], a tool for viewing what the night sky looks like on any day of the year') 2023-11-03 21:05:50.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:05:50.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Cow in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Cow:[362, 572, 920, 999],[001, 458, 165, 639],[001, 520, 251, 871],[205, 280, 411, 556],[602, 266, 669, 356],[608, 375, 834, 574],[441, 385, 628, 518],[847, 239, 939, 411],[821, 411, 980, 580],[939, 290, 998, 418],[717, 374, 857, 510].') 2023-11-03 21:06:00.475 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:06:00.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[472, 646, 500, 737],[416, 666, 443, 744];Flag:[820, 330, 842, 388];Awning:[711, 399, 1000, 537],[542, 520, 734, 617];Storage box:[728, 705, 778, 743];bucket:[314, 651, 346, 713];Motorcycle:[295, 709, 429, 997],[401, 678, 476, 747],[457, 673, 523, 728],[557, 653, 586, 710],[652, 668, 687, 728];Car:[188, 639, 314, 824],[357, 637, 417, 725];Pickup Truck:[289, 650, 375, 753];Truck:[506, 643, 565, 721],[897, 521, 1000, 824].') 2023-11-03 21:06:29.023 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:06:29.024 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,what is the track of the man standing and waiting?When detailing trajectories in your response, adhere to the TrackiFrame t:[xmin,ymin,xmax,ymax] format.', 'There is one man standing and waiting.Track1frame1:[820, 547, 1000, 1000];frame2:[820, 547, 1000, 1000].') 2023-11-03 21:06:48.939 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:06:48.940 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[389, 334, 755, 776] and track its trajectory.Your response should highlight trajectories using the established Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a cheviotFrame1:[389, 334, 755, 776];Frame2:[382, 328, 754, 769];Frame3:[398, 328, 748, 791];Frame4:[401, 329, 740, 807].') 2023-11-03 21:06:55.472 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:06:55.473 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[560, 287, 600, 471] and track its trajectory.Ensure that each trajectory within your response is articulated using the Frame t:[xmin,ymin,xmax,ymax] specification.', 'This is a personFrame1:[560, 287, 600, 471];Frame2:[551, 362, 597, 466];Frame3:[551, 374, 591, 477];Frame4:[552, 363, 596, 476];Frame5:[559, 352, 602, 475].') 2023-11-03 21:07:06.367 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 21:07:06.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Helmet,Sneakers and Gloves in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Helmet:[346, 247, 418, 342],[623, 003, 702, 079],[794, 043, 851, 105];Sneakers:[781, 556, 819, 615],[646, 579, 710, 672],[697, 596, 779, 697],[297, 599, 368, 715];Gloves:[617, 315, 661, 385],[667, 150, 710, 250],[766, 272, 814, 327],[764, 170, 816, 230].') 2023-11-03 21:07:15.335 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:07:15.335 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'After 395 days, international travel between New Zealand and Australia has opened, in which [100, 087, 364, 598] [630, 152, 977, 998] travellers were warmly greeted at Wellington airport.') 2023-11-03 21:07:18.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:07:18.307 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'bicycle:[485, 656, 578, 751];person:[483, 605, 586, 715];tree:[000, 000, 460, 071],[000, 045, 473, 145],[000, 055, 478, 410],[072, 159, 381, 420],[130, 236, 314, 444],[244, 297, 347, 509],[283, 350, 383, 523],[298, 273, 421, 595],[381, 185, 504, 666],[419, 302, 519, 632],[512, 090, 804, 836],[522, 311, 623, 611],[558, 000, 999, 320],[576, 369, 666, 676],[666, 260, 999, 680],[694, 123, 999, 374],[700, 241, 831, 763],[806, 361, 903, 802],[856, 414, 939, 751],[905, 345, 999, 523],[936, 532, 999, 751].') 2023-11-03 21:07:24.893 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:07:24.893 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[361, 233, 804, 995] Diane Lane wearing [419, 382, 801, 721] a striped tunic and [385, 689, 703, 924] red skirt; circa 1970; New York.') 2023-11-03 21:07:44.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:07:44.722 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'cattle:[406, 117, 703, 890],[130, 032, 201, 180],[243, 040, 338, 215],[503, 055, 561, 203],[881, 155, 999, 492].') 2023-11-03 21:07:50.003 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:07:50.003 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect car and girl in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'car:[638, 130, 999, 463];girl:[333, 195, 464, 759].') 2023-11-03 21:08:09.243 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:08:09.243 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'man:[325, 851, 486, 981],[006, 424, 116, 500],[556, 397, 621, 489],[678, 697, 810, 826],[686, 183, 745, 285],[839, 344, 991, 489],[841, 834, 898, 975],[841, 171, 997, 332];food:[209, 704, 264, 789],[539, 040, 659, 165];girl:[232, 391, 309, 493],[346, 504, 498, 657],[382, 010, 496, 161],[384, 257, 413, 320],[419, 246, 443, 326],[439, 253, 464, 324];clothing:[185, 771, 329, 826],[221, 424, 244, 491],[238, 436, 315, 495],[340, 008, 470, 165],[344, 906, 417, 995],[344, 516, 494, 663],[376, 114, 492, 167],[417, 257, 437, 320],[441, 263, 464, 326],[462, 246, 501, 322],[688, 881, 790, 993],[690, 730, 798, 828],[835, 504, 993, 659],[912, 532, 991, 642];human face:[262, 687, 338, 802],[342, 348, 498, 491],[382, 851, 448, 951],[456, 028, 498, 138],[562, 716, 661, 824],[692, 193, 741, 267],[731, 344, 826, 497],[835, 012, 924, 165],[841, 857, 887, 961],[845, 344, 993, 497],[885, 181, 995, 279],[900, 512, 973, 567].') 2023-11-03 21:09:06.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:09:06.895 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[251, 076, 850, 999],[788, 127, 916, 346].') 2023-11-03 21:09:27.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:09:27.560 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the monkey shifting to the right-side.For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is no monkey shifting to the right-side.') 2023-11-03 21:09:31.313 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:09:31.314 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Other Shoes:[293, 836, 423, 920];Sandals:[753, 869, 874, 938];Flower:[143, 378, 485, 521];Frame:[481, 195, 584, 262];Cabinet:[599, 247, 998, 405];Person:[071, 217, 681, 954],[344, 232, 675, 946],[592, 236, 981, 945].') 2023-11-03 21:10:07.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:10:07.833 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'window:[011, 350, 076, 486],[000, 138, 073, 265],[002, 000, 070, 036],[157, 000, 221, 086],[161, 183, 224, 285],[166, 373, 221, 481],[293, 021, 353, 126],[298, 203, 345, 293],[298, 383, 345, 475],[401, 060, 443, 160],[403, 381, 442, 470],[404, 216, 444, 311],[486, 248, 530, 335],[488, 393, 523, 471],[490, 100, 526, 171],[560, 125, 595, 201],[563, 395, 600, 471],[564, 265, 591, 338];house:[000, 000, 710, 566],[633, 108, 998, 540].') 2023-11-03 21:10:40.007 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:10:40.007 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Cabinet:[243, 206, 550, 741];Desk:[272, 526, 731, 904];Chair:[254, 688, 573, 999],[572, 499, 724, 749],[603, 522, 768, 842],[672, 556, 861, 872],[812, 613, 999, 985];Frame:[405, 586, 628, 658],[746, 559, 916, 844],[896, 591, 959, 688],[763, 552, 918, 835];Carpet:[223, 687, 832, 999];Lamp:[654, 154, 700, 218].') 2023-11-03 21:11:20.437 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:11:20.438 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[561, 429, 653, 887] and track its trajectory.Use the specified Frame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'This is a personFrame1:[561, 429, 653, 887];Frame2:[596, 398, 692, 909];Frame3:[596, 405, 711, 943];Frame4:[608, 431, 707, 955].') 2023-11-03 21:11:48.858 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:11:48.859 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[544, 116, 603, 390] [149, 045, 255, 595] [046, 036, 181, 638] [599, 148, 666, 373] [001, 004, 086, 672] [911, 156, 974, 448] [322, 260, 429, 505] [734, 135, 813, 440] [664, 152, 744, 437] Students and [800, 096, 901, 454] teachers watching as [881, 528, 977, 606] [814, 663, 930, 737] [688, 749, 820, 842] [539, 787, 680, 942] student-built model cars take off from the starting line.') 2023-11-03 21:12:36.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:12:36.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Couch:[717, 627, 996, 817],[001, 534, 461, 890];Carpet:[001, 696, 994, 998];Pillow:[158, 528, 350, 634],[206, 536, 424, 638];Chair:[272, 482, 381, 552],[095, 478, 209, 550],[580, 515, 719, 628],[360, 476, 435, 578];Flower:[216, 448, 274, 482],[235, 412, 356, 453];Desk:[363, 460, 616, 566],[077, 479, 455, 579];Cabinet:[671, 330, 746, 504];Lamp:[224, 331, 345, 417];Person:[616, 628, 958, 794],[015, 500, 437, 822],[323, 127, 494, 354];Skating and Skiing shoes:[343, 804, 489, 875],[455, 772, 573, 835];Leather Shoes:[433, 310, 485, 348];Dog:[577, 619, 720, 739];Bakset:[631, 598, 787, 678];Side Table:[432, 609, 583, 735],[505, 648, 632, 754].') 2023-11-03 21:14:27.266 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:14:27.267 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'People [651, 515, 694, 688] [151, 527, 213, 716] [535, 498, 576, 667] [001, 686, 084, 996] [873, 516, 935, 737] [783, 607, 885, 892] [579, 522, 627, 694] [944, 537, 997, 809] [196, 721, 270, 996] [096, 779, 219, 996] wait in line for a last-minute COVID-19 vaccine event at Seattle University after a freezer failure at a nearby hospital on January 29, 2021...') 2023-11-03 21:14:33.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:14:33.714 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'boy:[085, 040, 304, 672],[220, 443, 454, 792],[773, 133, 998, 999];clothing:[761, 240, 978, 970],[110, 105, 288, 655],[215, 482, 465, 798],[364, 033, 440, 205],[428, 130, 622, 529],[564, 251, 679, 725],[581, 155, 666, 267],[665, 272, 766, 794],[743, 385, 839, 879],[813, 057, 963, 190];plant:[000, 000, 752, 999].') 2023-11-03 21:14:52.875 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:14:52.876 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[024, 322, 041, 525],[051, 317, 076, 506],[061, 619, 151, 997],[100, 536, 128, 725],[131, 610, 180, 965],[173, 366, 199, 516],[248, 520, 341, 997],[293, 430, 331, 564],[339, 460, 360, 592],[396, 447, 473, 933],[545, 516, 599, 785],[863, 329, 895, 686];clothing:[000, 311, 021, 557],[020, 481, 077, 930],[021, 382, 038, 463],[046, 387, 054, 453],[050, 391, 074, 476],[061, 536, 086, 626],[063, 769, 145, 990],[073, 368, 083, 481],[095, 343, 132, 532],[121, 626, 164, 854],[126, 352, 142, 504],[137, 373, 178, 652],[150, 684, 175, 990],[174, 594, 253, 995],[195, 373, 231, 500],[255, 778, 342, 997],[319, 458, 336, 580],[320, 612, 337, 751],[331, 495, 352, 585],[370, 428, 400, 576],[398, 467, 433, 656],[400, 495, 431, 652],[415, 670, 466, 900],[429, 721, 550, 997],[480, 486, 498, 557],[500, 513, 519, 585],[519, 580, 551, 811],[543, 608, 592, 820],[557, 783, 686, 997],[643, 642, 697, 845],[680, 679, 780, 990],[774, 792, 820, 997],[816, 783, 905, 997],[861, 391, 895, 684],[920, 679, 999, 997],[993, 269, 999, 400].') 2023-11-03 21:15:13.274 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:15:13.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'woman:[062, 042, 521, 982],[373, 074, 999, 998],[621, 314, 999, 998];clothing:[057, 361, 518, 947],[372, 456, 799, 979],[630, 729, 999, 998];human face:[269, 092, 426, 423],[567, 163, 738, 528],[782, 415, 958, 818].') 2023-11-03 21:15:29.409 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:15:29.410 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track busFrame1:[465, 462, 523, 573],carFrame1:[254, 500, 320, 582],carFrame1:[564, 470, 701, 659] in this video clip.For clarity in your response, always adopt the classFrame t:[xmin,ymin,xmax,ymax] format.', 'busFrame1:[465, 462, 523, 573];Frame2:[468, 472, 517, 571],carFrame1:[254, 500, 320, 582];Frame2:[254, 498, 321, 583],carFrame1:[564, 470, 701, 659];Frame2:[533, 482, 632, 625].') 2023-11-03 21:15:44.080 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:15:44.081 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Winston Samuels [355, 033, 706, 997] with his wife Marilyn [149, 269, 465, 998], and their two sons [645, 138, 950, 996] Courtney (left) and Joel.') 2023-11-03 21:15:54.301 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:15:54.301 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Don Knight | The Herald Bulletin [463, 132, 707, 996] Lapel graduates toss their mortar board [319, 180, 384, 276] [733, 238, 783, 312] [613, 159, 667, 243]'s in the air at the conclusion of their commencement ceremony on Saturday.") 2023-11-03 21:15:56.069 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:15:56.069 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Street Lights in this image.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Street Lights:[748, 133, 781, 545],[372, 141, 397, 387],[014, 011, 055, 758].') 2023-11-03 21:16:04.587 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:16:04.588 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Person:[256, 522, 583, 998],[144, 324, 370, 795],[865, 693, 999, 971],[782, 352, 973, 639],[508, 210, 535, 246];Book:[756, 620, 908, 667],[773, 700, 932, 784],[482, 723, 654, 826];Backpack:[001, 499, 069, 599];Leather Shoes:[128, 603, 165, 634];Belt:[169, 698, 284, 741];Handbag:[483, 497, 575, 626];Glasses:[743, 741, 826, 788];Pencil:[777, 529, 809, 563];Blackboard:[927, 212, 967, 273];Cup:[726, 661, 770, 741],[803, 568, 834, 625],[617, 528, 643, 579],[563, 577, 638, 682],[323, 378, 350, 415];Plate:[680, 718, 746, 760].') 2023-11-03 21:16:19.837 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:16:19.838 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman and jeans in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[157, 600, 264, 910],[320, 595, 348, 866],[365, 581, 429, 918],[859, 556, 981, 858];jeans:[225, 695, 334, 954].') 2023-11-03 21:16:26.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:16:26.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[206, 062, 373, 500],[000, 257, 228, 949],[640, 010, 767, 239],[658, 152, 988, 637];man:[115, 253, 563, 972],[297, 121, 710, 622],[320, 415, 980, 999],[353, 142, 470, 302],[433, 003, 515, 253],[546, 030, 676, 301],[706, 049, 883, 345],[803, 000, 953, 134],[829, 026, 939, 330];wine:[900, 551, 988, 947],[243, 324, 288, 529],[453, 532, 497, 703],[501, 450, 551, 747],[874, 784, 897, 859],[956, 555, 995, 904];clothing:[111, 459, 613, 974],[000, 498, 143, 959],[213, 216, 365, 501],[326, 815, 981, 999],[433, 107, 516, 245],[520, 251, 710, 607],[648, 119, 748, 245],[672, 366, 988, 657],[828, 115, 938, 317],[927, 186, 999, 359];human face:[160, 276, 228, 500],[260, 088, 336, 228],[344, 170, 423, 305],[463, 015, 526, 124],[486, 161, 595, 363],[690, 034, 737, 142],[741, 192, 821, 432],[838, 042, 888, 150].') 2023-11-03 21:16:52.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:16:52.578 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Glasses:[260, 017, 378, 074];Hat:[257, 002, 384, 038];Towel:[640, 303, 770, 512];bucket:[257, 344, 391, 608];Car:[898, 267, 999, 420];Person:[111, 002, 495, 476],[498, 002, 887, 548],[737, 215, 778, 341],[764, 219, 848, 516],[788, 270, 874, 509],[857, 261, 934, 421],[535, 002, 1000, 762];Plate:[469, 140, 625, 302],[376, 474, 469, 559],[396, 556, 616, 682],[226, 479, 680, 727];Cup:[408, 357, 516, 543],[453, 412, 593, 663];Spoon:[492, 121, 626, 460];Bottle:[279, 400, 328, 652];Ice cream:[411, 334, 518, 393],[453, 405, 592, 467].') 2023-11-03 21:17:38.549 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:17:38.549 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[418, 533, 468, 602] in this video clip.Keep your response consistent with the classFrame t:[xmin,ymin,xmax,ymax] format.', 'carFrame1:[418, 533, 468, 602];Frame2:[408, 531, 459, 602].') 2023-11-03 21:17:54.702 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:17:54.703 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human body:[000, 606, 285, 981],[226, 229, 517, 998],[446, 314, 825, 998];woman:[000, 506, 280, 1000],[217, 224, 528, 998],[449, 316, 822, 998],[608, 167, 721, 376],[713, 178, 851, 483];human hair:[563, 312, 753, 651],[000, 470, 247, 794],[332, 220, 502, 423];human head:[060, 487, 255, 758],[381, 227, 514, 450],[560, 314, 711, 585],[606, 184, 644, 259];clothing:[007, 715, 269, 998],[231, 423, 514, 998],[453, 593, 825, 998];human face:[148, 640, 244, 751],[426, 323, 500, 446],[589, 403, 691, 581];human arm:[041, 813, 149, 998],[200, 764, 292, 998],[219, 470, 475, 794],[436, 470, 521, 693],[446, 587, 704, 951],[698, 593, 825, 935].') 2023-11-03 21:18:26.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:18:26.555 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Lamp:[516, 251, 588, 577],[815, 188, 893, 463];Pillow:[935, 384, 998, 488],[678, 577, 759, 882],[735, 870, 854, 998],[423, 565, 735, 828],[437, 798, 691, 870],[392, 838, 755, 998];Couch:[873, 390, 997, 600];Bed:[693, 698, 996, 999].') 2023-11-03 21:18:34.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:18:34.476 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Helmet:[799, 154, 851, 259],[700, 253, 754, 340],[635, 240, 685, 329],[498, 263, 546, 357],[342, 217, 395, 297],[318, 177, 373, 253],[225, 213, 289, 309],[123, 215, 172, 285];Hat:[465, 106, 513, 177];Handbag:[327, 596, 429, 749];Person:[699, 155, 907, 637],[631, 251, 787, 683],[548, 243, 692, 707],[431, 268, 588, 741],[420, 109, 569, 419],[224, 091, 425, 523],[220, 129, 371, 627],[092, 173, 246, 622],[023, 269, 113, 503];Paddle:[723, 221, 972, 815],[700, 525, 832, 840],[467, 378, 713, 883],[595, 323, 644, 842],[267, 427, 318, 668].') 2023-11-03 21:18:49.044 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:18:49.044 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,what is thisFrame1:[503, 430, 782, 679] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a horse cartFrame1:[503, 430, 782, 679];Frame2:[492, 404, 795, 668];Frame3:[485, 401, 817, 688];Frame4:[454, 408, 797, 700].') 2023-11-03 21:18:49.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:18:49.947 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you point out the trajectory of the human hand?The TrackiFrame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'There is one human hand.Track1frame:2:[784, 000, 999, 313].') 2023-11-03 21:18:51.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:18:51.688 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[111, 590, 373, 855];chest of drawers:[423, 729, 911, 929];book:[000, 704, 122, 815],[000, 810, 411, 999],[045, 525, 111, 581],[237, 485, 306, 541],[288, 566, 375, 627],[401, 627, 468, 676],[421, 545, 486, 595],[630, 574, 714, 636],[661, 476, 735, 531];drawer:[431, 760, 580, 798],[431, 883, 585, 922],[434, 842, 588, 880],[439, 801, 580, 837],[588, 760, 734, 794],[588, 838, 739, 876],[593, 880, 742, 924],[596, 803, 732, 838],[744, 805, 886, 838],[744, 758, 893, 794],[744, 843, 904, 881],[752, 883, 901, 920].') 2023-11-03 21:19:04.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:19:04.309 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,please tell me what is thisFrame1:[295, 070, 417, 290] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a footballFrame1:[295, 070, 417, 290];Frame2:[758, 408, 932, 702];Frame3:[742, 416, 921, 716];Frame4:[720, 450, 903, 759];Frame5:[712, 498, 897, 818].') 2023-11-03 21:19:21.128 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:19:21.128 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A family [543, 141, 762, 698] [441, 290, 644, 996] [126, 418, 378, 997] [548, 257, 780, 995] having fun together in the kitchen [006, 007, 993, 998]') 2023-11-03 21:19:21.999 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:19:21.999 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Sneakers:[734, 804, 770, 885],[371, 823, 406, 894],[300, 782, 346, 879],[221, 920, 272, 1000];Helmet:[631, 243, 709, 336],[358, 186, 445, 285],[213, 212, 291, 310];Hat:[435, 369, 479, 416];Bicycle:[614, 580, 775, 1000];Person:[157, 205, 348, 1000],[289, 186, 445, 979],[391, 368, 488, 983],[488, 696, 600, 992],[608, 242, 793, 996].') 2023-11-03 21:19:35.759 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:19:35.759 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[253, 238, 565, 397] and track its trajectory.The Frame t:[xmin,ymin,xmax,ymax] format should be employed for the trajectories within your answer.', 'This is a white airplane landing on groundFrame1:[253, 238, 565, 397];Frame2:[257, 219, 575, 380];Frame3:[239, 202, 559, 363];Frame4:[235, 200, 553, 361];Frame5:[231, 200, 556, 369].') 2023-11-03 21:20:05.667 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:20:05.667 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Crab:[346, 437, 876, 790].') 2023-11-03 21:20:29.809 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:20:29.810 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[205, 235, 778, 967];clothing:[164, 624, 881, 999];human face:[438, 318, 636, 714].') 2023-11-03 21:20:31.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:20:31.693 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'suit:[494, 278, 955, 700],[000, 414, 721, 999],[768, 433, 999, 999];man:[073, 233, 706, 974],[355, 340, 547, 608],[514, 355, 558, 432],[524, 231, 985, 999],[760, 168, 999, 999],[775, 290, 929, 369];glasses:[136, 304, 337, 351],[901, 282, 999, 323];human face:[207, 301, 343, 420],[500, 355, 538, 433],[523, 368, 563, 433],[533, 276, 658, 399],[787, 315, 861, 361],[909, 201, 999, 406].') 2023-11-03 21:20:47.864 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:20:47.865 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Carpet:[032, 638, 274, 690],[001, 670, 999, 999];Person:[362, 425, 415, 578],[327, 416, 364, 575],[466, 424, 502, 551],[521, 413, 574, 502],[593, 419, 646, 507],[671, 418, 695, 538],[534, 368, 569, 462],[762, 426, 791, 484],[839, 426, 863, 473],[889, 424, 919, 473],[848, 419, 907, 593],[944, 411, 999, 556];Cup:[456, 586, 478, 637].') 2023-11-03 21:20:48.669 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:20:48.669 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[276, 037, 634, 681] [712, 379, 996, 993] [243, 184, 859, 997] Huaorani natives and Yasunidos ecologist group activists march in Quito on April 12, 2014 toward the National Electoral Council to leave the...') 2023-11-03 21:21:01.504 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:21:01.505 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[002, 505, 035, 660];frame:2:[002, 505, 035, 660] doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an guy standing at the side without moving') 2023-11-03 21:21:14.110 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:21:14.110 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[068, 242, 790, 831] mosaics in a suitcase') 2023-11-03 21:21:19.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:21:19.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the fishes swimming and moving in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There are 10 fishes swimming and moving.Track1frame:1:[335, 235, 443, 335];frame:2:[357, 247, 449, 384];frame:3:[369, 274, 468, 434],Track2frame:1:[407, 123, 529, 193];frame:2:[444, 118, 562, 218];frame:3:[482, 125, 582, 243],Track3frame:1:[220, 697, 302, 793];frame:2:[223, 618, 279, 769];frame:3:[197, 509, 240, 702],Track4frame:1:[190, 103, 250, 231];frame:2:[198, 119, 272, 241];frame:3:[231, 106, 299, 335],Track5frame:1:[161, 490, 273, 564];frame:2:[157, 351, 248, 557];frame:3:[175, 295, 242, 485],Track6frame:1:[091, 573, 179, 791];frame:2:[113, 436, 151, 715];frame:3:[117, 347, 178, 610],Track7frame:1:[466, 001, 510, 255];frame:2:[455, 001, 507, 268];frame:3:[456, 001, 524, 252],Track8frame:1:[730, 031, 825, 178];frame:2:[720, 068, 842, 188];frame:3:[710, 046, 820, 214],Track9frame:1:[797, 431, 853, 611];frame:2:[828, 574, 882, 775];frame:3:[816, 529, 879, 702],Track10frame:1:[738, 524, 768, 785];frame:2:[735, 533, 765, 804];frame:3:[728, 537, 761, 809].') 2023-11-03 21:21:38.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:21:38.479 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Lamp:[552, 183, 909, 627];Traffic Light:[716, 276, 814, 411],[461, 365, 587, 511],[293, 408, 443, 628],[079, 513, 228, 724];Frame:[878, 224, 999, 514];Glasses:[758, 481, 857, 710];Sink:[001, 245, 721, 1000];Moniter:[654, 801, 882, 999].') 2023-11-03 21:21:53.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:21:53.745 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Speaker:[269, 095, 379, 186];Basketball:[310, 189, 365, 278];Chair:[338, 669, 373, 834],[490, 703, 528, 820];Storage box:[123, 115, 180, 188],[174, 144, 220, 186];Person:[928, 556, 1000, 803],[861, 560, 953, 809],[778, 580, 856, 817],[542, 378, 778, 999],[514, 422, 631, 1000],[230, 405, 535, 1000];Sneakers:[275, 867, 317, 892],[261, 835, 320, 912],[197, 823, 215, 880],[184, 827, 202, 898],[161, 814, 197, 907],[496, 874, 521, 948],[589, 883, 607, 914],[485, 955, 531, 999],[822, 787, 855, 812],[799, 792, 815, 815],[922, 777, 951, 808],[906, 780, 923, 810],[950, 768, 968, 803],[736, 778, 754, 815].') 2023-11-03 21:22:13.445 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:22:13.445 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Ninja Heroes 1 1 0 Mod Apk Max Gold 26 Silver 5d Video Dailymotion') 2023-11-03 21:22:26.460 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:22:26.461 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Woman [108, 120, 564, 998] with Down syndrome smiling at a dog [618, 161, 970, 994] that's sitting on the lap of a man sitting beside her.") 2023-11-03 21:22:54.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:22:54.984 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[490, 351, 557, 697],[759, 337, 906, 766].') 2023-11-03 21:23:30.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:23:30.641 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'jacket:[145, 258, 338, 635];person:[026, 288, 118, 555],[028, 169, 926, 999],[055, 325, 148, 627],[169, 222, 426, 921],[420, 205, 486, 648],[449, 186, 595, 858],[668, 154, 923, 999];hiking equipment:[055, 320, 157, 426],[063, 460, 073, 573],[135, 347, 240, 680],[373, 587, 449, 666],[465, 301, 583, 439],[553, 327, 879, 666],[801, 581, 905, 999],[891, 538, 916, 998];tree:[000, 000, 167, 434],[161, 000, 222, 331],[225, 000, 999, 484];footwear:[278, 845, 328, 923],[347, 782, 430, 837],[756, 907, 854, 994].') 2023-11-03 21:23:40.155 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:23:40.156 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'An illustration depicting people [087, 430, 263, 693] [565, 151, 720, 736] [229, 158, 382, 737] [695, 236, 879, 771] [411, 197, 571, 654] with sleeping bags [712, 394, 791, 504] [505, 510, 617, 646] [336, 444, 477, 678] on the sidewalks of Old Town [251, 813, 996, 997]') 2023-11-03 21:23:46.956 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:23:46.956 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'top tips for a perfect sparkler tunnel - [266, 214, 668, 998] bride and groom playing with [002, 505, 265, 875] sparklers') 2023-11-03 21:23:51.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:23:51.849 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[750, 011, 909, 452];frame:2:[787, 051, 958, 451];frame:3:[796, 055, 951, 447] doing?Please briefly answer the question based on the provided image frames and trajectory coordinates.', 'This is a/an goat shifting around') 2023-11-03 21:24:40.798 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:24:40.799 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat,Person and Sneakers in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Hat:[598, 003, 711, 087];Person:[598, 003, 783, 1000],[531, 203, 591, 397],[178, 035, 412, 901];Sneakers:[209, 842, 293, 902],[284, 635, 344, 690].') 2023-11-03 21:24:56.340 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:24:56.341 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, "Halloween Melons [269, 010, 907, 820] - Mummy Melon [498, 112, 907, 808] and Jack-o-Melon Carving how-to's plus tips and recipes for the watermelon flesh. Allergy-friendly and plant-based!") 2023-11-03 21:25:07.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:25:07.206 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'house:[000, 004, 490, 434];plant:[040, 885, 095, 973],[058, 000, 999, 808],[154, 785, 235, 849],[250, 928, 419, 985],[266, 813, 411, 856],[408, 832, 500, 880],[412, 694, 493, 745],[437, 930, 505, 999],[621, 968, 775, 995],[659, 934, 792, 971],[663, 830, 999, 947].') 2023-11-03 21:25:26.769 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:25:26.770 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'cat:[101, 167, 413, 810],[305, 189, 999, 845].') 2023-11-03 21:25:30.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:25:30.368 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Hat,Horse and Traffic Light in this image.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Hat:[950, 288, 988, 408],[916, 344, 950, 454],[865, 318, 899, 424],[831, 339, 860, 443],[796, 271, 835, 387],[771, 287, 797, 420],[722, 128, 773, 363],[677, 268, 725, 388],[614, 186, 666, 359],[597, 207, 629, 338],[538, 181, 585, 349],[453, 176, 501, 306],[402, 181, 428, 337],[360, 132, 415, 310],[263, 104, 319, 286],[226, 084, 251, 255],[167, 003, 228, 215],[108, 064, 166, 244],[001, 003, 042, 126];Horse:[887, 572, 999, 999],[792, 470, 855, 997],[743, 464, 801, 997],[682, 480, 778, 999],[673, 437, 738, 995],[528, 529, 697, 998],[364, 449, 522, 997],[277, 362, 428, 997],[251, 424, 371, 998],[096, 625, 260, 996],[001, 638, 091, 997];Traffic Light:[290, 010, 340, 287].') 2023-11-03 21:25:53.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:25:53.093 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Satchel:[577, 563, 723, 741],[736, 574, 869, 707],[428, 668, 591, 839];Person:[825, 427, 953, 817],[713, 417, 873, 871],[579, 430, 723, 877],[413, 404, 604, 870],[305, 448, 446, 872],[196, 450, 328, 873],[034, 445, 214, 872],[002, 314, 126, 778],[149, 306, 269, 635],[239, 289, 369, 676],[373, 285, 481, 584],[469, 288, 589, 530],[576, 257, 697, 540],[706, 274, 815, 515],[799, 284, 924, 535].') 2023-11-03 21:26:06.819 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:26:06.819 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Stone path [005, 828, 996, 997] and steps [104, 652, 961, 837] leading to a colonial portico [007, 005, 988, 841] with fluted columns [775, 216, 907, 669] [098, 214, 238, 670] [241, 218, 376, 669] [628, 220, 763, 670] and royal blue double doors [384, 384, 603, 646]') 2023-11-03 21:26:11.627 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:26:11.628 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Street Lights:[939, 311, 993, 816],[101, 273, 153, 832];Awning:[775, 490, 835, 545].') 2023-11-03 21:26:14.458 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:26:14.458 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '1890s large die-cut of [214, 037, 763, 965] girl holding [263, 124, 420, 207] a caged bird away from [159, 462, 526, 976] a cat') 2023-11-03 21:26:31.647 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:26:31.648 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,clothing and man in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'human face:[086, 154, 210, 385],[189, 032, 272, 196],[250, 381, 378, 567],[310, 622, 341, 688],[373, 216, 527, 478],[432, 054, 522, 175],[521, 000, 603, 131],[653, 211, 746, 400],[723, 000, 790, 120],[846, 155, 976, 382];clothing:[000, 166, 042, 369],[000, 480, 216, 995],[000, 306, 258, 581],[065, 000, 169, 090],[120, 506, 343, 999],[203, 171, 319, 324],[243, 000, 333, 140],[285, 073, 402, 275],[320, 140, 593, 428],[373, 315, 795, 636],[514, 090, 681, 315],[529, 270, 800, 445],[600, 000, 698, 108],[658, 058, 826, 201],[685, 310, 999, 711],[738, 642, 999, 999];man:[516, 000, 693, 308].') 2023-11-03 21:26:34.510 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:26:34.511 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Desk:[126, 513, 361, 775];Lamp:[271, 243, 307, 321];Chair:[162, 543, 237, 658],[133, 560, 168, 620];Person:[180, 633, 273, 939],[191, 377, 268, 639],[401, 401, 461, 677],[477, 365, 549, 546],[613, 362, 671, 673],[654, 389, 729, 693],[751, 451, 822, 669],[557, 362, 632, 675];Luggage:[393, 729, 465, 902];Backpack:[184, 806, 257, 941];Trash bin Can:[070, 632, 195, 996],[513, 537, 588, 740];TV:[126, 375, 200, 452].') 2023-11-03 21:27:02.231 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:27:02.232 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'west point from [003, 840, 995, 997] the hudson river, new york - west point military academy stock pictures, royalty-free photos & images') 2023-11-03 21:27:04.105 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:27:04.106 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A slice [093, 501, 962, 911] of no bake mint cheesecake with a biscuit base on a white plate. with a fork [187, 681, 990, 899]. Caption reads: Choc chip mint cheesecake') 2023-11-03 21:27:17.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:27:17.276 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'woman:[407, 077, 998, 992];fashion accessory:[542, 093, 846, 997],[567, 098, 823, 433],[612, 704, 850, 981],[757, 156, 823, 388],[782, 466, 846, 758];human face:[573, 222, 809, 718].') 2023-11-03 21:27:33.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:27:33.164 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Person:[384, 147, 999, 548],[567, 325, 629, 549].') 2023-11-03 21:28:19.125 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:28:19.126 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'beetle:[311, 435, 443, 810];plant:[000, 000, 999, 999].') 2023-11-03 21:29:53.135 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:29:53.135 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Man [136, 039, 994, 995] in car covering his mouth as he yawns') 2023-11-03 21:30:07.395 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:07.396 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[000, 326, 497, 999] and track its trajectory.Incorporate the Frame t:[xmin,ymin,xmax,ymax] format when presenting trajectories in your response.', 'This is a dragon-lion danceFrame1:[000, 326, 497, 999];Frame2:[148, 531, 514, 860];Frame3:[163, 440, 495, 873];Frame4:[140, 599, 461, 999].') 2023-11-03 21:30:12.048 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:12.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Chair:[835, 649, 949, 853],[445, 627, 532, 798],[583, 647, 688, 832],[686, 622, 768, 779],[272, 728, 395, 1000],[092, 839, 284, 1000],[885, 665, 954, 794];Person:[002, 539, 103, 685],[113, 523, 157, 633],[173, 458, 261, 620],[237, 464, 289, 608],[035, 594, 182, 757],[236, 603, 337, 872],[024, 601, 320, 1000],[396, 507, 483, 644],[418, 471, 460, 636],[484, 481, 525, 559],[467, 534, 593, 789],[575, 524, 624, 667],[578, 539, 669, 802],[653, 545, 779, 807],[625, 510, 657, 572],[647, 510, 699, 611],[684, 497, 722, 593],[906, 528, 999, 950],[880, 567, 993, 820],[961, 505, 1000, 560];Umbrella:[753, 382, 907, 520],[551, 404, 760, 530],[399, 381, 586, 530],[411, 256, 774, 583],[054, 335, 301, 582],[001, 066, 432, 774],[125, 451, 172, 577];Book:[886, 526, 987, 599];Desk:[001, 761, 140, 999],[559, 630, 680, 793].') 2023-11-03 21:30:26.467 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:26.468 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Wall View 002') 2023-11-03 21:30:28.254 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:28.254 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'bird:[500, 716, 701, 926],[000, 652, 079, 833],[000, 565, 145, 678],[049, 635, 229, 771],[128, 733, 382, 936],[133, 587, 251, 725],[242, 623, 449, 722],[268, 455, 365, 510],[271, 500, 372, 551],[288, 544, 395, 618],[301, 630, 540, 804],[407, 494, 568, 577],[454, 481, 560, 527],[478, 413, 577, 468],[489, 638, 617, 758],[553, 508, 693, 594],[577, 434, 680, 485],[587, 469, 736, 537],[614, 655, 866, 791],[622, 551, 802, 671],[632, 620, 866, 799],[636, 427, 705, 458],[682, 408, 766, 468],[742, 379, 801, 447],[784, 390, 826, 456],[835, 348, 998, 458];tree:[599, 025, 957, 327],[000, 126, 292, 575],[000, 020, 630, 354],[302, 212, 378, 279],[514, 238, 663, 272].') 2023-11-03 21:30:42.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:42.829 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Street Lights:[503, 388, 553, 999];Car:[662, 447, 942, 645],[847, 457, 961, 509],[896, 479, 979, 662],[759, 712, 977, 971],[659, 695, 978, 837],[000, 724, 423, 899],[256, 529, 284, 625];Bicycle:[781, 644, 850, 789],[218, 943, 365, 999];Truck:[275, 379, 728, 803];Backpack:[241, 859, 297, 935];Person:[242, 821, 329, 999],[749, 479, 776, 520],[604, 547, 637, 607].') 2023-11-03 21:30:51.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:51.138 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[661, 467, 695, 553],carFrame1:[601, 483, 639, 548],carFrame1:[000, 513, 139, 727],carFrame1:[030, 501, 144, 595],carFrame1:[200, 470, 256, 571],carFrame1:[670, 462, 883, 708],carFrame1:[394, 499, 434, 558] in this video clip.Your reply should be in alignment with the classFrame t:[xmin,ymin,xmax,ymax] structure.', 'carFrame1:[661, 467, 695, 553];Frame2:[661, 467, 695, 555],carFrame1:[601, 483, 639, 548];Frame2:[601, 483, 639, 548],carFrame1:[000, 513, 139, 727];Frame2:[000, 513, 139, 727],carFrame1:[030, 501, 144, 595];Frame2:[031, 501, 144, 595],carFrame1:[200, 470, 256, 571];Frame2:[200, 470, 256, 571],carFrame1:[670, 462, 883, 708];Frame2:[670, 462, 883, 708],carFrame1:[394, 499, 434, 558];Frame2:[394, 499, 434, 558].') 2023-11-03 21:30:55.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:30:55.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Lamp:[571, 001, 716, 191],[263, 375, 330, 495];Couch:[612, 488, 774, 769],[718, 549, 892, 882],[263, 494, 403, 821];Picture:[410, 262, 457, 343],[468, 271, 513, 350];Cabinet:[281, 401, 686, 708];Desk:[399, 536, 555, 734];Carpet:[231, 851, 992, 998].') 2023-11-03 21:31:44.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:31:44.387 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,TV,shelf and Tissue in this image.The category:[xmin,ymin,xmax,ymax] format should be rigorously followed in your response.', 'Person:[232, 405, 393, 1000];TV:[884, 203, 961, 350],[636, 415, 784, 669];shelf:[057, 637, 1000, 1000],[690, 274, 917, 551],[585, 486, 635, 668];Tissue:[469, 652, 567, 783].') 2023-11-03 21:32:01.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:32:01.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'woman:[650, 157, 934, 938],[000, 086, 288, 660],[254, 065, 417, 393],[386, 040, 482, 310],[595, 000, 672, 167],[626, 007, 783, 393],[790, 079, 999, 704];man:[295, 110, 678, 969],[000, 004, 076, 337],[000, 192, 394, 999],[124, 000, 278, 466],[171, 000, 262, 242],[250, 004, 345, 221],[520, 000, 648, 254],[755, 000, 827, 136],[760, 000, 872, 278],[844, 000, 931, 165],[914, 000, 999, 109];clothing:[298, 247, 672, 962],[000, 123, 055, 332],[000, 448, 327, 999],[172, 158, 236, 322],[188, 058, 258, 224],[249, 100, 294, 239],[262, 191, 415, 358],[339, 000, 430, 193],[521, 049, 611, 218],[608, 044, 673, 158],[655, 365, 941, 999],[760, 000, 809, 125],[764, 023, 880, 274],[823, 242, 999, 681],[865, 025, 919, 113];chair:[000, 329, 057, 404],[240, 271, 389, 680],[280, 671, 768, 999],[376, 877, 666, 999],[599, 377, 795, 999],[611, 221, 650, 596],[807, 668, 987, 999];human face:[029, 000, 999, 555].') 2023-11-03 21:32:09.537 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:32:09.538 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track carFrame1:[431, 611, 529, 736],carFrame1:[651, 559, 783, 662],carFrame1:[674, 570, 967, 818] in this video clip.Ensure you use the exact format categoryFrame t:[xmin,ymin,xmax,ymax] in your response.', 'carFrame1:[431, 611, 529, 736];Frame2:[447, 602, 546, 728],carFrame1:[651, 559, 783, 662];Frame2:[710, 530, 896, 682],carFrame1:[674, 570, 967, 818];Frame2:[873, 653, 1000, 930].') 2023-11-03 21:32:15.681 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:32:15.682 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect car,vehicle registration plate,man and clothing in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'car:[068, 024, 162, 091],[478, 218, 997, 996];vehicle registration plate:[679, 873, 897, 975];man:[279, 000, 493, 996],[547, 045, 816, 535];clothing:[145, 000, 344, 595],[286, 116, 487, 996],[544, 095, 809, 521].') 2023-11-03 21:32:31.524 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:32:31.524 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[003, 003, 995, 991] Image of the liver') 2023-11-03 21:33:07.785 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:33:07.786 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Bicycle,Tricycle and SUV in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bicycle:[794, 849, 861, 969],[909, 857, 971, 973];Tricycle:[265, 798, 341, 892];SUV:[417, 809, 530, 885].') 2023-11-03 21:33:09.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:33:09.428 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Necklace in this image.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Necklace:[914, 499, 970, 586].') 2023-11-03 21:33:12.557 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 4 samples! 2023-11-03 21:33:12.558 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Leopard & cheetah blouse') 2023-11-03 21:33:32.304 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:33:32.305 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect woman,human face,sunglasses and goggles in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[000, 058, 979, 999];human face:[483, 166, 747, 521];sunglasses:[460, 284, 822, 390];goggles:[481, 259, 809, 385].') 2023-11-03 21:33:36.221 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:33:36.222 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'tree:[551, 196, 816, 767],[013, 009, 141, 252],[013, 260, 186, 492],[192, 013, 335, 253],[335, 012, 464, 230],[428, 084, 499, 176],[533, 065, 595, 169],[611, 053, 656, 165];window:[738, 732, 903, 971];building:[010, 507, 333, 734],[014, 278, 330, 495],[023, 749, 283, 985],[067, 019, 295, 248],[359, 264, 658, 744],[653, 568, 993, 738].') 2023-11-03 21:33:52.077 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:33:52.077 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Three t-shirts [429, 170, 581, 489] [238, 201, 411, 539] [585, 272, 752, 583] and a tote bag [433, 504, 526, 786] are displayed against a blue backdrop [355, 025, 642, 905].') 2023-11-03 21:34:15.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:34:15.551 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'person:[663, 256, 776, 520],[000, 314, 036, 473],[006, 127, 020, 225],[034, 218, 043, 297],[041, 221, 051, 295],[046, 296, 128, 459],[346, 257, 474, 532],[428, 317, 520, 550],[435, 265, 509, 421],[853, 246, 999, 709],[968, 666, 999, 770];chair:[000, 348, 041, 456],[025, 328, 097, 446],[226, 327, 278, 423],[271, 328, 322, 420],[323, 318, 378, 418],[490, 318, 555, 423],[540, 327, 598, 437],[704, 358, 786, 476],[721, 368, 823, 542],[859, 457, 999, 636],[933, 561, 999, 667];human body:[000, 353, 036, 473],[046, 307, 128, 461],[350, 278, 446, 535],[438, 290, 509, 435],[634, 306, 760, 521],[854, 323, 999, 707];human leg:[000, 389, 036, 465],[055, 370, 071, 453],[095, 360, 124, 443],[360, 358, 430, 527],[438, 481, 473, 544],[464, 458, 520, 544],[636, 401, 680, 507],[671, 406, 752, 518],[850, 502, 949, 691],[865, 515, 999, 702];mammal:[000, 312, 036, 466],[005, 127, 028, 225],[037, 225, 058, 306],[045, 292, 128, 457],[120, 389, 154, 457],[347, 258, 470, 530],[416, 265, 511, 462],[525, 444, 588, 527],[632, 261, 761, 528],[766, 602, 853, 671],[848, 241, 999, 706],[853, 102, 877, 187];footwear:[866, 661, 931, 697];human arm:[078, 324, 115, 388],[415, 294, 440, 384],[430, 400, 444, 471],[648, 341, 696, 408],[661, 346, 728, 422],[912, 379, 999, 474].') 2023-11-03 21:34:45.625 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:34:45.626 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Person:[259, 546, 291, 659],[900, 510, 999, 819],[434, 385, 708, 999],[670, 295, 999, 998].') 2023-11-03 21:34:50.031 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:34:50.032 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Person:[001, 319, 121, 863],[099, 607, 194, 999],[098, 384, 246, 962],[188, 306, 218, 385],[231, 275, 386, 975],[367, 260, 487, 827];Necklace:[126, 691, 164, 731];Hat:[028, 326, 076, 375];Gloves:[007, 598, 028, 661];Leather Shoes:[018, 802, 060, 838],[070, 827, 100, 861];Backpack:[127, 463, 197, 620],[363, 324, 474, 552];Sandals:[170, 860, 214, 910],[104, 908, 174, 953],[395, 777, 443, 817];Slippers:[091, 967, 143, 1000],[335, 917, 383, 968];Sneakers:[446, 751, 479, 813];Satchel:[334, 483, 362, 572];Traffic cone:[425, 776, 486, 999];Bus:[230, 001, 999, 1000];Stop Sign:[030, 112, 063, 181];Bicycle:[073, 348, 120, 420],[134, 362, 179, 417];Car:[671, 319, 714, 388].') 2023-11-03 21:34:56.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:34:56.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the the motorcycle at the front of the boat.?For the trajectories included in the answer, please use the format TrackiFrame t:[xmin,ymin,xmax,ymax].', 'There is one the motorcycle at the front of the boat..Track1frame:1:[420, 297, 517, 458];frame:2:[438, 254, 536, 412];frame:3:[441, 226, 540, 381].') 2023-11-03 21:35:26.381 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:35:26.381 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[749, 310, 918, 681] [567, 305, 756, 692] [250, 305, 441, 668] [084, 326, 278, 671] The Usual Suspects (1995) is one of the best movies like Inside Man (2006') 2023-11-03 21:35:30.586 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:35:30.587 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'chair:[376, 658, 678, 977],[000, 370, 868, 907],[905, 396, 981, 457];clothing:[119, 553, 229, 799],[026, 567, 128, 711],[048, 508, 105, 572],[090, 512, 146, 588],[203, 508, 261, 658],[236, 461, 321, 600],[400, 496, 591, 676],[414, 643, 688, 865],[543, 434, 597, 536],[605, 470, 690, 578],[753, 458, 812, 550],[885, 345, 913, 441];person:[021, 341, 865, 792],[412, 577, 692, 919],[896, 332, 913, 452],[976, 429, 999, 514];desk:[933, 472, 999, 869];table:[196, 363, 999, 994];building:[000, 000, 999, 999].') 2023-11-03 21:35:38.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:35:38.275 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you tell me the trajectory of the a person wearing all black in this video clip?Use the specified TrackiFrame t:[xmin,ymin,xmax,ymax] format for all trajectories in your reply.', 'There is one a person wearing all black.Track1frame1:[530, 334, 1000, 879];frame2:[531, 332, 1000, 875].') 2023-11-03 21:35:51.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:35:51.049 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect SUV in this image.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'SUV:[645, 464, 828, 519],[846, 395, 1000, 538],[149, 456, 348, 518].') 2023-11-03 21:36:00.820 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:36:00.821 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect billiard table,human arm,man,human hair and human head in this image.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'human arm:[081, 726, 152, 921],[196, 670, 256, 816],[260, 368, 285, 458],[450, 740, 501, 999],[460, 369, 499, 478],[465, 671, 564, 933],[467, 361, 518, 549],[519, 805, 570, 999],[895, 597, 941, 745];man:[742, 468, 953, 987],[000, 327, 999, 999];human hair:[000, 359, 999, 999],[602, 544, 728, 730];human head:[603, 545, 738, 746],[000, 595, 046, 705],[000, 353, 999, 497],[006, 484, 057, 550],[029, 537, 075, 611],[099, 540, 180, 637],[155, 473, 196, 537],[206, 505, 245, 570],[231, 773, 378, 999],[243, 516, 316, 607],[275, 686, 379, 788],[295, 542, 343, 626],[361, 573, 450, 723],[367, 471, 436, 547],[426, 510, 486, 595],[432, 581, 493, 685],[533, 466, 587, 531],[540, 535, 583, 605],[568, 518, 635, 623],[625, 473, 725, 583],[717, 405, 766, 471],[760, 471, 844, 589],[817, 418, 850, 484],[919, 526, 987, 597].') 2023-11-03 21:36:05.246 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 3 samples! 2023-11-03 21:36:05.246 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'human head:[443, 295, 590, 629],[683, 442, 776, 697],[748, 079, 999, 708];clothing:[187, 498, 741, 985],[691, 678, 999, 997];human arm:[021, 345, 298, 951];woman:[000, 197, 763, 999];human hair:[504, 308, 701, 690],[756, 294, 898, 631].') 2023-11-03 21:36:13.878 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:36:13.879 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Heavy Truck:[891, 256, 999, 563];Chair:[313, 410, 466, 770];Bottle:[555, 026, 593, 165];Other Shoes:[710, 886, 778, 942],[593, 855, 637, 910];Boots:[449, 773, 486, 866],[403, 756, 441, 848],[259, 869, 313, 986];Person:[886, 465, 922, 583],[545, 140, 777, 938],[404, 166, 526, 863],[331, 302, 463, 804],[178, 093, 361, 983],[024, 338, 084, 549].') 2023-11-03 21:36:25.913 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:36:25.913 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, '[732, 286, 995, 994] [332, 421, 675, 998] Demonstrators display [959, 611, 997, 995] [281, 004, 407, 262] [407, 235, 545, 476] [092, 489, 336, 775] [627, 623, 755, 828] [338, 479, 475, 681] [787, 532, 996, 822] [027, 042, 173, 323] placards during a protest held to call for an end to racial injustice, Sunday, Aug. 9, 2020, in Boston, triggered by the death of George Floyd.') 2023-11-03 21:36:46.045 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:36:46.046 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[414, 177, 533, 617] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a personFrame1:[414, 177, 533, 617];Frame2:[428, 116, 513, 632];Frame3:[417, 166, 563, 561];Frame4:[426, 253, 563, 733].') 2023-11-03 21:37:17.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:37:17.902 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'Cymbal:[541, 412, 623, 466],[700, 406, 786, 463];Drum:[582, 560, 686, 706],[665, 478, 723, 578];Piano:[042, 450, 206, 498];Person:[618, 416, 742, 531],[399, 118, 621, 916],[269, 361, 327, 429];Sneakers:[520, 812, 611, 900],[442, 838, 517, 915].') 2023-11-03 21:37:33.630 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:37:33.630 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Picture:[822, 014, 999, 273];Person:[071, 355, 150, 491],[261, 178, 442, 858],[500, 195, 673, 856],[764, 093, 916, 774],[734, 345, 938, 724];Sneakers:[324, 776, 362, 850],[368, 778, 405, 843],[559, 781, 595, 849],[606, 779, 643, 849],[798, 678, 851, 766],[831, 666, 868, 737].') 2023-11-03 21:37:44.225 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:37:44.225 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Maintain strict adherence to the format category:[xmin,ymin,xmax,ymax] when presenting your answer.', 'shelf:[032, 076, 319, 739];Frame:[699, 169, 830, 370],[852, 161, 998, 375];Potted Plant:[518, 169, 702, 513];Chair:[275, 443, 498, 920],[619, 452, 853, 893],[358, 404, 546, 785],[456, 394, 637, 696],[649, 394, 838, 661],[724, 434, 933, 833];Carpet:[045, 640, 1000, 1000];Plate:[134, 144, 175, 226],[062, 151, 098, 222],[199, 164, 232, 230],[066, 238, 101, 323],[192, 243, 232, 326];Cup:[070, 156, 092, 222],[529, 475, 553, 529],[484, 499, 520, 532].') 2023-11-03 21:37:49.273 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:37:49.273 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'The pyramid dome of the Louvre Museum in Paris lit up in the evening. There is a crowd [000, 738, 995, 998] standing outside waiting to get in the doors.') 2023-11-03 21:38:05.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:38:05.187 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'woman:[133, 182, 436, 1000];man:[515, 030, 856, 992],[056, 378, 103, 568],[376, 320, 423, 593],[416, 275, 515, 791],[470, 308, 525, 649];clothing:[123, 315, 416, 979],[060, 401, 101, 576],[373, 370, 410, 591],[416, 330, 503, 601],[425, 606, 488, 786],[476, 345, 516, 443],[523, 300, 851, 997].') 2023-11-03 21:38:51.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:38:51.348 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[000, 346, 175, 936],[441, 279, 556, 751],[509, 298, 565, 689];drum:[166, 492, 281, 636],[221, 440, 265, 503];guitar:[011, 502, 261, 678],[507, 390, 579, 550].') 2023-11-03 21:39:10.310 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:39:10.310 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'shelf:[250, 212, 724, 716],[864, 253, 999, 640];Person:[285, 332, 352, 519],[342, 340, 420, 535],[417, 345, 473, 516],[538, 353, 592, 527],[583, 360, 654, 523],[649, 362, 700, 513],[927, 467, 983, 600],[978, 487, 1000, 593],[271, 624, 324, 706].') 2023-11-03 21:39:21.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:39:21.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,what is thisFrame1:[200, 061, 402, 554] and track its trajectory.Any trajectory details should be arranged according to the Frame t:[xmin,ymin,xmax,ymax] format for clarity.', 'This is a eyeFrame1:[200, 061, 402, 554];Frame2:[265, 079, 461, 573];Frame3:[324, 050, 528, 543];Frame4:[250, 015, 455, 526];Frame5:[208, 053, 431, 558].') 2023-11-03 21:39:28.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:39:28.377 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Person:[261, 072, 675, 954],[261, 072, 675, 954];Sneakers:[444, 880, 534, 951];Hat:[018, 187, 082, 221],[157, 153, 244, 194],[294, 145, 364, 189],[345, 153, 430, 190],[418, 160, 466, 189],[683, 065, 781, 110],[656, 074, 695, 124],[965, 105, 1000, 154];Glasses:[683, 108, 756, 144];Watch:[592, 227, 623, 254].') 2023-11-03 21:39:29.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:39:29.896 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'person:[898, 177, 981, 520],[000, 460, 162, 939],[160, 418, 282, 710],[198, 633, 504, 997],[409, 266, 460, 339],[446, 260, 496, 385],[475, 260, 521, 360],[498, 260, 550, 370],[531, 272, 576, 408],[565, 264, 610, 425],[612, 250, 664, 464],[667, 235, 726, 470],[695, 260, 751, 456],[739, 237, 795, 425],[796, 235, 862, 383],[826, 225, 895, 414],[867, 222, 901, 325],[953, 227, 992, 327],[960, 237, 998, 550];clothing:[003, 639, 129, 981],[175, 491, 264, 775],[182, 812, 475, 997],[235, 312, 423, 358],[451, 306, 490, 385],[515, 293, 543, 377],[535, 308, 573, 408],[565, 312, 603, 420],[596, 285, 620, 387],[614, 285, 665, 454],[653, 287, 678, 450],[673, 285, 709, 462],[693, 300, 742, 454],[743, 289, 796, 456],[792, 272, 815, 462],[801, 268, 856, 341],[840, 283, 900, 356],[890, 245, 917, 354],[901, 258, 953, 493],[964, 264, 990, 495],[964, 270, 998, 522];car:[228, 339, 665, 668].') 2023-11-03 21:40:06.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:40:06.478 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Lamp:[669, 176, 923, 260],[059, 143, 285, 229];Potted Plant:[291, 002, 687, 1000];Person:[585, 562, 623, 726],[543, 546, 582, 680],[558, 654, 592, 735],[390, 635, 431, 710],[087, 638, 163, 993],[007, 719, 088, 913];Satchel:[009, 795, 085, 913];Chair:[613, 907, 691, 989],[940, 888, 999, 1000],[804, 834, 852, 997],[945, 757, 989, 879],[885, 772, 934, 888],[781, 773, 826, 996],[794, 759, 834, 973],[742, 728, 795, 905],[701, 735, 740, 907],[660, 720, 704, 887],[593, 741, 657, 937],[263, 726, 317, 921],[309, 885, 392, 996],[513, 874, 550, 999],[390, 829, 462, 902],[325, 712, 364, 886];Desk:[851, 903, 1000, 998],[799, 798, 930, 891],[677, 745, 775, 904],[584, 753, 657, 935],[308, 899, 511, 997],[237, 736, 351, 815].') 2023-11-03 21:40:09.405 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:40:09.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[229, 468, 300, 738],personFrame1:[492, 259, 536, 490],personFrame1:[155, 468, 257, 768],personFrame1:[454, 458, 535, 734],personFrame1:[624, 526, 725, 830],personFrame1:[506, 301, 567, 551],personFrame1:[701, 197, 750, 427] in this video clip.Please strictly follow the format categoryFrame t:[xmin,ymin,xmax,ymax] to answer the question.', 'personFrame1:[229, 468, 300, 738];Frame2:[229, 462, 309, 727],personFrame1:[492, 259, 536, 490];Frame2:[496, 269, 557, 505],personFrame1:[155, 468, 257, 768];Frame2:[148, 472, 252, 772],personFrame1:[454, 458, 535, 734];Frame2:[454, 444, 531, 708],personFrame1:[624, 526, 725, 830];Frame2:[660, 487, 735, 816],personFrame1:[506, 301, 567, 551];Frame2:[492, 302, 557, 568],personFrame1:[701, 197, 750, 427];Frame2:[683, 193, 725, 420].') 2023-11-03 21:40:16.906 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:40:16.907 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Lamp:[272, 171, 340, 257],[561, 215, 615, 282],[790, 001, 986, 349];shelf:[701, 250, 792, 466],[644, 275, 702, 468],[461, 270, 567, 466],[370, 299, 453, 466],[246, 290, 370, 367],[148, 283, 248, 463],[001, 271, 149, 359],[183, 586, 276, 805],[392, 572, 490, 605];Carpet:[001, 815, 239, 918],[376, 892, 607, 999];Chair:[517, 673, 705, 1000],[506, 609, 667, 967],[718, 596, 827, 724];Stool:[468, 734, 526, 977],[627, 697, 703, 736],[683, 679, 780, 732];Desk:[495, 679, 1000, 1000];Candle:[841, 594, 877, 639],[871, 540, 897, 579];Tea pot:[251, 526, 294, 578];Coffee Machine:[188, 502, 228, 581];Microwave:[249, 366, 369, 459];Oven:[275, 595, 390, 731];Refrigerator:[002, 369, 184, 849];Pot:[484, 515, 524, 558];Gas stove:[254, 555, 377, 588].') 2023-11-03 21:40:18.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:40:18.097 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.For your response, please adhere to the specified category:[xmin,ymin,xmax,ymax] format.', 'Car:[005, 291, 944, 734];SUV:[001, 247, 233, 404];Pickup Truck:[199, 284, 287, 310].') 2023-11-03 21:40:24.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:40:24.500 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[105, 316, 531, 533] and track its trajectory.All trajectories in your reply should conform to the Frame t:[xmin,ymin,xmax,ymax] pattern.', 'This is a stealth bomberFrame1:[105, 316, 531, 533];Frame2:[345, 473, 690, 676];Frame3:[431, 533, 724, 704];Frame4:[386, 545, 634, 708];Frame5:[382, 541, 607, 691].') 2023-11-03 21:40:30.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:40:30.550 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'man:[221, 465, 396, 958],[034, 355, 136, 721],[112, 455, 265, 776],[234, 390, 303, 599],[318, 426, 421, 632],[328, 289, 406, 500],[389, 317, 420, 496],[408, 403, 467, 576],[408, 478, 590, 999],[431, 395, 476, 474],[466, 399, 522, 505],[494, 386, 527, 450],[509, 380, 595, 778],[510, 368, 558, 482],[572, 393, 633, 511],[621, 363, 672, 503],[649, 359, 723, 507],[688, 397, 871, 945],[868, 361, 999, 868],[881, 272, 955, 559];clothing:[228, 528, 394, 933],[034, 392, 145, 703],[110, 528, 250, 772],[241, 424, 306, 586],[330, 500, 453, 632],[339, 320, 399, 495],[383, 351, 418, 501],[401, 612, 578, 999],[456, 451, 522, 503],[510, 453, 598, 650],[581, 432, 636, 553],[625, 397, 669, 498],[660, 412, 725, 500],[681, 489, 874, 879],[742, 407, 828, 498],[876, 462, 999, 786],[882, 297, 958, 509];woman:[583, 393, 638, 532].') 2023-11-03 21:41:05.247 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:41:05.247 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Frame:[515, 002, 652, 082],[543, 353, 647, 551],[449, 376, 516, 445],[380, 353, 434, 444],[286, 393, 353, 459],[095, 286, 148, 377],[345, 272, 413, 341],[360, 157, 412, 253],[317, 055, 370, 149],[251, 080, 306, 171],[190, 118, 242, 213],[263, 267, 323, 379],[165, 221, 245, 328],[093, 462, 266, 579];Person:[641, 227, 864, 999].') 2023-11-03 21:41:14.917 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:41:14.919 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'Cell Phone:[616, 705, 665, 736];Cup:[330, 470, 376, 520];Wine Glass:[479, 426, 533, 611],[446, 417, 494, 585];Truck:[526, 001, 999, 1000];Tie:[596, 475, 645, 868];Belt:[583, 836, 658, 879];Other Shoes:[479, 970, 511, 999],[545, 972, 563, 999];Watch:[691, 771, 706, 796].') 2023-11-03 21:41:43.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:41:43.292 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Can [000, 276, 997, 998] a notary refuse to notarize a document?') 2023-11-03 21:41:57.756 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:41:57.757 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given frame1: and frame2:,track personFrame1:[342, 311, 418, 525],personFrame1:[600, 430, 681, 629],personFrame1:[667, 433, 747, 611],personFrame1:[759, 306, 804, 475],personFrame1:[289, 501, 352, 670],personFrame1:[367, 283, 407, 493],personFrame1:[358, 494, 416, 759] in this video clip.The prescribed format for your answer is classFrame t:[xmin,ymin,xmax,ymax]. Please follow it closely.', 'personFrame1:[342, 311, 418, 525];Frame2:[349, 294, 425, 509],personFrame1:[600, 430, 681, 629];Frame2:[603, 422, 691, 620],personFrame1:[667, 433, 747, 611];Frame2:[703, 394, 796, 588],personFrame1:[759, 306, 804, 475];Frame2:[768, 280, 809, 468],personFrame1:[289, 501, 352, 670];Frame2:[265, 513, 335, 731],personFrame1:[367, 283, 407, 493];Frame2:[367, 276, 429, 484],personFrame1:[358, 494, 416, 759];Frame2:[362, 484, 411, 748].') 2023-11-03 21:42:12.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:42:12.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3,frame4 and frame5,can you point out the trajectory of the woman holding a brown bag and gray hoodie?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one woman holding a brown bag and gray hoodie.Track1frame1:[750, 393, 1000, 1000];frame2:[732, 398, 1000, 1000].') 2023-11-03 21:43:00.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:43:00.056 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Tie:[863, 589, 902, 682];Chair:[787, 762, 999, 998],[688, 692, 851, 977];Stool:[652, 476, 691, 576];Person:[086, 314, 150, 463],[118, 365, 200, 601],[143, 296, 218, 423],[203, 308, 251, 412],[206, 360, 263, 469],[180, 403, 276, 674],[245, 426, 309, 660],[254, 317, 298, 432],[296, 300, 363, 427],[278, 376, 330, 469],[300, 396, 363, 643],[339, 371, 406, 477],[350, 325, 391, 407],[353, 430, 428, 639],[399, 370, 439, 464],[390, 321, 442, 406],[415, 427, 488, 631],[438, 349, 489, 487],[455, 318, 502, 413],[491, 358, 542, 423],[474, 427, 552, 629],[525, 399, 598, 621],[714, 367, 758, 475],[785, 368, 834, 538],[879, 359, 939, 615],[821, 455, 965, 740],[744, 533, 851, 752],[850, 616, 1000, 856].') 2023-11-03 21:43:01.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:43:01.136 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Cabinet:[839, 163, 993, 852],[947, 511, 999, 960];Lamp:[520, 376, 634, 578];Nightstand:[524, 559, 658, 711],[001, 580, 058, 840];Bed:[001, 061, 947, 998];Pillow:[289, 547, 387, 703],[353, 557, 555, 661],[084, 542, 304, 644].') 2023-11-03 21:43:19.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:43:19.597 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Flag:[538, 631, 640, 739],[627, 640, 731, 736],[708, 643, 778, 745],[779, 647, 847, 748];Street Lights:[089, 262, 297, 1000],[729, 475, 883, 912];Car:[395, 861, 796, 998].') 2023-11-03 21:43:25.005 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:43:25.006 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,can you tell me what is thisFrame1:[203, 445, 742, 711] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a destroyer escortFrame1:[203, 445, 742, 711];Frame2:[188, 442, 727, 708];Frame3:[182, 442, 721, 708];Frame4:[167, 444, 706, 710].') 2023-11-03 21:43:36.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:43:36.779 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.The format category:[xmin,ymin,xmax,ymax] should be strictly observed in your answer.', 'Tie:[237, 341, 323, 514];Person:[649, 334, 999, 1000],[346, 211, 810, 999],[003, 139, 412, 999];Flower:[610, 589, 876, 777].') 2023-11-03 21:44:03.681 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:44:03.681 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'person:[270, 578, 347, 871],[012, 488, 052, 564],[106, 488, 135, 564],[208, 493, 224, 570],[224, 496, 260, 577],[253, 488, 283, 569];tree:[000, 000, 819, 866],[795, 489, 999, 756],[819, 007, 999, 467];car:[000, 746, 046, 905],[000, 587, 090, 679],[102, 646, 292, 834],[525, 622, 633, 715],[563, 582, 653, 639],[576, 505, 645, 570],[677, 498, 749, 569],[740, 456, 828, 535].') 2023-11-03 21:44:10.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:44:10.871 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, what is this objectframe:1:[080, 322, 590, 840];frame:2:[080, 322, 590, 840];frame:3:[092, 341, 454, 810] doing?Tailor your answer briefly, using the image frames and trajectory coordinates given.', 'This is a/an yellow bird moving forward') 2023-11-03 21:44:30.344 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:44:30.345 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Boat:[590, 650, 665, 685],[800, 648, 869, 681],[715, 613, 801, 677].') 2023-11-03 21:44:50.834 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:44:50.835 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3, can you tell me what thisframe:1:[447, 401, 744, 600];frame:2:[411, 482, 735, 693];frame:3:[388, 373, 733, 601] is?Ensure your answer is to the point, taking into account the provided image frames and trajectory coordinates.', 'This is a/an the trailing plane of the two in motion.') 2023-11-03 21:44:59.960 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:44:59.960 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'Person:[347, 038, 895, 999];Guitar:[382, 132, 983, 698].') 2023-11-03 21:45:03.953 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:45:03.954 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[260, 411, 648, 991] and track its trajectory.Stick to the Frame t:[xmin,ymin,xmax,ymax] format when including trajectories in your answer.', 'This is a gecko staying in a transparent ballFrame1:[260, 411, 648, 991];Frame2:[280, 352, 674, 951];Frame3:[282, 336, 678, 938];Frame4:[289, 334, 678, 930].') 2023-11-03 21:45:21.936 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:45:21.937 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bicycle:[722, 418, 975, 755],[218, 534, 560, 839];Backpack:[396, 358, 493, 506];Sneakers:[421, 729, 467, 787];Handbag:[924, 373, 966, 437];Person:[307, 347, 477, 787],[817, 258, 942, 708].') 2023-11-03 21:45:31.140 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:45:31.141 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect human face,poster,clothing and person in this image.Maintain strict adherence to the format category:[x0,y0,x1,y1] when presenting your answer.', 'human face:[087, 626, 146, 730],[088, 365, 142, 466],[093, 119, 147, 212],[540, 186, 578, 255],[702, 178, 749, 256],[865, 208, 915, 296];poster:[032, 003, 966, 997];clothing:[059, 448, 157, 526],[060, 188, 161, 264],[063, 712, 156, 786],[191, 193, 963, 871],[510, 233, 606, 297],[681, 260, 776, 309],[842, 256, 954, 370];person:[912, 612, 963, 812],[058, 342, 155, 521],[058, 079, 157, 260],[060, 602, 157, 779],[191, 120, 309, 882],[298, 269, 433, 863],[364, 142, 421, 267],[388, 364, 487, 858],[445, 401, 552, 850],[511, 162, 604, 297],[526, 436, 620, 841],[572, 461, 662, 841],[637, 479, 720, 829],[676, 152, 777, 328],[689, 521, 758, 820],[753, 536, 808, 813],[809, 583, 859, 815],[839, 131, 950, 360],[856, 633, 898, 811].') 2023-11-03 21:45:44.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:45:44.021 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When composing your answer, be sure to consistently utilize the category:[xmin,ymin,xmax,ymax] structure.', 'Gloves:[186, 785, 389, 999];Helmet:[456, 120, 734, 544];Person:[190, 117, 937, 999].') 2023-11-03 21:45:59.465 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:45:59.465 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect bicycle wheel,bicycle and tire in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'bicycle wheel:[060, 514, 562, 974],[000, 546, 079, 992],[460, 513, 999, 975];bicycle:[000, 231, 094, 999],[026, 322, 213, 997],[026, 242, 335, 961],[270, 271, 432, 985],[340, 288, 538, 966],[465, 310, 665, 961],[548, 296, 767, 927],[681, 319, 868, 951],[769, 310, 955, 944],[784, 295, 999, 928];tire:[000, 551, 083, 993],[000, 549, 202, 998],[068, 530, 266, 990],[121, 528, 385, 979],[223, 527, 999, 991].') 2023-11-03 21:46:13.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:46:13.177 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your output should conform exactly to the category:[xmin,ymin,xmax,ymax] format.', 'person:[061, 239, 350, 736],[253, 201, 393, 611];plant:[000, 001, 999, 999];sheep:[323, 139, 999, 718].') 2023-11-03 21:47:36.314 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:47:36.315 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Boat in this image.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Boat:[654, 709, 860, 739].') 2023-11-03 21:47:54.527 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:47:54.528 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'Visit the lush grounds [001, 004, 995, 998] of Royal Hawaiian Hotel - one of the best things to do in Waikiki away from the crowds [318, 663, 485, 926]') 2023-11-03 21:47:56.335 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:47:56.335 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please ensure that your response strictly adheres to the category:[xmin,ymin,xmax,ymax] format.', 'Car:[000, 293, 037, 328],[000, 310, 232, 414],[339, 297, 450, 363],[441, 304, 502, 336];Street Lights:[108, 203, 127, 311],[285, 172, 302, 294];Bicycle:[191, 315, 205, 331],[645, 319, 684, 376];Bus:[494, 256, 594, 332],[000, 269, 078, 311],[954, 279, 1000, 315];Truck:[556, 261, 662, 333];Tricycle:[907, 309, 929, 348];Tent:[192, 440, 827, 1000];Person:[650, 288, 682, 362],[764, 289, 789, 331],[917, 281, 959, 410],[965, 290, 999, 414],[957, 294, 981, 401],[958, 289, 969, 323],[910, 292, 924, 332],[591, 289, 598, 301],[525, 285, 530, 297],[422, 291, 452, 388],[291, 290, 332, 405],[259, 299, 274, 334],[115, 324, 135, 342];Lamp:[876, 120, 885, 147],[614, 096, 625, 133],[642, 165, 648, 179],[370, 142, 379, 169],[809, 183, 814, 201],[968, 198, 974, 213];Satchel:[948, 352, 960, 391],[760, 316, 767, 329];Sandals:[979, 406, 988, 413],[971, 401, 981, 411],[964, 395, 974, 401],[962, 387, 970, 398],[940, 402, 947, 409],[917, 401, 929, 411],[314, 376, 328, 386],[297, 391, 308, 405];Hat:[963, 289, 969, 295],[935, 281, 947, 290];Helmet:[915, 292, 921, 298];Other Shoes:[658, 353, 666, 362];Sneakers:[428, 359, 437, 374],[428, 381, 438, 388];Bakset:[446, 334, 460, 351],[329, 338, 347, 359],[218, 366, 251, 413],[084, 385, 130, 433].') 2023-11-03 21:47:58.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:47:58.416 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.When submitting your answer, maintain the category:[xmin,ymin,xmax,ymax] structure consistently.', 'dress:[258, 335, 450, 956];suit:[028, 203, 274, 992],[786, 359, 999, 998];woman:[670, 209, 841, 1000],[253, 246, 485, 998];man:[798, 121, 984, 1000],[016, 076, 276, 998];human body:[013, 289, 265, 998],[273, 407, 489, 998],[631, 381, 841, 998],[788, 365, 999, 998];human hair:[051, 071, 199, 327],[313, 247, 408, 418],[721, 182, 830, 414],[888, 205, 966, 354];fashion accessory:[166, 356, 223, 643],[431, 623, 495, 836],[808, 454, 898, 928];human face:[160, 160, 218, 339],[366, 284, 412, 422],[718, 240, 763, 378],[826, 227, 909, 436];human arm:[030, 372, 280, 822],[288, 455, 430, 731],[436, 495, 488, 651],[623, 419, 715, 825],[641, 410, 843, 720],[795, 483, 999, 907].') 2023-11-03 21:49:00.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:49:00.092 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Potted Plant,Person and Chair in this image.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Potted Plant:[688, 295, 768, 482],[423, 294, 513, 462],[193, 301, 271, 462],[001, 320, 042, 496];Person:[251, 079, 413, 972],[001, 743, 031, 879],[001, 674, 098, 820],[114, 749, 185, 881],[116, 643, 176, 744],[160, 677, 216, 798],[193, 732, 265, 868],[240, 729, 322, 876],[296, 769, 360, 906],[288, 641, 362, 754],[204, 677, 272, 799],[186, 638, 227, 716],[250, 643, 278, 729],[161, 516, 229, 646],[038, 550, 152, 627],[068, 533, 142, 611],[109, 342, 154, 485],[187, 453, 232, 495],[240, 421, 272, 497],[404, 585, 461, 686],[433, 622, 466, 688],[444, 748, 530, 828],[585, 764, 660, 851],[513, 592, 570, 688],[565, 594, 610, 689],[602, 587, 659, 691],[625, 315, 654, 487],[657, 344, 684, 481],[727, 357, 764, 484],[862, 338, 894, 490],[970, 337, 1000, 459],[961, 424, 993, 495],[960, 596, 999, 701],[747, 604, 793, 692],[854, 689, 948, 845];Chair:[010, 553, 200, 663],[007, 569, 216, 665],[001, 620, 036, 669],[796, 415, 847, 493],[891, 433, 938, 493],[614, 425, 658, 483],[110, 426, 175, 486],[022, 429, 092, 497].') 2023-11-03 21:49:24.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:49:24.214 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("\nDetect all.It's essential that your answer aligns with the category:[x0,y0,x1,y1] format.", 'helicopter:[171, 380, 335, 567].') 2023-11-03 21:49:41.562 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:49:41.563 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given a video clip including frame1,frame2,frame3 and frame4,please tell me what is thisFrame1:[175, 306, 288, 439] and track its trajectory.If you're including trajectory details in your reply, the Frame t:[xmin,ymin,xmax,ymax] format is imperative.", 'This is a cotswoldFrame1:[175, 306, 288, 439];Frame2:[231, 309, 338, 433];Frame3:[205, 376, 314, 495];Frame4:[105, 453, 220, 576].') 2023-11-03 21:50:05.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 2 samples! 2023-11-03 21:50:05.118 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'Glasses:[098, 185, 246, 271],[307, 225, 388, 275],[701, 127, 777, 169],[779, 351, 865, 393];Cabinet:[862, 235, 999, 466];Storage box:[815, 168, 957, 315];Desk:[239, 776, 901, 999];Person:[688, 269, 1000, 999],[667, 054, 902, 632],[418, 155, 602, 552];Tape:[707, 816, 780, 860].') 2023-11-03 21:50:19.748 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:50:19.749 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video cluo including frame1,frame2,frame3,frame4 and frame5,can you tell me what is thisFrame1:[349, 198, 854, 456] and track its trajectory.Ensure the trajectories in your answer follow the Frame t:[xmin,ymin,xmax,ymax] structure.', 'This is a pt boatFrame1:[349, 198, 854, 456];Frame2:[334, 195, 850, 466];Frame3:[325, 133, 864, 418];Frame4:[338, 144, 907, 448];Frame5:[345, 155, 932, 465].') 2023-11-03 21:50:24.638 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:50:24.638 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ("Given frame1: and frame2:,track personFrame1:[304, 303, 371, 523],personFrame1:[223, 451, 291, 617],personFrame1:[516, 472, 594, 613],personFrame1:[851, 433, 932, 609],personFrame1:[166, 316, 234, 522],personFrame1:[696, 462, 755, 619],personFrame1:[375, 460, 445, 625] in this video clip.It's imperative that you stick to the categoryFrame t:[xmin,ymin,xmax,ymax] format for your answer.", 'personFrame1:[304, 303, 371, 523];Frame2:[300, 303, 379, 522],personFrame1:[223, 451, 291, 617];Frame2:[230, 424, 290, 618],personFrame1:[516, 472, 594, 613];Frame2:[520, 437, 593, 614],personFrame1:[851, 433, 932, 609];Frame2:[832, 444, 950, 608],personFrame1:[166, 316, 234, 522];Frame2:[166, 307, 235, 522],personFrame1:[696, 462, 755, 619];Frame2:[694, 457, 755, 619],personFrame1:[375, 460, 445, 625];Frame2:[378, 483, 450, 629].') 2023-11-03 21:50:34.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:50:34.554 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the lizard lying still, then crawling forward, then to the right in this video clip?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one lizard lying still, then crawling forward, then to the right.Track1frame:1:[663, 388, 741, 451];frame:2:[551, 381, 738, 450];frame:3:[547, 374, 742, 449].') 2023-11-03 21:50:52.191 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:50:52.191 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,please tell me the trajectory of the the cat sit down at first, then fight and lay down, then move away.Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one the cat sit down at first, then fight and lay down, then move away.Track1frame:1:[001, 378, 509, 782];frame:2:[001, 380, 527, 826];frame:3:[001, 393, 487, 878].') 2023-11-03 21:51:04.159 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:51:04.159 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[074, 020, 541, 937];Helmet:[317, 016, 446, 141];Gloves:[063, 315, 166, 473],[434, 436, 538, 564];Sneakers:[219, 722, 281, 876],[093, 822, 183, 937];Hockey Stick:[163, 419, 978, 625].') 2023-11-03 21:51:06.617 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:51:06.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,can you tell me the trajectory of the the white tiger on the left rear in this video clip?Maintain consistency in your answer by using the TrackiFrame t:[xmin,ymin,xmax,ymax] format for trajectories.', 'There is one the white tiger on the left rear.Track1frame:1:[409, 093, 546, 249];frame:2:[409, 093, 546, 249];frame:3:[484, 077, 599, 183].') 2023-11-03 21:53:06.618 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:53:06.619 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect all.To maintain clarity, use the prescribed category:[x0,y0,x1,y1] format for your answer.', 'Sneakers:[857, 811, 914, 871],[907, 825, 976, 885];High Heels:[792, 845, 826, 892];Person:[452, 202, 813, 1000].') 2023-11-03 21:53:38.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:53:38.700 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Ensure your response adheres strictly to the format category:[xmin,ymin,xmax,ymax]', 'woman:[298, 375, 405, 867],[091, 351, 143, 539],[458, 372, 542, 926],[586, 381, 697, 908],[912, 379, 950, 515];man:[768, 273, 908, 955],[000, 308, 165, 930],[090, 345, 225, 980],[188, 358, 293, 947],[225, 362, 248, 432],[294, 366, 340, 583],[300, 277, 483, 904],[391, 301, 628, 881],[591, 336, 795, 945],[882, 376, 914, 443],[951, 339, 999, 637];boy:[098, 344, 225, 979];tree:[852, 293, 921, 374],[911, 286, 968, 367];footwear:[120, 890, 196, 940],[128, 915, 179, 986],[199, 864, 240, 944],[377, 824, 422, 896],[480, 816, 503, 884],[503, 822, 545, 901],[563, 809, 591, 882],[666, 855, 722, 923],[722, 870, 754, 934],[777, 909, 843, 938].') 2023-11-03 21:54:01.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:54:01.662 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'human body:[347, 465, 440, 902],[316, 485, 374, 885],[349, 745, 540, 999],[477, 458, 579, 891],[606, 749, 794, 999],[666, 445, 727, 717],[720, 460, 769, 684],[766, 449, 823, 743],[820, 483, 861, 773],[847, 671, 998, 999],[850, 453, 905, 768],[889, 491, 946, 671],[946, 487, 999, 798];woman:[349, 471, 454, 881],[363, 737, 534, 999],[605, 731, 794, 999],[800, 478, 860, 778],[847, 664, 999, 999],[890, 476, 946, 689];human hair:[358, 768, 454, 912],[365, 468, 408, 534],[682, 735, 775, 883],[868, 460, 906, 526],[937, 664, 999, 837],[954, 489, 985, 567];human head:[229, 828, 384, 994],[312, 476, 358, 542],[363, 464, 413, 548],[363, 745, 461, 906],[533, 455, 564, 537],[681, 740, 773, 892],[913, 652, 999, 770];man:[288, 479, 398, 772],[477, 454, 584, 889],[665, 438, 737, 726],[720, 456, 777, 708],[765, 442, 828, 746],[827, 450, 875, 580];clothing:[336, 473, 465, 784],[316, 507, 373, 894],[362, 863, 539, 999],[477, 519, 582, 869],[597, 898, 797, 999],[638, 833, 675, 914],[667, 485, 999, 769],[767, 780, 879, 991];mammal:[240, 818, 385, 999],[308, 471, 372, 841],[347, 464, 476, 925],[362, 739, 536, 999],[476, 453, 581, 889],[593, 727, 801, 999],[609, 713, 693, 897],[667, 441, 741, 711],[672, 670, 880, 999],[722, 449, 777, 708],[763, 435, 834, 769],[818, 471, 865, 777],[832, 449, 876, 615],[840, 761, 999, 999],[875, 447, 907, 559],[887, 473, 949, 723],[911, 654, 999, 862],[931, 475, 948, 553],[936, 477, 996, 646];girl:[943, 494, 999, 639];building:[000, 000, 977, 908];human face:[376, 480, 413, 546].') 2023-11-03 21:54:15.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:54:15.342 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Consistently apply the category:[xmin,ymin,xmax,ymax] format to your answer.', 'flag:[198, 224, 764, 647].') 2023-11-03 21:54:19.748 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:54:19.748 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('Given a video clip including frame1,frame2 and frame3,what is the track of the woman walking down the hallway while using cell phone?Ensure the trajectories in your answer follow the TrackiFrame t:[xmin,ymin,xmax,ymax] structure.', 'There is one woman walking down the hallway while using cell phone.Track1frame1:[233, 364, 514, 1000];frame2:[213, 346, 477, 991];frame3:[203, 344, 457, 996].') 2023-11-03 21:54:59.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:54:59.318 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\ndetect Person,Cup and Bottle in this image.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'Person:[543, 230, 886, 904],[222, 082, 615, 954],[049, 045, 485, 999],[001, 184, 129, 1000];Cup:[862, 778, 937, 911];Bottle:[102, 699, 190, 998],[222, 705, 304, 1000],[467, 852, 639, 949],[744, 639, 813, 948],[794, 473, 882, 892],[957, 627, 999, 918].') 2023-11-03 21:55:11.506 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:55:11.507 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Your answer should be structured precisely according to the category:[xmin,ymin,xmax,ymax] format.', 'clothing:[724, 218, 924, 944],[013, 084, 073, 270],[090, 084, 138, 218],[232, 122, 282, 221],[282, 101, 338, 218],[327, 164, 390, 300],[404, 167, 451, 283],[417, 265, 497, 407],[489, 334, 543, 441],[544, 253, 590, 417],[608, 167, 737, 342],[891, 396, 999, 726];man:[000, 000, 592, 998],[610, 009, 738, 350],[610, 141, 744, 581],[616, 514, 706, 757],[723, 254, 999, 734],[735, 121, 877, 421],[969, 071, 999, 264];human face:[625, 565, 668, 710],[652, 195, 703, 355],[661, 027, 695, 140],[788, 331, 831, 469],[796, 178, 831, 267],[941, 318, 980, 441],[970, 111, 999, 250].') 2023-11-03 21:55:14.725 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:102 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:55:14.725 | INFO | mmgpt.data.dataset.pair_webdataset:token_processor:103 - (None, 'A group of people [466, 176, 623, 957] [123, 190, 313, 957] [783, 133, 933, 961] [261, 223, 424, 955] [002, 143, 158, 957] [636, 167, 796, 955] wearing tank tops [346, 198, 437, 473] [799, 241, 921, 502] [477, 296, 593, 558] [278, 323, 402, 565] [661, 288, 768, 541] [023, 253, 137, 515] with the Michigan State Spartan logo [684, 407, 751, 532], shorts [828, 447, 926, 549] [669, 488, 762, 585] [036, 437, 139, 540] and headbands [172, 159, 235, 218] [426, 218, 489, 311] [793, 152, 856, 248] [142, 221, 201, 316] [018, 159, 082, 248] [480, 188, 542, 268] [342, 140, 405, 204] [273, 257, 333, 332] lean forward, getting ready to run.') 2023-11-03 21:56:03.406 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:114 - exceeding max length 2048, ignore last 1 samples! 2023-11-03 21:56:03.407 | INFO | mmgpt.data.dataset.interpair_webdataset:token_processor:115 - ('\nDetect all.Please make sure your answer follows the category:[xmin,ymin,xmax,ymax] configuration precisely.', 'Bowl:[556, 623, 687, 741],[438, 587, 556, 652];Barrel:[356, 378, 380, 413],[111, 394, 175, 503];Frame:[020, 001, 098, 032],[242, 001, 341, 118],[325, 159, 363, 196],[353, 197, 389, 232];Cabinet:[465, 062, 657, 381],[306, 311, 392, 409];Fan:[812, 085, 865, 188];Stool:[004, 470, 085, 564],[234, 412, 292, 482],[563, 503, 638, 576];Storage box:[074, 479, 507, 615],[622, 473, 758, 560],[071, 430, 110, 493].')