runtime error

Exit code: 1. Reason: Downloading shards: 0%| | 0/4 [00:00<?, ?it/s] Downloading shards: 25%|██▌ | 1/4 [00:08<00:25, 8.64s/it] Downloading shards: 50%|█████ | 2/4 [00:16<00:16, 8.42s/it] Downloading shards: 75%|███████▌ | 3/4 [00:24<00:08, 8.22s/it] Downloading shards: 100%|██████████| 4/4 [00:27<00:00, 6.08s/it] Downloading shards: 100%|██████████| 4/4 [00:27<00:00, 6.93s/it] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 4/4 [00:00<00:00, 102927.71it/s] Traceback (most recent call last): File "/home/user/app/app.py", line 17, in <module> model = AutoModelForCausalLM.from_pretrained(model_name, device_map="auto", torch_dtype=torch.float16) File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4342, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 500, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...