dacorvo HF Staff commited on
Commit
6266253
·
verified ·
1 Parent(s): 9726c79

Update inference-cache-config/qwen-moe.json

Browse files
inference-cache-config/qwen-moe.json CHANGED
@@ -15,7 +15,7 @@
15
  {
16
  "batch_size": 8,
17
  "sequence_length": 4096,
18
- "num_cores": 8,
19
  "auto_cast_type": "bf16"
20
  }
21
  ]
 
15
  {
16
  "batch_size": 8,
17
  "sequence_length": 4096,
18
+ "num_cores": 16,
19
  "auto_cast_type": "bf16"
20
  }
21
  ]