hunterhector commited on
Commit
85faff5
·
verified ·
1 Parent(s): 64d16d0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -71,7 +71,7 @@ We deploy K2-THINK on Cerebras Wafer-Scale Engine (WSE) systems, leveraging the
71
  | Platform | Throughput (tokens/sec) | Example: 32k-token response (time) |
72
  | --------------------------------- | ----------------------: | ---------------------------------: |
73
  | **Cerebras WSE (our deployment)** | **\~2,000** | **\~16 s** |
74
- | Typical **H100/H200** GPU setup | \~200 | \~160 s |
75
 
76
  ---
77
 
 
71
  | Platform | Throughput (tokens/sec) | Example: 32k-token response (time) |
72
  | --------------------------------- | ----------------------: | ---------------------------------: |
73
  | **Cerebras WSE (our deployment)** | **\~2,000** | **\~16 s** |
74
+ | Typical Cloud Service setup | \~200 | \~160 s |
75
 
76
  ---
77