Update README.md
Browse files
README.md
CHANGED
@@ -71,7 +71,7 @@ We deploy K2-THINK on Cerebras Wafer-Scale Engine (WSE) systems, leveraging the
|
|
71 |
| Platform | Throughput (tokens/sec) | Example: 32k-token response (time) |
|
72 |
| --------------------------------- | ----------------------: | ---------------------------------: |
|
73 |
| **Cerebras WSE (our deployment)** | **\~2,000** | **\~16 s** |
|
74 |
-
| Typical
|
75 |
|
76 |
---
|
77 |
|
|
|
71 |
| Platform | Throughput (tokens/sec) | Example: 32k-token response (time) |
|
72 |
| --------------------------------- | ----------------------: | ---------------------------------: |
|
73 |
| **Cerebras WSE (our deployment)** | **\~2,000** | **\~16 s** |
|
74 |
+
| Typical Cloud Service setup | \~200 | \~160 s |
|
75 |
|
76 |
---
|
77 |
|