Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning
Paper
β’
2502.18080
β’
Published
β’
2
I'm glad you found it helpful!
Yes, this is planned. I was originally planning to write an article about training with the training operator, but now I'm wondering if I should skip that and focus on training with the new trainer instead.
PS: Kubeflow is migrating their training component from v1 (Kubeflow Training Operator) to v2 (Kubeflow Trainer).