Update README.md
Browse files
README.md
CHANGED
@@ -114,11 +114,12 @@ outputs = pipeline(
|
|
114 |
print(outputs[0]["generated_text"][-1]['content'])
|
115 |
````
|
116 |
|
|
|
|
|
|
|
117 |
To use the model with GenSelect inference, we recommend following our
|
118 |
-
[reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py).
|
119 |
-
|
120 |
-
[prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems.
|
121 |
-
We will add the prompt we used for the coding problems and a reference implementation soon!
|
122 |
|
123 |
You can learn more about GenSelect in these papers:
|
124 |
* [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
|
@@ -130,7 +131,7 @@ You can learn more about GenSelect in these papers:
|
|
130 |
If you find the data useful, please cite:
|
131 |
```
|
132 |
@article{ahmad2025opencodereasoning,
|
133 |
-
title={OpenCodeReasoning: Advancing Data Distillation for Competitive Coding},
|
134 |
author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
|
135 |
year={2025},
|
136 |
eprint={2504.01943},
|
@@ -142,7 +143,7 @@ If you find the data useful, please cite:
|
|
142 |
|
143 |
```
|
144 |
@misc{ahmad2025opencodereasoningiisimpletesttime,
|
145 |
-
title={OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique},
|
146 |
author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
|
147 |
year={2025},
|
148 |
eprint={2507.09075},
|
@@ -154,7 +155,7 @@ If you find the data useful, please cite:
|
|
154 |
|
155 |
```
|
156 |
@misc{moshkov2025aimo2winningsolutionbuilding,
|
157 |
-
title={AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset},
|
158 |
author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
|
159 |
year={2025},
|
160 |
eprint={2504.16891},
|
@@ -164,6 +165,16 @@ If you find the data useful, please cite:
|
|
164 |
}
|
165 |
```
|
166 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
167 |
## Additional Information:
|
168 |
|
169 |
### Deployment Geography:
|
|
|
114 |
print(outputs[0]["generated_text"][-1]['content'])
|
115 |
````
|
116 |
|
117 |
+
We have added [a simple transformer-based script](https://huggingface.co/nvidia/OpenReasoning-Nemotron-7B/blob/main/genselect_hf.py) in this repo to illustrate GenSelect.
|
118 |
+
To learn how to use the models in GenSelect mode with NeMo-Skills, see our [documentation](https://nvidia.github.io/NeMo-Skills/releases/openreasoning/evaluation/).
|
119 |
+
|
120 |
To use the model with GenSelect inference, we recommend following our
|
121 |
+
[reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py). Alternatively, you can manually extract the summary from all solutions and use this
|
122 |
+
[prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems. We will add the prompt we used for the coding problems and a reference implementation soon!
|
|
|
|
|
123 |
|
124 |
You can learn more about GenSelect in these papers:
|
125 |
* [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
|
|
|
131 |
If you find the data useful, please cite:
|
132 |
```
|
133 |
@article{ahmad2025opencodereasoning,
|
134 |
+
title={{OpenCodeReasoning: Advancing Data Distillation for Competitive Coding}},
|
135 |
author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
|
136 |
year={2025},
|
137 |
eprint={2504.01943},
|
|
|
143 |
|
144 |
```
|
145 |
@misc{ahmad2025opencodereasoningiisimpletesttime,
|
146 |
+
title={{OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique}},
|
147 |
author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
|
148 |
year={2025},
|
149 |
eprint={2507.09075},
|
|
|
155 |
|
156 |
```
|
157 |
@misc{moshkov2025aimo2winningsolutionbuilding,
|
158 |
+
title={{AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset}},
|
159 |
author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
|
160 |
year={2025},
|
161 |
eprint={2504.16891},
|
|
|
165 |
}
|
166 |
```
|
167 |
|
168 |
+
```
|
169 |
+
@inproceedings{toshniwal2025genselect,
|
170 |
+
title={{GenSelect: A Generative Approach to Best-of-N}},
|
171 |
+
author={Shubham Toshniwal and Ivan Sorokin and Aleksander Ficek and Ivan Moshkov and Igor Gitman},
|
172 |
+
booktitle={2nd AI for Math Workshop @ ICML 2025},
|
173 |
+
year={2025},
|
174 |
+
url={https://openreview.net/forum?id=8LhnmNmUDb}
|
175 |
+
}
|
176 |
+
```
|
177 |
+
|
178 |
## Additional Information:
|
179 |
|
180 |
### Deployment Geography:
|