stoshniwal commited on
Commit
8f7dc80
·
verified ·
1 Parent(s): 0ec0d2c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -7
README.md CHANGED
@@ -114,11 +114,12 @@ outputs = pipeline(
114
  print(outputs[0]["generated_text"][-1]['content'])
115
  ````
116
 
 
 
 
117
  To use the model with GenSelect inference, we recommend following our
118
- [reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py).
119
- Alternatively, you can manually extract the summary from all solutions and use this
120
- [prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems.
121
- We will add the prompt we used for the coding problems and a reference implementation soon!
122
 
123
  You can learn more about GenSelect in these papers:
124
  * [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
@@ -130,7 +131,7 @@ You can learn more about GenSelect in these papers:
130
  If you find the data useful, please cite:
131
  ```
132
  @article{ahmad2025opencodereasoning,
133
- title={OpenCodeReasoning: Advancing Data Distillation for Competitive Coding},
134
  author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
135
  year={2025},
136
  eprint={2504.01943},
@@ -142,7 +143,7 @@ If you find the data useful, please cite:
142
 
143
  ```
144
  @misc{ahmad2025opencodereasoningiisimpletesttime,
145
- title={OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique},
146
  author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
147
  year={2025},
148
  eprint={2507.09075},
@@ -154,7 +155,7 @@ If you find the data useful, please cite:
154
 
155
  ```
156
  @misc{moshkov2025aimo2winningsolutionbuilding,
157
- title={AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset},
158
  author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
159
  year={2025},
160
  eprint={2504.16891},
@@ -164,6 +165,16 @@ If you find the data useful, please cite:
164
  }
165
  ```
166
 
 
 
 
 
 
 
 
 
 
 
167
  ## Additional Information:
168
 
169
  ### Deployment Geography:
 
114
  print(outputs[0]["generated_text"][-1]['content'])
115
  ````
116
 
117
+ We have added [a simple transformer-based script](https://huggingface.co/nvidia/OpenReasoning-Nemotron-7B/blob/main/genselect_hf.py) in this repo to illustrate GenSelect.
118
+ To learn how to use the models in GenSelect mode with NeMo-Skills, see our [documentation](https://nvidia.github.io/NeMo-Skills/releases/openreasoning/evaluation/).
119
+
120
  To use the model with GenSelect inference, we recommend following our
121
+ [reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py). Alternatively, you can manually extract the summary from all solutions and use this
122
+ [prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems. We will add the prompt we used for the coding problems and a reference implementation soon!
 
 
123
 
124
  You can learn more about GenSelect in these papers:
125
  * [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
 
131
  If you find the data useful, please cite:
132
  ```
133
  @article{ahmad2025opencodereasoning,
134
+ title={{OpenCodeReasoning: Advancing Data Distillation for Competitive Coding}},
135
  author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
136
  year={2025},
137
  eprint={2504.01943},
 
143
 
144
  ```
145
  @misc{ahmad2025opencodereasoningiisimpletesttime,
146
+ title={{OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique}},
147
  author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
148
  year={2025},
149
  eprint={2507.09075},
 
155
 
156
  ```
157
  @misc{moshkov2025aimo2winningsolutionbuilding,
158
+ title={{AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset}},
159
  author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
160
  year={2025},
161
  eprint={2504.16891},
 
165
  }
166
  ```
167
 
168
+ ```
169
+ @inproceedings{toshniwal2025genselect,
170
+ title={{GenSelect: A Generative Approach to Best-of-N}},
171
+ author={Shubham Toshniwal and Ivan Sorokin and Aleksander Ficek and Ivan Moshkov and Igor Gitman},
172
+ booktitle={2nd AI for Math Workshop @ ICML 2025},
173
+ year={2025},
174
+ url={https://openreview.net/forum?id=8LhnmNmUDb}
175
+ }
176
+ ```
177
+
178
  ## Additional Information:
179
 
180
  ### Deployment Geography: