nvidia
/

OpenReasoning-Nemotron-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions

smajumdar94 commited on Jul 16

Commit

9833ada

·

verified ·

1 Parent(s): 80c8336

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ tags:
 # OpenReasoning-Nemotron-1.5B Overview
 ## Description: <br>
-OpenReasoning-Nemotron-1.5B is a large language model (LLM) which is a derivative of Qwen2.5-1.5B-Instruct (AKA the reference model). It is a reasoning model that is post-trained for reasoning about math, code and science solution generation. The model supports a context length of 48K tokens. The OpenReasoning model is available in the following sizes: 1.5B, 7B and 14B and 32B. <br>
 This model is ready for commercial/non-commercial research use. <br>
@@ -88,7 +88,7 @@ messages = [
 ]
 outputs = pipeline(
     messages,
-    max_new_tokens=48000,
 )
 print(outputs[0]["generated_text"][-1]['content'])
 ````
@@ -167,13 +167,13 @@ Network Architecture: Qwen-1.5B-Instruct
 **Input Type(s):** Text <br>
 **Input Format(s):** String <br>
 **Input Parameters:** One-Dimensional (1D) <br>
-**Other Properties Related to Input:** Context length up to 48,000 tokens <br>
 ## Output: <br>
 **Output Type(s):** Text <br>
 **Output Format:** String <br>
 **Output Parameters:** One-Dimensional (1D) <br>
-**Other Properties Related to Output:** Context length up to 48,000 tokens <br>
 Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems. By leveraging NVIDIA’s hardware (e.g. GPU cores) and software frameworks (e.g., CUDA libraries), the model achieves faster training and inference times compared to CPU-only solutions. <br>

 # OpenReasoning-Nemotron-1.5B Overview
 ## Description: <br>
+OpenReasoning-Nemotron-1.5B is a large language model (LLM) which is a derivative of Qwen2.5-1.5B-Instruct (AKA the reference model). It is a reasoning model that is post-trained for reasoning about math, code and science solution generation. The model supports a context length of 64K tokens. The OpenReasoning model is available in the following sizes: 1.5B, 7B and 14B and 32B. <br>
 This model is ready for commercial/non-commercial research use. <br>
 ]
 outputs = pipeline(
     messages,
+    max_new_tokens=64000,
 )
 print(outputs[0]["generated_text"][-1]['content'])
 ````
 **Input Type(s):** Text <br>
 **Input Format(s):** String <br>
 **Input Parameters:** One-Dimensional (1D) <br>
+**Other Properties Related to Input:** Context length up to 64,000 tokens <br>
 ## Output: <br>
 **Output Type(s):** Text <br>
 **Output Format:** String <br>
 **Output Parameters:** One-Dimensional (1D) <br>
+**Other Properties Related to Output:** Context length up to 64,000 tokens <br>
 Our AI models are designed and/or optimized to run on NVIDIA GPU-accelerated systems. By leveraging NVIDIA’s hardware (e.g. GPU cores) and software frameworks (e.g., CUDA libraries), the model achieves faster training and inference times compared to CPU-only solutions. <br>