aashish1904 commited on
Commit
b3dce27
·
verified ·
1 Parent(s): f48ab49

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +152 -0
README.md ADDED
@@ -0,0 +1,152 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ license: apache-2.0
5
+ library_name: transformers
6
+ language:
7
+ - en
8
+ base_model:
9
+ - Qwen/Qwen2.5-14B
10
+ pipeline_tag: text-generation
11
+
12
+ ---
13
+
14
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
15
+
16
+
17
+ # QuantFactory/Datarus-R1-14B-preview-GGUF
18
+ This is quantized version of [DatarusAI/Datarus-R1-14B-preview](https://huggingface.co/DatarusAI/Datarus-R1-14B-preview) created using llama.cpp
19
+
20
+ # Original Model Card
21
+
22
+
23
+ # Datarus-R1-14B-preview
24
+
25
+ <div align="center">
26
+ <img src="https://i.postimg.cc/7hsStNgm/logo-icon-2-1.png" alt="Datarus Logo" width="150"/>
27
+
28
+ [![Model](https://img.shields.io/badge/Model-Datarus--R1--14B-blue)](https://huggingface.co/DatarusAI/Datarus-R1-14B-preview)
29
+ [![License](https://img.shields.io/badge/License-Apache%202.0-green)](LICENSE)
30
+ [![Website](https://img.shields.io/badge/Website-datarus.ai-orange)](https://datarus.ai)
31
+ [![Demo](https://img.shields.io/badge/Demo-Try%20Now-purple)](https://chat.datarus.ai)
32
+ [![Paper](https://img.shields.io/badge/Paper-arXiv-red)](https://arxiv.org/abs/2508.13382)
33
+ </div>
34
+
35
+ ## 🚀 Overview
36
+
37
+ **Datarus-R1-14B-Preview** is a 14B-parameter open-weights language model fine-tuned from Qwen2.5-14B-Instruct, designed to act as a virtual data analyst and graduate-level problem solver. Unlike traditional models trained on isolated Q&A pairs, Datarus learns from complete analytical trajectories—including reasoning steps, code execution, error traces, self-corrections, and final conclusions—all captured in a ReAct-style notebook format.
38
+
39
+ ### Key Highlights
40
+
41
+ - **🎯 State-of-the-art efficiency**: Surpasses similar-sized models and competes with 32B+ models while using 18-49% fewer tokens
42
+ - **🔄 Dual reasoning interfaces**: Supports both Agentic (ReAct) mode for interactive analysis and Reflection (CoT) mode for concise documentation
43
+ - **📊 Superior performance**: Achieves up to 30% higher accuracy on AIME 2024/2025 and LiveCodeBench
44
+ - **💡 "AHA-moment" pattern**: Exhibits efficient hypothesis refinement in 1-2 iterations, avoiding circular reasoning loops
45
+
46
+ ## 🔗 Quick Links
47
+
48
+ - 🌐 **Website**: [https://datarus.ai](https://datarus.ai)
49
+ - 💬 **Try the Demo**: [https://chat.datarus.ai](https://chat.datarus.ai)
50
+ - 🛠️ **Jupyter Agent**: [GitHub Repository](https://github.com/DatarusAI/Datarus-JupyterAgent)
51
+ - 📄 **Paper**: [Datarus-R1: An Adaptive Multi-Step Reasoning LLM](https://arxiv.org/abs/2508.13382)
52
+
53
+ ## 📊 Performance
54
+
55
+ ### Benchmark Results
56
+
57
+ | Benchmark | Datarus-R1-14B-Preview | QwQ-32B | Phi-4-reasoning | DeepSeek-R1-Distill-14B |
58
+ |-----------|----------------|---------|-----------------|-------------------------|
59
+ | **LiveCodeBench v6** | 57.7 | 56.6 | 52.6 | 48.6 |
60
+ | **AIME 2024** | 70.1 | 76.2 | 74.6* | - |
61
+ | **AIME 2025** | 66.2 | 66.2 | 63.1* | - |
62
+ | **GPQA Diamond** | 62.1 | 60.1 | 55.0 | 58.6 |
63
+
64
+ *Reported values from official papers
65
+
66
+ ### Token Efficiency and Performance
67
+
68
+ <div align="center">
69
+ <img src="https://i.postimg.cc/NMSppNM4/perf-efficiency.png" alt="LCB-Efficiency" width="600"/>
70
+ <img src="https://i.postimg.cc/nV341Ssf/efficiency.png" alt="Efficiency" width="600" />
71
+ </div>
72
+
73
+ ## 🎯 Model Card
74
+
75
+ ### Model Details
76
+
77
+ - **Model Type**: Language Model for Reasoning and Data Analysis
78
+ - **Parameters**: 14.8B
79
+ - **Training Data**: 144,000 synthetic analytical trajectories across finance, medicine, numerical analysis, and other quantitative domains + A curated collection of reasoning datasets.
80
+ - **Language**: English
81
+ - **License**: Apache 2.0
82
+
83
+ ### Intended Use
84
+
85
+ #### Primary Use Cases
86
+ - **Data Analysis**: Automated data exploration, statistical analysis, and visualization
87
+ - **Mathematical Problem Solving**: Graduate-level mathematics including AIME-level problems
88
+ - **Code Generation**: Creating analytical scripts and solving programming challenges
89
+ - **Scientific Reasoning**: Complex problem-solving in physics, chemistry, and other sciences
90
+ - **Interactive Notebooks**: Building complete analysis notebooks with iterative refinement
91
+
92
+ ### Dual Mode Usage
93
+
94
+ #### Agentic Mode (for interactive analysis)
95
+ - Use `<step>`, `<thought>`, `<action>`, `<action_input>`, `<observation>` tags
96
+ - Enables iterative code execution and refinement
97
+ - Best for data analysis, simulations, and exploratory tasks
98
+
99
+ #### Reflection Mode (for documentation)
100
+ - Use `<think>` and `<answer>` tags
101
+ - Produces compact, self-contained reasoning chains
102
+ - Best for mathematical proofs, explanations, and reports
103
+
104
+ ## 📚 Citation
105
+
106
+ ```bibtex
107
+ @article{benchaliah2025datarus,
108
+ title={Datarus-R1: An Adaptive Multi-Step Reasoning LLM for Automated Data Analysis},
109
+ author={Ben Chaliah, Ayoub and Dellagi, Hela},
110
+ journal={arXiv preprint arXiv:2508.13382},
111
+ year={2025}
112
+ }
113
+ ```
114
+
115
+ ## 🤝 Contributing
116
+
117
+ We welcome contributions! Please see our [GitHub repository](https://github.com/DatarusAI/Datarus-JupyterAgent) for:
118
+ - Bug reports and feature requests
119
+ - Pull requests
120
+ - Discussion forums
121
+
122
+ ## 📄 License
123
+
124
+ This model is released under the Apache 2.0 License.
125
+
126
+ ## 🙏 Acknowledgments
127
+
128
+ We thank the Qwen team for the excellent base model and the open-source community for their valuable contributions.
129
+
130
+ ## 📧 Contact
131
+
132
133
+ - **Website**: [https://datarus.ai](https://datarus.ai)
134
+ - **Demo**: [https://chat.datarus.ai](https://chat.datarus.ai)
135
+
136
+ ---
137
+
138
+ <div align="center">
139
+ <strong>Experience the future of AI-powered data analysis with Datarus-R1</strong>
140
+
141
+ [Try Demo](https://chat.datarus.ai) | [View Code](https://github.com/DatarusAI/Datarus-JupyterAgent) | [Read Paper](https://arxiv.org/abs/2508.13382)
142
+ </div>
143
+
144
+ ## ⭐ Support
145
+
146
+ If you find this model and Agent pipeline useful, please consider __Like/Star__! Your support helps us continue improving the project.
147
+
148
+ Found a bug or have a feature request? Please open an issue on GitHub.
149
+
150
+ ---
151
+
152
+ <p align="center">Made with ❤️ by the Datarus Team from Paris</p>