Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,8 @@ base_model:
|
|
7 |
## Overview
|
8 |
Arctic-Text2SQL-R1-7B is a 7-billion-parameter Text-to-SQL model fine-tuned using Group Relative Policy Optimization (GRPO) with a simple execution-based reward signal. It converts natural language questions into executable SQL queries.
|
9 |
|
|
|
|
|
10 |
## Key Features
|
11 |
|
12 |
- **Lightweight RL formulation**: Uses only execution correctness and syntax validity as rewards.
|
|
|
7 |
## Overview
|
8 |
Arctic-Text2SQL-R1-7B is a 7-billion-parameter Text-to-SQL model fine-tuned using Group Relative Policy Optimization (GRPO) with a simple execution-based reward signal. It converts natural language questions into executable SQL queries.
|
9 |
|
10 |
+
Read more in our paper: [Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL](https://arxiv.org/abs/2505.20315).
|
11 |
+
|
12 |
## Key Features
|
13 |
|
14 |
- **Lightweight RL formulation**: Uses only execution correctness and syntax validity as rewards.
|