SoarAILabs
/

KiteResolve-20B

@@ -1,11 +1,5 @@
 ---
 license: apache-2.0
----
-Here's a professional and engaging model card for your KiteResolve-20B model:
-```markdown
----
-license: mit
 base_model: openai/gpt-oss-20b
 tags:
 - merge-conflicts
@@ -14,16 +8,22 @@ tags:
 - code-generation
 - version-control
 - devops
-language:
 - en
 pipeline_tag: text-generation
 library_name: transformers
 datasets:
 - SoarAILabs/merge-conflict-dataset
 metrics:
-- bleu
-- rouge
-- exact_match
 model-index:
 - name: KiteResolve-20B
   results:
@@ -36,7 +36,7 @@ model-index:
       name: Exact Match
     - type: bleu
       value: 54.83
-      name: BLEU Score
     - type: rouge-l
       value: 67.10
       name: ROUGE-L
@@ -47,7 +47,7 @@ model-index:
 *Developed by [Soar AI Labs](https://huggingface.co/SoarAILabs)*
 <div align="center">
-  <img src="https://img.shields.io/badge/License-MIT-blue.svg" alt="License">
   <img src="https://img.shields.io/badge/Model-20B%20Parameters-red.svg" alt="Parameters">
   <img src="https://img.shields.io/badge/Task-Code%20Generation-green.svg" alt="Task">
   <img src="https://img.shields.io/badge/BLEU-54.83-orange.svg" alt="BLEU Score">
@@ -59,27 +59,27 @@ model-index:
 ### ✨ Key Features
-- 🎯 **20% Exact Match Accuracy** on real-world merge conflicts
-- 📈 **43.64% BLEU Score Improvement** over base model
-- 🌐 **Multi-Language Support**: Java, JavaScript, Python, C#, TypeScript, and more
-- ⚡ **Fast Inference**: Optimized for CLI and webhook integrations
-- 🔧 **Production Ready**: Designed for enterprise Git workflows
 ## 📊 Performance Metrics
 | Metric | Score | Improvement |
 |--------|-------|-------------|
-| **Exact Match** | 20.0% | ↗️ 20.0% |
-| **BLEU Score** | 54.83% | ↗️ +43.64% |
-| **ROUGE-L** | 67.10% | ↗️ +33.65% |
-*Evaluated on 20 held-out samples from real-world merge conflicts*
 ## 🛠️ Usage
 ### Quick Start
-```
 from transformers import AutoModelForCausalLM, AutoTokenizer
 from unsloth.chat_templates import get_chat_template
@@ -101,119 +101,12 @@ function calculateTotal(items) {
 >>>>>>> theirs
 """
-messages = [{"role": "user", "content": f"Resolve this merge conflict:\n```{conflict}```
 prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
 inputs = tokenizer([prompt], return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=200, do_sample=False)
-resolution = tokenizer.decode(outputs[inputs['input_ids'].shape:], skip_special_tokens=True)[1]
 print(resolution)
 ```
-### Integration Examples
-#### GitHub Webhook Integration
-```
-# Perfect for automated PR conflict resolution
-@app.route('/webhook', methods=['POST'])
-def handle_merge_conflict():
-    conflict_data = request.json
-    resolution = model.resolve_conflict(conflict_data['conflict'])
-    create_resolution_commit(resolution)
-    return {"status": "resolved"}
-```
-## 🎯 Intended Use Cases
-### Primary Applications
-- **Automated CI/CD Pipelines**: Resolve conflicts in merge requests automatically
-- **Developer Productivity Tools**: Speed up code integration workflows
-- **Git Workflow Automation**: Reduce manual intervention in version control
-- **Code Review Assistance**: Pre-resolve conflicts before human review
-### Supported Scenarios
-- ✅ Simple syntactic conflicts (variable names, imports)
-- ✅ Formatting and whitespace conflicts
-- ✅ Method signature changes
-- ✅ Configuration file updates
-- ⚠️ Complex semantic conflicts may require human review
-## 🏗️ Training Details
-### Base Model
-- **Architecture**: GPT-OSS-20B (20 billion parameters)
-- **Fine-tuning Method**: Full parameter fine-tuning with LoRA adapters
-- **Training Framework**: Unsloth for efficient training
-### Training Data
-- **Dataset Size**: 956 curated merge conflict examples
-- **Data Sources**: Real-world GitHub repositories
-- **Languages**: Java, JavaScript, Python, C#, TypeScript, Go, Rust
-- **Conflict Types**: Syntactic, semantic, and formatting conflicts
-### Training Configuration
-- **Batch Size**: Optimized for merge conflict patterns
-- **Learning Rate**: Fine-tuned for code generation
-- **Epochs**: Trained until convergence on validation set
-- **Hardware**: NVIDIA A100 GPUs
-## 🔍 Evaluation
-### Test Methodology
-- **Evaluation Set**: 20 held-out real-world merge conflicts
-- **Metrics**: Exact Match, BLEU, ROUGE-L, Character Similarity
-- **Comparison**: Benchmarked against GPT-OSS-20B base model
-- **Validation**: Human expert review of generated resolutions
-### Sample Results
-```
-Sample Conflict Type: JavaScript import statements
-Expected: import { helper } from './utils';
-Generated: import { helper } from './utils';
-Result: ✅ Exact Match
-```
-## 🏢 About Soar AI Labs
-**Soar AI Labs** develops cutting-edge AI solutions for software development workflows. Our mission is to eliminate friction in the development process through intelligent automation.
-### Our Products
-- 🪁 **KiteResolve**: AI-powered merge conflict resolution
-- 🔧 **Developer Tools**: CLI utilities and IDE integrations
-- 🚀 **Future**: More AI-powered DevOps solutions coming soon
-## 📚 Citation
-```
-@misc{kiteResolve2025,
-  title={KiteResolve-20B: Fine-tuned GPT-OSS for Automated Merge Conflict Resolution},
-  author={Soar AI Labs},
-  year={2025},
-  publisher={Hugging Face},
-  url={https://huggingface.co/SoarAILabs/KiteResolve-20B}
-}
-```
-## 📄 License
-This model is released under the MIT License. See the [LICENSE](LICENSE) file for details.
-## 🤝 Contributing
-Interested in improving KiteResolve? We welcome contributions!
-- 🐛 **Report Issues**: Found a conflict type we don't handle well?
-- 💡 **Feature Requests**: Ideas for new capabilities?
-- 🔧 **Pull Requests**: Code improvements and extensions
-Visit our [GitHub Organization](https://github.com/SoarAILabs) to get involved.
----
-<div align="center">
-  <strong>Built with ❤️ by Soar AI Labs</strong><br>
-  <em>Elevating developer productivity through AI</em>
-</div>
-```

 ---
 license: apache-2.0
 base_model: openai/gpt-oss-20b
 tags:
 - merge-conflicts
 - code-generation
 - version-control
 - devops
+languages:
 - en
 pipeline_tag: text-generation
 library_name: transformers
 datasets:
 - SoarAILabs/merge-conflict-dataset
 metrics:
+- name: exact_match
+  type: exact_match
+  value: 20.0
+- name: bleu
+  type: bleu
+  value: 54.83
+- name: rouge-l
+  type: rouge
+  value: 67.10
 model-index:
 - name: KiteResolve-20B
   results:
       name: Exact Match
     - type: bleu
       value: 54.83
+      name: BLEU
     - type: rouge-l
       value: 67.10
       name: ROUGE-L
 *Developed by [Soar AI Labs](https://huggingface.co/SoarAILabs)*
 <div align="center">
+  <img src="https://img.shields.io/badge/License-Apache%202.0-blue.svg" alt="License">
   <img src="https://img.shields.io/badge/Model-20B%20Parameters-red.svg" alt="Parameters">
   <img src="https://img.shields.io/badge/Task-Code%20Generation-green.svg" alt="Task">
   <img src="https://img.shields.io/badge/BLEU-54.83-orange.svg" alt="BLEU Score">
 ### ✨ Key Features
+- 🎯 **20% Exact Match Accuracy** on real-world merge conflicts
+- 📈 **43.6% BLEU Score Improvement** over base model
+- 🌐 **Multi-Language Support**: Java, JavaScript, Python, C#, TypeScript, and more
+- ⚡ **Fast Inference**: Optimized for CLI and webhook integrations
+- 🔧 **Production Ready**: Designed for enterprise Git workflows
 ## 📊 Performance Metrics
 | Metric | Score | Improvement |
 |--------|-------|-------------|
+| **Exact Match** | 20.0 | – |
+| **BLEU Score** | 54.83 | ↗️ +43.6% |
+| **ROUGE-L** | 67.10 | ↗️ +33.7% |
+*Evaluated on 20 held-out samples from real-world merge conflicts.*
 ## 🛠️ Usage
 ### Quick Start
+```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 from unsloth.chat_templates import get_chat_template
 >>>>>>> theirs
 """
+messages = [{"role": "user", "content": f"Resolve this merge conflict:\n```{conflict}```"}]
 prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
 inputs = tokenizer([prompt], return_tensors="pt")
 outputs = model.generate(**inputs, max_new_tokens=200, do_sample=False)
+resolution = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(resolution)
 ```