|
--- |
|
datasets: |
|
- agentlans/crash-course |
|
base_model: |
|
- google/gemma-2-9b-it |
|
- FuseAI/FuseChat-Gemma-2-9B-Instruct |
|
- jsgreenawalt/gemma-2-9B-it-advanced-v2.1 |
|
tags: |
|
- gemma2 |
|
language: |
|
- en |
|
pipeline_tag: text-generation |
|
license: gemma |
|
--- |
|
# Gemma2-9B-AdvancedFuse |
|
|
|
Gemma2-9B-AdvancedFuse is an experimental, open-source large language model (LLM) with 9 billion parameters. |
|
It aims to combine the strengths of [FuseAI/FuseChat-Gemma-2-9B-Instruct](https://huggingface.co/fuseai/fusechat-gemma-2-9b-instruct) and |
|
[jsgreenawalt/gemma-2-9B-it-advanced-v2.1](https://huggingface.co/jsgreenawalt/gemma-2-9b-it-advanced-v2.1) through additive linear merging, |
|
further fine-tuned on a 12K row dataset from [agentlans/crash-course](https://huggingface.co/datasets/agentlans/crash-course) |
|
for enhanced chat and instruct performance, including math and multilingual prompts. |
|
|
|
## Capabilities |
|
- **Text Generation:** Generates coherent emails, summaries, and notes. This model card was primarily generated by the model itself. |
|
- **Instruction Following:** Demonstrates strong ability to understand and execute instructions in conversational settings. |
|
- **Roleplaying:** Can engage in third-person narrative roleplay but may exhibit common GPT expressions or clichés. |
|
|
|
### Limitations |
|
As with most large language models: |
|
- **Factual Errors:** May generate incorrect or outdated information due to data biases. |
|
- **Mathematical Operations:** Struggles with mathematical calculations requiring symbolic reasoning despite its finetuning data. |
|
- **Handling Unsafe Input:** May generate unsafe, biased, or malicious content if provided inappropriate input. Careful prompt engineering is recommended. |
|
|
|
### Model Usage Guidelines |
|
1. Use clear and specific instructions for optimal performance. |
|
2. Verify generated outputs for factual accuracy when critical information is involved. |
|
3. Avoid providing inputs that could lead to harmful or unethical responses. |
|
4. Consider using human review, especially in high-stakes applications. |