File size: 4,914 Bytes

e59db30

---
language: en
license: apache-2.0
base_model: google/gemma-7b
tags:
- financial-sentiment-analysis
- fine-tuned
- peft
- lora
- financial-phrasebank
- gemma
datasets:
- financial_phrasebank
metrics:
- accuracy
- f1
- precision
- recall
model-index:
- name: trained-gemma-sentences_allagree
  results:
  - task:
      type: text-classification
      name: Financial Sentiment Analysis
    dataset:
      type: financial_phrasebank
      name: Financial PhraseBank
      config: sentences_allagree
    metrics:
    - type: accuracy
      value: 0.876
      name: Accuracy
    - type: f1
      value: 0.870
      name: F1 Score
    - type: precision
      value: 0.875
      name: Precision
    - type: recall
      value: 0.865
      name: Recall
---

# Trained Gemma Sentences_Allagree

## Model Description

Gemma-7B fine-tuned on financial sentiment (100% agreement threshold). This model was fine-tuned using LoRA (Low-Rank Adaptation) on the Financial PhraseBank dataset with 100% annotator agreement threshold.

## Model Details

- **Base Model**: google/gemma-7b
- **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
- **Dataset**: Financial PhraseBank (sentences with 100% annotator agreement)
- **Task**: Financial Sentiment Analysis (3-class: positive, negative, neutral)
- **Language**: English

## Performance

| Metric | Value |
|--------|-------|
| Accuracy | 87.6% |
| F1 Score | 87.0% |
| Precision | 87.5% |
| Recall | 86.5% |

## Training Details

This model was fine-tuned as part of a Final Year Project on Financial Sentiment Analysis and Stock Prediction. The training used:

- **Training Framework**: Transformers + PEFT
- **Quantization**: 4-bit quantization using BitsAndBytes
- **Hardware**: CUDA-enabled GPU
- **Hyperparameter Optimization**: Extensive Optuna-based tuning

## Usage

```python
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
import torch

# Load base model and tokenizer
base_model = AutoModelForCausalLM.from_pretrained(
    "google/gemma-7b",
    torch_dtype=torch.float16,
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained("google/gemma-7b")

# Load fine-tuned model
model = PeftModel.from_pretrained(base_model, "jengyang/trained-gemma-sentences_allagree-financial-sentiment")

# Prepare input
text = "The company reported strong quarterly earnings, exceeding analyst expectations."
prompt = f"Classify the sentiment of this financial text as positive, negative, or neutral: {text}\n\nSentiment:"

# Tokenize and generate
inputs = tokenizer(prompt, return_tensors="pt")
with torch.no_grad():
    outputs = model.generate(
        **inputs,
        max_new_tokens=10,
        do_sample=False,
        pad_token_id=tokenizer.eos_token_id
    )

response = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(response)
```

## Training Data

The model was trained on the Financial PhraseBank dataset, specifically using sentences where 100% of annotators agreed on the sentiment label. This ensures higher quality and consistency in the training data.

The Financial PhraseBank contains financial news headlines categorized into:
- **Positive**: Favorable financial news
- **Negative**: Unfavorable financial news  
- **Neutral**: Factual financial information without clear sentiment

## Evaluation

The model was evaluated on a held-out test set from the Financial PhraseBank dataset. The evaluation metrics reflect performance on financial sentiment classification with the 100% agreement threshold.

**Note**: Gemma models in this series achieved up to 87.6% accuracy, representing state-of-the-art performance on financial sentiment analysis tasks.

## Limitations and Bias

- The model is specifically designed for financial text sentiment analysis
- Performance may vary on non-financial text or different domains
- The model reflects the biases present in the Financial PhraseBank dataset
- Results should be interpreted within the context of financial sentiment analysis
- The model may not capture nuanced sentiment in complex financial scenarios

## Intended Use

**Intended Use Cases:**
- Financial news sentiment analysis
- Investment research and analysis
- Automated financial content classification
- Academic research in financial NLP

**Out-of-Scope Use Cases:**
- General-purpose sentiment analysis
- Medical or legal text analysis
- Real-time trading decisions without human oversight

## Citation

If you use this model, please cite:

```bibtex
@misc{trained_gemma_sentences_allagree,
  title={Trained Gemma Sentences_Allagree: Fine-tuned gemma-7b for Financial Sentiment Analysis},
  author={Final Year Project},
  year={2024},
  howpublished={\url{https://huggingface.co/jengyang/trained-gemma-sentences_allagree-financial-sentiment}}
}
```

## Model Card Authors

This model card was generated as part of a Final Year Project on Financial Sentiment Analysis and Stock Prediction.