Add comprehensive model card

e59db30 verified 2 months ago

4.91 kB

	---
	language: en
	license: apache-2.0
	base_model: google/gemma-7b
	tags:
	- financial-sentiment-analysis
	- fine-tuned
	- peft
	- lora
	- financial-phrasebank
	- gemma
	datasets:
	- financial_phrasebank
	metrics:
	- accuracy
	- f1
	- precision
	- recall
	model-index:
	- name: trained-gemma-sentences_allagree
	results:
	- task:
	type: text-classification
	name: Financial Sentiment Analysis
	dataset:
	type: financial_phrasebank
	name: Financial PhraseBank
	config: sentences_allagree
	metrics:
	- type: accuracy
	value: 0.876
	name: Accuracy
	- type: f1
	value: 0.870
	name: F1 Score
	- type: precision
	value: 0.875
	name: Precision
	- type: recall
	value: 0.865
	name: Recall
	---

	# Trained Gemma Sentences_Allagree

	## Model Description

	Gemma-7B fine-tuned on financial sentiment (100% agreement threshold). This model was fine-tuned using LoRA (Low-Rank Adaptation) on the Financial PhraseBank dataset with 100% annotator agreement threshold.

	## Model Details

	- Base Model: google/gemma-7b
	- Fine-tuning Method: LoRA (Low-Rank Adaptation)
	- Dataset: Financial PhraseBank (sentences with 100% annotator agreement)
	- Task: Financial Sentiment Analysis (3-class: positive, negative, neutral)
	- Language: English

	## Performance

	\| Metric \| Value \|
	\|--------\|-------\|
	\| Accuracy \| 87.6% \|
	\| F1 Score \| 87.0% \|
	\| Precision \| 87.5% \|
	\| Recall \| 86.5% \|

	## Training Details

	This model was fine-tuned as part of a Final Year Project on Financial Sentiment Analysis and Stock Prediction. The training used:

	- Training Framework: Transformers + PEFT
	- Quantization: 4-bit quantization using BitsAndBytes
	- Hardware: CUDA-enabled GPU
	- Hyperparameter Optimization: Extensive Optuna-based tuning

	## Usage

	```python
	from transformers import AutoTokenizer, AutoModelForCausalLM
	from peft import PeftModel
	import torch

	# Load base model and tokenizer
	base_model = AutoModelForCausalLM.from_pretrained(
	"google/gemma-7b",
	torch_dtype=torch.float16,
	device_map="auto"
	)
	tokenizer = AutoTokenizer.from_pretrained("google/gemma-7b")

	# Load fine-tuned model
	model = PeftModel.from_pretrained(base_model, "jengyang/trained-gemma-sentences_allagree-financial-sentiment")

	# Prepare input
	text = "The company reported strong quarterly earnings, exceeding analyst expectations."
	prompt = f"Classify the sentiment of this financial text as positive, negative, or neutral: {text}\n\nSentiment:"

	# Tokenize and generate
	inputs = tokenizer(prompt, return_tensors="pt")
	with torch.no_grad():
	outputs = model.generate(
	**inputs,
	max_new_tokens=10,
	do_sample=False,
	pad_token_id=tokenizer.eos_token_id
	)

	response = tokenizer.decode(outputs[0], skip_special_tokens=True)
	print(response)
	```

	## Training Data

	The model was trained on the Financial PhraseBank dataset, specifically using sentences where 100% of annotators agreed on the sentiment label. This ensures higher quality and consistency in the training data.

	The Financial PhraseBank contains financial news headlines categorized into:
	- Positive: Favorable financial news
	- Negative: Unfavorable financial news
	- Neutral: Factual financial information without clear sentiment

	## Evaluation

	The model was evaluated on a held-out test set from the Financial PhraseBank dataset. The evaluation metrics reflect performance on financial sentiment classification with the 100% agreement threshold.

	Note: Gemma models in this series achieved up to 87.6% accuracy, representing state-of-the-art performance on financial sentiment analysis tasks.

	## Limitations and Bias

	- The model is specifically designed for financial text sentiment analysis
	- Performance may vary on non-financial text or different domains
	- The model reflects the biases present in the Financial PhraseBank dataset
	- Results should be interpreted within the context of financial sentiment analysis
	- The model may not capture nuanced sentiment in complex financial scenarios

	## Intended Use

	Intended Use Cases:
	- Financial news sentiment analysis
	- Investment research and analysis
	- Automated financial content classification
	- Academic research in financial NLP

	Out-of-Scope Use Cases:
	- General-purpose sentiment analysis
	- Medical or legal text analysis
	- Real-time trading decisions without human oversight

	## Citation

	If you use this model, please cite:

	```bibtex
	@misc{trained_gemma_sentences_allagree,
	title={Trained Gemma Sentences_Allagree: Fine-tuned gemma-7b for Financial Sentiment Analysis},
	author={Final Year Project},
	year={2024},
	howpublished={\url{https://huggingface.co/jengyang/trained-gemma-sentences_allagree-financial-sentiment}}
	}
	```

	## Model Card Authors

	This model card was generated as part of a Final Year Project on Financial Sentiment Analysis and Stock Prediction.