gabe-zhang
/

Llama-PaperSummarization-LoRA

paper-summarization

Model card Files Files and versions

gabe-zhang commited on Jan 14

Commit

cc5927f

·

verified ·

1 Parent(s): ff19ed3

Move evaluation to front

Files changed (1) hide show

README.md +11 -15

README.md CHANGED Viewed

@@ -22,14 +22,23 @@ pipeline_tag: summarization
 ## **Model Details**
 This is a **LoRA fine-tuned adapter** built on [**meta-llama/Llama-3.2-1B-Instruct**](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct). It is designed for scientific paper summarization tasks and leverages **Low-Rank Adaptation (LoRA)** to enhance model performance efficiently while maintaining a low computational overhead.
----
 ## **Dataset**
 The model was fine-tuned on the [**armanc/scientific_papers**](https://huggingface.co/datasets/armanc/scientific_papers) dataset. Below are the details of the dataset splits:
 - **Training Set**: 20K samples
 - **Validation Set**: 6K samples
----
 ## **LoRA Configuration**
 - **Trainable Parameters**: 850K (~7% of base model parameters)
@@ -46,19 +55,6 @@ The model was fine-tuned on the [**armanc/scientific_papers**](https://huggingfa
 - **Training Duration**: 28 hours
 - **Training Scripts**: [gabe-zhang/paper2summary](https://github.com/gabe-zhang/paper2summary)
----
-## **Evaluation**
-The model was evaluated on a **6K-sample test set** using **ROUGE scores** with the following settings:
-- **Decoding Strategy**: Beam search (beam size = 4)
-### **Performance Comparison**
-| Model                     | ROUGE-1 | ROUGE-2 | ROUGE-3 | ROUGE-L |
-|---------------------------|----------|----------|----------|----------|
-| **Llama-3.2-1B-Instruct** | 36.69    | 7.47     | 1.95     | 19.36    |
-| **Llama-PaperSummarization-LoRA** | **41.56** | **11.31** | **2.67** | **21.86** |
----
 ## **License**
 This repository contains a **LoRA fine-tuned adapter** derived from the Llama 3.2 model.

 ## **Model Details**
 This is a **LoRA fine-tuned adapter** built on [**meta-llama/Llama-3.2-1B-Instruct**](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct). It is designed for scientific paper summarization tasks and leverages **Low-Rank Adaptation (LoRA)** to enhance model performance efficiently while maintaining a low computational overhead.
+## **Performance Comparison**
+| Model                     | ROUGE-1 | ROUGE-2 | ROUGE-3 | ROUGE-L |
+|---------------------------|----------|----------|----------|----------|
+| **Llama-3.2-1B-Instruct** | 36.69    | 7.47     | 1.95     | 19.36    |
+| **Llama-PaperSummarization-LoRA** | **41.56** | **11.31** | **2.67** | **21.86** |
+The model was evaluated on a **6K-sample test set** using **ROUGE scores** with the following settings:
+- **Decoding Strategy**: Beam search (beam size = 4)
 ## **Dataset**
 The model was fine-tuned on the [**armanc/scientific_papers**](https://huggingface.co/datasets/armanc/scientific_papers) dataset. Below are the details of the dataset splits:
 - **Training Set**: 20K samples
 - **Validation Set**: 6K samples
+- **Test Set**: 6K samples
 ## **LoRA Configuration**
 - **Trainable Parameters**: 850K (~7% of base model parameters)
 - **Training Duration**: 28 hours
 - **Training Scripts**: [gabe-zhang/paper2summary](https://github.com/gabe-zhang/paper2summary)
 ## **License**
 This repository contains a **LoRA fine-tuned adapter** derived from the Llama 3.2 model.