shreyasmeher
/

ConflLlama

Text Classification

Model card Files Files and versions Community

shreyasmeher commited on 17 days ago

Commit

c92d20a

·

verified ·

1 Parent(s): 2f4249a

Update README.md

Files changed (1) hide show

README.md +20 -12

README.md CHANGED Viewed

@@ -40,9 +40,11 @@ inference:
 # ConflLlama: Domain-Specific LLM for Conflict Event Classification
-\<p align="center"\>
-\<img src="images/logo.png" alt="Project Logo" width="300"/\>
-\</p\>
 **ConflLlama** is a large language model fine-tuned to classify conflict events from text descriptions. This repository contains the GGUF quantized models (q4\_k\_m, q8\_0, and BF16) based on **Llama-3.1 8B**, which have been adapted for the specialized domain of political violence research.
@@ -93,9 +95,11 @@ The most significant improvements were observed in historically difficult-to-cla
       * Alpha (`lora_alpha`): 16
       * Target Modules: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj`
-\<p align="center"\>
-\<img src="images/model-arch.png" alt="Model Training Architecture" width="800"/\>
-\</p\>
 ### Training Data
@@ -103,9 +107,11 @@ The most significant improvements were observed in historically difficult-to-cla
   * **Time Period**: The training dataset consists of 171,514 events that occurred before January 1, 2017. The test set includes 38,192 events from 2017 onwards.
   * **Preprocessing**: The pipeline filters data by date, cleans text summaries, and combines primary, secondary, and tertiary attack types into a single multi-label field.
-\<p align="center"\>
-\<img src="images/preprocessing.png" alt="Data Preprocessing Pipeline" width="800"/\>
-\</p\>
 -----
@@ -134,9 +140,11 @@ This model is designed for academic and research purposes within the fields of p
 ## Training Logs
-\<p align="center"\>
-\<img src="images/training.png" alt="Training Logs" width="800"/\>
-\</p\>
 The training logs show a successful training run with healthy convergence patterns:

 # ConflLlama: Domain-Specific LLM for Conflict Event Classification
+<p align="center">
+  <img src="images/logo.png" alt="Project Logo" width="300"/>
+</p>
 **ConflLlama** is a large language model fine-tuned to classify conflict events from text descriptions. This repository contains the GGUF quantized models (q4\_k\_m, q8\_0, and BF16) based on **Llama-3.1 8B**, which have been adapted for the specialized domain of political violence research.
       * Alpha (`lora_alpha`): 16
       * Target Modules: `q_proj`, `k_proj`, `v_proj`, `o_proj`, `gate_proj`, `up_proj`, `down_proj`
+<p align="center">
+  <img src="images/model-arch.png" alt="Model Training Architecture" width="800"/>
+</p>
 ### Training Data
   * **Time Period**: The training dataset consists of 171,514 events that occurred before January 1, 2017. The test set includes 38,192 events from 2017 onwards.
   * **Preprocessing**: The pipeline filters data by date, cleans text summaries, and combines primary, secondary, and tertiary attack types into a single multi-label field.
+<p align="center">
+  <img src="images/preprocessing.png" alt="Data Preprocessing Pipeline" width="800"/>
+</p>
 -----
 ## Training Logs
+<p align="center">
+  <img src="images/training.png" alt="Training Logs" width="800"/>
+</p>
 The training logs show a successful training run with healthy convergence patterns: