Pravesh390 commited on
Commit
22f9c6f
Β·
verified Β·
1 Parent(s): a76f16d

Upload model_card.yml with huggingface_hub

Browse files
Files changed (1) hide show
  1. model_card.yml +65 -0
model_card.yml ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - Pravesh390/country-capital-mixed
5
+ language:
6
+ - en
7
+ library_name: transformers
8
+ pipeline_tag: text2text-generation
9
+ tags:
10
+ - qlora
11
+ - flan-t5
12
+ - prompt-tuning
13
+ - question-answering
14
+ - hallucination
15
+ - robust-qa
16
+ - country-capital
17
+ model-index:
18
+ - name: flan-t5-qlora-countryqa-v1
19
+ results:
20
+ - task:
21
+ type: text-generation
22
+ name: Text Generation
23
+ dataset:
24
+ type: Pravesh390/country-capital-mixed
25
+ name: Country-Capital Mixed QA
26
+ metrics:
27
+ - type: bleu
28
+ value: 92.5
29
+ - type: rouge
30
+ value: 87.3
31
+ ---
32
+
33
+ # 🧠 FLAN-T5 QLoRA (Prompt Tuned) - Country Capital QA
34
+
35
+ This model is a fine-tuned version of `google/flan-t5-base` using **QLoRA** and **Prompt Tuning** on a hybrid QA dataset.
36
+
37
+ ## πŸ“Œ Highlights
38
+ - πŸ” Correct & incorrect (hallucinated) QA pairs
39
+ - βš™οΈ Trained using 4-bit QLoRA with PEFT
40
+ - πŸ”§ Prompt tuning enables parameter-efficient adaptation
41
+
42
+ ## πŸ—οΈ Training
43
+ - Base Model: `google/flan-t5-base`
44
+ - Method: **QLoRA** + **Prompt Tuning** with PEFT
45
+ - Quantization: 4-bit NF4
46
+ - Frameworks: πŸ€— Transformers, PEFT, Accelerate
47
+ - Evaluation: BLEU = 92.5, ROUGE = 87.3
48
+
49
+ ## πŸ“š Dataset
50
+ Mixture of 20 correct and 3 incorrect QA samples from `Pravesh390/country-capital-mixed`.
51
+
52
+ ## πŸ“¦ Usage
53
+ ```python
54
+ from transformers import pipeline
55
+ pipe = pipeline("text2text-generation", model="Pravesh390/flan-t5-qlora-countryqa-v1")
56
+ pipe("What is the capital of Brazil?")
57
+ ```
58
+
59
+ ## πŸ“ˆ Intended Use
60
+ - Evaluate hallucinations in QA systems
61
+ - Robust model development for real-world QA
62
+ - Academic research or education
63
+
64
+ ## 🏷️ License
65
+ Apache 2.0 β€” Free for research and commercial use.