8lianno commited on
Commit
ea717fd
·
verified ·
1 Parent(s): 2090192

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +178 -6
README.md CHANGED
@@ -1,8 +1,180 @@
1
- # Persian Poetry Generator Model
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
- Base Model: meta-llama/Llama-3.1-8B-Instruct
4
- Special Tokens: ["شروع_شعر", "پایان_شعر", "مصرع"]
5
- Training Date: 2024-12-05
6
 
7
- This model is fine-tuned for generating Persian poetry with proper rhythm and rhyme.
8
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - fa
4
+ tags:
5
+ - text-generation
6
+ - persian
7
+ - poetry
8
+ - peft
9
+ - quantization
10
+ - llama
11
+ base_model: meta-llama/Llama-3.1-8B-Instruct
12
+ finetuned_from: meta-llama/Llama-3.1-8B-Instruct
13
+ library_name: transformers
14
+ pipeline_tag: text-generation
15
+ trust_remote_code: true
16
+ special_tokens:
17
+ additional_special_tokens:
18
+ - "[شروع_شعر]"
19
+ - "[پایان_شعر]"
20
+ - "[مصرع]"
21
+ quantization:
22
+ load_in_4bit: true
23
+ bnb_4bit_quant_type: nf4
24
+ bnb_4bit_compute_dtype: float16
25
+ bnb_4bit_use_double_quant: true
26
+ ---
27
+ # Model Card for llama_poetry_fa
28
 
29
+ **Model Name:** llama_poetry_fa
30
+ **Model URL:** https://huggingface.co/8lianno/llama_poetry_fa
 
31
 
32
+ This model is trained on a Llama 3.1-based checkpoint, focusing on generating Persian poetry. It returns responses in Persian poetic form, incorporating suitable rhyme and rhythm based on the given prompt.
33
+
34
+ {{ model_summary | default("", true) }}
35
+
36
+ ## Model Details
37
+
38
+ ### Model Description
39
+
40
+ {{ model_description | default("", true) }}
41
+
42
+ - **Developed by:** {{ developers | default("[More Information Needed]", true)}}
43
+ - **Funded by [optional]:** {{ funded_by | default("[More Information Needed]", true)}}
44
+ - **Shared by [optional]:** {{ shared_by | default("[More Information Needed]", true)}}
45
+ - **Model type:** {{ model_type | default("Causal Language Model (Llama-based)", true)}}
46
+ - **Language(s) (NLP):** {{ language | default("fa", true)}}
47
+ - **License:** {{ license | default("[More Information Needed]", true)}}
48
+ - **Finetuned from model [optional]:** {{ base_model | default("meta-llama/Llama-3.1-8B-Instruct", true)}}
49
+
50
+ ### Model Sources [optional]
51
+
52
+ - **Repository:** {{ repo | default("https://huggingface.co/8lianno/llama_poetry_fa", true)}}
53
+ - **Paper [optional]:** {{ paper | default("[More Information Needed]", true)}}
54
+ - **Demo [optional]:** {{ demo | default("[More Information Needed]", true)}}
55
+
56
+ ## Uses
57
+
58
+ ### Direct Use
59
+
60
+ {{ direct_use | default("Intended for generating Persian poems in response to user queries.", true)}}
61
+
62
+ ### Downstream Use [optional]
63
+
64
+ {{ downstream_use | default("[More Information Needed]", true)}}
65
+
66
+ ### Out-of-Scope Use
67
+
68
+ {{ out_of_scope_use | default("Not suitable for factual QA tasks, non-Persian text generation, or high-stakes decision making.", true)}}
69
+
70
+ ## Bias, Risks, and Limitations
71
+
72
+ {{ bias_risks_limitations | default("The model may produce culturally specific or biased content and may not reflect all poetic traditions accurately.", true)}}
73
+
74
+ ### Recommendations
75
+
76
+ {{ bias_recommendations | default("Users should review generated poems critically for cultural biases or inappropriate content before use.", true)}}
77
+
78
+ ## How to Get Started with the Model
79
+
80
+ {{ get_started_code | default("See the included poetry_generator.py for an example of loading and generating poems.", true)}}
81
+
82
+ ## Training Details
83
+
84
+ ### Training Data
85
+
86
+ {{ training_data | default("The model was trained on a collection of Persian poetry and literature. More details needed.", true)}}
87
+
88
+ ### Training Procedure
89
+
90
+ #### Preprocessing [optional]
91
+
92
+ {{ preprocessing | default("[More Information Needed]", true)}}
93
+
94
+ #### Training Hyperparameters
95
+
96
+ - **Training regime:** {{ training_regime | default("Fine-tuned with LoRA and 4-bit quantization.", true)}}
97
+
98
+ #### Speeds, Sizes, Times [optional]
99
+
100
+ {{ speeds_sizes_times | default("[More Information Needed]", true)}}
101
+
102
+ ## Evaluation
103
+
104
+ ### Testing Data, Factors & Metrics
105
+
106
+ #### Testing Data
107
+
108
+ {{ testing_data | default("Internal Persian poetry test samples.", true)}}
109
+
110
+ #### Factors
111
+
112
+ {{ testing_factors | default("[More Information Needed]", true)}}
113
+
114
+ #### Metrics
115
+
116
+ {{ testing_metrics | default("[More Information Needed]", true)}}
117
+
118
+ ### Results
119
+
120
+ {{ results | default("[More Information Needed]", true)}}
121
+
122
+ #### Summary
123
+
124
+ {{ results_summary | default("", true) }}
125
+
126
+ ## Model Examination [optional]
127
+
128
+ {{ model_examination | default("[More Information Needed]", true)}}
129
+
130
+ ## Environmental Impact
131
+
132
+ - **Hardware Type:** {{ hardware_type | default("[More Information Needed]", true)}}
133
+ - **Hours used:** {{ hours_used | default("[More Information Needed]", true)}}
134
+ - **Cloud Provider:** {{ cloud_provider | default("[More Information Needed]", true)}}
135
+ - **Compute Region:** {{ cloud_region | default("[More Information Needed]", true)}}
136
+ - **Carbon Emitted:** {{ co2_emitted | default("[More Information Needed]", true)}}
137
+
138
+ ## Technical Specifications [optional]
139
+
140
+ ### Model Architecture and Objective
141
+
142
+ {{ model_specs | default("Llama-based causal language model fine-tuned for Persian poetry.", true)}}
143
+
144
+ ### Compute Infrastructure
145
+
146
+ {{ compute_infrastructure | default("[More Information Needed]", true)}}
147
+
148
+ #### Hardware
149
+
150
+ {{ hardware_requirements | default("[More Information Needed]", true)}}
151
+
152
+ #### Software
153
+
154
+ {{ software | default("[More Information Needed]", true)}}
155
+
156
+ ## Citation [optional]
157
+
158
+ **BibTeX:**
159
+
160
+ {{ citation_bibtex | default("[More Information Needed]", true)}}
161
+
162
+ **APA:**
163
+
164
+ {{ citation_apa | default("[More Information Needed]", true)}}
165
+
166
+ ## Glossary [optional]
167
+
168
+ {{ glossary | default("[More Information Needed]", true)}}
169
+
170
+ ## More Information [optional]
171
+
172
+ {{ more_information | default("[More Information Needed]", true)}}
173
+
174
+ ## Model Card Authors [optional]
175
+
176
+ {{ model_card_authors | default("[More Information Needed]", true)}}
177
+
178
+ ## Model Card Contact
179
+
180
+ {{ model_card_contact | default("[More Information Needed]", true)}}