AventIQ-AI
/

T5-News-Summarization

Safetensors

Model card Files Files and versions

xet

Community

ayushsinha commited on Aug 5

Commit

0cd3fe7

verified ·

1 Parent(s): 4b721b5

Create README.md

Browse files

Files changed (1) hide show

README.md +106 -0

README.md ADDED Viewed

	@@ -0,0 +1,106 @@

+# Text-to-Text Transfer Transformer Quantized Model for News Summarization
+This repository hosts a quantized version of the T5 model, fine-tuned specifically for text summarization of news. The model extracts concise summaries from semi-structured or unstructured news texts, making it ideal for POS systems, kitchen displays, and chat-based food order logging.
+## Model Details
+  - **Field:**	Description
+  - **Model Architecture** T5 (Text-to-Text Transfer Transformer)
+  - **Task**	Text Summarization for News
+  - **Input Format**	Free-form order text (includes Order ID, Customer, Items, etc.)
+  - **Quantization**	8-bit (int8) using bitsandbytes
+  - **Framework**	Hugging Face Transformers
+  - **Base Model**	t5-base
+  - **Dataset**   Custom
+## Usage
+## Installation
+```sh
+pip install transformers accelerate bitsandbytes torch
+```
+### Loading the Model
+```python
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
+import torch
+device = "cuda" if torch.cuda.is_available() else "cpu"
+model_name = "AventIQ-AI/T5-News-Summarization"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSeq2SeqLM.from_pretrained(model_name, load_in_8bit=True, device_map="auto")
+def test_summarization(model, tokenizer):
+    user_text = input("\nEnter your News text:\n")
+    inputs = tokenizer("summarize: " + user_text, return_tensors="pt", truncation=True, max_length=512).to(model.device)
+    output = model.generate(
+        **inputs,
+        max_new_tokens=100,
+        num_beams=5,
+        length_penalty=0.8,
+        early_stopping=True
+    )
+    summary = tokenizer.decode(output[0], skip_special_tokens=True)
+    return summary
+print("\n📝 **Model Summary:**")
+print(test_summarization(model, tokenizer))
+```
+## ROUGE Evaluation Results
+After fine-tuning the **T5-Small** model for text summarization, we obtained the following **ROUGE** scores:
+| **Metric**  | **Score**  | **Meaning** |
+|-------------|-----------|-------------|
+| **ROUGE-1**  | **0.4125** (~41%)	| Overlap of **unigrams** between reference and summary. |
+| **ROUGE-2**  | **0.2167** (~22%)	| Overlap of **bigrams**, indicating fluency. |
+| **ROUGE-L**  | **0.3421** (~34%)	| Longest common subsequence matching structure. |
+| **ROUGE-Lsum**  | **0.3644** (~36%) | Sentence-level summarization effectiveness. |
+## Fine-Tuning Details
+### Dataset
+Custom-labeled food order dataset containing fields like Order ID, Customer, and Order Details. The model was trained to extract clean, natural summaries from noisy or inconsistent order formats.
+### Training
+  - Number of epochs: 3
+  - Batch size: 4
+  - Evaluation strategy: epoch
+  - Learning rate: 3e-5
+### Quantization
+Post-training 8-bit quantization using bitsandbytes library with Hugging Face integration. This reduced the model size and improved inference speed with negligible impact on summarization quality.
+## Repository Structure
+```
+.
+├── model/               # Contains the quantized model files
+├── tokenizer_config/    # Tokenizer configuration and vocabulary files
+├── model.safetensors/   # Quantized model weights
+├── README.md            # Model documentation
+```
+## Limitations
+ - The model may misinterpret or misformat input with excessive noise or missing key fields.
+ - Quantized versions may show slight accuracy loss compared to full-precision models.
+ - Best suited for English-language food order formats.
+## Contributing
+Contributions are welcome! If you have suggestions, feature requests, or improvements, feel free to open an issue or submit a pull request.