Comprehensive Performance Analysis: What This Model Is Actually Good At

#5
by Potheadxx - opened

I just completed a thorough performance analysis of this model and wanted to share findings that could help the community:

🔍 Key Findings

✅ Excellent Performance (4.9/5):

  • Creative writing and story continuation
  • Text completion in Bengali
  • Mixed Bengali-English content generation
  • Simple math operations

❌ Poor Performance (0-25% accuracy):

  • Factual question answering
  • Structured instruction following (Alpaca format)
  • Information retrieval tasks
  • RAG applications (despite the name)

📊 Testing Methodology

  • 7 different task categories tested
  • Multiple prompt formats (simple, structured, Alpaca-style)
  • Performance benchmarking on dual Tesla T4 GPUs
  • Real-world use case evaluation

💡 Bottom Line

This model is excellent for creative applications but not suitable for factual Q&A or information retrieval. The "RAG" in the name is misleading - it's better described as a creative writing assistant.

🎯 Use Cases

✅ RECOMMENDED FOR:

  • Creative writing assistance
  • Content brainstorming
  • Story/blog generation
  • Text completion
  • Language learning exercises

❌ NOT RECOMMENDED FOR:

  • Factual question answering
  • Educational Q&A systems
  • Information retrieval
  • Production chatbots

📊 Full Analysis

Complete no[tebook with benchmarks and methodology:
https://www.kaggle.com/code/risadmalik/bengali-llama-reality-check-hassanaliemon-bn-r-8b

Hope this helps others evaluate if this model fits their use case! Happy to answer any questions about the testing methodology.

Thanks a lot for the details test analysis on the repo, I really appreciate it.

Yes it has problem on RAG related things and the core reason is, I didn't get any opensource embedding model that is great at Bangla embedding. I had to train a one from scratch which was not feasible due to computation limitation

I hope you got my point and it is a great pleasure for me that some one took interest on it. If you have any further thoughts then feel free to reach me.

N.B kaggle is saying it can't find that page I mean this one https://www.kaggle.com/code/risadmalik/fork-of-bengali-llama-reality-check-hassan-b615d9

Hello ,

Thanks for your valuable time you gave to reply. I understand the RAG part, as im currently facing that issue actually.

here is the corrected link: https://www.kaggle.com/code/risadmalik/bengali-llama-reality-check-hassanaliemon-bn-r-8b

and thanks for the model, had fun playing with it

Sign up or log in to comment