AI & ML interests

retrieval augmented generation, grounded generation, large language models, LLMs, question answering, chatbot

Recent Activity

ofermend  published a Space about 21 hours ago
vectara/Supermicro-assistant
ofermend  updated a Space 1 day ago
vectara/cfpb-assistant
ofermend  updated a Space 1 day ago
vectara/Justice-Harvard
View all activity

vectara's activity

clefourrier 
posted an update 8 days ago
view post
Post
1735
Gemma3 family is out! Reading the tech report, and this section was really interesting to me from a methods/scientific fairness pov.

Instead of doing over-hyped comparisons, they clearly state that **results are reported in a setup which is advantageous to their models**.
(Which everybody does, but people usually don't say)

For a tech report, it makes a lot of sense to report model performance when used optimally!
On leaderboards on the other hand, comparison will be apples to apples, but in a potentially unoptimal way for a given model family (like some user interact sub-optimally with models)

Also contains a cool section (6) on training data memorization rate too! Important to see if your model will output the training data it has seen as such: always an issue for privacy/copyright/... but also very much for evaluation!

Because if your model knows its evals by heart, you're not testing for generalization.
ofermend 
posted an update 8 months ago
ofermend 
posted an update 11 months ago
view post
Post
1756
If you are a debate fan or did this as an extracurricular activity as a kid, you might have fun with this demo - debate bot. Debate against AI/RAG:

vectara/debate-bot
·