CompVis

university
Activity Feed

AI & ML interests

Generative approaches for visual synthesis, Invertible deep models for explainable AI, Deep metric and representation learning, self-supervised learning paradigms

Recent Activity

CompVis's activity

akhaliqΒ 
posted an update 30 days ago
view post
Post
5419
Google drops Gemini 2.0 Flash Thinking

a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more

now available in anychat, try it out: akhaliq/anychat
  • 1 reply
Β·
anton-lΒ 
posted an update 30 days ago
view post
Post
2235
Introducing πŸ“π…π’π§πžπŒπšπ­π‘: the best public math pre-training dataset with 50B+ tokens!
HuggingFaceTB/finemath

Math remains challenging for LLMs and by training on FineMath we see considerable gains over other math datasets, especially on GSM8K and MATH.

We build the dataset by:
πŸ› οΈ carefully extracting math data from Common Crawl;
πŸ”Ž iteratively filtering and recalling high quality math pages using a classifier trained on synthetic annotations to identify math reasoning and deduction.

We conducted a series of ablations comparing the performance of Llama-3.2-3B-Base after continued pre-training on FineMath and observe notable gains compared to the baseline model and other public math datasets.

We hope this helps advance the performance of LLMs on math and reasoning! πŸš€
We’re also releasing all the ablation models as well as the evaluation code.

HuggingFaceTB/finemath-6763fb8f71b6439b653482c2
kliyerΒ 
in CompVis/cleandift about 1 month ago

fix website link

1
#2 opened about 1 month ago by
qwertyforce
stefan-baumannΒ 
in CompVis/cleandift about 1 month ago
akhaliqΒ 
posted an update about 2 months ago
view post
Post
6530
QwQ-32B-Preview is now available in anychat

A reasoning model that is competitive with OpenAI o1-mini and o1-preview

try it out: akhaliq/anychat
  • 1 reply
Β·