Joe Rocca

rocca

AI & ML interests

None yet

Recent Activity

reacted to merve's post with ๐Ÿค— about 5 hours ago
So many open releases at Hugging Face past week ๐Ÿคฏ recapping all here โคต๏ธ https://huggingface.co/collections/merve/march-21-releases-67dbe10e185f199e656140ae ๐Ÿ‘€ Multimodal > Mistral AI released a 24B vision LM, both base and instruction FT versions, sota ๐Ÿ”ฅ (OS) > with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS) > SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants > SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS) ๐Ÿ’ฌ LLMs > NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset > LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B > Dataset: Glaive AI released a new reasoning dataset of 22M+ examples > Dataset: NVIDIA released new helpfulness dataset HelpSteer3 > Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS) > Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B > Dataset: GeneralThought-430K is a new reasoning dataset (OS) ๐Ÿ–ผ๏ธ Image Generation/Computer Vision > Roboflow released RF-DETR, new real-time sota object detector (OS) ๐Ÿ”ฅ > YOLOE is a new real-time zero-shot object detector with text and visual prompts ๐Ÿฅน > Stability AI released Stable Virtual Camera, a new novel view synthesis model > Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model > ByteDance released InfiniteYou, new realistic photo generation model > StarVector is a new 8B model that generates svg from images > FlexWorld is a new model that expands 3D views (OS) ๐ŸŽค Audio > Sesame released CSM-1B new speech generation model (OS) ๐Ÿค– Robotics > NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset *OS ones have Apache 2.0 or MIT license
liked a model 2 days ago
zzzrw/DeepMesh
liked a model 3 days ago
Roblox/cube3d-v0.1
View all activity

Organizations

DeepGHS's profile picture

rocca's activity

New activity in lodestones/Chroma 4 days ago

Diffusers Roadmap?

2
#5 opened 7 days ago by
Impulse2000
New activity in ibm-ai-platform/llama-13b-accelerator 8 months ago

70B model?

5
#1 opened 10 months ago by
rocca
New activity in TheBloke/LLaMA2-13B-Tiefighter-GPTQ over 1 year ago

TGI "Fast Tokenizer" support?

#1 opened over 1 year ago by
rocca
New activity in tiiuae/falcon-40b-instruct almost 2 years ago

Fix "Finetuned from model" link

#26 opened almost 2 years ago by
rocca
New activity in rocca/lyra-v2-soundstream over 2 years ago

Conversion to ONNX

4
#1 opened over 2 years ago by
roseman
New activity in rocca/rwkv-4-pile-web over 2 years ago
New activity in rocca/openai-clip-js almost 3 years ago

This is really exciting

2
#1 opened almost 3 years ago by
victor