alkinun's picture

alkinun

AtAndDev

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

liked a model about 15 hours ago
XeroCodes/lynx-8b
reacted to merve's post with 🤗 1 day ago
So many open releases at Hugging Face past week 🤯 recapping all here ⤵️ https://huggingface.co/collections/merve/march-21-releases-67dbe10e185f199e656140ae 👀 Multimodal > Mistral AI released a 24B vision LM, both base and instruction FT versions, sota 🔥 (OS) > with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS) > SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants > SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS) 💬 LLMs > NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset > LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B > Dataset: Glaive AI released a new reasoning dataset of 22M+ examples > Dataset: NVIDIA released new helpfulness dataset HelpSteer3 > Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS) > Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B > Dataset: GeneralThought-430K is a new reasoning dataset (OS) 🖼️ Image Generation/Computer Vision > Roboflow released RF-DETR, new real-time sota object detector (OS) 🔥 > YOLOE is a new real-time zero-shot object detector with text and visual prompts 🥹 > Stability AI released Stable Virtual Camera, a new novel view synthesis model > Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model > ByteDance released InfiniteYou, new realistic photo generation model > StarVector is a new 8B model that generates svg from images > FlexWorld is a new model that expands 3D views (OS) 🎤 Audio > Sesame released CSM-1B new speech generation model (OS) 🤖 Robotics > NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset *OS ones have Apache 2.0 or MIT license
View all activity

Organizations

ESPnet's profile picture CVPR Demo Track's profile picture BigScience Biomedical Datasets's profile picture ONNXConfig for all's profile picture video-p2p-library's profile picture Gradio-Themes-Party's profile picture Gradio-Blocks-Party's profile picture scikit-learn's profile picture Open-Source AI Meetup's profile picture lora concepts library's profile picture OpenBuddy Community's profile picture ECCV 2022's profile picture Kornia AI's profile picture Tune a video concepts library's profile picture SIGGRAPH 2022's profile picture Interspeech2022's profile picture Stable Diffusion concepts library's profile picture SIGGRAPH Asia 2022 Demos's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Musika's profile picture Blog-explorers's profile picture OpenSky's profile picture ICCV2023's profile picture ICML2023's profile picture huggingPartyParis's profile picture Multi🤖Transformers's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Pirates Party for all software open source's profile picture MLX Community's profile picture recipe research's profile picture Narra's profile picture Social Post Explorers's profile picture Cognitive Computations's profile picture M4-ai's profile picture Spinner-GPT-4's profile picture Dev Mode Explorers's profile picture Stable Diffusion Community (Unofficial, Non-profit)'s profile picture Hugging Face Discord Community's profile picture Nerdy Face's profile picture OpenEndedLM's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture None yet's profile picture

AtAndDev's activity

New activity in reciperesearch/dolphin-sft-v0.1-preference 4 months ago

aaa

#2 opened 5 months ago by
wepe1
New activity in mattshumer/Reflection-Llama-3.1-70B 6 months ago
New activity in AtAndDev/ShortKing-3b-v0.2 12 months ago
New activity in open-llm-leaderboard/open_llm_leaderboard over 1 year ago

Benchmarks full names

2
#303 opened over 1 year ago by
AtAndDev
New activity in AtAndDev/ShortKing-3b-v0.2 over 1 year ago
New activity in cognitivecomputations/dolphin over 1 year ago

Number of samples?

1
#7 opened over 1 year ago by
PhilipMay
New activity in Chat-Error/wizard_alpaca_dolly_orca over 1 year ago

License / Creadits

2
#2 opened over 1 year ago by
AtAndDev
New activity in AtAndDev/ShortKing-1.4b-v0.1 over 1 year ago
New activity in vicgalle/alpaca-gpt4 over 1 year ago

License

1
#3 opened over 1 year ago by
AtAndDev
New activity in bigscience/bloom about 2 years ago

Code generation

1
#147 opened over 2 years ago by
celestialme
New activity in bigscience/test-bloomd-6b3 over 2 years ago

Create README.md

#1 opened over 2 years ago by
AtAndDev

Create README.md

#1 opened over 2 years ago by
AtAndDev

Create README.md

#1 opened over 2 years ago by
AtAndDev