cfahlgren1 (Caleb Fahlgren)

reactedto paasthaamz's post with 👍 about 1 month ago

Post

3491

test

3 replies

·

reactedto AdinaY's post with 🔥 about 2 months ago

Post

1486

DeepSeek R1 dropped one year ago 🐳 and a lot has changed.

With @irenesolaiman , we’re launching a blog series about how that moment reshaped AI + open source in 2025, starting with strategic shifts and the explosion of new open models in China!

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment

reactedto Wauplin's post with 🔥 8 months ago

Post

3464

Say hello to hf: a faster, friendlier Hugging Face CLI ✨

We are glad to announce a long-awaited quality-of-life improvement: the Hugging Face CLI has been officially renamed from huggingface-cli to hf!

So... why this change?

Typing huggingface-cli constantly gets old fast. More importantly, the CLI’s command structure became messy as new features were added over time (upload, download, cache management, repo management, etc.). Renaming the CLI is a chance to reorganize commands into a clearer, more consistent format.

We decided not to reinvent the wheel and instead follow a well-known CLI pattern: hf <resource> <action>. Isn't hf auth login easier to type and remember?

The full rationale, implementation details, and migration notes are in the blog post: https://huggingface.co/blog/hf-cli

6 replies

·

reactedto AdinaY's post with 🔥 8 months ago

Post

3442

Qwen3-Coder 💻 agentic code model by Alibaba Qwen team🚀

Qwen/Qwen3-Coder-480B-A35B-Instruct

✨ 480B total, 35B activated MoE
✨ Agentic Coding + Browser Use → Top code model performance
✨ 256K context (up to 1M via Yarn) for repo-scale understanding

2 replies

·

reactedto severo's post with 🔥 8 months ago

Post

2382

Today, three unrelated Slacks popped up at the same time with enthusiastic messages about the new Qwen model.

And all of them mentioned @simonw 's post!

#TopInfluencer

posted an update 9 months ago

Post

1040

I ran the Anthropic Misalignment Framework for a few top models and added it to a dataset: cfahlgren1/anthropic-agentic-misalignment-results

You can read the reasoning traces of the models trying to blackmail the user and perform other actions. It's very interesting!!

posted an update 10 months ago

Post

419

Really nice to see AllenAI drop the Reward-Bench-2 dataset and leaderboard from their new paper all on the hub! 👏

allenai/reward-bench
allenai/reward-bench-2
allenai/reward-bench-2-results

Great work @natolambert , allenai and others!! 🤗

posted an update 10 months ago

Post

1739

Yesterday, we dropped a new conversational viewer for datasets on the hub! 💬

Actually being able to view and inspect your data is extremely important. This is a big step in making data more accessible and actionable for everyone.

Here's some datasets you can try it out on:
• mlabonne/FineTome-100k
• Salesforce/APIGen-MT-5k
• open-thoughts/OpenThoughts2-1M
• allenai/tulu-3-sft-mixture

Any other good ones?

1 reply

·

repliedto severo's post 11 months ago

Awesome! How did you find it? SEO?

It's been a long while since i've updated anything there, so hopefully most things continue to work 🤣🤞. My journey to HF came from there though from what feels like long ago https://x.com/calebfahlgren/status/1785827324283281572

repliedto OFT's post 12 months ago

It depends on the compute time on the GPU which would show why some calls may be more expensive than others. For example, Flux will be more expensive if it takes longer than another request.

Source: https://huggingface.co/docs/inference-providers/en/pricing#hf-inference-cost

reactedto OFT's post with 👀 12 months ago

Post

2164

HF's new system makes me feel like they are not transparent on their pricing and making me feel they are not trustworthy.

6 replies

·

repliedto OFT's post 12 months ago

Curious what is making it confusing?

reactedto onekq's post with 🚀 about 1 year ago

Post

2352

Introducing 🎉 OneSQL-v0.1🥳, our first text-to-SQL model based on Qwen2.5-Coder. This model has achieved an EX score of 63.33 on the BIRD leaderboard (https://bird-bench.github.io/).

The model family includes 7B and 32B,
onekq-ai/onesql-v01-qwen-67d8e3eb1611c5532bb90c5f
and can be also found on Ollama (https://ollama.com/onekq/OneSQL-v0.1-Qwen)

My goal is to make OneSQL the most usable open-weights model for text-to-SQL. I'm currently working on best practices to help users use this model the right away and avoid pitfalls. After that, I plan to train the next version to push for a higher EX score.

Enjoy this model and feel free to share comments/questions 🤗

2 replies

·

reactedto jasoncorkill's post with 👍 about 1 year ago

Post

2869

Integrating human feedback is vital for evolving AI models. Boost quality, scalability, and cost-effectiveness with our crowdsourcing tool!

..Or run A/B tests and gather thousands of responses in minutes. Upload two images, ask a question, and watch the insights roll in!

Check it out here and let us know your feedback: https://app.rapidata.ai/compare

reactedto vikhyatk's post with 🔥 about 1 year ago

Post

6222

🚨 New VQA + captioning dataset! moondream/megalith-mdqa

Images from Megalith, captioned using Moondream, then transformed to short-form QA.

9M+ images, 6-10 QA pairs per image.

posted an update about 1 year ago

Post

2356

If you haven't seen yet, we just released Inference Providers 🔀

> 4 new serverless inference providers on the Hub 🤯
> Use your HF API key or personal key with all providers 🔑
> Chat with Deepseek R1, V3, and more on HF Hub 🐋
> We support Sambanova, TogetherAI, Replicate, and Fal.ai 💪

Best of all, we don't charge any markup on top of the provider 🫰 Have you tried it out yet? HF Pro accounts get $2 of free usage for the provider inference.

repliedto their post about 1 year ago

Hey @Sayan01

If you click Save Query then you'll see Download as an option in the menu

reactedto hexgrad's post with 🔥 about 1 year ago

Post

22138

📣 Looking for labeled, high-quality synthetic audio/TTS data 📣 Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. ❤️

More details at hexgrad/Kokoro-82M#21

25 replies

·

reactedto merve's post with ❤️ about 1 year ago

Post

3746

What a beginning to this year in open ML 🤠
Let's unwrap! merve/jan-10-releases-677fe34177759de0edfc9714

Multimodal 🖼️
> ByteDance released SA2VA: a family of vision LMs that can take image, video, text and visual prompts
> moondream2 is out with new capabilities like outputting structured data and gaze detection!
> Dataset: Alibaba DAMO lab released multimodal textbook — 22k hours worth of samples from instruction videos 🤯
> Dataset: SciCap captioning on scientific documents benchmark dataset is released along with the challenge!

LLMs 💬
> Microsoft released Phi-4, sota open-source 14B language model 🔥
> Dolphin is back with Dolphin 3.0 Llama 3.1 8B 🐬🐬
> Prime-RL released Eurus-2-7B-PRIME a new language model trained using PRIME alignment
> SmallThinker-3B is a new small reasoning LM based on Owen2.5-3B-Instruct 💭
> Dataset: QWQ-LONGCOT-500K is the dataset used to train SmallThinker, generated using QwQ-32B-preview 📕
> Dataset: @cfahlgren1 released React Code Instructions: a dataset of code instruction-code pairs 📕
> Dataset: Qwen team is on the roll, they just released CodeElo, a dataset of code preferences 👩🏻‍💻

Embeddings 🔖
> @MoritzLaurer released zero-shot version of ModernBERT large 👏
> KaLM is a new family of performant multilingual embedding models with MIT license built using Qwen2-0.5B

Image/Video Generation ⏯️
> NVIDIA released Cosmos, a new family of diffusion/autoregressive World Foundation Models generating worlds from images, videos and texts 🔥
> Adobe released TransPixar: a new text-to-video model that can generate assets with transparent backgrounds (a first!)
> Dataset: fal released cosmos-openvid-1m Cosmos-tokenized OpenVid-1M with samples from OpenVid-1M

Others
> Prior Labs released TabPFNv2, the best tabular transformer is out for classification and regression
> Metagene-1 is a new RNA language model that can be used for pathogen detection, zero-shot embedding and genome understanding

posted an update about 1 year ago

Post

1793

Wow, I just added Langfuse tracing to the Deepseek Artifacts app and it's really nice 🔥

It allows me to visualize and track more things along with the cfahlgren1/react-code-instructions dataset.

It was just added as a one click Docker Space template, so it's super easy to self host 💪

Caleb Fahlgren PRO

AI & ML interests

Recent Activity

Organizations

Caleb Fahlgren PRO

AI & ML interests

Recent Activity

Organizations

cfahlgren1's activity