598 1018 3695

Victor Mustar PRO

victor

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

upvoted a changelog about 11 hours ago

Static Spaces can now have a build step

upvoted an article about 12 hours ago

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

reacted to ginipick's post with 🔥 about 12 hours ago

🎨 FLUX VIDEO Generation - All-in-One AI Image/Video/Audio Generator 🚀 Introduction FLUX VIDEO Generation is an all-in-one AI creative tool that generates images, videos, and audio from text prompts, powered by NVIDIA H100 GPU for lightning-fast processing! https://huggingface.co/spaces/ginigen/Flux-VIDEO ✨ Key Features 1️⃣ Text → Image → Video 🖼️➡️🎬 Generate high-quality images from Korean/English prompts Transform still images into natural motion videos Multiple size presets (Instagram, YouTube, Facebook, etc.) Demo: 1-4 seconds / Full version: up to 60 seconds 2️⃣ Image Aspect Ratio Change 🎭 Freely adjust image aspect ratios Expand images with outpainting technology 5 alignment options (Center, Left, Right, Top, Bottom) Real-time preview functionality 3️⃣ Video + Audio Generation 🎵 Add AI-generated audio to videos Korean prompt support (auto-translation) Context-aware sound generation Powered by MMAudio technology 🛠️ Tech Stack Image Generation: FLUX, Stable Diffusion XL Video Generation: TeaCache optimization Audio Generation: MMAudio (44kHz high-quality) Outpainting: ControlNet Union Infrastructure: NVIDIA H100 GPU for ultra-fast generation 💡 How to Use Select your desired tab Enter your prompt (Korean/English supported!) Adjust settings Click generate button 🎯 Use Cases 📱 Social media content creation 🎥 YouTube Shorts/Reels 📊 Presentation materials 🎨 Creative artwork 🎵 Background sound generation

View all activity

Organizations

victor's activity

upvoted a changelog about 11 hours ago

Changelog

Static Spaces can now have a build step

12 days ago

• 82

upvoted an article about 12 hours ago

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

•

about 15 hours ago

• 40

reacted to ginipick's post with 🔥 about 12 hours ago

Post

3912

🎨 FLUX VIDEO Generation - All-in-One AI Image/Video/Audio Generator

🚀 Introduction
FLUX VIDEO Generation is an all-in-one AI creative tool that generates images, videos, and audio from text prompts, powered by NVIDIA H100 GPU for lightning-fast processing!

ginigen/Flux-VIDEO

✨ Key Features
1️⃣ Text → Image → Video 🖼️➡️🎬

Generate high-quality images from Korean/English prompts
Transform still images into natural motion videos
Multiple size presets (Instagram, YouTube, Facebook, etc.)
Demo: 1-4 seconds / Full version: up to 60 seconds

2️⃣ Image Aspect Ratio Change 🎭

Freely adjust image aspect ratios
Expand images with outpainting technology
5 alignment options (Center, Left, Right, Top, Bottom)
Real-time preview functionality

3️⃣ Video + Audio Generation 🎵

Add AI-generated audio to videos
Korean prompt support (auto-translation)
Context-aware sound generation
Powered by MMAudio technology

🛠️ Tech Stack

Image Generation: FLUX, Stable Diffusion XL
Video Generation: TeaCache optimization
Audio Generation: MMAudio (44kHz high-quality)
Outpainting: ControlNet Union
Infrastructure: NVIDIA H100 GPU for ultra-fast generation

💡 How to Use

Select your desired tab
Enter your prompt (Korean/English supported!)
Adjust settings
Click generate button

🎯 Use Cases

📱 Social media content creation
🎥 YouTube Shorts/Reels
📊 Presentation materials
🎨 Creative artwork
🎵 Background sound generation

1 reply

liked a model about 20 hours ago

lllyasviel/FramePack_F1_I2V_HY_20250503

Updated May 3 • 101k • 31

liked a Space about 21 hours ago

Rag Mcp Server

🏢

This space defines a RAG MCP server

reacted to frascuchon's post with 👍 about 21 hours ago

Post

2550

Hey! I built RAG MCP Server Space, a simple Gradio MCP server for RAG systems that allows you to search relevant results without passing huge contexts to your LLM.

You can use this space to integrate with your agents and improve the efficiency of your search results. Feel free to try it out and let me know if you have any feedback or questions!

frascuchon/rag-mcp-server

Thanks for checking it out!

reacted to prithivMLmods's post with 👍 about 21 hours ago

Post

4491

OpenAI, Google, Hugging Face, and Anthropic have released guides and courses on building agents, prompting techniques, scaling AI use cases, and more. Below are 10+ minimalistic guides and courses that may help you in your progress. 📖

⤷ Agents Companion : https://www.kaggle.com/whitepaper-agent-companion
⤷ Building Effective Agents : https://www.anthropic.com/engineering/building-effective-agents
⤷ Guide to building agents by OpenAI : https://cdn.openai.com/business-guides-and-resources/a-practical-guide-to-building-agents.pdf
⤷ Prompt engineering by Google : https://www.kaggle.com/whitepaper-prompt-engineering
⤷ Google: 601 real-world gen AI use cases : https://cloud.google.com/transform/101-real-world-generative-ai-use-cases-from-industry-leaders
⤷ Prompt engineering by IBM : https://www.ibm.com/think/topics/prompt-engineering-guide
⤷ Prompt Engineering by Anthropic : https://docs.anthropic.com/en/docs/build-with-claude/prompt-engineering/overview
⤷ Scaling AI use cases : https://cdn.openai.com/business-guides-and-resources/identifying-and-scaling-ai-use-cases.pdf
⤷ Prompting Guide 101 : https://services.google.com/fh/files/misc/gemini-for-google-workspace-prompting-guide-101.pdf
⤷ AI in the Enterprise by OpenAI : https://cdn.openai.com/business-guides-and-resources/ai-in-the-enterprise.pdf

by HF🤗 :
⤷ AI Agents Course by Huggingface : https://huggingface.co/learn/agents-course/unit0/introduction
⤷ Smol-agents Docs : https://huggingface.co/docs/smolagents/en/tutorials/building_good_agents
⤷ MCP Course by Huggingface : https://huggingface.co/learn/mcp-course/unit0/introduction
⤷ Other Course (LLM, Computer Vision, Deep RL, Audio, Diffusion, Cookbooks, etc..) : https://huggingface.co/learn

2 replies

liked a model about 21 hours ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Updated 1 day ago • 280 • 79

upvoted 2 papers about 21 hours ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published 1 day ago • 102

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published 7 days ago • 67

upvoted a paper 1 day ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published 4 days ago • 105

updated a Space 1 day ago

brutal-design

🐳

published a Space 1 day ago

brutal-design

🐳

liked a dataset 1 day ago

dleemiller/irish_penny_journal

Viewer • Updated 4 days ago • 4.09k • 108 • 2

upvoted an article 1 day ago

Article

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

•

2 days ago

• 9

reacted to codelion's post with 🚀 1 day ago

Post

2874

🧠 We just implemented Andrej Karpathy's "third paradigm" for LLM learning!

System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.

🚀 How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.

📊 Results across math benchmarks:
Arena Hard: 29% → 37.6% (+8.6%)
AIME24: 23.33% → 30% (+6.67%)
OptILLMBench: 61% → 65% (+4%)

The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.

✨ Key benefits:
🔄 Cumulative learning over time
📖 Transparent, inspectable strategies
🔌 Works with any OpenAI-compatible API
⚡ Simple integration: just add "spl-" prefix to your model

Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!

This feels like a genuine step toward AI that learns from experience while staying completely interpretable.

🔗 GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
📖 Full article: https://huggingface.co/blog/codelion/system-prompt-learning
🐦 Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486

Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?

liked a model 1 day ago

dleemiller/Penny-1.7B

Text Generation • Updated 1 day ago • 125 • 20

reacted to Kseniase's post with 🚀 1 day ago

Post

1649

13 Awesome MCP Servers

MCP changed how agents connect with tools.

After writing the most read explanation of MCP on Hugging Face (https://huggingface.co/blog/Kseniase/mcp), we chose this 13 awesome MCP servers that you can work with:

1. Agentset MCP -> https://github.com/agentset-ai/mcp-server
For efficient and quick building of intelligent, doc-based apps using open-source Agentset platform for RAG

2. GitHub MCP Server -> https://github.com/github/github-mcp-server
Integrates GitHub APIs into your workflow, allowing to build AI tools and apps that interact with GitHub's ecosystem

3. arXiv MCP -> https://github.com/andybrandt/mcp-simple-arxiv
Allows working with research papers on arXiv through effective search and access to their metadata, abstracts, and links

4. MCP Run Python -> https://github.com/pydantic/pydantic-ai/tree/main/mcp-run-python
Enables to run Python code in a sandbox via Pyodide in Deno, so it can be isolated from the rest of the operating system

5. Safe Local Python Executor -> https://github.com/maxim-saplin/mcp_safe_local_python_executor
A lightweight tool for running LLM-generated Python code locally, using Hugging Face’s LocalPythonExecutor (from smolagents framework) and exposing it via MCP for AI assistant integration

6. Cursor MCP Installer -> https://github.com/matthewdcage/cursor-mcp-installer
Allows to automatically add MCP servers to Cursor for development convenience

7. Basic Memory -> https://memory.basicmachines.co/docs/introduction
This knowledge management system connects to LLMs and lets you build a persistent semantic graph from AI conversations with AI agents

Read further in the comments 👇

If you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe

1 reply

liked a Space 1 day ago

PlayDiffusion

🎨

Generate audio modifications and speech

reacted to clem's post with 🔥 1 day ago

Post

4963

Today, we're unveiling two new open-source AI robots! HopeJR for $3,000 & Reachy Mini for $300 🤖🤖🤖

Let's go open-source AI robotics!

5 replies