Nielly

Nielly
Β·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Nielly's activity

upvoted 4 articles 1 day ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

β€’ 649
view article
Article

Our Transformers Code Agent beats the GAIA benchmark!

β€’ 59
view article
Article

Open-source DeepResearch – Freeing our search agents

β€’ 658
reacted to Jaward's post with πŸ”₯ 1 day ago
view post
Post
2896
ByteDance drops OmniHumanπŸ”₯
This is peak SOTA performance - flawless natural gestures with perfect lip sync and facial expressions. This is the second time they've released SOTA level talking-heads only this time with hands and body motion.
Project: https://omnihuman-lab.github.io/
  • 2 replies
Β·
reacted to openfree's post with πŸ”₯ 7 days ago
view post
Post
6313
πŸ“š Multilingual RAG Chatbot with PDF Support

Chat naturally with your documents! 🌟

✨ Key Features:
β€’ 🌏 Multilingual Q&A support (English, Korean, etc.)
β€’ πŸ“„ Real-time PDF and text file processing
β€’ πŸ” Context-aware accurate responses
β€’ ⚑ Intuitive Chainlit-powered chat interface

πŸ› οΈ Tech Stack:
β€’ πŸ’» Clean, documented open-source code
β€’ 🀝 User-friendly Chainlit UI
β€’ πŸ“Š Vector database for efficient retrieval
β€’ πŸ”„ Real-time streaming responses

πŸ“± Try it now!
β†’ Demo: openfree/PDF-RAG

πŸ”§ Special Features:
β€’ πŸ“Š Support for PDF/text files up to 2MB
β€’ 🎯 Precise context understanding
β€’ ⚑ Fast response time
β€’ πŸ”’ Secure file handling

Full source code available - ready to integrate into your projects!

#RAG #NLP #Chatbot #OpenSource #PDFProcessing
reacted to victor's post with πŸš€ 9 days ago
view post
Post
2949
Finally, an open-source AI that turns your lyrics into full songs is hereβ€”meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot
replied to dylanebert's post 9 days ago
view reply

I don’t really think it’s a side project.

reacted to nicolay-r's post with πŸ‘€ 10 days ago
view post
Post
1760
πŸ“’ For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing:
https://openrouter.ai/deepseek/deepseek-r1

The no-string option to quick start with using DeepSeek-R1 includes three steps:
βœ… OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py
βœ… Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain
βœ… Json Schema for Chain-of-Though reasoning (see screenshot πŸ“· below)

πŸ“Ί below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is text within this example).

πŸ“ƒ To apply it for JSONL/CSV data, you can use --src shell parameter for passing the related file

⏳ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request

Models:
deepseek-ai/DeepSeek-R1
  • 1 reply
Β·
reacted to lewtun's post with πŸ”₯ 10 days ago
view post
Post
9886
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

πŸ§ͺ Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

πŸ”₯ Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1
Β·