This is available today, in the open-source version of phospho. Still is 100% compatible with LeRobot.
The LeRobot dataset by HuggingFace and Remi Cadene is becoming a standard to create robotics datasets. But working with it can rapidly become a nightmare:
- you can't delete a faulty episode. Failed a demo? Finito. - you can't merge datasets - you can't split datasets
So we fixed it.
Now, in the dashboard or in Python, using phospho you can: - repair corrupted LeRobot datasets - delete episodes from a dataset - merge datasets - split datasets
How does it work ? - You give an URL - The AI assistant crawls the website content and embed it - Add it to your frontend in one line of code - People on your website can ask the assistant questions
📈 Increase the quality of your RAG with a simple Linear Layer! No need to change your embedding model (keep that old OpenAI API).
Introducing EmbeddingAlign RAG, a novel approach to improve Retrieval-Augmented Generation (RAG) systems.
Key highlights: - Uses a simple linear transformation on existing embeddings - Boosts hit rate from 89% to 95% on real-world examples - Minor increase on latency (less than 10ms) - Works on top of blackbox embedding models (Mistral AI, OpenAI, Cohere,...) - No dataset needed (just your documents) - Train easily on CPU