Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 4 days ago • 40
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published 4 days ago • 30
view post Post 5416 Google drops Gemini 2.0 Flash Thinkinga new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and morenow available in anychat, try it out: akhaliq/anychat See translation 1 reply · 🚀 6 6 🔥 4 4 👀 1 1 + Reply
view post Post 6527 QwQ-32B-Preview is now available in anychatA reasoning model that is competitive with OpenAI o1-mini and o1-previewtry it out: akhaliq/anychat See translation 1 reply · ❤️ 3 3 👀 2 2 + Reply
view post Post 3823 New model drop in anychatallenai/Llama-3.1-Tulu-3-8B is now availabletry it here: akhaliq/anychat See translation 🔥 4 4 👍 1 1 + Reply
view post Post 2792 anychatsupports chatgpt, gemini, perplexity, claude, meta llama, grok all in one apptry it out there: akhaliq/anychat ❤️ 7 7 🚀 3 3 🔥 2 2 + Reply
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence Paper • 2305.14334 • Published May 23, 2023 • 1
Readout Guidance: Learning Control from Diffusion Features Paper • 2312.02150 • Published Dec 4, 2023 • 3
A Benchmark of Domain-Adapted Large Language Models for Generating Brief Hospital Course Summaries Paper • 2403.05720 • Published Mar 8, 2024
GREEN: Generative Radiology Report Evaluation and Error Notation Paper • 2405.03595 • Published May 6, 2024
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models Paper • 2311.18232 • Published Nov 30, 2023 • 1
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 54
view post Post 1300 Llama 3.1 405B Instruct beats GPT-4o on MixEval-HardJust ran MixEval for 405B, Sonnet-3.5 and 4o, with 405B landing right between the other two at 66.19The GPT-4o result of 64.7 replicated locally but Sonnet-3.5 actually scored 70.25/69.45 in my replications 🤔 Still well ahead of the other 2 though.Sammple of 1 of the eval calls here: https://wandb.ai/morgan/MixEval/weave/calls/07b05ae2-2ef5-4525-98a6-c59963b76fe1Quick auto-logging tracing for openai-compatible clients and many more here: https://wandb.github.io/weave/quickstart/ 👍 3 3 🔥 1 1 + Reply
Multimodal datasets: misogyny, pornography, and malignant stereotypes Paper • 2110.01963 • Published Oct 5, 2021
AutoGRAMS: Autonomous Graphical Agent Modeling Software Paper • 2407.10049 • Published Jul 14, 2024 • 1
view post Post 21592 New feature 🔥 Image models and LoRAs now have little previews 🤏If you don't know where to start to find them, I invite you to browse cool LoRAs in the profile of some amazing fine-tuners: @artificialguybr , @alvdansen , @DoctorDiffusion , @e-n-v-y , @KappaNeuro @ostris 2 replies · ❤️ 12 12 + Reply