Aurora-M/MDEL

community

https://aurora-lm.github.io/posts/about-us/

Activity Feed Request to join this org

AI & ML interests

Formerly, MDEL, we have renamed ourselves after the model we deployed, Aurora-M. Visit us here: https://huggingface.co/aurora-m

Recent Activity

huu-ontocord updated a Space 4 days ago

Multi-Domain-Expert-Learning/README

Ziyang authored a paper 6 days ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

gpucce authored a paper 6 days ago

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation

View all activity

Multi-Domain-Expert-Learning's activity

huu-ontocord

updated a Space 4 days ago

README

Ziyang

authored a paper 6 days ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

Paper • 2411.18932 • Published Nov 28, 2024 • 1

gpucce

authored a paper 6 days ago

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation

Paper • 2504.17025 • Published 12 days ago • 16

xu3kev

authored a paper 21 days ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published 21 days ago • 11

xzyao

authored a paper 25 days ago

DataPerf: Benchmarks for Data-Centric AI Development

Paper • 2207.10062 • Published Jul 20, 2022

ajibawa-2023

posted an update 26 days ago

Post

3963

Hi All, I recently released two Audio datasets which are generated using my earlier released dataset: ajibawa-2023/Children-Stories-Collection

First Audio Dataset:https://huggingface.co/datasets/ajibawa-2023/Audio-Children-Stories-Collection-Large has 5600++ stories in .mp3 format.

Second Audio Dataset:https://huggingface.co/datasets/ajibawa-2023/Audio-Children-Stories-Collection has 600 stories in .mp3 format.

3 replies

·

Taishi-N324

authored a paper about 1 month ago

Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models

Paper • 2503.23714 • Published Mar 31

HoangHa

authored a paper about 2 months ago

Pensez: Less Data, Better Reasoning -- Rethinking French LLM

Paper • 2503.13661 • Published Mar 17 • 5

shannons

authored a paper about 2 months ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published Feb 13 • 36

Taishi-N324

authored 2 papers about 2 months ago

Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs

Paper • 2411.08719 • Published Nov 10, 2024

Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs

Paper • 2412.14471 • Published Dec 19, 2024

mayank-mishra

authored a paper about 2 months ago

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

Paper • 2502.09927 • Published Feb 14

Taishi-N324

authored a paper about 2 months ago

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Paper • 2503.04412 • Published Mar 6 • 1

terryyz

authored a paper 2 months ago

CodeArena: A Collective Evaluation Platform for LLM Code Generation

Paper • 2503.01295 • Published Mar 3 • 8

huu-ontocord

authored 3 papers 2 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 56

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Paper • 2412.15035 • Published Dec 19, 2024 • 4

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Paper • 2502.19413 • Published Feb 26 • 19

JJitsev

authored a paper 2 months ago

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Paper • 2502.19413 • Published Feb 26 • 19

Taishi-N324

authored a paper 2 months ago

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Paper • 2502.19261 • Published Feb 26 • 7

bzantium

authored a paper 2 months ago

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66