LLM-jp

university

https://llm-jp.nii.ac.jp/en/

llm_jp

llm-jp

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Taka008 published a model about 17 hours ago

llm-jp/llm-jp-3-8x13b-instruct3

Taka008 published a model about 17 hours ago

llm-jp/llm-jp-3-8x13b-instruct2

Taka008 published a model about 17 hours ago

llm-jp/llm-jp-3-8x1.8b-instruct3

View all activity

Articles

Introduction to the Open Leaderboard for Japanese LLMs

llm-jp's activity

Taka008

published 4 models about 17 hours ago

llm-jp/llm-jp-3-8x13b-instruct3

Text Generation • Updated about 19 hours ago • 2

llm-jp/llm-jp-3-8x13b-instruct2

Text Generation • Updated about 19 hours ago

llm-jp/llm-jp-3-8x1.8b-instruct3

Text Generation • Updated about 19 hours ago • 1

llm-jp/llm-jp-3-8x1.8b-instruct2

Text Generation • Updated about 19 hours ago

Taka008

updated 6 models about 19 hours ago

llm-jp/llm-jp-3-8x13b

Text Generation • Updated about 19 hours ago

llm-jp/llm-jp-3-8x1.8b

Text Generation • Updated about 19 hours ago

llm-jp/llm-jp-3-8x13b-instruct2

Text Generation • Updated about 19 hours ago

llm-jp/llm-jp-3-8x13b-instruct3

Text Generation • Updated about 19 hours ago • 2

llm-jp/llm-jp-3-8x1.8b-instruct2

Text Generation • Updated about 19 hours ago

llm-jp/llm-jp-3-8x1.8b-instruct3

Text Generation • Updated about 19 hours ago • 1

speed

updated a collection 4 days ago

Multi Modal Models

4 items • Updated 4 days ago • 1

t0-0

updated 3 datasets 7 days ago

llm-jp/leaderboard-requests

Preview • Updated 7 days ago • 16k • 2

llm-jp/leaderboard-contents

Viewer • Updated 7 days ago • 582 • 1.95k • 1

llm-jp/leaderboard-results

Updated 7 days ago • 2k • 1

Taishi-N324

updated a collection 7 days ago

Drop-Upcycling

31 items • Updated 7 days ago • 2

Taishi-N324

authored 2 papers 20 days ago

Balancing Speed and Stability: The Trade-offs of FP8 vs. BF16 Training in LLMs

Paper • 2411.08719 • Published Nov 10, 2024

Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs

Paper • 2412.14471 • Published Dec 19, 2024

Taishi-N324

authored a paper 21 days ago

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

Paper • 2503.04412 • Published 21 days ago • 1

Taishi-N324

authored a paper 28 days ago

Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization

Paper • 2502.19261 • Published 29 days ago • 7