1 4 17

Jay Shin

jshin49

jshin49

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

apple/DiffuCoder-7B-cpGRPO

liked a dataset about 1 month ago

openai/MMMLU

liked a model 3 months ago

nari-labs/Dia-1.6B

View all activity

Organizations

liked a model 3 days ago

apple/DiffuCoder-7B-cpGRPO

8B • Updated 9 days ago • 3.24k • 279

liked a dataset about 1 month ago

openai/MMMLU

Viewer • Updated Oct 16, 2024 • 393k • 4.8k • 489

liked a model 3 months ago

nari-labs/Dia-1.6B

Text-to-Speech • 2B • Updated Jun 1 • 144k • • 2.63k

authored 12 papers 3 months ago

Reducing Gender Bias in Abusive Language Detection

Paper • 1808.07231 • Published Aug 22, 2018

EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria

Paper • 2309.13633 • Published Sep 24, 2023

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 55

Aligning Large Language Models through Synthetic Feedback

Paper • 2305.13735 • Published May 23, 2023 • 1

The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning

Paper • 2305.14045 • Published May 23, 2023 • 5

Who Wrote this Code? Watermarking for Code Generation

Paper • 2305.15060 • Published May 24, 2023 • 1

KLUE: Korean Language Understanding Evaluation

Paper • 2105.09680 • Published May 20, 2021 • 1

Dialogue Summaries as Dialogue States (DS2), Template-Guided Summarization for Few-shot Dialogue State Tracking

Paper • 2203.01552 • Published Mar 3, 2022

HyperCLOVA X Technical Report

Paper • 2404.01954 • Published Apr 2, 2024 • 26

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 123

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 3

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37

upvoted a paper 3 months ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37

liked a model 3 months ago

trillionlabs/Trillion-LLaVA-7B

Visual Question Answering • 8B • Updated Apr 20 • 22 • 10

New activity in trillionlabs/Trillion-7B-preview 4 months ago

MT-bench scores are awkwardly low for EXAONE-3.5-7.8B-Instruct.

❤️ 🤝 4

#2 opened 4 months ago by

yhg0112

liked a model 4 months ago

trillionlabs/Trillion-7B-preview

Text Generation • 8B • Updated Apr 25 • 6.37k • 85

published a model 4 months ago

trillionlabs/Trillion-7B-preview

Text Generation • 8B • Updated Apr 25 • 6.37k • 85

Jay Shin

AI & ML interests

Recent Activity

Organizations

jshin49's activity

MT-bench scores are awkwardly low for EXAONE-3.5-7.8B-Instruct.