Data Mining and Information Systems Lab

dmis-lab

8 19 31

https://dmis.korea.ac.kr

dmis-lab

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

published a model 3 days ago

dmis-lab/Qwen3-VL-8B-Instruct-MRPO

updated a model 3 days ago

dmis-lab/Qwen3-VL-8B-Instruct-MRPO

View all activity

Organizations

upvoted a paper 2 days ago

Breaking Failure Cascades: Step-Aware Reinforcement Learning for Medical Multimodal Reasoning

Paper • 2606.31825 • Published 6 days ago • 19

upvoted 2 papers 2 months ago

ToxReason: A Benchmark for Mechanistic Chemical Toxicity Reasoning via Adverse Outcome Pathway

Paper • 2604.06264 • Published Apr 7 • 4

Learning from Negative Samples in Generative Biomedical Entity Linking

Paper • 2408.16493 • Published Aug 29, 2024 • 1

upvoted a paper 3 months ago

ASGuard: Activation-Scaling Guard to Mitigate Targeted Jailbreaking Attack

Paper • 2509.25843 • Published Apr 14 • 20

upvoted a paper 7 months ago

The Curious Case of Analogies: Investigating Analogical Reasoning in Large Language Models

Paper • 2511.20344 • Published Nov 25, 2025 • 14

upvoted a paper 9 months ago

Thinking Sparks!: Emergent Attention Heads in Reasoning Models During Post Training

Paper • 2509.25758 • Published Sep 30, 2025 • 25

upvoted a paper 11 months ago

HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches

Paper • 2508.08088 • Published Aug 11, 2025 • 29

upvoted a collection 11 months ago

Med-PRM

Collection

This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards • 7 items • Updated Aug 16, 2025 • 4

upvoted a paper 11 months ago

CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction

Paper • 2508.03159 • Published Aug 5, 2025 • 23

upvoted a paper about 1 year ago

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24, 2025 • 45

upvoted a collection about 1 year ago

Outlier-Safe Pre-Training (OSP)

Collection

A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework. • 11 items • Updated Jun 26, 2025 • 4

upvoted a paper about 1 year ago

Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards

Paper • 2506.11474 • Published Jun 13, 2025 • 18

upvoted 6 papers over 1 year ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20, 2025 • 26

System Message Generation for User Preferences using Open-Source Models

Paper • 2502.11330 • Published Feb 17, 2025 • 15

SimpleStrat: Diversifying Language Model Generation with Stratification

Paper • 2410.09038 • Published Oct 11, 2024 • 4

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11, 2024 • 17

StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 47

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published Oct 11, 2024 • 88

upvoted a paper over 2 years ago

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29, 2024 • 47

Data Mining and Information Systems Lab

AI & ML interests

Recent Activity

Organizations

dmis-lab's activity