Morioh-cho

university

https://ntu.edu.tw

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

zenyn authored a paper 4 days ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

zenyn authored a paper 4 days ago

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models

dcml0714 authored a paper 4 days ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

View all activity

zenyn

authored 2 papers 4 days ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published 23 days ago • 3

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models

Paper • 2505.17496 • Published May 23

dcml0714

authored 2 papers 4 days ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published 23 days ago • 3

STITCH: Simultaneous Thinking and Talking with Chunked Reasoning for Spoken Language Models

Paper • 2507.15375 • Published 5 days ago • 23

kehanlu

authored 2 papers 11 days ago

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Paper • 2507.02768 • Published 23 days ago • 3

Analyzing Mitigation Strategies for Catastrophic Forgetting in End-to-End Training of Spoken Language Models

Paper • 2505.17496 • Published May 23

kehanlu

updated a dataset 28 days ago

Morioh/shelf

Updated Mar 16 • 23

dcml0714

authored a paper about 2 months ago

Audio-Aware Large Language Models as Judges for Speaking Styles

Paper • 2506.05984 • Published Jun 6 • 15

kehanlu

authored 4 papers about 2 months ago

Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

Paper • 2401.00273 • Published Dec 30, 2023

A context-aware knowledge transferring strategy for CTC-based ASR

Paper • 2210.06244 • Published Oct 12, 2022

Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data

Paper • 2409.20007 • Published Sep 30, 2024 • 1

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 1

zenyn

authored 2 papers 2 months ago

Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

Paper • 2401.00273 • Published Dec 30, 2023

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Paper • 2411.05361 • Published Nov 8, 2024 • 1

kehanlu

updated a model 2 months ago

Morioh/toilet

wilzzzz

updated a dataset 3 months ago

Morioh/shelf

Updated Mar 16 • 23

kehanlu

updated a dataset 3 months ago

Morioh/livingroom

Updated Mar 4 • 548

wilzzzz

updated a dataset 3 months ago

Morioh/livingroom

Updated Mar 4 • 548

Allen172

updated a dataset 3 months ago

Morioh/shelf

Updated Mar 16 • 23

dcml0714

authored a paper 4 months ago

Can Large Language Models Be an Alternative to Human Evaluations?

Paper • 2305.01937 • Published May 3, 2023 • 2