Qingkai Fang's picture

Qingkai Fang

poeroz

·

https://fangqingkai.github.io/

poeroz

AI & ML interests

Large Language Models, Speech-Language Models, Speech Translation

Recent Activity

updated a dataset 16 days ago

ICTNLP/InstructS2S-Eval

published a dataset 16 days ago

ICTNLP/InstructS2S-Eval

updated a model about 1 month ago

poeroz/st-7b

View all activity

Organizations

authored 5 papers about 2 months ago

Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation

Paper • 2310.13361 • Published Oct 20, 2023

BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models

Paper • 2306.10968 • Published Jun 19, 2023 • 7

DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation

Paper • 2310.07403 • Published Oct 11, 2023

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Paper • 2411.16300 • Published Nov 25, 2024

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Paper • 2505.02625 • Published May 5 • 22

authored a paper 6 months ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 53

authored 3 papers 10 months ago

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning

Paper • 2406.03049 • Published Jun 5, 2024 • 1

Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?

Paper • 2406.07289 • Published Jun 11, 2024 • 1

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 58