Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Qingkai Fang's picture
10 4 21

Qingkai Fang

poeroz
21world's profile picture
·
https://fangqingkai.github.io/
  • poeroz

AI & ML interests

Large Language Models, Speech-Language Models, Speech Translation

Recent Activity

authored a paper 3 days ago
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation
authored a paper 3 days ago
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models
authored a paper 3 days ago
DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation
View all activity

Organizations

Natural Language Processing Group, Institute of Computing Technology, Chinese Academy of Science's profile picture

poeroz's activity

upvoted a paper 4 months ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published Jan 7 • 53
upvoted a paper 7 months ago

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 20
upvoted a paper 8 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10, 2024 • 58
upvoted a paper about 1 year ago

Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

Paper • 2403.02677 • Published Mar 5, 2024 • 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs