Zhijian Liu's picture

3 8

Zhijian Liu

zhijianliu

·

https://zhijianliu.com

AI & ML interests

Efficient machine learning and systems

Recent Activity

authored a paper about 1 month ago

Scaling RL to Long Videos

upvoted a paper about 1 month ago

Scaling RL to Long Videos

authored a paper about 2 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

View all activity

Organizations

authored a paper about 1 month ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 156

authored a paper about 2 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19 • 17

authored 12 papers 3 months ago

SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer

Paper • 2303.17605 • Published Mar 30, 2023

Deep Leakage from Gradients

Paper • 1906.08935 • Published Jun 21, 2019

HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Paper • 2005.14187 • Published May 28, 2020 • 2

MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models

Paper • 2308.12963 • Published Aug 24, 2023

TorchSparse: Efficient Point Cloud Inference Engine

Paper • 2204.10319 • Published Apr 21, 2022

FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer

Paper • 2301.08739 • Published Jan 20, 2023

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 53

AMC: AutoML for Model Compression and Acceleration on Mobile Devices

Paper • 1802.03494 • Published Feb 10, 2018

APQ: Joint Search for Network Architecture, Pruning and Quantization Policy

Paper • 2006.08509 • Published Jun 15, 2020

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Paper • 2411.12915 • Published Nov 19, 2024

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 42

authored 2 papers 6 months ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 95

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

Paper • 2502.14866 • Published Feb 20 • 13

authored 2 papers over 1 year ago

StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation

Paper • 2312.12491 • Published Dec 19, 2023 • 73

Point Transformer V3: Simpler, Faster, Stronger

Paper • 2312.10035 • Published Dec 15, 2023 • 21

authored 2 papers almost 2 years ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 89

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Paper • 2205.13542 • Published May 26, 2022