shikras

community

https://github.com/shikras/shikra

Activity Feed Request to join this org

AI & ML interests

None defined yet.

zbrl

authored a paper 2 months ago

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

Paper • 2506.10890 • Published Jun 12 • 10

zbrl

authored a paper 5 months ago

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Paper • 2503.08377 • Published Mar 11 • 2

chenkq

authored a paper 5 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 165

chenkq

authored a paper 6 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 200

chenkq

authored a paper 11 months ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78

zbrl

authored 7 papers over 1 year ago

Link-Context Learning for Multimodal LLMs

Paper • 2308.07891 • Published Aug 15, 2023 • 16

Advancing Referring Expression Segmentation Beyond Single Image

Paper • 2305.12452 • Published May 21, 2023

Shikra: Unleashing Multimodal LLM's Referential Dialogue Magic

Paper • 2306.15195 • Published Jun 27, 2023

Graphic Design with Large Multimodal Model

Paper • 2404.14368 • Published Apr 22, 2024 • 2

Described Object Detection: Liberating Object Detection with Flexible Expressions

Paper • 2307.12813 • Published Jul 24, 2023 • 1

Co-Salient Object Detection with Co-Representation Purification

Paper • 2303.07670 • Published Mar 14, 2023

Gradient-Induced Co-Saliency Detection

Paper • 2004.13364 • Published Apr 28, 2020

chenkq

updated 2 models about 2 years ago

shikras/shikra-7b-delta-v1-0708

Text Generation • Updated Jul 11, 2023 • 11 • 3

shikras/shikra-7b-delta-v1

Text Generation • Updated Jul 2, 2023 • 19 • 7