Yunus Serhat Bıçakçı's picture

Yunus Serhat Bıçakçı

yunusserhat

·

https://www.yunusserhat.com

AI & ML interests

None yet

Recent Activity

liked a dataset 1 day ago

omersaidd/tts_mazlum_kiper_tur

liked a dataset 2 days ago

blanchon/LEVIR_CDPlus

liked a dataset 2 days ago

blanchon/INRIA-Aerial-Image-Labeling

View all activity

Organizations

upvoted a collection 2 days ago

🛰️🌍 Geospatial Datasets

A curated collections of diverse geospatial and satellite imagery datasets. • 56 items • Updated Mar 11 • 23

upvoted a paper about 2 months ago

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30 • 34

upvoted an article about 2 months ago

Article

Vision Language Models (Better, Faster, Stronger)

By

and 4 others •

May 12

• 488

upvoted a collection 3 months ago

D-FINE

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55

upvoted a collection 5 months ago

Türkçe VLMler

11 items • Updated Mar 4 • 10

upvoted 2 articles 5 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

By

and 3 others •

Mar 4

• 75

Article

FastRTC: The Real-Time Communication Library for Python

By

and 1 other •

Feb 25

• 171

upvoted a paper 5 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 197

upvoted a collection 5 months ago

SigLIP2

36 items • Updated 16 days ago • 79

upvoted a collection 6 months ago

Visual Document Retrieval

A collection of models, datasets, and spaces in the VDR series • 5 items • Updated Jan 10 • 8

upvoted an article 6 months ago

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

By

•

Jan 29

• 19

upvoted a collection 7 months ago

Jan 10 Releases 🌨️

38 items • Updated Jan 10 • 12

upvoted a collection 8 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 80

upvoted 2 collections 10 months ago

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 305

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 626

upvoted a collection about 1 year ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 573

upvoted a paper about 1 year ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

upvoted 2 collections about 1 year ago

Florence

9 items • Updated May 1 • 172

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 4 days ago • 163

upvoted an article about 1 year ago

Article

Vision Language Models Explained

By

and 1 other •

Apr 11, 2024

• 420