Describe Anything Org

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

longlian authored a paper 15 days ago

Learning Adaptive Parallel Reasoning with Language Models

longlian authored a paper 15 days ago

Describe Anything: Detailed Localized Image and Video Captioning

richardaecn authored a paper 15 days ago

Describe Anything: Detailed Localized Image and Video Captioning

View all activity

describe-anything-org's activity

longlian

authored 2 papers 15 days ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published 16 days ago • 42

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published 15 days ago • 60

richardaecn

authored a paper 15 days ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published 15 days ago • 60

richardaecn

authored a paper 27 days ago

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 78

richardaecn

authored 13 papers about 1 month ago

Unified Visual Relationship Detection with Vision and Language Models

Paper • 2303.08998 • Published Mar 16, 2023

The iNaturalist Species Classification and Detection Dataset

Paper • 1707.06642 • Published Jul 20, 2017

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception

Paper • 2305.06324 • Published May 10, 2023 • 1

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

Paper • 2004.12276 • Published Apr 26, 2020 • 1

Spatiotemporal Contrastive Video Representation Learning

Paper • 2008.03800 • Published Aug 9, 2020

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Paper • 2012.07177 • Published Dec 13, 2020

A Simple Zero-shot Prompt Weighting Technique to Improve Prompt Ensembling in Text-Image Models

Paper • 2302.06235 • Published Feb 13, 2023

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

Paper • 2411.07126 • Published Nov 11, 2024 • 31

Edify 3D: Scalable High-Quality 3D Asset Generation

Paper • 2411.07135 • Published Nov 11, 2024

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Paper • 2503.15558 • Published Mar 18 • 46

longlian

authored a paper about 2 months ago

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Paper • 2503.12355 • Published Mar 16 • 11

richardaecn

authored a paper 9 months ago

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 33

richardaecn

authored a paper about 1 year ago

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

Paper • 2404.19752 • Published Apr 30, 2024 • 25

AI & ML interests

Recent Activity

Team members 2

describe-anything-org's activity