Anthonny Olime's picture

Anthonny Olime

Aviv-anthonnyolime

·

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Beta

upvoted a collection 1 day ago

upvoted a collection 1 day ago

Papers - Google

View all activity

Organizations

Aviv-anthonnyolime's activity

upvoted 2 collections 1 day ago

image

229 items • Updated about 19 hours ago • 2

Papers - Google

53 items • Updated Nov 2, 2024 • 2

upvoted 2 papers 1 day ago

ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion

Paper • 2403.18818 • Published Mar 27, 2024 • 26

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 8 days ago • 265

upvoted an article 1 day ago

Article

PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs

By

•

6 days ago

• 9

upvoted 2 papers 10 days ago

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 77

LaDiMo: Layer-wise Distillation Inspired MoEfier

Paper • 2408.04278 • Published Aug 8, 2024 • 1

upvoted 2 papers 14 days ago

MoH: Multi-Head Attention as Mixture-of-Head Attention

Paper • 2410.11842 • Published Oct 15, 2024 • 21

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 16 days ago • 270

upvoted a collection 22 days ago

Phi-4

Phi-4 small language model. • 2 items • Updated 22 days ago • 45

upvoted a collection 23 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 13 days ago • 251

upvoted a paper 23 days ago

TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language Environment

Paper • 2501.00522 • Published 30 days ago • 1

upvoted 2 papers 24 days ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published 27 days ago • 31

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published 27 days ago • 42

upvoted an article 24 days ago

Article

Fine-tune SmolLM's on custom synthetic data

By

•

25 days ago

• 16

upvoted a paper 28 days ago

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Paper • 2306.07691 • Published Jun 13, 2023 • 6

upvoted a collection 28 days ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated 24 days ago • 22

upvoted 3 papers about 1 month ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 78

Deliberation in Latent Space via Differentiable Cache Augmentation

Paper • 2412.17747 • Published Dec 23, 2024 • 30

Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?

Paper • 2307.14023 • Published Jul 26, 2023 • 1