-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 103 -
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Paper • 2404.14219 • Published • 257 -
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Paper • 2404.16710 • Published • 80
Mayor
Eric111
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
cerebras/btlm-3b-8k-base
liked
a model
6 days ago
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
liked
a model
6 days ago
deepseek-ai/DeepSeek-R1-0528
Organizations
None yet
Collections
1
models
50
Eric111/Pleias-350m-instruct
Updated
Eric111/vicuna-13b-v1.5-16k-gguf
Updated
•
3
Eric111/vicuna-13b-v1.5-gguf
Updated
Eric111/gemma3-1b-thinking-GGUF
Updated
Eric111/CatunaMayo3B-DPO
Updated
Eric111/CatunaMayo3B
Text Generation
•
Updated
•
19
Eric111/UltraCatunaMayo-DPO-GGUF
Updated
•
8
Eric111/UltraCatunaMayo-DPO
Text Generation
•
Updated
•
13
Eric111/UltraCatunaMayo-GGUF
Updated
•
15
Eric111/UltraCatunaMayo
Text Generation
•
Updated
•
16
datasets
0
None public yet