Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
65.1
TFLOPS
113
167
1616
Xi
xi0v
Follow
lunarflu's profile picture
woofwolfy's profile picture
GigaBoy's profile picture
35 followers
·
89 following
AI & ML interests
Reinforcement learning, Diffusion Model Merging, LLM Merging, Model Editing and Vision/Multimodal Model Fine-tuning.
Recent Activity
reacted
to
Kseniase
's
post
with 👀
about 3 hours ago
8 types of RoPE As we always use Transformers, it's helpful to understand RoPE—Rotary Position Embedding. Since token order matters, RoPE encodes it by rotating token embeddings based on their position, so the model knows how to interpret which token comes first, second, and so on. Here are 8 types of RoPE that can be implemented in different cases: 1. Original RoPE -> https://huggingface.co/papers/2104.09864 Encodes token positions by rotating token embeddings in the complex plane via a position-based rotation matrix, thereby providing the self-attention mechanism with relative positional info. 2. LongRoPE -> https://huggingface.co/papers/2402.13753 Extends the context window of pre-trained LLMs to 2048k tokens, leveraging non-uniformities in positional interpolation with an efficient search. 3. LongRoPE2 -> https://huggingface.co/papers/2502.20082 Extends the effective context window of pre-trained LLMs to the target! length, rescaling RoPE guided by “needle-driven” perplexity. 4. Multimodal RoPE (MRoPE) -> https://huggingface.co/papers/2502.13923 Decomposes positional embedding into 3 components: temporal, height and width, so that positional features are aligned across modalities: text, images and videos. 5. Directional RoPE (DRoPE) -> https://huggingface.co/papers/2503.15029 Adds an identity scalar, improving how angles are handled without extra complexity. It helps balance accuracy, speed, and memory usage. 6. VideoRoPE -> https://huggingface.co/papers/2502.05173 Adapts RoPE for video, featuring 3D structure, low-frequency temporal allocation, diagonal layout, and adjustable spacing. 7. VRoPE -> https://huggingface.co/papers/2502.11664 An another RoPE for video, which restructures positional indices and balances encoding for uniform spatial focus. 8. XPos (Extrapolatable Position Embedding) -> https://huggingface.co/papers/2212.10 Introduces an exponential decay factor into the rotation matrix, improving stability on long sequences.
liked
a model
about 3 hours ago
SUFE-AIFLM-Lab/Fin-R1
liked
a model
about 7 hours ago
OnomaAI/Log2char_Orion-14B
View all activity
Organizations
spaces
1
Paused
185
Stable Video Diffusion Img2Vid
✨
Animate Your Pictures With Stable VIdeo DIffusion
models
46
Sort: Recently updated
xi0v/Illu-v1-1-vpred
Text-to-Image
•
Updated
1 day ago
•
10
xi0v/IzumiXL-v22Vpred
Text-to-Image
•
Updated
2 days ago
•
11
xi0v/Ultra-7B
Text Generation
•
Updated
5 days ago
xi0v/T3Q-qwen2.5-14b-v1.0-e3-archive
Text Generation
•
Updated
6 days ago
xi0v/unaligned-1
Text Generation
•
Updated
6 days ago
xi0v/Linkbricks-Horizon-AI-Avengers-V1-32B-archive
Updated
9 days ago
xi0v/riku-14B
Updated
10 days ago
•
2
xi0v/GaLLM-14B-v0.1-archive
Updated
13 days ago
xi0v/Model-Dump
Updated
16 days ago
xi0v/ObsessionVpred11
Text-to-Image
•
Updated
18 days ago
•
9
Expand 46 models
datasets
1
xi0v/c-v
Updated
21 days ago
•
34