SeerAttention
/

SeerAttention-Decode-R1-Distill-Qwen-14B-AttnGates

Text Generation

Model card Files Files and versions Community

SeerAttention-Decode-R1-Distill-Qwen-14B-AttnGates

Ctrl+K

Ctrl+K

1 contributor

History: 8 commits

SeerAttention's picture

Update README.md

eac07d3 verified 2 months ago

.gitattributes

1.52 kB

initial commit 2 months ago
README.md

3.13 kB

Update README.md 2 months ago
attn_gate_weights.pth
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.BFloat16Storage"
What is a pickle import?
101 MB
LFS

Upload folder using huggingface_hub 2 months ago
config.json

1.01 kB

Upload folder using huggingface_hub 2 months ago