Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
142.3
TFLOPS
22
2
宋小猫
SongXiaoMao
Follow
0 followers
·
2 following
AI & ML interests
None yet
Recent Activity
liked
a model
19 days ago
Qwen/Qwen2.5-Omni-7B
new
activity
about 1 month ago
Qwen/QwQ-32B:
When will you fix the model replies missing</think>\n start tags
new
activity
about 1 month ago
Qwen/QwQ-32B:
When answering questions in Chinese, the model frequently terminates prematurely (outputs the end token). Is this a common problem?
View all activity
Organizations
None yet
SongXiaoMao
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
19 days ago
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
6 days ago
•
171k
•
1.45k
New activity in
Qwen/QwQ-32B
about 1 month ago
When will you fix the model replies missing</think>\n start tags
17
#19 opened about 2 months ago by
xldistance
When answering questions in Chinese, the model frequently terminates prematurely (outputs the end token). Is this a common problem?
1
#40 opened about 2 months ago by
zhangw355
missing opening <think>
18
#4 opened about 2 months ago by
getfit
New activity in
Valdemardi/DeepSeek-R1-Distill-Llama-70B-AWQ
about 2 months ago
AWQ q6
1
#1 opened 3 months ago by
D-r-e
New activity in
unsloth/DeepSeek-R1-GGUF
2 months ago
I tested dynamic 1.58bit and 2.22bit, All thoughts are empty?
9
#24 opened 2 months ago by
SongXiaoMao
No think tokens visible
6
#15 opened 3 months ago by
sudkamath
New activity in
PowerInfer/SmallThinker-3B-Preview
3 months ago
How to Pair with Larger Models
4
#7 opened 4 months ago by
windkkk
New activity in
Qwen/QwQ-32B-Preview
4 months ago
multi GPU inferencing
2
#18 opened 5 months ago by
cjj2003
Use sample code to start error reporting
1
#45 opened 4 months ago by
SongXiaoMao
vllm reply garbled
3
#29 opened 5 months ago by
SongXiaoMao
vllm has problems running this model
3
#46 opened 4 months ago by
SongXiaoMao
Can you officially support VLLM?
1
#48 opened 4 months ago by
SongXiaoMao
New activity in
TechxGenus/Mistral-Large-Instruct-2407-AWQ
9 months ago
The model can be started using vllm, but no dialogue is possible.
3
#2 opened 9 months ago by
SongXiaoMao
updated
a model
9 months ago
SongXiaoMao/testYI
Updated
Jul 13, 2024
•
21
liked
a model
12 months ago
deepseek-ai/DeepSeek-V2-Chat
Text Generation
•
Updated
Jun 8, 2024
•
1.38k
•
460
Load more