Stanisław Szymczyk
sszymczyk
AI & ML interests
None yet
Organizations
None yet
Please test the QwQ-32B-Preview model
➕
2
#3 opened 11 months ago
by
sszymczyk
This model performs worse in complex problems compared to the DeepSeek R1
17
#254 opened 8 months ago
by
sszymczyk
Requesting Support for GGUF Quantization of MiniMax-Text-01 through llama.cpp
👍
24
7
#1 opened 9 months ago
by
Doctor-Chad-PhD

Missing tokenizer.json and tokenizer_config.json files
1
#2 opened 9 months ago
by
sszymczyk
Please add the "tokenizer.model" file
2
#3 opened 9 months ago
by
ken133
Hardware Requirements
6
#1 opened 11 months ago
by
Lightchain
Can you provide code for inference with MCTS?
👍
6
6
#3 opened 11 months ago
by
sszymczyk
Reason behind not using special tokens in the prompt format?
2
#2 opened 11 months ago
by
Doctor-Shotgun

The curse of the Consolidated Safetensors strikes again...
2
#4 opened 11 months ago
by
jukofyork

The model often enters infinite generation loops
👍
5
13
#32 opened about 1 year ago
by
sszymczyk
Calculation of _mscale during YARN RoPE scaling
1
#4 opened over 1 year ago
by
sszymczyk
Wrong BOS and EOS tokens in tokenizer.model file
👍
1
1
#12 opened over 1 year ago
by
sszymczyk
Problem with repeated generation of newline characters
2
#3 opened over 1 year ago
by
sszymczyk
Possible error in tokenizer.json
👍
🔥
4
6
#6 opened over 1 year ago
by
sszymczyk
Model is paraphrasing text instead of citing it verbatim
3
#7 opened over 1 year ago
by
sszymczyk