Stanisław Szymczyk's picture

23 2

Stanisław Szymczyk

sszymczyk

·

AI & ML interests

None yet

Organizations

None yet

New activity in allenai/ZebraLogic 6 months ago

Please test the QwQ-32B-Preview model

#3 opened 11 months ago by

New activity in perplexity-ai/r1-1776 7 months ago

This model performs worse in complex problems compared to the DeepSeek R1

#254 opened 8 months ago by

New activity in MiniMaxAI/MiniMax-Text-01 9 months ago

Requesting Support for GGUF Quantization of MiniMax-Text-01 through llama.cpp

#1 opened 9 months ago by

Doctor-Chad-PhD

New activity in RUC-AIBOX/Virgo-72B 9 months ago

Missing tokenizer.json and tokenizer_config.json files

#2 opened 9 months ago by

Please add the "tokenizer.model" file

#3 opened 9 months ago by

New activity in Qwen/QwQ-32B-preview 11 months ago

Hardware Requirements

#1 opened 11 months ago by

New activity in AIDC-AI/Marco-o1 11 months ago

Can you provide code for inference with MCTS?

#3 opened 11 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B 11 months ago

Reason behind not using special tokens in the prompt format?

#2 opened 11 months ago by

New activity in mistralai/Mistral-Large-Instruct-2411 11 months ago

The curse of the Consolidated Safetensors strikes again...

#4 opened 11 months ago by

New activity in meta-llama/Llama-3.1-8B-Instruct about 1 year ago

The model often enters infinite generation loops

#32 opened about 1 year ago by

New activity in nvidia/Nemotron-4-340B-Instruct over 1 year ago

Gguf

#5 opened over 1 year ago by

New activity in deepseek-ai/DeepSeek-V2 over 1 year ago

Calculation of _mscale during YARN RoPE scaling

#4 opened over 1 year ago by

New activity in Snowflake/snowflake-arctic-instruct over 1 year ago

Wrong BOS and EOS tokens in tokenizer.model file

#12 opened over 1 year ago by

New activity in abacusai/TheProfessor-155b over 1 year ago

Problem with repeated generation of newline characters

#3 opened over 1 year ago by

New activity in mistralai/Mixtral-8x22B-Instruct-v0.1 over 1 year ago

Possible error in tokenizer.json

#6 opened over 1 year ago by

Model is paraphrasing text instead of citing it verbatim

#7 opened over 1 year ago by