Undi's picture

Undi PRO

Undi95

AI & ML interests

I search sleep

Recent Activity

published a model 18 days ago
Undi95/QwQ-RP-LoRA
published a model 18 days ago
Undi95/QwQ-RP
View all activity

Organizations

Caldera AI's profile picture Pygmalion's profile picture unalignment's profile picture NeverSleep's profile picture OnlyThings's profile picture MinervaAI-Private's profile picture MinervaAI's profile picture Social Post Explorers's profile picture onlynow's profile picture NeverSleep - Historical's profile picture MergeFuel's profile picture He-He's profile picture MixtureMaxing's profile picture SillyTilly's profile picture Anthracite's profile picture OnlyNow2's profile picture 410's profile picture

Undi95's activity

New activity in Undi95/QwQ-RP 16 days ago
New activity in Undi95/QwQ-RP-GGUF 18 days ago

Error in LM Studio

3
#1 opened 18 days ago by
Sirfrummel
New activity in Undi95/MistralThinker-v1.1 19 days ago
New activity in Undi95/Mistral-11B-v0.1 19 days ago

FYI

2
#5 opened 19 days ago by
yamatazen
replied to their post 21 days ago
view reply

That's what some of my dataset do, but then you're still stuck with one reply trained, not an entire conversation.
I break my head around that haha

Edit: I missread,if you add multiple in the context, the model is confused because they are trimmed out of the context by the chat template to not waste token we don't need anymore.
So we can't train it like this either, because the bot will have multiple thinking process in the conversation.

replied to their post 21 days ago
view reply

You could do that but in that case the bot will not use <think>because it's not trained on all of the reply to do it.

What I would ideally want is a model that apply the thinking itself without system prompt or prefilling

posted an update 21 days ago
view post
Post
5168
Hi there!

If you want to create your own thinking model or do a better MistralThinker, I just uploaded my entire dataset made on Deepseek R1 and the axolotl config. (well I made them public)

Axolotl config : Undi95/MistralThinker-v1.1

The dataset : Undi95/R1-RP-ShareGPT3

You can also read all I did on those two discord screenshot from two days ago, I'm a little lazy to rewrite all kek.

Hope you will use them!
·
New activity in Undi95/MistralThinker-v1.1 22 days ago

Are you considering...

3
#4 opened 22 days ago by
BigHuggyD
New activity in Undi95/Phi4-abliterated 24 days ago

Abliterated

4
#3 opened 2 months ago by
Nexesenex
New activity in Undi95/MistralThinker-v1.1 26 days ago

This shit is fire

13
#2 opened 27 days ago by
Ainonake