Is this a new model?

#1
by sonicrules1234 - opened

I know that there was a dolphin venice mistral model before this. What happened to that one, and what is the difference between that one and this one?

Cognitive Computations org

This is an updated version based on feedback we received on v1.
It is the first time we have used RL to remove censorship from a model, and it also includes new datasets aimed at making the model less dry (the 24B is very STEM-oriented).

The older version can still be accessed privately here with this key for anyone who still wants to use it

cognitivecomputations/Dolphin-Mistral-24B-Venice-Edition-Legacy
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

It can be run with

export HF_TOKEN="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"

before your startup command

or

--token "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" \

in the startup command

If it is a new updated version can it be renamed to be more obvious? With V2 in the name or something, it whoud be less confusing.

Yes, when silently replacing a model in the same repo, there will be endless confusion on which model (when converted into other formats) is which, a situation which sucks especially for end-users - nobody will know what model is the real one.

Request DOI?

Sign up or log in to comment