🤗 Serve any model with Inference Endpoints + Custom Handlers
•
3
oh nice to see you here @bechavarren 👋🏻
model.similarity(embeddings1, embeddings2)
and you'll get your similarity scores immediately. Model authors can specify their desired similarity score, so you don't have to worry about it anymore!distilabel
, so we implemented PrometheusEval
.PrometheusEval
running their 7B variant with vLLM in a single L40 on top of
HuggingFaceH4/instruction-dataset, we got the 327 existing prompt-completion pairs evaluated and pushed to the Hub in less than 2 minutes!