Alfredo Serafini

seralf

seralf

AI & ML interests

knowledge graphs

Recent Activity

liked a dataset 5 days ago

drt/graphext-qa

liked a dataset 5 days ago

ajibawa-2023/Children-Stories-Collection

liked a model 6 days ago

utter-project/EuroLLM-1.7B-Instruct

View all activity

Organizations

None yet

liked 2 datasets 5 days ago

drt/graphext-qa

Viewer • Updated Jul 5, 2023 • 59.9k • 72 • 4

ajibawa-2023/Children-Stories-Collection

Viewer • Updated Mar 16, 2024 • 897k • 585 • 48

liked a model 6 days ago

utter-project/EuroLLM-1.7B-Instruct

Text Generation • 2B • Updated Dec 16, 2024 • 6.15k • 77

liked a dataset 9 days ago

DataProvenanceInitiative/Commercially-Verified-Licenses

Preview • Updated Nov 3, 2023 • 87 • 5

liked 2 datasets 13 days ago

DeepMount00/italian_conversations

Viewer • Updated 13 days ago • 21.1k • 1.48k • 27

sarosavo/Master-RM

Viewer • Updated 15 days ago • 180k • 311 • 7

reacted to davidberenstein1957's post with 👍 15 days ago

Post

1434

The Meta Llama-3.1 model series can be used for distilling and fine-tuning but this requires annotated preference data so I created a Human Feedback Collector based on Gradio that directly logs data to the Hugging Face Hub.

- Model meta-llama/Meta-Llama-3.1-8B-Instruct
- Data SFT, KTO and DPO data
- Runs on free Zero GPUs in Hugging Face Spaces
- Might need some human curation in Argilla
- Or provide some AI feedback with distilabel

https://huggingface.co/collections/davidberenstein1957/chatinterface-llm-human-feedback-collectors-66a22859c9e703d2af7500c1