reciprocate
·
AI & ML interests
Reward models
Organizations
reciprocate/mistral-7b-gsm8k-code-rm
Text Classification
•
7B
•
Updated
•
14
•
3
reciprocate/mistral-7b-rm
Text Classification
•
Updated
•
22
•
2
reciprocate/rm_beluga-7b_hh-full
Text Classification
•
Updated
•
12
reciprocate/rm-llama2-7b-gsm8k
Text Generation
•
Updated
•
29
reciprocate/llama2-7b-gsm8k
Text Generation
•
Updated
•
27
•
1
reciprocate/shepherd-13b
Text Generation
•
Updated
•
19
•
1
reciprocate/tiny-llama
Text Generation
•
Updated
•
68
•
2
reciprocate/vicuna-13b_rm_oasst-hh
Text Classification
•
Updated
•
41
reciprocate/openllama-13b-rlhf-v0
Text Generation
•
Updated
•
20
reciprocate/openllama-13b_rm_oasst-hh
Text Classification
•
Updated
•
20
reciprocate/gpt-j_rm_format-oa
Text Classification
•
Updated
•
12
•
1
reciprocate/dahoas-gptj-rm-static
Text Classification
•
Updated
•
40
reciprocate/gpt2-simulacra
Text Generation
•
Updated
•
18
reciprocate/ppo_hh_gpt-j
Text Generation
•
Updated
•
44
•
6
reciprocate/ppo_hh_pythia-125M
Text Generation
•
Updated
•
21
reciprocate/ppo_hh_neox-20B
Text Generation
•
Updated
•
20
•
2
reciprocate/ppo_hh_pythia-1B
Text Generation
•
Updated
•
26
reciprocate/ppo_hh_pythia-6B
Text Generation
•
Updated
•
69
•
5