lomahony
·
AI & ML interests
PhD student
Organizations
None yet
lomahony/pythia-1.4b-helpful-sft
Text Generation
•
1B
•
Updated
•
5
lomahony/pythia-410m-helpful-sft
Text Generation
•
0.4B
•
Updated
•
12
lomahony/pythia-70m-helpful-sft
Text Generation
•
0.1B
•
Updated
•
8
lomahony/pythia-1b-helpful-sft
Text Generation
•
1B
•
Updated
•
10
lomahony/pythia-160m-helpful-sft
Text Generation
•
0.2B
•
Updated
•
5
lomahony/pythia-1b-helpful-dpo
Text Generation
•
Updated
•
2
lomahony/pythia-70m-helpful-dpo
Text Generation
•
Updated
•
4
lomahony/pythia-160m-helpful-dpo
Text Generation
•
Updated
•
1
lomahony/pythia-1.4b-helpful-dpo
Text Generation
•
Updated
•
3
lomahony/pythia-2.8b-helpful-dpo
Text Generation
•
Updated
•
1
lomahony/pythia-410m-helpful-dpo
Text Generation
•
Updated
lomahony/pythia-2.8b-helpful-sft
Text Generation
•
3B
•
Updated
•
3
lomahony/pythia-2.8b-helpful-sfted1-dpo-3epochs
Updated
lomahony/pythia-2.8b-helpful-sfted0-dpo-3epochs
Updated
lomahony/pythia-2.8b-helpful-sfted3-dpo-3epochs
Updated
lomahony/pythia-2.8b-helpful-sfted2-dpo-3epochs
Updated
lomahony/pythia-2.8b-helpful-sfted2-dpo-3epochs-old
Updated
lomahony/pythia-2.8b-helpful-sfted1-dpo-3epochs-old
Text Generation
•
Updated
lomahony/pythia-2.8b-helpful-sfted0-dpo-3epochs-old
Updated
lomahony/pythia-2.8b-helpful-sft-3epochs
Text Generation
•
3B
•
Updated
•
3
lomahony/pythia-1.4b-helpful-sfted1-ppo-3epochs-old
Text Generation
•
2B
•
Updated
lomahony/pythia-2.8b-helpful-sft-3epochs-old
Text Generation
•
3B
•
Updated
lomahony/pythia-1b-helpful-sfted1-ppo-3epochs-old
Text Generation
•
1B
•
Updated
lomahony/pythia-160m-helpful-sft-epoch2
Text Generation
•
0.2B
•
Updated
lomahony/pythia-70m-helpful-sft-epoch2
Text Generation
•
0.1B
•
Updated
lomahony/pythia-2.8b-helpful-sft-epoch2
Text Generation
•
3B
•
Updated
lomahony/pythia-1.4b-helpful-sft-epoch2
Text Generation
•
1B
•
Updated
•
1
lomahony/pythia-1b-helpful-sft-epoch2
Text Generation
•
1B
•
Updated
lomahony/pythia-410m-helpful-sft-epoch2
Text Generation
•
0.4B
•
Updated
•
4
lomahony/eleuther-pythia12b-hh-dpo
Text Generation
•
Updated
•
2