Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Long Le's picture
1 1 2

Long Le

lole25
·
https://longtanle.github.io/
  • https://github.com/longtanle

AI & ML interests

None yet

Organizations

DUAL Group's profile picture DUAL-GPO-2's profile picture

Collections 1

LLM_Alignment
  • iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization

    Paper • 2405.15230 • Published May 24, 2024 • 3
LLM_Alignment
  • iREPO: implicit Reward Pairwise Difference based Empirical Preference Optimization

    Paper • 2405.15230 • Published May 24, 2024 • 3

models 27

lole25/zephyr-7b-irepo-i1

Text Generation • 7B • Updated May 11, 2024 • 35

lole25/zephyr-7b-gpo-v9-i1

Updated May 8, 2024 • 4

lole25/zephyr-7b-gpo-v7-i1

Updated May 8, 2024 • 28

lole25/zephyr-7b-gpo-v6-i1

Updated May 7, 2024 • 5

lole25/zephyr-7b-gpo-gen-i1

Updated Apr 25, 2024 • 74

lole25/phi-2-gpo-test-iter-1

Updated Mar 18, 2024 • 32

lole25/phi-2-gpo-test-iter-0

Updated Mar 18, 2024 • 4

lole25/phi-2-gpo-test-iter-2

Updated Mar 18, 2024 • 23

lole25/phi-2-gpo-lora-ultrafeedback-test-1

Updated Mar 18, 2024 • 18

lole25/phi-2-gpo-lora-ultrafeedback-test

Updated Mar 18, 2024 • 2
View 27 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs