Xueguang Ma PRO

MrLight

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago
Reinforcement Pre-Training
upvoted a collection 5 days ago
Qwen3-Embedding
liked a dataset 6 days ago
TIGER-Lab/One-Shot-CFT-Data
View all activity

Organizations

Castorini's profile picture Tevatron's profile picture TIGER-Lab's profile picture Hugging Face Discord Community's profile picture SVRL's profile picture RLHN's profile picture

MrLight's activity

New activity in MrLight/dse-qwen2-2b-mrl-v1 3 months ago

Fix task tag

1
#5 opened 4 months ago by
merve
New activity in Tevatron/wiki-ss-nq 7 months ago
New activity in MrLight/dse-qwen2-2b-mrl-v1 8 months ago
New activity in castorini/repllama-v1.1-mrl-7b-lora-passage 8 months ago

Cannot download the model.

2
#2 opened 8 months ago by
hbh234
New activity in MrLight/dse-phi35-vidore-ft 9 months ago

Onnx support

2
#1 opened 9 months ago by
freedmand
New activity in castorini/repllama-v1.1-mrl-7b-lora-passage about 1 year ago
New activity in castorini/rankllama-v1-7b-lora-passage about 1 year ago

13B model

2
#5 opened about 1 year ago by
cramraj8
New activity in castorini/rankllama-v1-7b-lora-doc over 1 year ago

Code for training the LLM

2
#2 opened over 1 year ago by
cramraj8
New activity in castorini/rankllama-v1-7b-lora-passage over 1 year ago

GradCache implementation?

3
#4 opened over 1 year ago by
serialcoder
New activity in castorini/rankllama-v1-7b-lora-passage over 1 year ago
New activity in castorini/ance-msmarco-passage over 2 years ago

model documentation

1
#2 opened over 2 years ago by
nazneen