Wei Liu
PeterV09
AI & ML interests
Machine Learning, Natural Language Processing
Recent Activity
upvoted
a
paper
about 19 hours ago
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged
Reinforcement Learning
Organizations
Collections
2
models
18
PeterV09/llava-1.6-alignmentv2
Text Generation
•
Updated
•
8
PeterV09/llava-1.6-beta-26
Updated
PeterV09/llava-1.6-asft
Updated
PeterV09/llava-1.6-4sftmse
Updated
•
2
PeterV09/llava-1.6-3sft0.5
Updated
•
2
PeterV09/llava-1.6-2sft
Updated
•
3
PeterV09/llava-1.6-sft
Text Generation
•
Updated
•
9
PeterV09/mistral-7b-300k-6k-a100-6e-valid-hkust_2-l4k
Text Generation
•
Updated
•
17
PeterV09/deita-6k-sft-fordpo
Text Generation
•
Updated
•
12
PeterV09/mistral-7b-300k-6k-a100-6e-valid-7
Text Generation
•
Updated
•
11
datasets
0
None public yet