Guang Liu
ZacLiu


·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
upvoted
a
collection
2 days ago
MegaPairs
new activity
6 days ago
BAAI/Infinity-MM:坐标尺度
Organizations
ZacLiu's activity
stage2 data only has 21M
5
#2 opened 4 months ago
by
HensonLiu
Hope to Know Training Time for Aquila-VL-2B with Infinity-MM
1
#5 opened 4 months ago
by
SSSAMMMM
数据集全部是单图的吗?
3
#6 opened 3 months ago
by
nan1248
ChartQA,DocVQA,InfoVQA 等明显低于汇报结果
13
#10 opened about 2 months ago
by
Ivy1997

Different dataset versions ? 3M / 0608 / 06012
9
#4 opened 9 months ago
by
philschmid

Language field
1
#5 opened 9 months ago
by
Tijmen2

关于3M数据和chat数据的使用
6
#10 opened 8 months ago
by
Spurslipu
部分对话开头应该是来自系统
1
#11 opened 8 months ago
by
VIPSP
关于8月新更新的数据集问题
7
#14 opened 7 months ago
by
Spurslipu
0625 Split Error: `pyarrow.lib.ArrowInvalid: Expected to read 538970747 metadata bytes, but only read 1072`
2
#15 opened 7 months ago
by
Avelina

the perspective of instruction compliance
1
#17 opened 7 months ago
by
YvanLee
0729聊天数据集有计划开源吗?
2
#16 opened 7 months ago
by
yixinsong

指令数据集的类别映射问题
1
#19 opened 6 months ago
by
Alwin114
About `system_prompt` setting when fine-tuning by this dataset
2
#22 opened 6 months ago
by
Remixa

About the details of TD-IDF and pHash
6
#12 opened 11 days ago
by
Abracdabra-H

Did you dedupe this???
2
#3 opened 9 months ago
by
rombodawg

Decontamination?
1
#20 opened 6 months ago
by
alpayariyak

git lfs pull 失败
1
#21 opened 6 months ago
by
zheong
What is the Differences and Overlap between 7M, 7M Domain, Gen and 0625?
2
#23 opened 6 months ago
by
alpayariyak
