Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
26
7
235
Hieu Lam
lamhieu
Follow
mwmilad's profile picture
allknowingroger's profile picture
NotASI's profile picture
94 followers
Β·
9 following
lh0x00
lh0x00
AI & ML interests
.-.
Recent Activity
liked
a Space
about 23 hours ago
baohuynhbk14/Qwen3-VL-Demo
replied
to
their
post
3 days ago
π Introducing the xLLMs Dataset Collection The xLLMs project is a growing suite of multilingual and multimodal dialogue datasets designed to train and evaluate advanced conversational LLMs. Each dataset focuses on a specific capability β from long-context reasoning and factual grounding to STEM explanations, math Q&A, and polite multilingual interaction. π Explore the full collection on Hugging Face: π https://huggingface.co/collections/lamhieu/xllms-66cdfe34307bb2edc8c6df7d π¬ Highlight: xLLMs β Dialogue Pubs A large-scale multilingual dataset built from document-guided synthetic dialogues (Wikipedia, WikiHow, and technical sources). Itβs ideal for training models on long-context reasoning, multi-turn coherence, and tool-augmented dialogue across 9 languages. π https://huggingface.co/datasets/lamhieu/xllms_dialogue_pubs π§ Designed for: - Long-context and reasoning models - Multilingual assistants - Tool-calling and structured response learning All datasets are open for research and development use β free, transparent, and carefully curated to improve dialogue model quality.
updated
a Space
4 days ago
lamhieu/lightweight-embeddings
View all activity
Organizations
lamhieu
's datasets
37
Sort:Β Recently updated
lamhieu/xllms_dialogue_greetings
Viewer
β’
Updated
5 days ago
β’
41.3k
β’
15
β’
1
lamhieu/xllms_dialogue_pubs
Viewer
β’
Updated
5 days ago
β’
999k
β’
61
β’
3
lamhieu/xllms_dialogue_wildchat
Viewer
β’
Updated
Sep 4, 2024
β’
206k
β’
13
β’
1
lamhieu/xllms_dialogue_stem
Viewer
β’
Updated
Sep 4, 2024
β’
110k
β’
11
lamhieu/xllms_dialogue_mathqa
Viewer
β’
Updated
Sep 4, 2024
β’
395k
β’
8
lamhieu/itorca_dpo_en
Viewer
β’
Updated
Jul 1, 2024
β’
5.92k
β’
7
β’
1
lamhieu/beyond_dpo_en
Viewer
β’
Updated
Jul 1, 2024
β’
25k
β’
6
lamhieu/itorca_dpo_vi
Viewer
β’
Updated
Jul 1, 2024
β’
12.9k
β’
4
lamhieu/beyond_dpo_vi
Viewer
β’
Updated
Jul 1, 2024
β’
25k
β’
3
lamhieu/wikihow_summarize_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
6.62k
β’
11
β’
1
lamhieu/mabrycodes_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
599k
β’
19
β’
1
lamhieu/mabrycodes_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
599k
β’
19
β’
2
lamhieu/medical_mediqa_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
2.21k
β’
11
β’
1
lamhieu/medical_pubmed_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
2.45k
β’
16
lamhieu/medical_advice_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
8.68k
β’
16
β’
1
lamhieu/medical_wikidoc_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
10k
β’
6
β’
3
lamhieu/medical_medqa_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
10.2k
β’
7
β’
2
lamhieu/medical_terms_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
6.86k
β’
34
β’
1
lamhieu/math_cc_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
47.5k
β’
9
lamhieu/math_gs_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
8.79k
β’
5
lamhieu/math_arxiv_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
8.79k
β’
30
lamhieu/sharegpt_dialogue_base
Viewer
β’
Updated
May 17, 2024
β’
112k
β’
16
β’
2
lamhieu/math_metaqa_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
40k
β’
15
lamhieu/beyond_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
25k
β’
6
lamhieu/tstory_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
5.44k
β’
3
lamhieu/slwiki_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
1.98k
β’
3
lamhieu/lima_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
1.03k
β’
4
lamhieu/slorca_dialogue_en
Viewer
β’
Updated
May 17, 2024
β’
18k
β’
14
lamhieu/oasst_dialogue_base
Viewer
β’
Updated
May 17, 2024
β’
9.85k
β’
3
lamhieu/oasst_dialogue_vi
Viewer
β’
Updated
May 17, 2024
β’
3.6k
β’
4
Previous
1
2
Next