Richard Lian
richardlian
AI & ML interests
None yet
Recent Activity
upvoted
an
article
2 days ago
Train 400x faster Static Embedding Models with Sentence Transformers
liked
a Space
15 days ago
bigcode/bigcode-models-leaderboard
upvoted
an
article
19 days ago
Efficient LLM Pretraining: Packed Sequences and Masked Attention
Organizations
richardlian's activity
Discrepancy between Base and Instruct model eos_token.
#119 opened 8 months ago
by
richardlian
Discrepancy in vocab size
2
#1 opened 10 months ago
by
richardlian
YiTokenizer doesn't exist
2
#13 opened about 1 year ago
by
Xyzzyxsfr