Shuyue Jia (Bruce)
shuyuej
AI & ML interests
A Ph.D. Student at @vkola-lab, Boston University. Passionate about Large Language Models (LLMs), Multimodal Foundation Models, Generative AI, and Medical AI.
Organizations
shuyuej's activity
What Happens If the Prompt Exceeds 8,196 Tokens? And difference between input limit and context length limit?
👀
1
3
#36 opened 6 months ago
by
averyyu99
quant versions?
👍
5
1
#12 opened 6 months ago
by
apol

RecursionError: maximum recursion depth exceeded
1
#1 opened almost 2 years ago
by
WajihUllahBaig
missing model.safetensors.index.json
3
#1 opened 10 months ago
by
kresimirfijacko
Can you create gptq 8 bits quants?
1
#1 opened 10 months ago
by
rjmehta
Update quantize_config.json
1
#12 opened 10 months ago
by
shuyuej

Update config.json
1
#11 opened 10 months ago
by
shuyuej

Source codes to quantize the LLaMA 3.1 405B model
3
#10 opened 10 months ago
by
shuyuej

Request for Mistral Large Instruct GPTQ INT4
4
#2 opened 11 months ago
by
sparsh35
Missing config.json
➕
2
5
#6 opened 11 months ago
by
wxl2001
Learning Rate during pretraining
1
#58 opened 11 months ago
by
shuyuej

Model max_seq_length
7
#6 opened 11 months ago
by
shuyuej

Model max_seq_length
1
#4 opened 11 months ago
by
shuyuej

Where can we find `eval_medical_llm.py` and `main.py`
1
#15 opened about 1 year ago
by
shuyuej

Fine-Tune a gemma model for question answering
👍
1
17
#62 opened about 1 year ago
by
Iamexperimenting
Weird Performance Issue with Gemma-7b compared to Gemma-2b with Qlora
6
#91 opened about 1 year ago
by
UserDAN
What is the actual context size of mistralai/Mixtral-8x7B-Instruct-v0.1 model
4
#186 opened about 1 year ago
by
Pradeep1995

Very different results with float16. [Actually, gemma-7b-it does not work with float16]
👍
3
6
#33 opened over 1 year ago
by
EarthWorm001