Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
1
Itay Levy
itlevy
Follow
soumye's profile picture
1 follower
·
3 following
itayoush
itay-levy-cs
AI & ML interests
None yet
Recent Activity
new
activity
6 days ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1:
_prepare_generation_config bugfix (failed due to version update in transformers)
new
activity
6 days ago
nvidia/Llama-3_1-Nemotron-51B-Instruct:
_prepare_generation_config bugfix (failed due to version update in transformers)
new
activity
6 days ago
nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1:
_prepare_generation_config bugfix (failed due to version update in transformers)
View all activity
Organizations
itlevy
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
6 days ago
_prepare_generation_config bugfix (failed due to version update in transformers)
#14 opened 7 days ago by
ishahaf
New activity in
nvidia/Llama-3_1-Nemotron-51B-Instruct
6 days ago
_prepare_generation_config bugfix (failed due to version update in transformers)
#25 opened 7 days ago by
ishahaf
New activity in
nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1
6 days ago
_prepare_generation_config bugfix (failed due to version update in transformers)
#2 opened 7 days ago by
ishahaf
New activity in
nvidia/Llama-3_3-Nemotron-Super-49B-v1
3 months ago
Nemotron 253B?
2
#10 opened 3 months ago by
BoshiAI
New activity in
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
3 months ago
How come this pruned model has 162 layers
5
#3 opened 3 months ago by
ymcki
New activity in
nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1
3 months ago
add model card
#1 opened 3 months ago by
itlevy
New activity in
nvidia/Llama-3_1-Nemotron-51B-Instruct
9 months ago
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#19 opened 9 months ago by
tomer-nv
New activity in
nvidia/Llama-3_1-Nemotron-51B-Instruct
10 months ago
DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50
#16 opened 10 months ago by
itlevy
add batch_size attribute to VariableCache
#15 opened 10 months ago by
itlevy
nvidia-open-model-license
#14 opened 10 months ago by
itlevy
nvidia-open-model-license
#13 opened 10 months ago by
itlevy
nvidia-open-model-license
#12 opened 10 months ago by
itlevy
v4.46 support
#7 opened 10 months ago by
itlevy
loading as llama model
1
#4 opened 10 months ago by
KnutJaegersberg
v4.45 support
#6 opened 10 months ago by
itlevy
fixed flash_attention backward_compat
#3 opened 10 months ago by
itlevy
flash_attention_utils_backward_compat
#2 opened 10 months ago by
itlevy
Load more