Itay Levy

itlevy

·

AI & ML interests

None yet

Organizations

New activity in nvidia/gpt-oss-puzzle-88B 3 months ago

Fix vLLM command

#10 opened 3 months ago by

New activity in nvidia/gpt-oss-puzzle-88B 4 months ago

Fix vLLM command

#8 opened 4 months ago by

Update README.md

#3 opened 4 months ago by

New activity in nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 8 months ago

Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4/llama_nemotron_toolcall_parser_no_streaming.py missing

#1 opened 9 months ago by

Update README and toolcall_parser

#5 opened 8 months ago by

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 about 1 year ago

_prepare_generation_config bugfix (failed due to version update in transformers)

#14 opened about 1 year ago by

New activity in nvidia/Llama-3_1-Nemotron-51B-Instruct about 1 year ago

_prepare_generation_config bugfix (failed due to version update in transformers)

#25 opened about 1 year ago by

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1 about 1 year ago

_prepare_generation_config bugfix (failed due to version update in transformers)

#2 opened about 1 year ago by

New activity in nvidia/Llama-3_3-Nemotron-Super-49B-v1 over 1 year ago

Nemotron 253B?

#10 opened over 1 year ago by

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 over 1 year ago

How come this pruned model has 162 layers

#3 opened over 1 year ago by

New activity in nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1 over 1 year ago

add model card

#1 opened over 1 year ago by

New activity in nvidia/Llama-3_1-Nemotron-51B-Instruct almost 2 years ago

Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model

#19 opened almost 2 years ago by

DeciLMForCausalLM(DeciLMPreTrainedModel, GenerationMixin) for v4.50

#16 opened almost 2 years ago by

add batch_size attribute to VariableCache

#15 opened almost 2 years ago by

nvidia-open-model-license

#14 opened almost 2 years ago by

nvidia-open-model-license

#13 opened almost 2 years ago by

nvidia-open-model-license

#12 opened almost 2 years ago by

v4.46 support

#7 opened almost 2 years ago by

loading as llama model

#4 opened almost 2 years ago by

KnutJaegersberg

v4.45 support

#6 opened almost 2 years ago by