ssmits/Llama-3.1-Nemotron-92B-Instruct-HF-early
Text Generation
•
92B
•
Updated
•
6
•
2
Strategic merging of language models through layer-level architecture optimization.