1 1

Krishna Teja Chitty-Venkata

krishnateja95

https://krishnateja95.github.io/

AI & ML interests

LLM Optimization, Neural Architecture Search, Quantization, Pruning

Recent Activity

updated a model 3 days ago

nm-testing/Llama-3.1-8B-Instruct-FP8-block

authored a paper 5 days ago

MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models

authored a paper 5 days ago

PagedEviction: Structured Block-wise KV Cache Pruning for Efficient Large Language Model Inference

View all activity

Organizations

Collections 2

Papers 6

models 57

datasets 1

krishnateja95/ImageNet-Think

Preview • Updated 11 days ago • 10.8k • 1

Krishna Teja Chitty-Venkata

AI & ML interests

Recent Activity

Organizations

Collections 2

krishnateja95/DeepseekVL2-4bit

krishnateja95/llama_3_2_11b_lora_nas_docvqa_gud_64_searched

krishnateja95/DeepseekVL2-4bit

krishnateja95/llama_3_2_11b_lora_nas_docvqa_gud_64_searched

Papers 6

models 57

krishnateja95/llama_3_2_11b_lora_nas_vsr_qkvogudfc_64

krishnateja95/llama_3_2_11b_lora_nas_vsr_gud_64

krishnateja95/llama_3_2_11b_lora_nas_vsr_gud_64_searched

krishnateja95/llama_3_2_11b_lora_nas_vsr_qkvogudfc_64_searched

krishnateja95/llama_3_2_11b_lora_nas_vsr_qk_64

krishnateja95/llama_3_2_11b_lora_nas_vsr_qk_64_searched

krishnateja95/llama_3_2_11b_lora_nas_hitab_qkvogudfc_64

krishnateja95/llama_3_2_11b_lora_nas_hitab_qkvogudfc_64_searched

krishnateja95/llama_3_2_11b_lora_nas_hitab_gud_64

krishnateja95/llama_3_2_11b_lora_nas_hitab_qk_64

datasets 1

krishnateja95/ImageNet-Think

Krishna Teja Chitty-Venkata

AI & ML interests

Recent Activity

Organizations

Collections 2

Papers 6

models 57 Sort: Recently updated

datasets 1

models 57