jiogenes's picture

5

jiogenes

jiogenes

·

jiogenes

AI & ML interests

None yet

Recent Activity

updated a Space about 2 months ago

jiogenes/First_agent_template

upvoted a paper 2 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

upvoted an article 3 months ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

View all activity

Organizations

jiogenes's activity

updated a Space about 2 months ago

First Agent Template

Fetch and display current time in a specified timezone

upvoted a paper 2 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 153

upvoted 2 articles 3 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

By

•

Jan 30

• 72

Article

4D masks support in Transformers

By

•

Jan 8, 2024

• 23

upvoted a paper 12 months ago

LLaNA: Large Language and NeRF Assistant

Paper • 2406.11840 • Published Jun 17, 2024 • 18

updated 5 models over 1 year ago

jiogenes/bert-base-cased-finetuning-hrs

Fill-Mask • Updated Feb 20, 2024 • 17

jiogenes/bert-base-uncased-finetuning-hrs

Fill-Mask • Updated Feb 20, 2024 • 15

jiogenes/Llama-2-7b-hf-finetuned-open-korean-instructions

Text Generation • Updated Jan 16, 2024 • 17

jiogenes/gpt2-medium-finetuned-open-korean-instructions

Text Generation • Updated Jan 11, 2024 • 17

jiogenes/basic-transformer-machine-translation

Updated Dec 27, 2023

upvoted a paper over 1 year ago

PyNeRF: Pyramidal Neural Radiance Fields

Paper • 2312.00252 • Published Nov 30, 2023 • 11