Lukas's picture

1 3 1

Lukas

sirluk

·

AI & ML interests

None yet

Recent Activity

commented on their article 14 days ago

Efficient LLM Pretraining: Packed Sequences and Masked Attention

upvoted a paper 6 months ago

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

authored a paper 7 months ago

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

View all activity

Organizations

sirluk's activity

commented on Efficient LLM Pretraining: Packed Sequences and Masked Attention 14 days ago

Hey @shantanuagarwal , glad you enjoyed the article! Even though I havent tried it out myself you should be able to leverage pytorch flexattention api for this. Have a look at the tutorial here https://pytorch.org/blog/flexattention/. Section "Document Masking/Jagged Sequences" talks about these packed sequence masks.

upvoted a paper 6 months ago

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Paper • 2410.22391 • Published Oct 29, 2024 • 22

authored a paper 7 months ago

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

Paper • 2410.07170 • Published Oct 9, 2024 • 15

upvoted 2 papers 7 months ago

Retrieval-Augmented Decision Transformer: External Memory for In-context RL

Paper • 2410.07071 • Published Oct 9, 2024 • 7

One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation

Paper • 2410.07170 • Published Oct 9, 2024 • 15

published an article 7 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

By

•

Oct 7, 2024

• 38

New activity in meta-llama/Llama-3.1-8B 8 months ago

apply_chat_template method not working correctly for llama 3 tokenizer

#35 opened 9 months ago by

liked a dataset 11 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 7.24k • 797

published an article over 1 year ago

Article

Multilabel Classification using Mistral-7B on a single GPU with quantization and LoRA

By

•

Jan 22, 2024

• 19