Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
11
43
Mohammed Khalil
mohamed-khalil
Follow
X779's profile picture
21world's profile picture
potato4code's profile picture
5 followers
·
5 following
https://v3xlrm1nOwo1.github.io
v3xlrm1nOwo1
v3xlrm1nOwo1
AI & ML interests
ML Researcher || NLP || anime
Recent Activity
reacted
to
Jaward
's
post
with ❤️
5 days ago
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted
a
paper
about 2 months ago
Large Language Diffusion Models
liked
a dataset
about 2 months ago
brando/small-c4-dataset
View all activity
Organizations
Papers
1
arxiv:
2407.19835
models
0
None public yet
datasets
4
Sort: Recently updated
mohamed-khalil/ATHAR
Viewer
•
Updated
Aug 4, 2024
•
66k
•
80
•
9
mohamed-khalil/KaidanNihonbunka
Viewer
•
Updated
Apr 15, 2024
•
8.56k
•
30
mohamed-khalil/AnimeSongsLyrics
Viewer
•
Updated
Mar 5, 2024
•
23.6k
•
38
•
4
mohamed-khalil/AnimeQuotes
Viewer
•
Updated
Feb 21, 2024
•
10.4k
•
28
•
4