Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
82.6
TFLOPS
30
5
85
Mahwiz Khalil
mahwizzzz
Follow
Alefiah's profile picture
victor's profile picture
HuzaifaDev's profile picture
16 followers
·
37 following
https://topmate.io/mahwiz
mwzkhalil
mwzkhalil
mahwiz-khalil
AI & ML interests
Low-Resource NLP
Recent Activity
published
a dataset
about 6 hours ago
mahwizzzz/tiny-llama-urdu-ds
updated
a model
about 7 hours ago
mahwizzzz/CAT
reacted
to
Kseniase
's
post
with 👍
about 9 hours ago
12 Foundational AI Model Types Let’s refresh some fundamentals today to stay fluent in the what we all work with. Here are some of the most popular model types that shape the vast world of AI (with examples in the brackets): 1. LLM - Large Language Model (GPT, LLaMA) -> https://huggingface.co/papers/2402.06196 + history of LLMs: https://www.turingpost.com/t/The%20History%20of%20LLMs It's trained on massive text datasets to understand and generate human language. They are mostly build on Transformer architecture, predicting the next token. LLMs scale by increasing overall parameter count across all components (layers, attention heads, MLPs, etc.) 2. SLM - Small Language Model (TinyLLaMA, Phi models, SmolLM) https://huggingface.co/papers/2410.20011 Lightweight LM optimized for efficiency, low memory use, fast inference, and edge use. SLMs work using the same principles as LLMs 3. VLM - Vision-Language Model (CLIP, Flamingo) -> https://huggingface.co/papers/2405.17247 Processes and understands both images and text. VLMs map images and text into a shared embedding space or generate captions/descriptions from both 4. MLLM - Multimodal Large Language Model (Gemini) -> https://huggingface.co/papers/2306.13549 A large-scale model that can understand and process multiple types of data (modalities) — usually text + other formats, like images, videos, audio, structured data, 3D or spatial inputs. MLLMs can be LLMs extended with modality adapters or trained jointly across vision, text, audio, etc. 5. LAM - Large Action Model (InstructDiffusion, RT-2) -> https://huggingface.co/papers/2412.10047 Understands and generates action sequences by predicting action tokens (discrete/continuous instructions) that guide agents. Trained on behavior datasets, LAMs generalize across tasks, environments, and modalities - video, sensor data, etc. Read about LRM, MoE, SSM, RNN, CNN, SAM and LNN below👇 Also, subscribe to the Turing Post: https://www.turingpost.com/subscribe
View all activity
Organizations
Posts
1
view post
Post
2001
[ { "from": "human", "value": "First post 🤗" }]
spaces
1
Running
Local ChatGPT
😻
models
25
Sort: Recently updated
mahwizzzz/CAT
Updated
about 7 hours ago
•
1
mahwizzzz/asd
Updated
11 days ago
•
22
mahwizzzz/urdu-bitnet-medium
Updated
Apr 27
mahwizzzz/stable-tts-urdu
Updated
Apr 26
mahwizzzz/urdu-g2p
Updated
Apr 23
•
4
mahwizzzz/tiny-llama-urdu
Updated
Apr 18
mahwizzzz/ur-tts
Updated
Apr 5
•
5
mahwizzzz/orpheus-urdu-tts
Updated
Apr 5
•
37
•
2
mahwizzzz/ur_gpt_tts
Updated
Apr 3
mahwizzzz/urdu_text_correction
Text2Text Generation
•
Updated
Mar 21
•
9
Expand 25 models
datasets
18
Sort: Recently updated
mahwizzzz/tiny-llama-urdu-ds
Updated
about 6 hours ago
mahwizzzz/urdu-g2p
Viewer
•
Updated
Apr 19
•
30.7k
•
25
mahwizzzz/UAT_expresso
Updated
Apr 11
•
6
mahwizzzz/combined-chartqa
Updated
Mar 27
•
7
mahwizzzz/urdu_error_correction
Viewer
•
Updated
Mar 20
•
600k
•
25
mahwizzzz/Urdu_Rekhta
Viewer
•
Updated
Mar 8
•
1.31k
•
42
•
3
mahwizzzz/sindhi_alpaca_yc_filtered
Viewer
•
Updated
Mar 3
•
28.9k
•
19
mahwizzzz/UrduLegal
Viewer
•
Updated
Feb 13
•
1.07k
•
17
mahwizzzz/UAT
Viewer
•
Updated
Feb 1
•
20.4k
•
212
mahwizzzz/Cases-Instruct-pk
Viewer
•
Updated
Jun 24, 2024
•
1.41k
•
30
Expand 18 datasets