Mohammed Hamdy's picture

Open to Collab

Mohammed Hamdy

mmhamdy

hugging-science

·

AI & ML interests

TechBio | AI4Sci | NLP | Reinforcement Learning

Recent Activity

posted an update about 2 months ago

The new DeepSeek Engram paper is super fun! It also integrates mHC, and I suspect they're probably releasing all these papers to make the V4 report of reasonable length😄 Here's a nice short summary from Gemini

upvoted an article 3 months ago

Continuous batching from first principles

reacted to Kseniase's post with ❤️ 4 months ago

12 Types of JEPA Since Yann LeCun together with Randall Balestriero released a new paper on JEPA (Joint-Embedding Predictive Architecture), laying out its theory and introducing an efficient practical version called LeJEPA, we figured you might need even more JEPA. Here are 7 recent JEPA variants plus 5 iconic ones: 1. LeJEPA → https://huggingface.co/papers/2511.08544 Explains a full theory for JEPAs, defining the “ideal” JEPA embedding as an isotropic Gaussian, and proposes the SIGReg objective to push JEPA toward this ideal, resulting in practical LeJEPA 2. JEPA-T → https://huggingface.co/papers/2510.00974 A text-to-image model that tokenizes images and captions with a joint predictive Transformer, enhances fusion with cross-attention and text embeddings before training loss, and generates images by iteratively denoising visual tokens conditioned on text 3. Text-JEPA → https://huggingface.co/papers/2507.20491 Converts natural language into first-order logic, with a Z3 solver handling reasoning, enabling efficient, explainable QA with far lower compute than large LLMs 4. N-JEPA (Noise-based JEPA) → https://huggingface.co/papers/2507.15216 Connects self-supervised learning with diffusion-style noise by using noise-based masking and multi-level schedules, especially improving visual classification 5. SparseJEPA → https://huggingface.co/papers/2504.16140 Adds sparse representation learning to make embeddings more interpretable and efficient. It groups latent variables by shared semantic structure using a sparsity penalty while preserving accuracy 6. TS-JEPA (Time Series JEPA) → https://huggingface.co/papers/2509.25449 Adapts JEPA to time-series by learning latent self-supervised representations and predicting future latents for robustness to noise and confounders Read further below ↓ It you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe

View all activity

Organizations

liked a Space 4 months ago

Unlocking On-Policy Distillation for Any Model Family

Visualize on-policy distillation for any model family

liked a dataset 5 months ago

transferable-samplers/many-peptides-md

Updated Dec 15, 2025 • 413k • 7

liked 2 Spaces 5 months ago

Science Release Heatmap

Explore AI4Science contributors by organization and tag

Maintain the unmaintainable

Explore the complex relationships between 400+ machine learning models

liked a Space 6 months ago

Transformers Timeline

Interactive timeline to explore the 🤗Transformers models

liked a model 7 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • Updated Oct 31, 2025 • 206k • 1.27k

liked a dataset 9 months ago

nvidia/Nemotron-Personas-USA

Viewer • Updated Dec 16, 2025 • 1M • 3.51k • 260

liked a model 9 months ago

PlayHT/PlayDiffusion

Updated Jul 29, 2025 • 109

liked a model 10 months ago

facebook/KernelLLM

Text Generation • Updated Jan 15 • 342 • • 193

liked a model 12 months ago

sesame/csm-1b

Text-to-Speech • Updated Dec 1, 2025 • 139k • 2.34k

liked a Space 12 months ago

The Distill Template

Craft Beautiful Blogs

liked 2 models about 1 year ago

ElectricAlexis/NotaGen

Updated Feb 26, 2025 • 150

microsoft/wham

Updated Dec 17, 2025 • 77 • 268

liked a Space about 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a model about 1 year ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated Apr 10, 2025 • 9.16M • • 5.81k

liked a dataset about 1 year ago

HuggingFaceH4/MATH-500

Viewer • Updated Dec 15, 2025 • 500 • 103k • 287

liked a model about 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 1.27M • 1.01k

liked a Space about 1 year ago

Scaling test-time compute

Boost LLM answers with search‑guided test‑time compute

liked a model about 1 year ago

CohereLabs/c4ai-command-r7b-12-2024

Text Generation • Updated Oct 30, 2025 • 11k • • 409

liked a dataset over 1 year ago

CohereLabs/Global-MMLU

Viewer • Updated Aug 14, 2025 • 602k • 9.72k • 150