9 3 34

Gordon

GordonM

https://gordonmcd.com

AI & ML interests

Data Science for good

Recent Activity

reacted to MoritzLaurer's post with 👍 about 1 month ago

Quite excited by the ModernBERT release! 0.15/0.4B small, 2T modern pre-training data and tokenizer with code, 8k context window, great efficient model for embeddings & classification! This will probably be the basis for many future SOTA encoders! And I can finally stop using DeBERTav3 from 2021 :D Congrats @answerdotai, @LightOnIO and collaborators like @tomaarsen ! Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb

reacted to MoritzLaurer's post with 🔥 about 1 month ago

🚀 Releasing a new zeroshot-classifier based on ModernBERT! Some key takeaways: - ⚡ Speed & efficiency: It's multiple times faster and uses significantly less memory than DeBERTav3. You can use larger batch sizes and enabling bf16 (instead of fp16) gave me a ~2x speed boost as well - 📉 Performance tradeoff: It performs slightly worse than DeBERTav3 on average across my zeroshot classification task collection - 🧠 Use cases: I recommend using it for scenarios requiring speed and a larger context window (8k). - 💡 What’s next? I’m preparing a newer version trained on better + longer synthetic data to fully leverage the 8k context window and improve upon the training mix of my older zeroshot-v2.0 models. I also hope that there will be a multilingual variant in the future. Great work by https://huggingface.co/answerdotai ! If you’re looking for a high-speed zeroshot classifier, give it a try! 📄 Resources below: 👇 Base model: https://huggingface.co/MoritzLaurer/ModernBERT-base-zeroshot-v2.0 Large model: https://huggingface.co/MoritzLaurer/ModernBERT-large-zeroshot-v2.0 Updated zeroshot collection: https://huggingface.co/collections/MoritzLaurer/zeroshot-classifiers-6548b4ff407bb19ff5c3ad6f ModernBERT collection with paper: https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb

liked a model 2 months ago

Varosa/SeamlessExpressive

View all activity

Organizations

GordonM's activity

reacted to MoritzLaurer's post with 👍 about 1 month ago

Post

2631

Quite excited by the ModernBERT release! 0.15/0.4B small, 2T modern pre-training data and tokenizer with code, 8k context window, great efficient model for embeddings & classification!

This will probably be the basis for many future SOTA encoders! And I can finally stop using DeBERTav3 from 2021 :D

Congrats @answerdotai , @LightOnIO and collaborators like @tomaarsen !

Paper and models here 👇https://huggingface.co/collections/answerdotai/modernbert-67627ad707a4acbf33c41deb

3 replies

reacted to MoritzLaurer's post with 🔥 about 1 month ago

Post

2258

answerdotai !

If you’re looking for a high-speed zeroshot classifier, give it a try!

📄 Resources below: 👇
Base model: MoritzLaurer/ModernBERT-base-zeroshot-v2.0
Large model: MoritzLaurer/ModernBERT-large-zeroshot-v2.0
Updated zeroshot collection: MoritzLaurer/zeroshot-classifiers-6548b4ff407bb19ff5c3ad6f
ModernBERT collection with paper: answerdotai/modernbert-67627ad707a4acbf33c41deb

liked a model 2 months ago

Varosa/SeamlessExpressive

Updated Jan 1, 2024 • 2

liked a Space 2 months ago

Real Time Transcriber

📊

real-time transcriber

upvoted a collection 7 months ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16, 2024 • 154

updated a Space 8 months ago

README

📉

liked 2 Spaces 9 months ago

263

Latent Navigation

🪐

Travel through the model latent space

121

Candle Segment Anything Wasm

🕯

Segment Anything Model on the Browser with Candle/Rust/WASM

reacted to Xenova's post with ❤️😎🚀 10 months ago

Post

15144

I'm excited to announce that Transformers.js V3 is finally available on NPM! 🔥 State-of-the-art Machine Learning for the web, now with WebGPU support! 🤯⚡️

Install it from NPM with:
𝚗𝚙𝚖 𝚒 @𝚑𝚞𝚐𝚐𝚒𝚗𝚐𝚏𝚊𝚌𝚎/𝚝𝚛𝚊𝚗𝚜𝚏𝚘𝚛𝚖𝚎𝚛𝚜

or via CDN, for example: https://v2.scrimba.com/s0lmm0qh1q

Segment Anything demo: webml-community/segment-anything-webgpu

5 replies

liked a model 10 months ago

NovaSearch/stella_en_400M_v5

reacted to Xenova's post with 🚀 10 months ago

Post

14072

I can't believe this... Phi-3.5-mini (3.8B) running in-browser at ~90 tokens/second on WebGPU w/ Transformers.js and ONNX Runtime Web! 🤯 Since everything runs 100% locally, no messages are sent to a server — a huge win for privacy!
- 🤗 Demo: webml-community/phi-3.5-webgpu
- 🧑‍💻 Source code: https://github.com/huggingface/transformers.js-examples/tree/main/phi-3.5-webgpu

11 replies

upvoted a collection 10 months ago

InternVL2.0

Collection

Expanding Performance Boundaries of Open-Source MLLM • 15 items • Updated Apr 20 • 89

reacted to Xenova's post with 🔥 10 months ago

Post

15144

5 replies

liked a model 11 months ago

second-state/Mistral-Nemo-Instruct-2407-GGUF

Text Generation • Updated Jul 24, 2024 • 3.31k • 66

liked a Space 11 months ago

Convert to ONNX

☯

Convert a Hugging Face model to ONNX format

reacted to Xenova's post with ❤️🧠👀 11 months ago

Post

6901

Introducing Whisper Timestamped: Multilingual speech recognition with word-level timestamps, running 100% locally in your browser thanks to 🤗 Transformers.js! Check it out!
👉 Xenova/whisper-word-level-timestamps 👈

This unlocks a world of possibilities for in-browser video editing! 🤯 What will you build? 😍

Source code: https://github.com/xenova/transformers.js/tree/v3/examples/whisper-word-timestamps

1 reply