Shaij's picture

2 7 17

Shaij PRO

appoose

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

EuroLLM-9B

liked a model 2 months ago

jinaai/jina-embeddings-v3

liked a model 3 months ago

mobiuslabsgmbh/faster-whisper-large-v3-turbo

View all activity

Organizations

appoose's activity

upvoted an article about 2 months ago

Article

EuroLLM-9B

By

•

Dec 2, 2024

• 105

liked a model 2 months ago

jinaai/jina-embeddings-v3

Feature Extraction • Updated 12 days ago • 1.02M • 669

liked a model 3 months ago

mobiuslabsgmbh/faster-whisper-large-v3-turbo

Updated Oct 8, 2024 • 74.6k • 16

liked 2 models 4 months ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 2.13M • • 1.78k

Qwen/Qwen2.5-72B-Instruct

Text Generation • Updated 6 days ago • 118k • • 681

upvoted an article 4 months ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

May 16, 2024

• 38

liked a model 5 months ago

mobiuslabsgmbh/Hermes-3-Llama-3.1-70B_4bitgs64_hqq

Text Generation • Updated Aug 16, 2024 • 5 • 4

updated a model 5 months ago

mobiuslabsgmbh/Hermes-3-Llama-3.1-70B_4bitgs64_hqq

Text Generation • Updated Aug 16, 2024 • 5 • 4

posted an update 5 months ago

Post

2067

Releasing HQQ Llama-3.1-70b 4-bit quantized version! Check it out at mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq.

Achieves 99% of the base model performance across various benchmarks! Details in the model card.

liked a model 5 months ago

mobiuslabsgmbh/Llama-3.1-70b-instruct_4bitgs64_hqq

Text Generation • Updated Aug 16, 2024 • 21 • 31

liked a model 6 months ago

facebook/sam2-hiera-small

Mask Generation • Updated Aug 7, 2024 • 6.6k • 13

posted an update 6 months ago

Post

1790

Excited to announce the release of our high-quality Llama-3.1 8B 4-bit HQQ/calibrated quantized model! Achieving an impressive 99.3% relative performance to FP16, it also delivers the fastest inference speed for transformers.

mobiuslabsgmbh/Llama-3.1-8b-instruct_4bitgs64_hqq_calib

1 reply

·

liked 2 models 6 months ago

mobiuslabsgmbh/Llama-3.1-8b-instruct_4bitgs64_hqq_calib

Text Generation • Updated Aug 27, 2024 • 68 • 55

mobiuslabsgmbh/Llama-3-8b-instruct_2bitgs64_hqq

Text Generation • Updated Aug 16, 2024 • 32 • 10

updated a model 8 months ago

appoose/aana-soccer-no-pretraining-weight-balanced-2_2.0_720-combined-finetuned

Updated May 30, 2024

upvoted 2 articles 9 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 231

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 171

upvoted a collection 10 months ago

Llama3 HQQ

4 items • Updated Aug 13, 2024 • 18

updated a Space 10 months ago

README

Multimodal AI for the world's scale

reacted to osanseviero's post with 🔥 10 months ago

Post

2075

Diaries of Open Source. Part 11 🚀

🚀Databricks release DBRX, potentially the best open access model! A 132B Mixture of Experts with 36B active params and trained on 12 trillion tokens
Blog: https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
Base and instruct models: databricks/dbrx-6601c0852a0cdd3c59f71962
Demo: https://hf.co/spaces/databricks/dbrx-instruct

🤏1-bit and 2-bit quantization exploration using HQQ+
Blog post: https://mobiusml.github.io/1bit_blog/
Models: https://hf.co/collections/mobiuslabsgmbh/llama2-7b-hqq-6604257a96fc8b9c4e13e0fe
GitHub: https://github.com/mobiusml/hqq

📚Cosmopedia: a large-scale synthetic dataset for pre-training - it includes 25 billion tokens and 30 million files
Dataset: HuggingFaceTB/cosmopedia
Blog: https://hf.co/blog/cosmopedia

⭐Mini-Gemini: multi-modal VLMs, from 2B to 34B
Models: https://hf.co/collections/YanweiLi/mini-gemini-6603c50b9b43d044171d0854
Paper: Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models (2403.18814)
GitHub: https://github.com/dvlab-research/MiniGemini

🔥VILA - On Pre-training for VLMs
Models: Efficient-Large-Model/vila-on-pre-training-for-visual-language-models-65d8022a3a52cd9bcd62698e
Paper: VILA: On Pre-training for Visual Language Models (2312.07533)

Misc
👀 FeatUp: a framework for image features at any resolution: mhamilton723/FeatUp FeatUp: A Model-Agnostic Framework for Features at Any Resolution (2403.10516)
🍞ColBERTus Maxiums, a colbertialized embedding model mixedbread-ai/mxbai-colbert-large-v1
🖌️Semantic Palette, a new drawing paradigm ironjr/SemanticPalette
🧑‍⚕️HistoGPT, a vision model that generates accurate pathology reports marr-peng-lab/histogpt https://www.medrxiv.org/content/10.1101/2024.03.15.24304211v1

4 replies

·