Khalil Guetari

KhalilGuetari

AI & ML interests

None yet

Recent Activity

reacted to prithivMLmods's post with 🔥 8 days ago

Gemma-3-4B : Image and Video Inference 🖼️🎥 🧤Space: https://huggingface.co/spaces/prithivMLmods/Gemma-3-Multimodal 🥠Git : https://github.com/PRITHIVSAKTHIUR/Gemma-3-Multimodal @gemma3 : {Tag + Space_+ 'prompt'} @video-infer : {Tag + Space_+ 'prompt'} + Gemma3-4B : https://huggingface.co/google/gemma-3-4b-it + By default, it runs : https://huggingface.co/prithivMLmods/Qwen2-VL-OCR-2B-Instruct Gemma 3 Technical Report : https://storage.googleapis.com/deepmind-media/gemma/Gemma3Report.pdf

liked a Space 20 days ago

stabilityai/stable-diffusion-3.5-large

liked a dataset 27 days ago

momentslab/AstroCaptions

View all activity

Organizations

KhalilGuetari's activity

reacted to prithivMLmods's post with 🔥 8 days ago

Post

2450

Gemma-3-4B : Image and Video Inference 🖼️🎥

🧤Space: prithivMLmods/Gemma-3-Multimodal
🥠Git : https://github.com/PRITHIVSAKTHIUR/Gemma-3-Multimodal

@gemma3 : {Tag + Space_+ 'prompt'}
@video-infer : {Tag + Space_+ 'prompt'}

+ Gemma3-4B : google/gemma-3-4b-it
+ By default, it runs : prithivMLmods/Qwen2-VL-OCR-2B-Instruct

Gemma 3 Technical Report : https://storage.googleapis.com/deepmind-media/gemma/Gemma3Report.pdf

1 reply

liked a Space 20 days ago

1.84k

Stable Diffusion 3.5 Large

🏃

Generate images with SD3.5

liked a dataset 27 days ago

momentslab/AstroCaptions

Viewer • Updated May 20, 2024 • 44.1k • 103 • 4

updated a Space about 1 month ago

First Agent Template

⚡

Find current time in any timezone

reacted to cfahlgren1's post with 👀 5 months ago

Post

1164

If you are like me, I like to find up and coming datasets and spaces before everyone else.

I made a trending repo space cfahlgren1/trending-repos where it shows:

- New up and coming Spaces in the last day
- New up and coming Datasets in the last 2 weeks

It's a really good way to find some new gems before they become popular. For example, someone is working on a way to dynamically create assets inside a video game here: gptcall/AI-Game-Creator

reacted to fdaudens's post with 👍 5 months ago

Post

2363

🔍 NYT leveraged AI to investigate election interference by analyzing 400+ hours of recorded meetings - that's 5M words of data!

AI spotted patterns, humans verified facts. Every AI-flagged quote was manually verified against source recordings. Really appreciate that they published their full methodology - transparency matters when using AI in journalism.

A perfect blend of tech & journalism.

The future of journalism isn't robots replacing reporters - it's AI helping humans process massive datasets more efficiently. Sometimes the most powerful tech solutions are the least flashy ones.

Read the article: https://www.nytimes.com/interactive/2024/10/28/us/politics/inside-the-movement-behind-trumps-election-lies.html?unlocked_article_code=1.Vk4.ucv9.dbHVquTQaf0G&smid=nytcore-ios-share

upvoted an article 6 months ago

Article

FineVideo: behind the scenes

Sep 23, 2024

• 30

authored a paper 7 months ago

Multimodal Chaptering for Long-Form TV Newscast Video

Paper • 2406.17590 • Published Mar 20, 2024 • 2

upvoted a paper 7 months ago

Multimodal Chaptering for Long-Form TV Newscast Video

Paper • 2406.17590 • Published Mar 20, 2024 • 2

updated a collection 7 months ago

Moments Lab Research papers

Collection

All of Moments Lab Research papers available on Hugging Face • 3 items • Updated Sep 2, 2024 • 1

upvoted a collection 7 months ago

Moments Lab Research papers

Collection

All of Moments Lab Research papers available on Hugging Face • 3 items • Updated Sep 2, 2024 • 1

authored a paper 9 months ago

Towards Retrieval Augmented Generation over Large Video Libraries

Paper • 2406.14938 • Published Jun 21, 2024 • 21

upvoted a paper 9 months ago

Towards Retrieval Augmented Generation over Large Video Libraries

Paper • 2406.14938 • Published Jun 21, 2024 • 21

upvoted an article 10 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 243

liked a dataset 10 months ago

isidentical/moondream2-coyo-5M-captions

Viewer • Updated May 13, 2024 • 5.01M • 122 • 57

authored a paper 10 months ago

Inserting Faces inside Captions: Image Captioning with Attention Guided Merging

Paper • 2405.02305 • Published Mar 20, 2024 • 2

updated a dataset 10 months ago

momentslab/AstroCaptions

Viewer • Updated May 20, 2024 • 44.1k • 103 • 4

New activity in momentslab/AstroCaptions 10 months ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

upvoted a paper 10 months ago

Inserting Faces inside Captions: Image Captioning with Attention Guided Merging

Paper • 2405.02305 • Published Mar 20, 2024 • 2

reacted to merve's post with 🔥 10 months ago

Post

1769

New open Vision Language Model by @Google : PaliGemma 💙🤍

📝 Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution
🧩 Combination of Gemma 2B LLM and SigLIP image encoder
🤗 Supported in transformers

PaliGemma can do..
🧩 Image segmentation and detection! 🤯
📑 Detailed document understanding and reasoning
🙋 Visual question answering, captioning and any other VLM task!

Read our blog 🔖 hf.co/blog/paligemma
Try the demo 🪀 hf.co/spaces/google/paligemma
Check out the Spaces and the models all in the collection 📚 google/paligemma-release-6643a9ffbf57de2ae0448dda
Collection of fine-tuned PaliGemma models google/paligemma-ft-models-6643b03efb769dad650d2dda

13 replies