3 1 13

Sam Gillham PRO

nomadicsynth

AI & ML interests

Reasoning, self-learning, AI-powered research

Recent Activity

updated a Space 1 day ago

nomadicsynth/inkling

replied to their post 2 days ago

I Did a Thing! I made an embedding model to find answers in research papers. It goes deeper than plain "semantic search" by identifying deeply reasoned connections and interdisciplinary insights that might have been overlooked. The goal is to find the solutions that might have been missed and to uncover answers that are already out there. I’ve set up a demo Space - https://huggingface.co/spaces/nomadicsynth/inkling . It’s early days, and I’d love some feedback on the model’s results. Try it out and let me know what you think! Oh, and if it finds your Nobel-winning answer, I want a cut! 😉

new activity 2 days ago

nomadicsynth/arxiv-dataset-abstract-embeddings:[bot] Conversion to Parquet

View all activity

Organizations

nomadicsynth's activity

replied to their post 2 days ago

I'm attempting to use a 7B LLM, Llama in this case, with an embedding head stuck on the end instead of the lm_head. I used an LLM to rank a ton of randomly selected pairs of papers based on if they have good connections, and trained the embedding head on triplets mined from those ranked pairs.

The idea is for the embedding head to learn to align features from paper abstracts that complement each other.

this is the first version and yeah, I'm not overly impressed. I think I'm seeing results that kinda vibe with the concept sometimes, but I think the ranking criteria for the dataset were a bit loose. I'm going to try making a new dataset with better, more strict, more nuanced criteria and train a second version of the model from that.

replied to their post 3 days ago

Thanks for letting me know. I've fixed the issue. Feel free to try again.

posted an update 3 days ago

Post

1989

I Did a Thing!

I made an embedding model to find answers in research papers. It goes deeper than plain "semantic search" by identifying deeply reasoned connections and interdisciplinary insights that might have been overlooked. The goal is to find the solutions that might have been missed and to uncover answers that are already out there.

I’ve set up a demo Space - nomadicsynth/inkling . It’s early days, and I’d love some feedback on the model’s results. Try it out and let me know what you think!

Oh, and if it finds your Nobel-winning answer, I want a cut! 😉

6 replies

replied to their post 3 days ago

I think we can extract that Harvard knowledge and distribute it in the form of properly open models. Get them chatting with our LLMs and train on the collected knowledge. Mwahaha!

They do it to us, after all.

replied to their post 3 days ago

5090 because I'm not a millionaire.

replied to their post 3 days ago

it's pretty exciting to see the newer, more powerful hardware coming down the pipeline. with better hardware becoming more prolific I hope we can see expansions of concepts like BOINC and Folding@Home and newer ideas as well. It's impossible for you or I to compete with BigAI, but with federated learning techniques and enough people I think it can be done, at least "good enough". It sounds like the kind of thing you might be interested in, yeah?

reacted to MonsterMMORPG's post with 😎 15 days ago

Post

2301

30 seconds hard test on FramePack - [0] a man talking , [5] a man crying , [10] a man smiling , [15] a man frowning , [20] a man sleepy , [25] a man going crazy - i think result is excellent when we consider how hard this test is - Generated with SECourses FramePack App V40

App link and 1-click installers for Windows, RunPod and Massed Compute here : https://www.patreon.com/posts/126855226

I got the prompt using idea from this pull request : https://github.com/lllyasviel/FramePack/pull/218/files

Not exactly same implementation but i think pretty accurate when considering that it is a 30 second 30 fps video at 840p resolution

reacted to samihalawa's post with 👀 16 days ago

Post

2412

SkyReels-V2 INFINITE VIDEO🔥♾️🎬 UNLIMITED duration video generation model by Skywork.

> “Finally is here. An Open-Source model that achieves what we all have waiting for: Infinite Length Videos.’’😮

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought (2504.05599)

Model: Skywork/SkyReels-V2-T2V-14B-720P

✨ 1.3B & 14B
✨ Generates infinite length videos using Diffusion Forcing with diffusion models + autoregressive methods

reacted to clem's post with 🔥 16 days ago

Post

3979

Energy is a massive constraint for AI but do you even know what energy your chatGPT convos are using?

We're trying to change this by releasing ChatUI-energy, the first interface where you see in real-time what energy your AI conversations consume. Great work from @jdelavande powered by spaces & TGI, available for a dozen of open-source models like Llama, Mistral, Qwen, Gemma and more.

jdelavande/chat-ui-energy

Should all chat interfaces have this? Just like ingredients have to be shown on products you buy, we need more transparency in AI for users!

3 replies

reacted to JLouisBiz's post with 👍 27 days ago

Post

3529

Article: https://huggingface.co/blog/JLouisBiz/semantical-website-links

You don't need to do the tedious work of finding all those links on your huge website.

Automating semantic links on websites using Large Language Models (LLMs) enhances user experience and efficiency. Here's a simplified workflow:

1. Store LLM embeddings in PostgreSQL: Use the vector data type to store text embeddings generated by an LLM.
2. Divide page texts into chunks for processing.
3. Generate embeddings using an LLM for each chunk of text.
4. Create template markup around specific terms needing links.

An automated program then:

- Converts marked-up terms to their corresponding LLMs' embeddings,
- Compares these with stored database embeddings (using cosine similarity),
- Identifies the most relevant page based on highest similarity score, and
- Automatically adds a link from the original content to this contextually related information.

This process improves navigation by directing users to highly contextual pages. It saves time as it automates creating semantic links while maintaining accuracy.

posted an update 29 days ago

Post

2357

What if intelligence didn’t belong to the rich?
What if insight, planning, and innovation were available to everyone?
What if we actually leveled the playing field — not by force, but by toolset?

8 replies

reacted to onekq's post with 🔥 about 2 months ago

Post

3758

Folks, let's get ready.🥳 We will be busy soon. 😅🤗https://github.com/huggingface/transformers/pull/36878

reacted to clem's post with 🤗 about 2 months ago

Post

4672

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

3 replies

reacted to samihalawa's post with 🧠 3 months ago

Post

1782

✅BEST DEBUG PROMPT
Language: Any. 🌀Project Type: Any

What prompt if sent to you  will make you  detect and fix all the code crashing  issues in the COMPLETE codebase so I don't have to ask you to fix again and again ?
Step 1. Gimme such prompt
Step 2 Follow it yourself quietly and COMPLETELY
Step3: State  that if you are asked again about finding fatal bugs and logic issues and inconsistencies in the current codebase you would not be able  to  find more. (You cannot lie, so you must make all the code adjustments necessary prior to  such statement).

reacted to AdinaY's post with 🤗 4 months ago

Post

2384

Excited to see Alibaba DAMO Academy release a multimodel dataset for vision language pretraining on the hub🔥

Paper: 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining (2501.00958)
Dataset: DAMO-NLP-SG/multimodal_textbook

✨ 6.5M images + 0.8B text from 22k hours of instructional videos
✨ Covers subjects like math, physics, and chemistry
✨ Apache 2.0

reacted to prithivMLmods's post with ❤️ 5 months ago

Post

6549

Sketchify 😉🎨

+ strangerzonehf/Flux-Sketch-Smudge-LoRA
+ strangerzonehf/Flux-Sketch-Sized-LoRA
+ strangerzonehf/Sketch-Paint

- strangerzonehf/sketch-fav-675ba869c7ceaec7e652ee1c

reacted to fdaudens's post with 👀 5 months ago

Post

1358

Did a fun experiment: What are the main themes emerging from the 100+ Nieman Journalism Lab predictions for 2025?

I used natural language processing to cluster and map them — really helps spot patterns that weren't obvious when reading predictions one by one. So what will shape journalism next year? A lot of AI and US politics (surprise!), but there's also this horizontal axis that spans from industry strategies to deep reflections on how to talk to the public.

Click any dot to explore the original prediction. What themes surprise/interest you the most?

👉 fdaudens/nieman_lab_2025_predictions_visualization

P.s.: I discovered that Nieman Lab's content is under Creative Commons license!