Sam Gillham's picture

Sam Gillham PRO

nomadicsynth

AI & ML interests

Reasoning, self-learning, AI-powered research

Recent Activity

Organizations

Neon Cortex's profile picture

nomadicsynth's activity

replied to their post 2 days ago
view reply

I'm attempting to use a 7B LLM, Llama in this case, with an embedding head stuck on the end instead of the lm_head. I used an LLM to rank a ton of randomly selected pairs of papers based on if they have good connections, and trained the embedding head on triplets mined from those ranked pairs.

The idea is for the embedding head to learn to align features from paper abstracts that complement each other.

this is the first version and yeah, I'm not overly impressed. I think I'm seeing results that kinda vibe with the concept sometimes, but I think the ranking criteria for the dataset were a bit loose. I'm going to try making a new dataset with better, more strict, more nuanced criteria and train a second version of the model from that.

replied to their post 3 days ago
view reply

Thanks for letting me know. I've fixed the issue. Feel free to try again.

posted an update 3 days ago
view post
Post
1989
I Did a Thing!

I made an embedding model to find answers in research papers. It goes deeper than plain "semantic search" by identifying deeply reasoned connections and interdisciplinary insights that might have been overlooked. The goal is to find the solutions that might have been missed and to uncover answers that are already out there.

I’ve set up a demo Space - nomadicsynth/inkling . It’s early days, and I’d love some feedback on the model’s results. Try it out and let me know what you think!

Oh, and if it finds your Nobel-winning answer, I want a cut! 😉
·
replied to their post 3 days ago
view reply

I think we can extract that Harvard knowledge and distribute it in the form of properly open models. Get them chatting with our LLMs and train on the collected knowledge. Mwahaha!

They do it to us, after all.

replied to their post 3 days ago
replied to their post 3 days ago
view reply

it's pretty exciting to see the newer, more powerful hardware coming down the pipeline. with better hardware becoming more prolific I hope we can see expansions of concepts like BOINC and Folding@Home and newer ideas as well. It's impossible for you or I to compete with BigAI, but with federated learning techniques and enough people I think it can be done, at least "good enough". It sounds like the kind of thing you might be interested in, yeah?

reacted to MonsterMMORPG's post with 😎 15 days ago
view post
Post
2301
30 seconds hard test on FramePack - [0] a man talking , [5] a man crying , [10] a man smiling , [15] a man frowning , [20] a man sleepy , [25] a man going crazy - i think result is excellent when we consider how hard this test is - Generated with SECourses FramePack App V40

App link and 1-click installers for Windows, RunPod and Massed Compute here : https://www.patreon.com/posts/126855226

I got the prompt using idea from this pull request : https://github.com/lllyasviel/FramePack/pull/218/files

Not exactly same implementation but i think pretty accurate when considering that it is a 30 second 30 fps video at 840p resolution
reacted to samihalawa's post with 👀 16 days ago
view post
Post
2412
SkyReels-V2 INFINITE VIDEO🔥♾️🎬 UNLIMITED duration video generation model by Skywork.

> “Finally is here. An Open-Source model that achieves what we all have waiting for: Infinite Length Videos.’’😮

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought (2504.05599)

Model: Skywork/SkyReels-V2-T2V-14B-720P

✨ 1.3B & 14B
✨ Generates infinite length videos using Diffusion Forcing with diffusion models + autoregressive methods
reacted to clem's post with 🔥 16 days ago
view post
Post
3979
Energy is a massive constraint for AI but do you even know what energy your chatGPT convos are using?

We're trying to change this by releasing ChatUI-energy, the first interface where you see in real-time what energy your AI conversations consume. Great work from @jdelavande powered by spaces & TGI, available for a dozen of open-source models like Llama, Mistral, Qwen, Gemma and more.

jdelavande/chat-ui-energy

Should all chat interfaces have this? Just like ingredients have to be shown on products you buy, we need more transparency in AI for users!
  • 3 replies
·
reacted to JLouisBiz's post with 👍 27 days ago
view post
Post
3529
Article: https://huggingface.co/blog/JLouisBiz/semantical-website-links

You don't need to do the tedious work of finding all those links on your huge website.

Automating semantic links on websites using Large Language Models (LLMs) enhances user experience and efficiency. Here's a simplified workflow:

1. Store LLM embeddings in PostgreSQL: Use the vector data type to store text embeddings generated by an LLM.
2. Divide page texts into chunks for processing.
3. Generate embeddings using an LLM for each chunk of text.
4. Create template markup around specific terms needing links.

An automated program then:

- Converts marked-up terms to their corresponding LLMs' embeddings,
- Compares these with stored database embeddings (using cosine similarity),
- Identifies the most relevant page based on highest similarity score, and
- Automatically adds a link from the original content to this contextually related information.

This process improves navigation by directing users to highly contextual pages. It saves time as it automates creating semantic links while maintaining accuracy.
posted an update 29 days ago
view post
Post
2357
What if intelligence didn’t belong to the rich?
What if insight, planning, and innovation were available to everyone?
What if we actually leveled the playing field — not by force, but by toolset?
·
reacted to onekq's post with 🔥 about 2 months ago
view post
Post
3758
Folks, let's get ready.🥳 We will be busy soon. 😅🤗https://github.com/huggingface/transformers/pull/36878
reacted to clem's post with 🤗 about 2 months ago
view post
Post
4672
We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!
·
reacted to samihalawa's post with 🧠 3 months ago
view post
Post
1782
✅BEST DEBUG PROMPT
Language: Any. 🌀Project Type: Any

What prompt if sent to you  will make you  detect and fix all the code crashing  issues in the COMPLETE codebase so I don't have to ask you to fix again and again ?
Step 1. Gimme such prompt
Step 2 Follow it yourself quietly and COMPLETELY
Step3: State  that if you are asked again about finding fatal bugs and logic issues and inconsistencies in the current codebase you would not be able  to  find more. (You cannot lie, so you must make all the code adjustments necessary prior to  such statement).

reacted to AdinaY's post with 🤗 4 months ago
reacted to prithivMLmods's post with ❤️ 5 months ago
reacted to fdaudens's post with 👀 5 months ago
view post
Post
1358
Did a fun experiment: What are the main themes emerging from the 100+ Nieman Journalism Lab predictions for 2025?

I used natural language processing to cluster and map them — really helps spot patterns that weren't obvious when reading predictions one by one. So what will shape journalism next year? A lot of AI and US politics (surprise!), but there's also this horizontal axis that spans from industry strategies to deep reflections on how to talk to the public.

Click any dot to explore the original prediction. What themes surprise/interest you the most?

👉 fdaudens/nieman_lab_2025_predictions_visualization

P.s.: I discovered that Nieman Lab's content is under Creative Commons license!