1 69 52

robbie

robb-0

AI & ML interests

None yet

Recent Activity

reacted to Akjava's post with 👍 43 minutes ago

Initial API-Based Smolagents and Linear.app Integration Example https://huggingface.co/spaces/Akjava/linear-app-api-smolagents In short,this example contain get_todo_issue() tool and add_comment(),change_state_reviewing() function to linear.app Large language models, like 70B parameter models, can often readily utilize tools such as add_comment or change_state, potentially handling multiple issues concurrently. However, smaller models may require repeated calls to a tool or even fail to utilize it entirely. Therefore, this initial example focuses on the get_todo_issue() tool.

reacted to merve's post with 😎 1 day ago

IBM released https://huggingface.co/ibm-granite/granite-vision-3.1-2b-preview, a small vision LM with impressive performance on different tasks 😮🔥 it comes with transformers and vLLM support from the get-go 💗 you can run it in Colab T4, so I built a notebook to put it to test, find it here: https://github.com/merveenoyan/smol-vision/blob/main/inference_gists/IBM_Granite_Vision.ipynb

upvoted a paper 1 day ago

Modular Training of Neural Networks aids Interpretability

View all activity

Organizations

robb-0's activity

upvoted 3 papers 1 day ago

Modular Training of Neural Networks aids Interpretability

Paper • 2502.02470 • Published Feb 4 • 1

The Empirical Impact of Reducing Symmetries on the Performance of Deep Ensembles and MoE

Paper • 2502.17391 • Published 13 days ago • 1

Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures

Paper • 2502.00617 • Published Feb 2 • 1

upvoted an article 1 day ago

Article

The Open Arabic LLM Leaderboard 2

28 days ago

• 29

upvoted a paper 1 day ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published 25 days ago • 143

upvoted an article 3 days ago

Article

Welcome the Falcon 3 Family of Open Models!

Dec 17, 2024

• 121

upvoted a paper 6 days ago

Demystifying the Token Dynamics of Deep Selective State Space Models

Paper • 2410.03292 • Published Oct 4, 2024 • 1

upvoted 13 papers 7 days ago

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 61

LION: Linear Group RNN for 3D Object Detection in Point Clouds

Paper • 2407.18232 • Published Jul 25, 2024 • 2

xLSTM: Extended Long Short-Term Memory

Paper • 2405.04517 • Published May 7, 2024 • 13

ZigMa: Zigzag Mamba Diffusion Model

Paper • 2403.13802 • Published Mar 20, 2024 • 18

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 17 days ago • 160

WebGames: Challenging General-Purpose Web-Browsing AI Agents

Paper • 2502.18356 • Published 12 days ago • 11

On Relation-Specific Neurons in Large Language Models

Paper • 2502.17355 • Published 13 days ago • 6

Adam: A Method for Stochastic Optimization

Paper • 1412.6980 • Published Dec 22, 2014 • 2

From Markov to Laplace: How Mamba In-Context Learns Markov Chains

Paper • 2502.10178 • Published 24 days ago • 1

Going Deeper with Convolutions

Paper • 1409.4842 • Published Sep 17, 2014 • 2