Bertoti
Giuliano
AI & ML interests
None yet
Organizations
Video Gen
Medicine
Agents
-
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 8 -
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Paper • 2401.00812 • Published • 11 -
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Paper • 2310.03714 • Published • 35 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 27
Agents GUI
-
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Paper • 2411.17465 • Published • 88 -
OmniParser for Pure Vision Based GUI Agent
Paper • 2408.00203 • Published • 26 -
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 66 -
THUDM/cogagent-9b-20241220
Image-Text-to-Text • 14B • Updated • 1.25k • 53
Voice
text2sql
LLM Personalization
Agents SWE
-
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper • 2407.16741 • Published • 73 -
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs
Paper • 2408.02193 • Published • 1 -
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Paper • 2411.04329 • Published -
SWE-Gym/OpenHands-7B-Agent
Updated • 9
LLM Reasoning
-
STaR: Bootstrapping Reasoning With Reasoning
Paper • 2203.14465 • Published • 8 -
Let's Verify Step by Step
Paper • 2305.20050 • Published • 10 -
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 87 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 62
Multimodal
Voice
Video Gen
text2sql
Medicine
LLM Personalization
Agents
-
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 8 -
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
Paper • 2401.00812 • Published • 11 -
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
Paper • 2310.03714 • Published • 35 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 27
Agents SWE
-
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper • 2407.16741 • Published • 73 -
CodeACT: Code Adaptive Compute-efficient Tuning Framework for Code LLMs
Paper • 2408.02193 • Published • 1 -
CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models
Paper • 2411.04329 • Published -
SWE-Gym/OpenHands-7B-Agent
Updated • 9
Agents GUI
-
ShowUI: One Vision-Language-Action Model for GUI Visual Agent
Paper • 2411.17465 • Published • 88 -
OmniParser for Pure Vision Based GUI Agent
Paper • 2408.00203 • Published • 26 -
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
Paper • 2412.04454 • Published • 66 -
THUDM/cogagent-9b-20241220
Image-Text-to-Text • 14B • Updated • 1.25k • 53
LLM Reasoning
-
STaR: Bootstrapping Reasoning With Reasoning
Paper • 2203.14465 • Published • 8 -
Let's Verify Step by Step
Paper • 2305.20050 • Published • 10 -
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 87 -
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Paper • 2411.14405 • Published • 62