Papers
arxiv:2502.10173

Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model

Published on Feb 14
· Submitted by mjbuehler on Feb 17
Authors:
,

Abstract

Proteins are dynamic molecular machines whose biological functions, spanning enzymatic catalysis, signal transduction, and structural adaptation, are intrinsically linked to their motions. Designing proteins with targeted dynamic properties, however, remains a challenge due to the complex, degenerate relationships between sequence, structure, and molecular motion. Here, we introduce VibeGen, a generative AI framework that enables end-to-end de novo protein design conditioned on normal mode vibrations. VibeGen employs an agentic dual-model architecture, comprising a protein designer that generates sequence candidates based on specified vibrational modes and a protein predictor that evaluates their dynamic accuracy. This approach synergizes diversity, accuracy, and novelty during the design process. Via full-atom molecular simulations as direct validation, we demonstrate that the designed proteins accurately reproduce the prescribed normal mode amplitudes across the backbone while adopting various stable, functionally relevant structures. Notably, generated sequences are de novo, exhibiting no significant similarity to natural proteins, thereby expanding the accessible protein space beyond evolutionary constraints. Our work integrates protein dynamics into generative protein design, and establishes a direct, bidirectional link between sequence and vibrational behavior, unlocking new pathways for engineering biomolecules with tailored dynamical and functional properties. This framework holds broad implications for the rational design of flexible enzymes, dynamic scaffolds, and biomaterials, paving the way toward dynamics-informed AI-driven protein engineering.

Community

Paper submitter

VibeGen: Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model

Proteins are not purely static but rely on dynamic motions across a range of timescales to perform critical biological functions. Existing computational tools for protein design predominantly focus on static structures and neglect low-frequency vibrational behaviors relevant for phenomena like allostery and conformational gating. Our end-to-end, agentic framework guides de novo sequence creation based on targeted vibrational modes. By harnessing these modes, fundamental signatures of protein flexibility, we achieve both accuracy and diversity in designed proteins, many of which exhibit novel sequences distinct from those found in nature. This approach lays a foundation for engineering proteins with bespoke dynamical properties, offering new opportunities for advances in enzymatic catalysis, biosensing, and materials development.

Code: https://github.com/lamm-mit/ModeShapeDiffusionDesign
Model weights: https://huggingface.co/lamm-mit/VibeGen

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2502.10173 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2502.10173 in a Space README.md to link it from this page.

Collections including this paper 1