Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model
Abstract
Proteins are dynamic molecular machines whose biological functions, spanning enzymatic catalysis, signal transduction, and structural adaptation, are intrinsically linked to their motions. Designing proteins with targeted dynamic properties, however, remains a challenge due to the complex, degenerate relationships between sequence, structure, and molecular motion. Here, we introduce VibeGen, a generative AI framework that enables end-to-end de novo protein design conditioned on normal mode vibrations. VibeGen employs an agentic dual-model architecture, comprising a protein designer that generates sequence candidates based on specified vibrational modes and a protein predictor that evaluates their dynamic accuracy. This approach synergizes diversity, accuracy, and novelty during the design process. Via full-atom molecular simulations as direct validation, we demonstrate that the designed proteins accurately reproduce the prescribed normal mode amplitudes across the backbone while adopting various stable, functionally relevant structures. Notably, generated sequences are de novo, exhibiting no significant similarity to natural proteins, thereby expanding the accessible protein space beyond evolutionary constraints. Our work integrates protein dynamics into generative protein design, and establishes a direct, bidirectional link between sequence and vibrational behavior, unlocking new pathways for engineering biomolecules with tailored dynamical and functional properties. This framework holds broad implications for the rational design of flexible enzymes, dynamic scaffolds, and biomaterials, paving the way toward dynamics-informed AI-driven protein engineering.
Community
VibeGen: Agentic End-to-End De Novo Protein Design for Tailored Dynamics Using a Language Diffusion Model
Proteins are not purely static but rely on dynamic motions across a range of timescales to perform critical biological functions. Existing computational tools for protein design predominantly focus on static structures and neglect low-frequency vibrational behaviors relevant for phenomena like allostery and conformational gating. Our end-to-end, agentic framework guides de novo sequence creation based on targeted vibrational modes. By harnessing these modes, fundamental signatures of protein flexibility, we achieve both accuracy and diversity in designed proteins, many of which exhibit novel sequences distinct from those found in nature. This approach lays a foundation for engineering proteins with bespoke dynamical properties, offering new opportunities for advances in enzymatic catalysis, biosensing, and materials development.
Code: https://github.com/lamm-mit/ModeShapeDiffusionDesign
Model weights: https://huggingface.co/lamm-mit/VibeGen
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering (2025)
- Learning to engineer protein flexibility (2024)
- Prot2Chat: Protein LLM with Early Fusion of Sequence and Structure (2025)
- Steering Protein Family Design through Profile Bayesian Flow (2025)
- Docking-Aware Attention: Dynamic Protein Representations through Molecular Context Integration (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 1
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper