InVirtuoGen: Uniform-Source Discrete Flows for Drug Discovery
π Paper: Refine Drugs, Don't Complete Them: Uniform-Source Discrete Flows for Fragment-Based Drug Discovery
InVirtuoGen is a generative framework for molecule design and property optimization, built on discrete flow matching with a uniform source. It supports de novo generation, fragment-constrained generation, and reinforcement learningβbased property optimization.
π§© Available Checkpoints
invirtuo_gen.ckptStandard model for general use.invirtuo_gen_big.ckptLarger model variant for optimal performance.
π Quick Usage
from huggingface_hub import hf_hub_download
import torch
# Download a checkpoint
ckpt_path = hf_hub_download("invirtuolabs/InVirtuoGen", filename="invirtuo_gen.ckpt")
from in_virtuo_gen.models import InVirtuoFM
model = InVirtuoFM.load_from_checkpoint(ckpt_path, map_location="cpu")
python -m in_virtuo_gen.generate \
--ckpt_path checkpoints/invirtuo_gen.ckpt \
--num_samples 1000 \
--batch_size 200
π Tasks Supported
- De Novo Molecule Generation β high quality and diverse outputs
- Fragment-Constrained Generation β motif extension, linker design, scaffold decoration, superstructure generation
- Target Property Optimization β property-driven search with reinforcement learning
- Lead Optimization β structure-based optimization using docking
π Full Reproducibility
For environment setup, benchmarks, and reproduction of results from the paper, please see the GitHub repository.
π Citation
If you use this work, please cite:
@misc{kaech2025refinedrugsdontcomplete,
title={Refine Drugs, Don't Complete Them: Uniform-Source Discrete Flows for Fragment-Based Drug Discovery},
author={Benno Kaech and Luis Wyss and Karsten Borgwardt and Gianvito Grasso},
year={2025},
eprint={2509.26405},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2509.26405},
}
π§ Contact
π¬ Email: [email protected]