Derify/augmented_canonical_pubchem_13m
Viewer
•
Updated
•
13.3M
•
36
A set of SMILES datasets canonicalized with RDKit and 33% randomly augmented for robust, diverse molecular ML training.