Derify/ChemMRL
Sentence Similarity
•
0.2B
•
Updated
•
4.25k
SMILES Matryoshka Representation Learning Embedding Transformer
Note A SMILES-pair dataset for training ChemMRL via knowledge distillation from GenMol. Each molecule is paired with a valid, similar variant and similarity label to enable molecular similarity, retrieval, clustering, and other cheminformatics tasks.
Search for similar molecules using SMILES or a canvas