As a subject-matter novice, I found this paper's explanation of canonical SMILES notation and one way to turn it into a feature matrix for ML very helpful.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-018-2523-5
Use this form to request data for our open tasks or leave us a message. We are looking forward to collaborate with you!