With RCDK is very easy to get a good number of descriptors with few rows of code:
library(rcdk)
dn <- get.desc.names(dc[1])
mol <- parse.smiles("c1ccccc1")
descNames <- unique(unlist(sapply(get.desc.categories(), get.desc.names)))
descs <- eval.desc(mol, descNames)
Nice!, 288 descriptors (but 65 of them are "NA").
Matteo, I guess a lot of them are caused by the fact that the molecules do not have 3D coordinates.
ReplyDeletethanks for the suggestion, Egon!
ReplyDeleteI will check if all of the 65 "NA" require 3D coords...
A question: does the CDK generates 3D conformers with simple force fields (such as OpenBabel does)?