Training sets on cytotoxicity of chemical compounds built on experimental data from CHEMBLdb were used to train PASS for “structure-cell line cytotoxicity” relationship prediction. The average prediction accuracy calculated by leave-one-out cross-validation procedure is approximately 93% for cytotoxicity prediction for cancer cell lines and non-tumor cell lines.
The outcome of the collection of experimental data was the training set of 59,882 structures of compounds, which reflects current knowledge about the cytotoxic substances in relation to 943 human cell lines.