Click here for full text:
Incremental Machine Learning to Reduce Biochemistry Lab Costs in the Search for Drug Discovery
Keyword(s): supervised machine learning; support vector machines; reinforcement learning
Abstract: This paper promotes the use of supervised machine learning in laboratory settings where chemists have a large number of samples to test for some property, and are interested in identifying as many positive instances for the least laboratory testing effort. Rather than traditional supervised learning where the chemists would first develop a large training set and then train a clasifier, the paper promotes incrementally re-training from each lab test as it completes and then predicting the next best sample to test, as in the field of reinforcement learning. The method outperformed the 2001 KDD Cup thrombin competition winner, partly due to its reduced risk to concept drift from training set to test set.
Back to Index