The Learnehr program implements the Semi-Supervised Set Covering Machine (S3CM) algorithm
for Electronic Health Record (EHR) freetext classification. This program was developed by
Dr Zhuoran Wang initially in 2010, and further revised in 2011, while his was working in
the Centre for Statistics and Machine Learning at University College London. This work is
part of the Wellcome Trust and NIHR funded project CALIBER (http://www.caliberresearch.org.uk/).
A detailed introduction and empirical evaluations of the algorithm can be found in the
following manuscript.
Z. Wang, A. D. Shah, A. R. Tate, S. Denaxas, J. Shawe-Taylor and H. Hemingway. Extracting
diagnoses from unstructured text in electronic health records by semi-supervised machine
learning. 2011.