A biological continuum based approach for efficient clinical classification.

Pubmed ID: 24035745

Journal: Journal of biomedical informatics

Publication Date: Feb. 1, 2014

MeSH Terms: Humans, Aged, Cardiovascular Diseases, Risk Factors, Cohort Studies, Aging, Algorithms, Artificial Intelligence, Bayes Theorem, California, Data Collection, Maryland, Medical Informatics, Models, Theoretical, North Carolina, Pattern Recognition, Automated, Pennsylvania, Rural Population, Urban Population, Support Vector Machine

Authors: Tay D, Poh CL, Goh C, Kitney RI

Cite As: Tay D, Poh CL, Goh C, Kitney RI. A biological continuum based approach for efficient clinical classification. J Biomed Inform 2014 Feb;47:28-38. Epub 2013 Sep 12.

Studies:

Abstract

Clinical feature selection problem is the task of selecting and identifying a subset of informative clinical features that are useful for promoting accurate clinical diagnosis. This is a significant task of pragmatic value in the clinical settings as each clinical test is associated with a different financial cost, diagnostic value, and risk for obtaining the measurement. Moreover, with continual introduction of new clinical features, the need to repeat the feature selection task can be very time consuming. Therefore to address this issue, we propose a novel feature selection technique for diagnosis of myocardial infarction - one of the leading causes of morbidity and mortality in many high-income countries. This method adopts the conceptual framework of biological continuum, the optimization capability of genetic algorithm for performing feature selection and the classification ability of support vector machine. Together, a network of clinical risk factors, called the biological continuum based etiological network (BCEN), was constructed. Evaluation of the proposed methods was carried out using the cardiovascular heart study (CHS) dataset. Results demonstrate a significant speedup of 4.73-fold can be achieved for the development of MI classification model. The key advantage of this methodology is the provision of a reusable (feature subset) paradigm for efficient development of up-to-date and efficacious clinical classification models.