Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis.

As more diagnostic testing options become available to physicians, it becomes more difficult to combine various types of medical information together in order to optimize the overall diagnosis. To improve diagnostic performance, here we introduce an approach to optimize a decision-fusion technique to combine heterogeneous information, such as from different modalities, feature categories, or institutions. For classifier comparison we used two performance metrics: The receiving operator characteristic (ROC) area under the curve [area under the ROC curve (AUC)] and the normalized partial area under the curve (pAUC). This study used four classifiers: Linear discriminant analysis (LDA), artificial neural network (ANN), and two variants of our decision-fusion technique, AUC-optimized (DF-A) and pAUC-optimized (DF-P) decision fusion. We applied each of these classifiers with 100-fold cross-validation to two heterogeneous breast cancer data sets: One of mass lesion features and a much more challenging one of microcalcification lesion features. For the calcification data set, DF-A outperformed the other classifiers in terms of AUC (p < 0.02) and achieved AUC=0.85 +/- 0.01. The DF-P surpassed the other classifiers in terms of pAUC (p < 0.01) and reached pAUC=0.38 +/- 0.02. For the mass data set, DF-A outperformed both the ANN and the LDA (p < 0.04) and achieved AUC=0.94 +/- 0.01. Although for this data set there were no statistically significant differences among the classifiers' pAUC values (pAUC=0.57 +/- 0.07 to 0.67 +/- 0.05, p > 0.10), the DF-P did significantly improve specificity versus the LDA at both 98% and 100% sensitivity (p < 0.04). In conclusion, decision fusion directly optimized clinically significant performance measures, such as AUC and pAUC, and sometimes outperformed two well-known machine-learning techniques when applied to two different breast cancer data sets.

Type

Journal article

Department

Biomedical Engineering

Subjects

Algorithms, Artificial intelligence, Breast Neoplasms, Database Management Systems, Databases, Factual, Decision Support Systems, Clinical, Diagnosis, Computer-Assisted, Humans, Information Storage and Retrieval, Quality Control, Reproducibility of Results, Sensitivity and Specificity

Permalink

https://hdl.handle.net/10161/207

Rights

http://rightsstatements.org/vocab/InC/1.0/

Published Version (Please cite this version)

10.1118/1.2208934

Publication Info

Jesneck, Jonathan Lee (2006). Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis. Med Phys, 33(8). 10.1118/1.2208934 Retrieved from https://hdl.handle.net/10161/207.

This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.

Collections

Dissertations

Full item page

Dukes student scholarship is made available to the public using a Creative Commons Attribution / Non-commercial / No derivative (CC-BY-NC-ND) license.

Optimized approach to decision fusion of heterogeneous data for breast cancer diagnosis.

Date

Authors

Advisors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

Citation Stats

Abstract