Benign and Malignant Lymph Nodes Classification in Non-Small Cell Lung Cancer via Machine Learning Model

Loading...
Thumbnail Image
Limited Access
This item is unavailable until:
2026-06-06

Date

2024

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

2
views
0
downloads

Abstract

Objective:To develop a machine learning model that integrates deep learning image features and radiomics features to classify lymph nodes as benign or malignant in Non-Small Cell Lung Cancer (NSCLC). Methods: The dataset comprises contrast-enhanced CT scans from 541 lung cancer patients before surgery, collected at a Shanghai Hospital between July 2015 and December 2017 under an IRB study. It includes 1,237 lymph nodes, identified from preoperative CT scans due to enlargement and confirmed as non-small cell lung cancer (NSCLC) via surgical pathology. Lymph node classification into malignant or benign categories utilized in postoperative pathological reports. Our method employs a dual radiomic feature extraction strategy. The deep image features (DIF) were derived from the final convolutional layer of a pre-trained VGG-16 encoder network to characterize the lymph node’s image texture. A total of nine 2D shape-based radiomic features (RF) are extracted based on the Py-radiomics calculation toolbox to characterize lymph node morphological information. And ninety-two handcrafted radiomic features (HRF) are extracted. The extracted DIF, RF, and HRF were combined and fed into a Random Forest classifier for the benign and malignant lymph node classification. The random forest classifier was trained following an 8:2 train/test split ratio and evaluated using Area Under the Curve (AUC), Receiver Operating Characteristic (ROC), and p-value, and 5-fold cross-validation was also employed to objectively evaluate model performance.

Results: The mean AUC for the Random Forest classifier using only 2D shape features is 0.691, while mean AUC for the classifier employing only DIF is 726. Utilizing both DIF and HRF for classification resulted in an average AUC of 0.724, whereas integrating RF with DIF achieved superior classification efficacy, boasting the highest average AUC of 0.746. All results were considered statistically significant with a p-value of less than 0.05. Conclusion: The combination of image texture analysis refers to DIF with morphological information offers an enhanced characterization ability to classify lymph nodes as benign or malignant from CT images for lung NSCLC patients.

Description

Provenance

Citation

Citation

Ge, Jingyu (2024). Benign and Malignant Lymph Nodes Classification in Non-Small Cell Lung Cancer via Machine Learning Model. Master's thesis, Duke University. Retrieved from https://hdl.handle.net/10161/31052.

Collections


Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.