TY - JOUR
T1 - Integration of an imbalance framework with novel high-generalizable classifiers for radiomics-based distant metastases prediction of advanced nasopharyngeal carcinoma[Formula presented]
AU - Zhang, Yuanpeng
AU - Lam, Saikit
AU - Yu, Tingting
AU - Teng, Xinzhi
AU - Zhang, Jiang
AU - Lee, Francis Kar ho
AU - Au, Kwok hung
AU - Yip, Celia Wai yi
AU - Wang, Shitong
AU - Cai, Jing
N1 - Funding Information:
This work was supported in part by the Hong Kong Scholars Program under Grant XJ2019056 and Innovation, Technology Fund ITS/080/19 , and the Key Projects of the National Natural Science Foundation of China under Grant U20A20228 , the Natural Science Foundation of Jiangsu Province under Grant under BK20201441 .
Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2022/1/10
Y1 - 2022/1/10
N2 - Model overfitting and data imbalance are two main challenges in radiomics studies. In this study, we develop a high-generalizable classifier MERGE (Multi-kErnel Regression with Graph Embedding) and an imbalance framework SUS (Sensitivity-based Under-Sampling) to address these two challenges. First, we integrate the class compactness graph into multi-kernel regression to keep the samples from the same class close together when they are transformed to the label space. In the class compactness graph, each pair of samples from the same class are linked by an undirected weighted edge to capture the relationship between two samples. In such a way, samples from the same class can be kept as close as possible so that the model overfitting problem can be weakened to a great extent. Secondly, to utilize potentially informative data, we propose a sensitivity-based under-sampling imbalance ensemble framework SUS. In each ensemble procedure, the majority class is organized into different blocks through clustering according to the sensitivity of each sample computed by MERGE and then each block is randomly under-sampled in a concordant & self-paced manner. We collect 100 advanced nasopharyngeal carcinoma patients from Hong Kong Queen Elizabeth Hospital and use the proposed SUS-MERGE framework to predict distant metastasis using radiomics features extracted from tumor subregions of different image modalities. Experimental results show promising performance as compared with benchmarking methods.
AB - Model overfitting and data imbalance are two main challenges in radiomics studies. In this study, we develop a high-generalizable classifier MERGE (Multi-kErnel Regression with Graph Embedding) and an imbalance framework SUS (Sensitivity-based Under-Sampling) to address these two challenges. First, we integrate the class compactness graph into multi-kernel regression to keep the samples from the same class close together when they are transformed to the label space. In the class compactness graph, each pair of samples from the same class are linked by an undirected weighted edge to capture the relationship between two samples. In such a way, samples from the same class can be kept as close as possible so that the model overfitting problem can be weakened to a great extent. Secondly, to utilize potentially informative data, we propose a sensitivity-based under-sampling imbalance ensemble framework SUS. In each ensemble procedure, the majority class is organized into different blocks through clustering according to the sensitivity of each sample computed by MERGE and then each block is randomly under-sampled in a concordant & self-paced manner. We collect 100 advanced nasopharyngeal carcinoma patients from Hong Kong Queen Elizabeth Hospital and use the proposed SUS-MERGE framework to predict distant metastasis using radiomics features extracted from tumor subregions of different image modalities. Experimental results show promising performance as compared with benchmarking methods.
KW - Class compactness graph
KW - Imbalance classification
KW - Overfitting
KW - Radiomics-based prediction
KW - Sensitivity-based under-sampling
UR - http://www.scopus.com/inward/record.url?scp=85118494284&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2021.107649
DO - 10.1016/j.knosys.2021.107649
M3 - Journal article
AN - SCOPUS:85118494284
SN - 0950-7051
VL - 235
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 107649
ER -