Aim:Accurately determining the aggressiveness of central nervous system tumors is critical to improving patient survival rates and planning effective treatment programs. Recently, molecular data have become increasingly valuable in tumor classification. In response, this study proposes a weighted vote-based ensemble classification method to classify paraganglioma/pheochromocytoma, low-grade glioma, and glioblastoma tumorsconditions that present with similar symptomsagainst other central nervous system tumors using clinical and molecular data.
Material and Methods:This study utilized clinical and molecular data from The Cancer Genome Atlas database of the US National Cancer Institute. Initially, categorical variables were transformed into numerical values, and class distribution imbalance was addressed through oversampling. The dataset was split, with 80% used for training across 10 different classical classification algorithms and the remaining 20% reserved for testing. A weighted vote-based ensemble classification algorithm was developed using six classifiers, artificial neural networks, logistic regression, extra trees, random forest, gradient boosting, and extreme gradient boosting, selected for their high classification accuracy. Additionally, feature importance analysis identified the most critical risk factors within the dataset.
Results:The proposed algorithm achieved an accuracy of 90.4% and an area under the receiver operating characteristic curve of 0.968, indicating strong classification performance.
Conclusion:The findings from this study suggest that the proposed method could be a valuable tool for supporting treatment planning in central nervous system tumor cases.