Evaluating Lung Cancer Classification Performance Using Multiple Feature Extraction Methods with SVM and KNN Classifiers
Abstract
Lung cancer is one of the most prevalent causes of mortality worldwide, making early detection essential for improving patient survival rates. Computed tomography (CT) imaging serves as a crucial diagnostic tool; however, the large volume of generated images poses challenges in precise interpretation by radiologists. This study evaluates the effectiveness of lung cancer classification by utilizing various feature extraction techniques in combination with support vector machine (SVM) and k-nearest neighbours (KNN) classifiers. By analysing different feature sets, the research aims to identify the most effective combination for enhanced classification accuracy. The findings indicate notable improvements in classification performance, facilitating more reliable lung cancer detection.
References
V. Krishnaiah, G. Narsimha, C. Subhash.( 2013), Diagnosis of lung cancer prediction system using data mining classification techniques, In International Journal of Computer Science and Information Technologies, 4(1): 39-45.
J. J. Dignam, L. Huang, L. Ries, M. Reichman, A. Mariotto, E. Feuer. (2009), Estimating cancer statistic and other-cause mortality in clinical trial and population-based cancer registry cohorts. Wiley InterScience [Online].
Disha Sharma, Gagandeep Jindal (2011), Identifying Lung Cancer Using Image Processing Techniques, International Conference on Computational Techniques and Artificial Intelligence.
B.N, Mithuna&Ravikumar, Pushpa& C.N, Arpitha (2018), A Quantitative Approach for Determining Lung Cancer Using CT scan Images, 1786-1790. 10.1109/ICECA.2018.8474670.
Fan, Jianqing; Han, Fang; Liu, Han (2014), Challenges of Big Data analysis, National Science Review. 1 (2): 293–314. ISSN 2095- 5138. PMC 4236847. PMID 25419469. doi:10.1093/nsr/nwt032.
Tina M. St. John M.D. (2005), With Every Breath: A Lung Cancer Guidebook, (1):75-82. ISBN 0-9760450-2- 8, www.lungcancerguidebook.org.
B. Sobolev, A. Levy, and S. Goring, Eds (2016), Health Services Data: Big Data Analytics for Deriving Predictive Healthcare Insights, in Data and Measures in Health Services Research, Springer US, 1(1)1–17.
http://www.news-medical.net/news/20120530/Insulinuse-linked-to-lung-cancer-risk-in-diabetes.aspx (2012)
Dutkowska, Adam Antczak (2016), Comorbidities in lung cancer , AgataEwa, Pneumonologiai Alergologia Polska, 84 ( 3): 186–192.
Xie X, Liu Q, Wu J, Wakui M.(2009), Impact of cigarette smoking in type 2 diabetes development,
Cta PharmacologicaSinica,30 (6):784- 787.
Chang SA (2012), Smoking and Type 2 Diabetes Mellitus. Diabetes & Metabolism, Journal,36(6):399-403
Maddatu, Judith et al.(2017), Smokingand the risk of type 2 diabetes, Translational Research:184(1),101-107.
Appari A, Eric Johnson M, Anthony DL.(2013), Meaningful use of electronic health record systems and
process quality of care: evidence from a panel data analysis of U.S. acute-care hospitals, Health
ServRes,48(1):354–75.
Fitzhenry F, Murff HJ, Matheny ME, et al.(2013), Exploring the frontier of electronic health record
surveillance: the case of postoperative complications, Med Care51:509–16.
J.R. Quinlan.(1994), C4.5 programs for machine learning, Morgan Kaufmann Publishers,(16):235-240.
Vapnik,V.(1995), Support-vector networks, .Machine Learning. 20 (3): 273–297.
G. Dimitoglou, J. A. Adams, and C. M. Jim.(2012), Comparison of the C4.5 and a Naive Bayes Classifier
for the Prediction of Lung Cancer Survivability, CoRR, 4(8):1–9.
Hamid KarimKhani Z and et.al.(2015), A comparative survey on data mining techniques for breast cancer
diagnosis and prediction Survey, Indian Journal of Fundamental and Applied Life Sciences.5 (S1): 4330-
.
S. G. Armato, III and W.F. Sensakovic(2004), Automated lung segmentation for thoracic CT: Impact on
computer-aided diagnosis, Acad. Radiol., 11(9): 1011-1021.
I. sluimer, M. Prokop, and B. van Ginneken (2005), Towards automated segmentation of the pathological
lung in CT, IEEE Trans. Medical Image, 24(8):1025-1038.
Y. Xu, M. Sonka, G. McLennan, J. Guo. And E.A. Hoffman (2006), MDCT-based 3-D texture
Classification of emphysema and early smoking related lung anthologies, IEEE Trans. Med. Imag.,
(4): 464-475.
Tanushree Sinha Roy, NeerajSirohi, ArtiPatle (2015), Classification of Lung Image and Nodule Detection
Using Fuzzy Inference System, International Conference on Computing, Communication and
Automation (ICCCA2015).
Ritika Agarwal, AnkitShankhadhar, Raj Kumar Sagar (2015), Detection of Lung Cancer Using Content
Based Medical Image Retrieval, Fifth International Conference On Advanced Computing And
Communication Technologies.
M. Mukherjee and P. K. Biswal (2018), Segmentation of lungs nodules by iterative thresholding method
and classification with Reduced Features, Second International Conference on Inventive Communication
and Computational Technologies (ICICCT), Coimbatore: 450-455.