Yayın:
Performance of Lung Cancer Prediction Methods Using Different Classification Algorithms

dc.contributor.authorGultepe, Yasemin
dc.date.accessioned2026-01-04T14:55:45Z
dc.date.issued2021-01-01
dc.description.abstractIn 2018, 1.76 million people worldwide died of lung cancer. Most of these deaths are due to late diagnosis, and early-stage diagnosis significantly increases the likelihood of a successful treatment for lung cancer. Machine learning is a branch of artificial intelligence that allows computers to quickly identify patterns within complex and large datasets by learning from existing data. Machine-learning techniques have been improving rapidly and are increasingly used by medical professionals for the successful classification and diagnosis of early-stage disease. They are widely used in cancer diagnosis. In particular, machine learning has been used in the diagnosis of lung cancer due to the benefits it offers doctors and patients. In this context, we per formed a study on machine-learning techniques to increase the classification accuracy of lung cancer with 32 x 56 sized numerical data from the Machine Learning Repository web site of the University of California, Irvine. In this study, the precision of the classification model was increased by the effective employment of pre-processing methods instead of direct use of classification algorithms. Nine datasets were derived with pre-processing methods and six machine-learning classification methods were used to achieve this improvement. The study results suggest that the accuracy of the k-nearest neighbors algorithm is superior to random forest, naive Bayes, logistic regression, decision tree, and support vector machines. The performance of pre-processing methods was assessed on the lung cancer dataset. The most successful pre-processing methods were Z-score (83% accuracy) for normalization methods, principal component analysis (87% accuracy) for dimensionality reduction methods, and information gain (71% accuracy) for feature selection methods.
dc.description.urihttps://doi.org/10.32604/cmc.2021.014631
dc.description.urihttps://dx.doi.org/10.32604/cmc.2021.014631
dc.description.urihttps://avesis.kocaeli.edu.tr/publication/details/98fcb814-529e-40f4-a8ee-996f6911a3fa/oai
dc.description.urihttps://avesis.atauni.edu.tr/publication/details/98fcb814-529e-40f4-a8ee-996f6911a3fa/oai
dc.description.urihttps://aperta.ulakbim.gov.tr/record/239114
dc.identifier.doi10.32604/cmc.2021.014631
dc.identifier.endpage2028
dc.identifier.issn1546-2226
dc.identifier.openairedoi_dedup___::a79e381e068c3464467f50b7885e2b29
dc.identifier.scopus2-s2.0-85102496464
dc.identifier.startpage2015
dc.identifier.urihttps://hdl.handle.net/20.500.12597/38478
dc.identifier.volume67
dc.identifier.wos000616713000006
dc.language.isoeng
dc.publisherTech Science Press
dc.relation.ispartofComputers, Materials & Continua
dc.rightsOPEN
dc.subject.sdg4. Education
dc.subject.sdg15. Life on land
dc.subject.sdg7. Clean energy
dc.subject.sdg3. Good health
dc.titlePerformance of Lung Cancer Prediction Methods Using Different Classification Algorithms
dc.typeArticle
dspace.entity.typePublication
local.api.response{"authors":[{"fullName":"Gultepe, Yasemin","name":"Yasemin","surname":"Gultepe","rank":1,"pid":null}],"openAccessColor":"gold","publiclyFunded":false,"type":"publication","language":{"code":"eng","label":"English"},"countries":null,"subjects":[{"subject":{"scheme":"FOS","value":"03 medical and health sciences"},"provenance":null},{"subject":{"scheme":"FOS","value":"0302 clinical medicine"},"provenance":null},{"subject":{"scheme":"SDG","value":"4. Education"},"provenance":null},{"subject":{"scheme":"FOS","value":"0202 electrical engineering, electronic engineering, information engineering"},"provenance":null},{"subject":{"scheme":"FOS","value":"02 engineering and technology"},"provenance":null},{"subject":{"scheme":"SDG","value":"15. Life on land"},"provenance":null},{"subject":{"scheme":"SDG","value":"7. Clean energy"},"provenance":null},{"subject":{"scheme":"SDG","value":"3. Good health"},"provenance":null}],"mainTitle":"Performance of Lung Cancer Prediction Methods Using Different Classification Algorithms","subTitle":null,"descriptions":["In 2018, 1.76 million people worldwide died of lung cancer. Most of these deaths are due to late diagnosis, and early-stage diagnosis significantly increases the likelihood of a successful treatment for lung cancer. Machine learning is a branch of artificial intelligence that allows computers to quickly identify patterns within complex and large datasets by learning from existing data. Machine-learning techniques have been improving rapidly and are increasingly used by medical professionals for the successful classification and diagnosis of early-stage disease. They are widely used in cancer diagnosis. In particular, machine learning has been used in the diagnosis of lung cancer due to the benefits it offers doctors and patients. In this context, we per formed a study on machine-learning techniques to increase the classification accuracy of lung cancer with 32 x 56 sized numerical data from the Machine Learning Repository web site of the University of California, Irvine. In this study, the precision of the classification model was increased by the effective employment of pre-processing methods instead of direct use of classification algorithms. Nine datasets were derived with pre-processing methods and six machine-learning classification methods were used to achieve this improvement. The study results suggest that the accuracy of the k-nearest neighbors algorithm is superior to random forest, naive Bayes, logistic regression, decision tree, and support vector machines. The performance of pre-processing methods was assessed on the lung cancer dataset. The most successful pre-processing methods were Z-score (83% accuracy) for normalization methods, principal component analysis (87% accuracy) for dimensionality reduction methods, and information gain (71% accuracy) for feature selection methods."],"publicationDate":"2021-01-01","publisher":"Tech Science Press","embargoEndDate":null,"sources":["Crossref"],"formats":null,"contributors":null,"coverages":null,"bestAccessRight":{"code":"c_abf2","label":"OPEN","scheme":"http://vocabularies.coar-repositories.org/documentation/access_rights/"},"container":{"name":"Computers, Materials & Continua","issnPrinted":"1546-2226","issnOnline":null,"issnLinking":null,"ep":"2028","iss":null,"sp":"2015","vol":"67","edition":null,"conferencePlace":null,"conferenceDate":null},"documentationUrls":null,"codeRepositoryUrl":null,"programmingLanguage":null,"contactPeople":null,"contactGroups":null,"tools":null,"size":null,"version":null,"geoLocations":null,"id":"doi_dedup___::a79e381e068c3464467f50b7885e2b29","originalIds":["10.32604/cmc.2021.014631","50|doiboost____|a79e381e068c3464467f50b7885e2b29","3132765352","98fcb814-529e-40f4-a8ee-996f6911a3fa","50|od_____10011::a2aa07518a31d6f308eb442fef197747","50|od______9933::a2aa07518a31d6f308eb442fef197747","oai:aperta.ulakbim.gov.tr:239114","50|r39c86a4b39b::5c64dfd98a45ea7a4eababe0e30f5d8a"],"pids":[{"scheme":"doi","value":"10.32604/cmc.2021.014631"}],"dateOfCollection":null,"lastUpdateTimeStamp":null,"indicators":{"citationImpact":{"citationCount":14,"influence":3.6420282e-9,"popularity":1.2985762e-8,"impulse":8,"citationClass":"C4","influenceClass":"C4","impulseClass":"C4","popularityClass":"C4"}},"instances":[{"pids":[{"scheme":"doi","value":"10.32604/cmc.2021.014631"}],"license":"https://doi.org/10.32604/TSP-CROSSMARKPOLICY","type":"Article","urls":["https://doi.org/10.32604/cmc.2021.014631"],"publicationDate":"2021-01-01","refereed":"peerReviewed"},{"pids":[{"scheme":"doi","value":"10.32604/cmc.2021.014631"}],"license":"CC BY","type":"Article","urls":["https://doi.org/10.32604/cmc.2021.014631"],"refereed":"nonPeerReviewed"},{"alternateIdentifiers":[{"scheme":"mag_id","value":"3132765352"},{"scheme":"doi","value":"10.32604/cmc.2021.014631"}],"type":"Article","urls":["https://dx.doi.org/10.32604/cmc.2021.014631"],"refereed":"nonPeerReviewed"},{"alternateIdentifiers":[{"scheme":"doi","value":"10.32604/cmc.2021.014631"}],"type":"Article","urls":["https://avesis.kocaeli.edu.tr/publication/details/98fcb814-529e-40f4-a8ee-996f6911a3fa/oai"],"publicationDate":"2021-01-01","refereed":"nonPeerReviewed"},{"alternateIdentifiers":[{"scheme":"doi","value":"10.32604/cmc.2021.014631"}],"type":"Article","urls":["https://avesis.atauni.edu.tr/publication/details/98fcb814-529e-40f4-a8ee-996f6911a3fa/oai"],"publicationDate":"2021-01-01","refereed":"nonPeerReviewed"},{"license":"CC BY","type":"Other literature type","urls":["https://aperta.ulakbim.gov.tr/record/239114"],"publicationDate":"2021-01-01","refereed":"nonPeerReviewed"}],"isGreen":true,"isInDiamondJournal":false}
local.import.sourceOpenAire
local.indexed.atWOS
local.indexed.atScopus

Dosyalar

Koleksiyonlar