Yayın:
Detection of Intracranial Hemorrhage from Computed Tomography Images: The Diagnostic Role and Efficacy of ChatGPT-4o

dc.contributor.authorKoyun, Mustafa
dc.contributor.authorCevval, Zeycan Kubra
dc.contributor.authorReis, Bahadir
dc.contributor.authorEce, Bunyamin
dc.date.accessioned2026-01-04T21:12:37Z
dc.date.issued2024-12-18
dc.description.abstractBackground/Objectives: The role of artificial intelligence (AI) in radiological image analysis is rapidly evolving. This study evaluates the diagnostic performance of Chat Generative Pre-trained Transformer (ChatGPT) in detecting intracranial hemorrhages (ICH) on non-contrast computed tomography (NCCT) images, along with its ability to classify hemorrhage type, stage, anatomical location, and associated findings. Methods: A retrospective study was conducted using 240 cases, comprising 120 ICH cases and 120 controls with normal findings. Five consecutive NCCT slices per case were selected by radiologists and analyzed by ChatGPT-4o using a standardized prompt with nine questions. Diagnostic accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated by comparing the model’s results with radiologists’ assessments (gold standard). After a two-week interval, the same dataset was re-evaluated to assess intra-observer reliability and consistency. Results: ChatGPT-4o achieved 100% accuracy in identifying imaging modality type. For ICH detection, the model demonstrated 68.3% diagnostic accuracy, sensitivity of 79.2%, specificity of 57.5%, PPV of 65.1%, and NPV of 73.4%. It correctly classified 34.0% of hemorrhage types and 7.3% of localizations. All ICH-positive cases were identified as acute phase (100%). In the second evaluation, diagnostic accuracy improved to 73.3%, with sensitivity of 86.7% and specificity of 60%. The Cohen’s Kappa coefficient for intraobserver agreement in ICH detection indicated moderate agreement (κ=0.469). Conclusions: ChatGPT-4o shows promise in identifying imaging modalities and ICH presence but demonstrates limitations in localization and hemorrhage type classification. These findings highlight its potential for improvement through targeted training for medical applications.
dc.description.urihttps://doi.org/10.20944/preprints202412.1475.v1
dc.description.urihttps://doi.org/10.3390/diagnostics15020143
dc.description.urihttp://dx.doi.org/10.3390/diagnostics15020143
dc.description.urihttps://doaj.org/article/1fecd04fa0ea42f4ad099020a6996e5d
dc.identifier.doi10.20944/preprints202412.1475.v1
dc.identifier.eissn2075-4418
dc.identifier.openairedoi_dedup___::2704a9284bff7e32f724e1ed47a914ba
dc.identifier.orcid0000-0002-9811-4385
dc.identifier.orcid0000-0003-0523-057x
dc.identifier.orcid0009-0003-7260-6917
dc.identifier.orcid0000-0001-6288-8410
dc.identifier.startpage143
dc.identifier.urihttps://hdl.handle.net/20.500.12597/42315
dc.identifier.volume15
dc.publisherMDPI AG
dc.relation.ispartofDiagnostics
dc.rightsOPEN
dc.subjectMedicine (General)
dc.subjectChatGPT
dc.subjectR5-920
dc.subjectcomputed tomography
dc.subjectartificial intelligence
dc.subjectintracranial hemorrhage
dc.subjectradiology
dc.subjectArticle
dc.titleDetection of Intracranial Hemorrhage from Computed Tomography Images: The Diagnostic Role and Efficacy of ChatGPT-4o
dc.typeArticle
dspace.entity.typePublication
local.api.response{"authors":[{"fullName":"Mustafa Koyun","name":"Mustafa","surname":"Koyun","rank":1,"pid":{"id":{"scheme":"orcid","value":"0000-0002-9811-4385"},"provenance":null}},{"fullName":"Zeycan Kubra Cevval","name":"Zeycan Kubra","surname":"Cevval","rank":2,"pid":{"id":{"scheme":"orcid","value":"0000-0003-0523-057x"},"provenance":null}},{"fullName":"Bahadir Reis","name":"Bahadir","surname":"Reis","rank":3,"pid":{"id":{"scheme":"orcid_pending","value":"0009-0003-7260-6917"},"provenance":null}},{"fullName":"Bunyamin Ece","name":"Bunyamin","surname":"Ece","rank":4,"pid":{"id":{"scheme":"orcid","value":"0000-0001-6288-8410"},"provenance":null}}],"openAccessColor":"gold","publiclyFunded":false,"type":"publication","language":{"code":"und","label":"Undetermined"},"countries":null,"subjects":[{"subject":{"scheme":"keyword","value":"Medicine (General)"},"provenance":null},{"subject":{"scheme":"keyword","value":"ChatGPT"},"provenance":null},{"subject":{"scheme":"keyword","value":"R5-920"},"provenance":null},{"subject":{"scheme":"keyword","value":"computed tomography"},"provenance":null},{"subject":{"scheme":"keyword","value":"artificial intelligence"},"provenance":null},{"subject":{"scheme":"keyword","value":"intracranial hemorrhage"},"provenance":null},{"subject":{"scheme":"keyword","value":"radiology"},"provenance":null},{"subject":{"scheme":"keyword","value":"Article"},"provenance":null}],"mainTitle":"Detection of Intracranial Hemorrhage from Computed Tomography Images: The Diagnostic Role and Efficacy of ChatGPT-4o","subTitle":null,"descriptions":["<jats:p>Background/Objectives: The role of artificial intelligence (AI) in radiological image analysis is rapidly evolving. This study evaluates the diagnostic performance of Chat Generative Pre-trained Transformer (ChatGPT) in detecting intracranial hemorrhages (ICH) on non-contrast computed tomography (NCCT) images, along with its ability to classify hemorrhage type, stage, anatomical location, and associated findings. Methods: A retrospective study was conducted using 240 cases, comprising 120 ICH cases and 120 controls with normal findings. Five consecutive NCCT slices per case were selected by radiologists and analyzed by ChatGPT-4o using a standardized prompt with nine questions. Diagnostic accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated by comparing the model&amp;rsquo;s results with radiologists&amp;rsquo; assessments (gold standard). After a two-week interval, the same dataset was re-evaluated to assess intra-observer reliability and consistency. Results: ChatGPT-4o achieved 100% accuracy in identifying imaging modality type. For ICH detection, the model demonstrated 68.3% diagnostic accuracy, sensitivity of 79.2%, specificity of 57.5%, PPV of 65.1%, and NPV of 73.4%. It correctly classified 34.0% of hemorrhage types and 7.3% of localizations. All ICH-positive cases were identified as acute phase (100%). In the second evaluation, diagnostic accuracy improved to 73.3%, with sensitivity of 86.7% and specificity of 60%. The Cohen&amp;rsquo;s Kappa coefficient for intraobserver agreement in ICH detection indicated moderate agreement (&amp;kappa;=0.469). Conclusions: ChatGPT-4o shows promise in identifying imaging modalities and ICH presence but demonstrates limitations in localization and hemorrhage type classification. These findings highlight its potential for improvement through targeted training for medical applications.</jats:p>"],"publicationDate":"2024-12-18","publisher":"MDPI AG","embargoEndDate":null,"sources":["Crossref","Diagnostics (Basel)","Diagnostics, Vol 15, Iss 2, p 143 (2025)"],"formats":null,"contributors":null,"coverages":null,"bestAccessRight":{"code":"c_abf2","label":"OPEN","scheme":"http://vocabularies.coar-repositories.org/documentation/access_rights/"},"container":{"name":"Diagnostics","issnPrinted":null,"issnOnline":"2075-4418","issnLinking":null,"ep":null,"iss":null,"sp":"143","vol":"15","edition":null,"conferencePlace":null,"conferenceDate":null},"documentationUrls":null,"codeRepositoryUrl":null,"programmingLanguage":null,"contactPeople":null,"contactGroups":null,"tools":null,"size":null,"version":null,"geoLocations":null,"id":"doi_dedup___::2704a9284bff7e32f724e1ed47a914ba","originalIds":["10.20944/preprints202412.1475.v1","50|doiboost____|2704a9284bff7e32f724e1ed47a914ba","diagnostics15020143","10.3390/diagnostics15020143","50|doiboost____|9d7383fdf7e497fcdd8348b61af06c0f","oai:pubmedcentral.nih.gov:11763562","50|od_______267::44b540137e2ca65b074910f3b1a8c9ca","50|doajarticles::c95a54856736c5f6d837c3b4d8e0cc9e","oai:doaj.org/article:1fecd04fa0ea42f4ad099020a6996e5d"],"pids":[{"scheme":"doi","value":"10.20944/preprints202412.1475.v1"},{"scheme":"doi","value":"10.3390/diagnostics15020143"}],"dateOfCollection":null,"lastUpdateTimeStamp":null,"indicators":{"citationImpact":{"citationCount":4,"influence":2.662858e-9,"popularity":5.552541e-9,"impulse":4,"citationClass":"C5","influenceClass":"C5","impulseClass":"C5","popularityClass":"C4"}},"instances":[{"pids":[{"scheme":"doi","value":"10.20944/preprints202412.1475.v1"}],"license":"CC BY","type":"Article","urls":["https://doi.org/10.20944/preprints202412.1475.v1"],"publicationDate":"2024-12-18","refereed":"peerReviewed"},{"pids":[{"scheme":"doi","value":"10.3390/diagnostics15020143"}],"license":"CC BY","type":"Article","urls":["https://doi.org/10.3390/diagnostics15020143"],"publicationDate":"2025-01-09","refereed":"peerReviewed"},{"alternateIdentifiers":[{"scheme":"doi","value":"10.3390/diagnostics15020143"}],"license":"CC BY","type":"Other literature type","urls":["http://dx.doi.org/10.3390/diagnostics15020143"],"publicationDate":"2025-01-09","refereed":"nonPeerReviewed"},{"alternateIdentifiers":[{"scheme":"doi","value":"10.3390/diagnostics15020143"}],"type":"Article","urls":["https://doaj.org/article/1fecd04fa0ea42f4ad099020a6996e5d"],"publicationDate":"2025-01-01","refereed":"nonPeerReviewed"}],"isGreen":true,"isInDiamondJournal":false}
local.import.sourceOpenAire

Dosyalar

Koleksiyonlar