Data Valuation with Shapley-based Methods for Medical Image Classification

dc.contributor.authorAkçelik, Zeliha Kaya
dc.contributor.authorHoşavcı, Reyhan
dc.contributor.authorDik, Sümeyye Zülal
dc.contributor.authorAydın, Musa
dc.contributor.authorKuş, Zeki
dc.date.accessioned2025-12-02T11:14:07Z
dc.date.available2025-12-02T11:14:07Z
dc.date.issued2025en_US
dc.departmentFSM Vakıf Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.description.abstractThis study introduces novel approaches to data valuation in medical image classification, focusing on the Gradient Shapley and Improved Gradient Shapley methods. These methods aim to reduce data selection costs while improving the model performance, making them highly practical for training processes. The Gradient Shapley method evaluates the contributions of individual data samples to model performance based on a robust theoretical foundation. In addition, the Improved Gradient Shapley method enhances computational efficiency and demonstrates superior performance, particularly on noisy or imbalanced datasets. Experiments conducted on the MedMNIST dataset reveal that both methods achieve competitive accuracy and AUC values even with significantly reduced data. For instance, in the PathMNIST dataset, using only 10% of the data resulted in an AUC value of 96.6%, which is remarkably close to the baseline AUC value of 98.3% achieved with the full dataset. In particular, the Shapley-based methods have shown better classification performance with ≤50% of the full data in some datasets. This study significantly improves data valuation processes in medical image classification. The findings highlight the potential of Shapley's value-based methods to optimize training processes without sacrificing performance. They offer a scalable and efficient method for real-world applications in critical domains like healthcare. Future research could explore integrating these methods with other data selection approaches to further enhance data valuation processes.en_US
dc.identifier.citationAKÇELİK, Zeliha Kaya, Reyhan HOŞAVCI, Sümeyye Zülal DİK, Musa AYDIN & Zeki KUŞ. "Data Valuation with Shapley-based Methods for Medical Image Classification". 2025 10th International Conference on Machine Learning Technologies, (2025): 461-467.en_US
dc.identifier.doi10.1109/ICMLT65785.2025.11193393
dc.identifier.endpage467en_US
dc.identifier.scopus2-s2.0-105022269087
dc.identifier.scopusqualityN/A
dc.identifier.startpage461en_US
dc.identifier.urihttps://hdl.handle.net/11352/5750
dc.indekslendigikaynakScopus
dc.institutionauthorAkçelik, Zeliha Kaya
dc.institutionauthorHoşavcı, Reyhan
dc.institutionauthorDik, Sümeyye Zülal
dc.institutionauthorAydın, Musa
dc.institutionauthorKuş, Zeki
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.relation.ispartof2025 10th International Conference on Machine Learning Technologies
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/embargoedAccessen_US
dc.subjectShapley valueen_US
dc.subjectData valuationen_US
dc.subjectMedMNISTen_US
dc.subjectGradient Shapleyen_US
dc.subjectImproved Gradient Shapleyen_US
dc.titleData Valuation with Shapley-based Methods for Medical Image Classificationen_US
dc.typeConference Object

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
Akçelik.pdf
Boyut:
1.45 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Konferans Ögesi

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: