A Large-Scale Peripheral Blood Cell Dataset for Automated Hematological Analysis
| dc.contributor.author | Yarıkan, Atıf Eren | |
| dc.contributor.author | Örer, Can | |
| dc.contributor.author | Akyıldız, Volkan | |
| dc.contributor.author | Kuş, Zeki | |
| dc.contributor.author | Aydın, Musa | |
| dc.contributor.author | Palaoğlu, Kerim Erhan | |
| dc.contributor.author | İncir, Said | |
| dc.contributor.author | Baysal, Kemal | |
| dc.contributor.author | Özçelik, Cemal | |
| dc.contributor.author | Kiraz, Berna | |
| dc.contributor.author | Kiraz, Alper | |
| dc.date.accessioned | 2026-03-27T10:56:40Z | |
| dc.date.issued | 2026 | |
| dc.department | FSM Vakıf Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | |
| dc.department | FSM Vakıf Üniversitesi, Mühendislik Fakültesi, Yapay Zeka ve Veri Mühendisliği Bölümü | |
| dc.description.abstract | White blood cell classification is fundamental to hematological diagnosis, yet existing datasets are limited in scale and class diversity. We present a comprehensive peripheral blood cell dataset comprising 31,489 high-resolution microscopic images across 13 distinct cell classes, representing the largest publicly available collection for automated blood cell analysis. Images are acquired using the Sysmex DI-60 system from May-Grünwald-Giemsa-stained blood smears at 100 × magnification under standardized laboratory conditions. Expert hematologists with over 10 years of experience performed manual annotation with high inter-rater agreement (Cohen’s kappa >0.85 for all classes). The dataset includes common cell types such as segmented neutrophils and lymphocytes, alongside diagnostically critical but rare subtypes, including myelocytes, blasts, and reactive lymphocytes. Images are organized into training, validation, and test splits (70:10:20 ratio) with consistent 368 × 368 pixel resolution. Baseline experiments using 14 deep learning architectures demonstrate the dataset’s utility, with DenseNet-121 achieving 95.23% accuracy. KU-Optofil PBC Dataset addresses critical gaps in medical image analysis datasets and supports the development of robust automated hematology systems for clinical applications. | |
| dc.identifier.citation | YARIKAN, Atıf Eren, Can ÖRER, Volkan AKYILDIZ, Zeki KUŞ, Musa AYDIN, Kerim Erhan PALAOĞLU, Said İNCİR, Kemal BAYSAL, Cemal ÖZÇELİK, Berna KİRAZ & Alper KİRAZ. "A Large-Scale Peripheral Blood Cell Dataset for Automated Hematological Analysis". Scientific Data, 13.1 (2026): 1-11. | |
| dc.identifier.doi | 10.1038/s41597-026-06761-y | |
| dc.identifier.endpage | 11 | |
| dc.identifier.issue | 1 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-8762-7233 | |
| dc.identifier.startpage | 1 | |
| dc.identifier.uri | https://www.nature.com/articles/s41597-026-06761-y | |
| dc.identifier.uri | https://hdl.handle.net/11352/6065 | |
| dc.identifier.volume | 13 | |
| dc.identifier.wosquality | Q1 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.language.iso | en | |
| dc.publisher | Nature | |
| dc.relation.ispartof | Scientific Data | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.title | A Large-Scale Peripheral Blood Cell Dataset for Automated Hematological Analysis | |
| dc.type | Article |










