A Large-Scale Peripheral Blood Cell Dataset for Automated Hematological Analysis

dc.contributor.authorYarıkan, Atıf Eren
dc.contributor.authorÖrer, Can
dc.contributor.authorAkyıldız, Volkan
dc.contributor.authorKuş, Zeki
dc.contributor.authorAydın, Musa
dc.contributor.authorPalaoğlu, Kerim Erhan
dc.contributor.authorİncir, Said
dc.contributor.authorBaysal, Kemal
dc.contributor.authorÖzçelik, Cemal
dc.contributor.authorKiraz, Berna
dc.contributor.authorKiraz, Alper
dc.date.accessioned2026-03-27T10:56:40Z
dc.date.issued2026
dc.departmentFSM Vakıf Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü
dc.departmentFSM Vakıf Üniversitesi, Mühendislik Fakültesi, Yapay Zeka ve Veri Mühendisliği Bölümü
dc.description.abstractWhite blood cell classification is fundamental to hematological diagnosis, yet existing datasets are limited in scale and class diversity. We present a comprehensive peripheral blood cell dataset comprising 31,489 high-resolution microscopic images across 13 distinct cell classes, representing the largest publicly available collection for automated blood cell analysis. Images are acquired using the Sysmex DI-60 system from May-Grünwald-Giemsa-stained blood smears at 100 × magnification under standardized laboratory conditions. Expert hematologists with over 10 years of experience performed manual annotation with high inter-rater agreement (Cohen’s kappa >0.85 for all classes). The dataset includes common cell types such as segmented neutrophils and lymphocytes, alongside diagnostically critical but rare subtypes, including myelocytes, blasts, and reactive lymphocytes. Images are organized into training, validation, and test splits (70:10:20 ratio) with consistent 368 × 368 pixel resolution. Baseline experiments using 14 deep learning architectures demonstrate the dataset’s utility, with DenseNet-121 achieving 95.23% accuracy. KU-Optofil PBC Dataset addresses critical gaps in medical image analysis datasets and supports the development of robust automated hematology systems for clinical applications.
dc.identifier.citationYARIKAN, Atıf Eren, Can ÖRER, Volkan AKYILDIZ, Zeki KUŞ, Musa AYDIN, Kerim Erhan PALAOĞLU, Said İNCİR, Kemal BAYSAL, Cemal ÖZÇELİK, Berna KİRAZ & Alper KİRAZ. "A Large-Scale Peripheral Blood Cell Dataset for Automated Hematological Analysis". Scientific Data, 13.1 (2026): 1-11.
dc.identifier.doi10.1038/s41597-026-06761-y
dc.identifier.endpage11
dc.identifier.issue1
dc.identifier.orcidhttps://orcid.org/0000-0001-8762-7233
dc.identifier.startpage1
dc.identifier.urihttps://www.nature.com/articles/s41597-026-06761-y
dc.identifier.urihttps://hdl.handle.net/11352/6065
dc.identifier.volume13
dc.identifier.wosqualityQ1
dc.indekslendigikaynakWeb of Science
dc.language.isoen
dc.publisherNature
dc.relation.ispartofScientific Data
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.titleA Large-Scale Peripheral Blood Cell Dataset for Automated Hematological Analysis
dc.typeArticle

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
Yarıkan.pdf
Boyut:
3.48 MB
Biçim:
Adobe Portable Document Format

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.17 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: