Publicly available open-access OCT datasets for retinal layer segmentation have been limited in scope, often being small in size, specific to a single disease, or containing only one grading. This dataset improves upon this with multi-grader and multi-disease labels for training machine learning-based algorithms. The proposed dataset covers three subsets of scans (Age-related Macular Degeneration, Diabetic Macular Edema, and healthy) and annotations for two types of tasks (semantic segmentation and object detection). This dataset compiled 5016 pixel-wise manual labels for 1672 OCT scans featuring 5 layer boundaries for three different disease classes to support development of automatic techniques. A subset of data (566 scans across 9 classes of disease biomarkers) was subsequently labeled for disease features for 4698 bounding box annotations. To minimize bias, images were shuffled and distributed among graders. Retinal layers were corrected, and outliers identified using the interquartile range (IQR). This step was iterated three times, improving layer annotations’ quality iteratively, ensuring a reliable dataset for automated retinal image analysis.

OCT5k: A dataset of multi-disease and multi-graded annotations for retinal layers / Arikan, M.; Willoughby, J.; Ongun, S.; Sallo, F.; Montesel, A.; Ahmed, H.; Hagag, A.; Book, M.; Faatz, H.; Cicinelli, M. V.; Fawzi, A. A.; Podkowinski, D.; Cilkova, M.; De Almeida, D. M.; Zouache, M.; Ramsamy, G.; Lilaonitkul, W.; Dubis, A. M.. - In: SCIENTIFIC DATA. - ISSN 2052-4463. - 12:1(2025). [10.1038/s41597-024-04259-z]

OCT5k: A dataset of multi-disease and multi-graded annotations for retinal layers

Cicinelli M. V.;
2025-01-01

Abstract

Publicly available open-access OCT datasets for retinal layer segmentation have been limited in scope, often being small in size, specific to a single disease, or containing only one grading. This dataset improves upon this with multi-grader and multi-disease labels for training machine learning-based algorithms. The proposed dataset covers three subsets of scans (Age-related Macular Degeneration, Diabetic Macular Edema, and healthy) and annotations for two types of tasks (semantic segmentation and object detection). This dataset compiled 5016 pixel-wise manual labels for 1672 OCT scans featuring 5 layer boundaries for three different disease classes to support development of automatic techniques. A subset of data (566 scans across 9 classes of disease biomarkers) was subsequently labeled for disease features for 4698 bounding box annotations. To minimize bias, images were shuffled and distributed among graders. Retinal layers were corrected, and outliers identified using the interquartile range (IQR). This step was iterated three times, improving layer annotations’ quality iteratively, ensuring a reliable dataset for automated retinal image analysis.
File in questo prodotto:
File Dimensione Formato  
unpaywall-bitstream--329149434.pdf

accesso aperto

Tipologia: PDF editoriale (versione pubblicata dall'editore)
Licenza: Creative commons
Dimensione 2.68 MB
Formato Adobe PDF
2.68 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11768/190783
Citazioni
  • ???jsp.display-item.citation.pmc??? 4
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 5
social impact