
DeFungi
Donated on 1/29/2023
DeFungi is a dataset for direct mycological examination of microscopic fungi image. The images are from superficial fungal infections caused by yeasts, moulds, or dermatophyte fungi. The images have been manually labelled into five classes and curated with a subject matter expert assistance. The images have been cropped with automated algorithms to produce the final dataset.
Dataset Characteristics
Image
Subject Area
Computer Science
Associated Tasks
Classification
Attribute Type
Real
# Instances
9114
# Attributes
-
Information
For what purpose was the dataset created?
The dataset was created to develop a machine-learning algorithm for detecting and classifying Fungi images.
Who funded the creation of the dataset?
No funder.
What do the instances in this dataset represent?
Photos
Are there recommended data splits?
No
Does the dataset contain data that might be considered sensitive in any way?
No
Was there any data preprocessing performed?
Yes. The dataset has been pre-processed. All images have been cropped to the region of interest.
Has the dataset been used for any tasks already?
It has been used in this paper: https://arxiv.org/abs/2109.07322
Citation Requests/Acknowledgements
Please cite the below paper published in arXiv.org if you use the dataset. Paper link: https://arxiv.org/abs/2109.07322 @misc{sopo2021defungi, title={DeFungi: Direct Mycological Examination of Microscopic Fungi Images}, author={Camilo Javier Pineda Sopo, Farshid Hajati, and Soheila Gheisari}, year={2021}, eprint={2109.07322}, archivePrefix={arXiv}, primaryClass={eess.IV} }
Introduction Paper
-
Hajati,Farshid. (2023). DeFungi. UCI Machine Learning Repository. https://doi.org/10.48550/arXiv.2109.07322.
@misc{misc_defungi_773, author = {Hajati,Farshid}, title = {{DeFungi}}, year = {2023}, howpublished = {UCI Machine Learning Repository}, note = {{DOI}: \url{10.48550/arXiv.2109.07322}} }
Keywords
Creators
Farshid Hajati
hajati@gmail.com
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.