LICENCE
Creative Commons CCZero (CC0-1.0), Open Data Commons Public Domain Dedication and Licence (PDDL-1.0)
DOMAIN
Health, Science and Technology
COVERAGE
EU-wide
FORMATS
ARFF, CSV, JSON, RDF, XLS / XLSX
PERSONAL DATA PROTECTION
No personal data
* Please note that the classification is taken from the original source
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality metrics (e.g. accuracy, precision, area under ROC curve, etc.) for classification, feature selection or clustering algorithms. This repository was inspired by an increasing need in machine learning / bioinformatics communities for a collection of microarray classification problems that could be used by different researches. This way many different classification or feature selection techniques can finally be compared to eachother on the same set of problems.