A survey on datasets for fairness-aware machine learning

Download statistics - Document (COUNTER):

Le Quy, T.; Roy, A.; Iosifidis, V.; Zhang, W.; Ntoutsi, E.: A survey on datasets for fairness-aware machine learning. In: Wiley Interdisciplinary Reviews (WIREs). Data Mining and Knowledge Discovery 12 (2022), Nr. 3, e1452. DOI: https://doi.org/10.1002/widm.1452

Repository version

To cite the version in the repository, please use this identifier: https://doi.org/10.15488/14664

Selected time period:

year: 
month: 

Sum total of downloads: 59




Thumbnail
Abstract: 
As decision-making increasingly relies on machine learning (ML) and (big) data, the issue of fairness in data-driven artificial intelligence systems is receiving increasing attention from both research and industry. A large variety of fairness-aware ML solutions have been proposed which involve fairness-related interventions in the data, learning algorithms, and/or model outputs. However, a vital part of proposing new approaches is evaluating them empirically on benchmark datasets that represent realistic and diverse settings. Therefore, in this paper, we overview real-world datasets used for fairness-aware ML. We focus on tabular data as the most common data representation for fairness-aware ML. We start our analysis by identifying relationships between the different attributes, particularly with respect to protected attributes and class attribute, using a Bayesian network. For a deeper understanding of bias in the datasets, we investigate interesting relationships using exploratory analysis. This article is categorized under: Commercial, Legal, and Ethical Issues > Fairness in Data Mining Fundamental Concepts of Data and Knowledge > Data Concepts Technologies > Data Preprocessing.
License of this version: CC BY 4.0 Unported
Document Type: Article
Publishing status: publishedVersion
Issue Date: 2022
Appears in Collections:Forschungszentren

distribution of downloads over the selected time period:

downloads by country:

pos. country downloads
total perc.
1 image of flag of Germany Germany 26 44.07%
2 image of flag of Netherlands Netherlands 12 20.34%
3 image of flag of United States United States 7 11.86%
4 image of flag of China China 4 6.78%
5 image of flag of Sweden Sweden 3 5.08%
6 image of flag of Spain Spain 2 3.39%
7 image of flag of Vietnam Vietnam 1 1.69%
8 image of flag of Russian Federation Russian Federation 1 1.69%
9 image of flag of Indonesia Indonesia 1 1.69%
10 image of flag of France France 1 1.69%
    other countries 1 1.69%

Further download figures and rankings:


Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Search the repository


Browse