Fair Dummies: Achieving Equalized Odds by Resampling Sensitive Attributes

This package implements "Fair Dummies": a flexible framework [1] for learning predictive models that approximately satisfy the equalized odds notion of fairness. This is achieved by introducing a general discrepancy function that rigorously quantifies violations of this criterion, formulating a differentiable penalty that drives the model parameters towards equalized odds.

To rigorously evaluate fitted models, we also implement a formal hypothesis test to detect when a prediction rule violates the equalized odds property. Both the model fitting and hypothesis testing leverage a resampled version of the sensitive attribute obeying the equalized odds property by construction.

Lastly, we demonstrate how to incorporate techniques for equitable uncertainty quantification---unbiased for each protected group---to precisely communicate the results of the data analysis.

[1] Y. Romano, S. Bates, and E. J. Candès, “Achieving Equalized Odds by Resampling Sensitive Attributes.” Advances in Neural Information Processing Systems (NeurIPS), 2020.

Getting Started

The implementation of [1] is self-contained and written in python.

Usage

Please refer to synthetic_experiment.ipynb for basic usage. The notebooks real_classification_experiment.ipynb and real_regression_experiment.ipynb demonstrate how to use the software package on real data.

Comparisons to competitive methods and additional usage examples of this package can be found in all_classification_experiments.py and all_regression_experiments.py.

Further information and dependencies

This package also implemets:

Adversarial Debiasing [2]: our implementation is based on https://github.com/equialgo/fairness-in-ml
HGR [3]: where our code is based on https://github.com/criteo-research/continuous-fairness

[2] B. H. Zhang, B. Lemoine, and M. Mitchell, "Mitigating unwanted biases with adversarial learning." In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, pp. 335-340, 2018.

[3] J. Mary, C. Calauzènes, and N. El Karoui, "Fairness-aware learning for continuous attributes and treatments." ICML, 2019

Dependencies:

Conformalized quantile regression (CQR) [4] and equalized coverage [5] frameworks for constructing distribusion-free prediction intervals/sets. Code is avaialable at https://github.com/yromano/cqr
nonconformist package available at https://github.com/donlnz/nonconformist

[4] Y. Romano, E. Patterson, and E. J. Candès, “Conformalized quantile regression.” NeurIPS 2019.

[5] Y. Romano, R. F. Barber, C. Sabbatti and E. J. Candès, “With malice towards none: Assessing uncertainty via equalized coverage.” HDSR 2019.

Prerequisites

python
numpy
scipy
scikit-learn
scikit-garden
pytorch
pandas

Installing

The development version is available here on github:

git clone https://github.com/yromano/fair_dummies.git

Reproducible Research

The code available under synthetic_experiment.ipynb, all_classification_experiments.py, and all_regression_experiments.py in the repository replicates all experimental results in [1].

Publicly Available Datasets

Communities and Crimes: UCI Communities and crime data set.
Nursery: UCI Nursery data set.

Data subject to copyright/usage rules

The Medical Expenditure Panel Survey (MPES) data can be downloaded by following this explanation (code provided by IBM's AIF360).

MEPS_21: Medical expenditure panel survey, panel 21.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name	Name	Last commit message	Last commit date
Latest commit yromano Merge pull request #1 from altosaar/master Jan 13, 2022 471713e · Jan 13, 2022 History 28 Commits
data	data	revision updates	Oct 19, 2020
fair_dummies	fair_dummies	fix bug so Y_hat and Y can differ in shape	Jan 13, 2022
others	others	small change	Jun 8, 2020
.DS_Store	.DS_Store	revision updates	Oct 19, 2020
.gitignore	.gitignore	Initial commit	Jun 1, 2020
.gitmodules	.gitmodules	cqr added	Jun 8, 2020
LICENSE	LICENSE	Initial commit	Jun 1, 2020
README.md	README.md	revision updates	Oct 19, 2020
all_classification_experiments.py	all_classification_experiments.py	small change	Jun 8, 2020
all_regression_experiments.py	all_regression_experiments.py	new examples	Jun 8, 2020
get_dataset.py	get_dataset.py	clean code	Jun 1, 2020
make_figure_regularization_effect.py	make_figure_regularization_effect.py	revision updates	Oct 19, 2020
real_classification_experiment.ipynb	real_classification_experiment.ipynb	notebooks	Jun 8, 2020
real_regression_experiment.ipynb	real_regression_experiment.ipynb	notebooks	Jun 8, 2020
regression_experimets_regularization_effect.py	regression_experimets_regularization_effect.py	revision updates	Oct 19, 2020
synthetic_experiment.ipynb	synthetic_experiment.ipynb	revision updates	Oct 19, 2020
synthetic_experiment2_with_A_as_feature.ipynb	synthetic_experiment2_with_A_as_feature.ipynb	revision updates	Oct 19, 2020
synthetic_experiment2_without_A_as_feature.ipynb	synthetic_experiment2_without_A_as_feature.ipynb	revision updates	Oct 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fair Dummies: Achieving Equalized Odds by Resampling Sensitive Attributes

Getting Started

Usage

Further information and dependencies

Prerequisites

Installing

Reproducible Research

Publicly Available Datasets

Data subject to copyright/usage rules

License

About

Releases

Packages

Languages

License

yromano/fair_dummies

Folders and files

Latest commit

History

Repository files navigation

Fair Dummies: Achieving Equalized Odds by Resampling Sensitive Attributes

Getting Started

Usage

Further information and dependencies

Prerequisites

Installing

Reproducible Research

Publicly Available Datasets

Data subject to copyright/usage rules

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages