Mitigating Subgroup Unfairness in Machine Learning Classifiers: A Data-Driven Approach

Imbalanced sample collection can lead to unfairness in learned models due to historical biases and a lack of control over data collection. Through the introduction of “Implicit Biased Set (IBS)", we propose an efficient pre-processing algorithm to identify IBS and then propose data remedy techniques to balance the data collection in IBS.

This repository contains code for the Identification Algorithms, Data Remedy Methods, Divexplorer metrics and ML model settings and a demo notebook to run both effectiveness and efficency experiments in the paper.

Prerequisites

To install the package and prepare for use, run:

git clone https://github.com/niceIrene/remedy.git

pip install -r requirements.txt

The following python packages are required to run the code: divexplorer, pandas, sklearn, numpy, sympy.

Demo

For a demonstration of our working code and results, use this: COMPAS Dataset Effectiveness Demo.

It is also possible to view the individual algorithms for:

Data Remedy Methods

Identification Algorithms

ML model settings and divexplorer

Datasets

Adult Dataset: https://archive.ics.uci.edu/ml/datasets/adult.

COMPAS Dataset: https://github.com/propublica/compas-analysis.

Law School Dataset: https://www.kaggle.com/datasets/danofer/law-school-admissions-bar-passage

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
Time		Time
datasets		datasets
Demo_Adult.ipynb		Demo_Adult.ipynb
Demo_Bar.ipynb		Demo_Bar.ipynb
Demo_Effectiveness_Propublica.ipynb		Demo_Effectiveness_Propublica.ipynb
README.md		README.md
compare_gerrymandering_adult.ipynb		compare_gerrymandering_adult.ipynb
identification.py		identification.py
models.py		models.py
remedy.py		remedy.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mitigating Subgroup Unfairness in Machine Learning Classifiers: A Data-Driven Approach

Prerequisites

Demo

Datasets

About

Releases

Packages

Contributors 2

Languages

niceIrene/remedy

Folders and files

Latest commit

History

Repository files navigation

Mitigating Subgroup Unfairness in Machine Learning Classifiers: A Data-Driven Approach

Prerequisites

Demo

Datasets

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages