Engineering Fair Machine Learning Pipelines

Martin Hirzel; Kiran Kate; Parikshit Ram

ICLR 2021

Workshop paper

03 May 2021

Engineering Fair Machine Learning Pipelines

Download paper

Abstract

Data splits and data preparation during fairness mitigation are known to influence the performance of output models. We propose including protected attributes in stratification when splitting a dataset. We also describe fairness patterns for assembling fair pipelines that include data preparation, estimators, and mitigators. This paper introduces an open-source Python library lale.lib.aif360 that offers sklearn compatible implementations of fair stratification and fairness patterns.

Workshop paper