Explaining models: An empirical study of how explanations impact fairness judgment

Jonathan Dodge, Q. Vera Liao, Yunfeng Zhang, Rachel K.E. Bellamy, Casey Dugan

Research output: Contribution to conferencePaperpeer-review

241 Scopus citations

Abstract

Ensuring fairness of machine learning systems is a human-in-the-loop process. It relies on developers, users, and the general public to identify fairness problems and make improvements. To facilitate the process we need efective, unbiased, and user-friendly explanations that people can conidently rely on. Towards that end, we conducted an empirical study with four types of programmatically generated explanations to understand how they impact people's fairness judgments of ML systems. With an experiment involving more than 160 Mechanical Turk workers, we show that: 1) Certain explanations are considered inherently less fair, while others can enhance people's conidence in the fairness of the algorithm; 2) Diferent fairness problems-such as model-wide fairness issues versus case-speciic fairness discrepancies-may be more efectively exposed through diferent styles of explanation; 3) Individual diferences, including prior positions and judgment criteria of algorithmic fairness, impact how people react to diferent styles of explanation. We conclude with a discussion on providing personalized and adaptive explanations to support fairness judgments of ML systems.

Original languageEnglish (US)
Pages275-285
Number of pages11
DOIs
StatePublished - 2019
Event24th ACM International Conference on Intelligent User Interfaces, IUI 2019 - Marina del Ray, United States
Duration: Mar 17 2019Mar 20 2019

Conference

Conference24th ACM International Conference on Intelligent User Interfaces, IUI 2019
Country/TerritoryUnited States
CityMarina del Ray
Period3/17/193/20/19

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Explaining models: An empirical study of how explanations impact fairness judgment'. Together they form a unique fingerprint.

Cite this