American Economic Association

Reducing Discrimination in Algorithmic Risk Assessment

David Arnold

,

University of California-San Diego

Will Dobbie

,

Harvard University

Peter Hull

,

Brown University

View Abstract

Abstract

Predictive risk scores increasingly guide many high-stakes decisions, with growing concerns that they are "unfair" to protected groups. We develop new quasi-experimental tools to assess the fairness of existing risk score algorithms, and to test conventional strategies for reducing their unfairness. In doing so we address a fundamental selection challenge: the algorithm's target (e.g. pretrial misconduct potential) may only be observed among some individuals (defendants) chosen non-randomly by a human decision-maker (bail judge). Leveraging quasi-random judge assignment in NYC, we find that a conventional race-blind algorithm for predicting pretrial misconduct yields systemically higher risk predictions for Black defendants than white defendants with the same true misconduct potential. A conventional adjustment of the algorithmic inputs to purge their correlation with race leads to an overcorrection, because true risk is correlated with race. Fixing this overcorrection with our quasi-experimental tools equalizes risk scores among equally risk white and Black defendants, while also improving the overall accuracy of risk assessment.

Counterfactual Risk Assessments under Unmeasured Confounding

Amanda Coston

,

Carnegie Mellon University

Ashesh Rambachan

,

Harvard University

Alexandra Chouldechova

,

Carnegie Mellon University

Abstract

Statistical risk assessments inform consequential decisions such as pretrial release in criminal justice, and loan approvals in consumer finance. Such risk assessments make counterfactual predictions, predicting the likelihood of an outcome under a proposed decision (e.g., what would happen if we approved this loan?). A central challenge, however, is that there may have been unmeasured confounders that jointly affected past decisions and outcomes in the historical data. This paper proposes a tractable mean outcome sensitivity model that bounds the extent to which unmeasured confounders could affect outcomes on average. The mean outcome sensitivity model partially identifies the conditional likelihood of the outcome under the proposed decision, popular predictive performance metrics (e.g., accuracy, calibration, TPR, FPR), and commonly-used predictive disparities. We derive their sharp identified sets, and we then solve three tasks that are essential to deploying statistical risk assessments in high-stakes settings. First, we propose a doubly-robust learning procedure for the bounds on the conditional likelihood of the outcome under the proposed decision. Second, we translate our estimated bounds on the conditional likelihood of the outcome under the proposed decision into a robust, plug-in decision-making policy. Third, we develop doubly-robust estimators of the bounds on the predictive performance of an existing risk assessment. We apply our methods to analyze a real-world credit-scoring task, illustrating how varying assumptions on unmeasured confounding leads to substantive changes in the credit score's predictions and evaluations of its predictive disparities.

The Challenge of Understanding What Users Want: Inconsistent Preferences and Engagement Optimization

Jon Kleinberg

,

Cornell University

Sendhil Mullainathan

,

University of Chicago

Manish Raghavan

,

Harvard University

Abstract

Online platforms have a wealth of data, run countless experiments and use industrial-scale algorithms to optimize user experience. Despite this, many users seem to regret the time they spend on these platforms. One possible explanation is that incentives are misaligned: platforms are not optimizing for user happiness. We suggest the problem runs deeper, transcending the specific incentives of any particular platform, and instead stems from a mistaken foundational assumption. To understand what users want, platforms look at what users do. This is a kind of revealed-preference assumption that is ubiquitous in the way user models are built. Yet research has demonstrated, and personal experience affirms, that we often make choices in the moment that are inconsistent with what we actually want. The behavioral economics and psychology literatures suggest, for example, that we can choose mindlessly or that we can be too myopic in our choices, behaviors that feel entirely familiar on online platforms. In this work, we develop a model of media consumption where users have inconsistent preferences. We consider an altruistic platform which simply wants to maximize user utility, but only observes behavioral data in the form of the user’s engagement. We show how our model of users’ preference inconsistencies produces phenomena that are familiar from everyday experience, but difficult to capture in traditional user interaction models. These phenomena include users who have long sessions on a platform but derive very little utility from it, and platform changes that steadily raise user engagement before abruptly causing users to go “cold turkey” and quit. A key ingredient in our model is a formulation for how platforms determine what to show users: they optimize over a large set of potential content (the content manifold) parametrized by underlying features of the content. Whether improving engagement improves user welfare depends on the direction of movement in the content manifold: for certain directions of change, increasing engagement makes users less happy, while in other directions on the same manifold, increasing engagement makes users happier. We provide a characterization of the structure of content manifolds for which increasing engagement fails to increase user utility. By linking these effects to abstractions of platform design choices, our model thus creates a theoretical framework and vocabulary in which to explore interactions between design, behavioral science, and social media.

Algorithmic Design: Fairness Versus Accuracy

Annie Liang

,

Northwestern University

Jay Lu

,

University of California-Los Angeles

Xiaosheng Mu

,

Princeton University

View Abstract

Abstract

Algorithms are increasingly used to guide consequential decisions, such as who should be granted bail or be approved for a loan. Motivated by growing empirical evidence, regulators are concerned about the possibility that the errors of these algorithms differ sharply across subgroups of the population. What are the tradeoffs between accuracy and fairness, and how do these tradeoffs depend on the inputs to the algorithm? We propose a model in which a designer chooses an algorithm that maps observed inputs into decisions, and introduce a fairness-accuracy Pareto frontier. We identify how the algorithm's inputs govern the shape of this frontier, showing (for example) that access to group identity reduces the error for the worse-off group everywhere along the frontier. We then apply these results to study an "input-design" problem where the designer controls the algorithm's inputs (for example, by legally banning an input), but the algorithm itself is chosen by another agent. We show that: (1) all designers strictly prefer to allow group identity if and only if the algorithm's other inputs satisfy a condition we call group-balance; (2) all designers strictly prefer to allow any input (including potentially biased inputs such as test scores) so long as group identity is permitted as an input, but may prefer to ban it when group identity is not.

Algorithmic Design for Social Decision Making

Saturday, Jan. 7, 2023 8:00 AM - 10:00 AM (CST)

Reducing Discrimination in Algorithmic Risk Assessment

Abstract

Counterfactual Risk Assessments under Unmeasured Confounding

Abstract

The Challenge of Understanding What Users Want: Inconsistent Preferences and Engagement Optimization

Abstract

Algorithmic Design: Fairness Versus Accuracy

Abstract

JEL Classifications

This website uses cookies.

Algorithmic Design for Social Decision Making

Saturday, Jan. 7, 2023 8:00 AM - 10:00 AM (CST)

Reducing Discrimination in Algorithmic Risk Assessment

Abstract

Counterfactual Risk Assessments under Unmeasured Confounding

Abstract

The Challenge of Understanding What Users Want: Inconsistent Preferences and Engagement Optimization

Abstract

Algorithmic Design: Fairness Versus Accuracy

Abstract

JEL Classifications