Fairness Through Computationally-Bounded Awareness

Michael P. Kim, Omer Reingold, Guy N. Rothblum

Introduction

More and more, machine learning systems are being used to make predictions about people. Algorithmic predictions are now being used to answer questions of significant personal consequence; for instance, Is this person likely to repay a loan? or Is this person likely to recommit a crime? . As these classification systems have become more ubiquitous, concerns have also grown that classifiers obtained via machine learning might discriminate based on sensitive attributes like race, gender, or sexual orientation. Indeed, machine-learned classifiers run the risk of perpetuating or amplifying historical biases present in the training data. Examples of discrimination in classification have been well-illustrated ; nevertheless, developing a systematic approach to fairness has been challenging. Often, it feels that the objectives of fair classification are at odds with obtaining high-utility predictions.

In an influential work, Dwork et al. proposed a framework to resolve the apparent conflict between utility and fairness, which they call “fairness through awareness." This framework takes the perspective that a fair classifier should treat similar individuals similarly. The work formalizes this abstract goal by assuming access to a task-specific similarity metric dd on pairs of individuals. The proposed notion of fairness requires that if the distance between two individuals is small, then the predictions of a fair classifier cannot be very different. More formally, for some small constant τ0\tau\geq 0, we say a hypothesis f:Xf:{\cal X}\to satisfies (d,τ)(d,\tau)-metric fairnessNote the definition given in is slightly different; in particular, they propose a more general Lipschitz condition, but fix τ=0\tau=0. if the following (approximate) Lipschitz condition holds for all pairs of individuals from the population X{\cal X}.

Subject to these intuitive similarity constraints, the classifier may be chosen to maximize utility. Note that, in general, the metric may be designed externally (say, by a regulatory agency) to address legal and ethical concerns, independent from the task of learning. In particular, in certain settings, the metric designers may have access to a different set of features than the learner. For instance, perhaps the metric designers have access to sensitive attributes, but for legal, social, or pragmatic reasons, the learner does not. In addition to its conceptual simplicity, the modularity of fairness through awareness makes it a very appealing framework. Currently, there are many (sometimes contradictory) notions of what it means for a classifier to be fair , and there is much debate on which definitions should be applied in a given context. Discrimination comes in many forms and classification is used in a variety of settings, so naturally, it is hard to imagine any universally-applicable definition of fairness. Basing fairness on a similarity metric offers a flexible approach for formalizing a variety of guarantees and protections from discrimination.

Still, a challenging aspect of this approach is the assumption that the similarity metric is known for all pairs of individuals.Indeed, identifies this assumption as “one of the most challenging aspects” of the framework. Deciding on an appropriate metric is itself a delicate matter and could require human input from sociologists, legal scholars, and specialists with domain expertise. For instance, in the loan repayment example, a simple, seemingly-objective metric might be a comparison of credit scores. A potential concern, however, is that these scores might themselves be biased (i.e. encode historical discriminations). In this case, a more nuanced metric requiring human input may be necessary. Further, if the metric depends on features that are latent to the learner (e.g. some missing sensitive feature) then the metric could appear arbitrarily complex to the learner. As such, in many realistic settings, the resulting metric will not be a simple function of the learner’s feature vectors of individuals.

In most machine learning applications, where the universe of individuals is assumed to be very large, even writing down an appropriate metric could be completely infeasible. In these cases, rather than require the metric value to be specified for all pairs of individuals, we could instead ask a panel of experts to provide similarity scores for a small sample of pairs of individuals. While it is information-theoretically impossible to guarantee metric fairness from a sampling-based approach, we still might hope to provide a strong, provable notion of fairness that maintains the theoretical appeal and practical modularity of the fairness through awareness framework.

In this work, we propose a new theoretical framework for fair classification based on fairness through awareness – which we dub “fairness through computationally-bounded awareness” – that eliminates the considerable issue of requiring the metric to be known exactly. Our approach maintains the simplicity and flexibility of fairness through awareness, but provably only requires a small number of random samples from the underlying metric, even though we make no structural assumptions about the metric. In particular, our approach works even if the metric provably cannot be learned. Specifically, our notion will require that a fair classifier treat similar subpopulations of individuals similarly, in a sense we will make formal next. While our definition relaxes fairness through awareness, we argue that it still protects against important forms of discrimination that the original work aimed to combat; further, we show that stronger notions necessarily require a larger sample complexity from the metric. As in , we investigate how to learn a classifier that achieves optimal utility under similarity-based fairness constraints, assuming a weaker model of limited access to the metric. We give positive and negative results that show connections between achieving our fairness notion and learning.

Setting and metric multifairness

The focus on linear functions is not too restrictive; in particular, by increasing the dimension to n=O(nk)n^{\prime}=O(n^{k}), we can learn any degree-kk polynomial function of the original features. By increasing kk, we can approximate increasingly complex functions.

1 Metric multifairness

We define our relaxation of metric fairness with respect to a rich class of statistical tests on the pairs of individuals. Let a comparison be any subset of the pairs of X×X{\cal X}\times{\cal X}. Our definition, which we call metric multifairness, is parameterized by a collection of comparisons C2X×X{\cal C}\subseteq 2^{{\cal X}\times{\cal X}} and requires that a hypothesis appear Lipschitz according to all of the statistical tests defined by the comparisons SCS\in{\cal C}.

Let C2X×X{\cal C}\subseteq 2^{{\cal X}\times{\cal X}} be a collection of comparisons and let d:X×Xd:{\cal X}\times{\cal X}\to be a metric. For some constants τ0\tau\geq 0, a hypothesis ff is (C,d,τ)({\cal C},d,\tau)-metric multifair if for all SCS\in{\cal C},

To begin, note that metric multifairness is indeed a relaxation of metric fairness; if we take the collection C={{(x,x)}:x,xX×X}{\cal C}=\left\{\left\{(x,x^{\prime})\right\}:x,x^{\prime}\in{\cal X}\times{\cal X}\right\} to be the collection of all pairwise comparisons, then (C,d,τ)({\cal C},d,\tau)-metric multifairness is equivalent to (d,τ)(d,\tau)-metric fairness.

In order to achieve metric multifairness from a small sample from the metric, however, we need a lower bound on the density of each comparison in C{\cal C}; in particular, we can’t hope to enforce metric fairness from a small sample. For some γ>0\gamma>0, we say that a collection of comparisons C{\cal C} is γ\gamma-large if for all SCS\in{\cal C}, Pr(x,x)M[(x,x)S]γ\Pr_{(x,x^{\prime})\sim{\cal M}}[(x,x^{\prime})\in S]\geq\gamma. A natural next choice for C{\cal C} would be a collection of comparisons that represent the Cartesian products between traditionally-protected groups, defined by race, gender, etc. In this case, as long as the minority populations are not too small, then a random sample from the metric will concentrate around the true expectation, and we could hope to enforce this statistical relaxation of metric fairness. While this approach is information-theoretically feasible, its protections are very weak.

To highlight this weakness, suppose we want to predict the probability individuals will repay a loan, and our metric is an adjusted credit score. Even after adjusting scores, two populations P,QXP,Q\subseteq{\cal X} (say, defined by race) may have large average distance because overall PP has better credit than QQ; still, within PP and QQ, there may be significant subpopulations PPP^{\prime}\subseteq P and QQQ^{\prime}\subseteq Q that should be treated similarly (possibly representing the qualified members of each group). In this case, a coarse statistical relaxation of metric fairness will not require that a classifier treat PP^{\prime} and QQ^{\prime} similarly; instead, the classifier could treat everyone in PP better than everyone in QQ – including treating unqualified members of PP better than qualified members of QQ. Indeed, the weaknesses of broad-strokes statistical definitions served as motivation for the original work of . We would like to choose a class C{\cal C} that strengthens the fairness guarantees of metric multifairness, but maintains its efficient sample complexity.

While we can define metric multifairness with respect to any collection C{\cal C}, typically, we will think of C{\cal C} as a rich class of overlapping subsets; equivalently, we can think of the collection C{\cal C} as an expressive class of boolean functions, where for SCS\in{\cal C}, cS(x,x)=1c_{S}(x,x^{\prime})=1 if and only if (x,x)S(x,x^{\prime})\in S. In particular, C{\cal C} should be much more expressive than simply defining comparisons across traditionally-protected groups. The motivation for choosing such an expressive class C{\cal C} is exemplified in the following proposition.

Suppose there is some SCS\in{\cal C}, such that E(x,x)S[d(x,x)]ε\operatorname*{\textnormal{{E}}}_{(x,x^{\prime})\sim S}[d(x,x^{\prime})]\leq\varepsilon. Then if ff is (C,d,τ)({\cal C},d,\tau)-metric multifair, then ff satisfies (d,(ε+τ)/p)(d,(\varepsilon+\tau)/p)-metric fairness for at least a (1p)(1-p)-fraction of the pairs in SS.

That is, if there is some subset SCS\in{\cal C} that identifies a set of pairs whose metric distance is small, then any metric multifair hypothesis must also satisfy the stronger individual metric fairness notion on many pairs from SS. This effect will compound if many different (possibly overlapping) comparisons are identified that have small average distance. We emphasize that these small-distance comparisons are not known before sampling from the metric; indeed, this would imply the metric was (approximately) known a priori. Still, if the class C{\cal C} is rich enough to correlate well with various comparisons that reveal significant information about the metric, then any metric multifair hypothesis will satisfy individual-level fairness on a significant fraction of the population!

While increasing the expressiveness of C{\cal C} increases the strength of the fairness guarantee, in order to learn from a small sample, we cannot choose C{\cal C} to be arbitrarily complex. Thus, in choosing C{\cal C} we must balance the strength of the fairness guarantee with the information bottleneck in accessing dd through random samples. Our resolution to these competing needs is complexity-theoretic: while information-theoretically, we can’t hope to ensure fair treatment across all subpopulations, we can hope ensure fair treatment across efficiently-identifiable subpopulations. For instance, if we take C{\cal C} to be a family defined according to some class of computations of bounded dimension – think, the set of conjunctions of a constant number of boolean features or short decision trees – then we can hope to accurately estimate and enforce the metric multifairness conditions. Taking such a bounded C{\cal C} ensures that a hypothesis will be fair on all comparisons identifiable within this computational bound. This is the sense in which metric multifairness provides fairness through computationally-bounded awareness.

2 Learning model

Throughout, our goal is to learn a hypothesis from noisy samples from the metric that satisfies multifairness. Specifically, we assume an algorithm can obtain a small number of independent random metric samples (x,x,Δ(x,x))X×X×(x,x^{\prime},\Delta(x,x^{\prime}))\in{\cal X}\times{\cal X}\times where (x,x)M(x,x^{\prime})\sim{\cal M} is drawn at random over the distribution of pairs of individuals, and Δ(x,x)\Delta(x,x^{\prime}) is a random variable of bounded magnitude with E[Δ(x,x)]=d(x,x)\operatorname*{\textnormal{{E}}}[\Delta(x,x^{\prime})]=d(x,x^{\prime}).

We emphasize that this is a very limited access model. As Theorem 2 shows we achieve (C,d,τ)({\cal C},d,\tau)-metric multifairness from a number of samples that depends logarithmically on the size of C{\cal C} independent of the complexity of the similarity metric.Alternatively, for continuous classes of C{\cal C}, we can replace log(C)\log(\left|{\cal C}\right|) with some notion of dimension (VC-dimension, metric entropy, etc.) through a uniform convergence argument. Recall that d:X×Xd:{\cal X}\times{\cal X}\to can be an arbitrary symmetric function; thus, the learner does not necessarily have enough information to learn dd. Still, for exponentially-sized C{\cal C}, the learner can guarantee metric multifairness from a polynomial-sized sample, and the strength of the guarantee will scale up with the complexity of C{\cal C} (as per Propsition 1).

In order to ensure a strong notion of fairness, we assume that the subpopulations we wish to protect are well-represented in the pairs drawn from M{\cal M}. This assumption, while important, is not especially restrictive, as we think of the metric samples as coming from a regulatory committee or ethically-motivated party; in other words, in practical settings, it is reasonable to assume that one can choose the metric sampling distribution based on the notion of fairness one wishes to enforce.

Label access.

When we learn linear families, our goal will be to learn from a sample of labeled examples. We assume the algorithm can ask for independent random samples x,yDx,y\sim{\cal D}.

Measuring optimality.

To evaluate the utility guarantees of our learned predictions, we take a comparative approach. Suppose H2X×X{\cal H}\subseteq 2^{{\cal X}\times{\cal X}} is a collection of comparisons. For ε0\varepsilon\geq 0, we say a hypothesis ff is (H,ε)({\cal H},\varepsilon)-optimal with respect to F{\cal F}, if

where fFf^{*}\in{\cal F} is an optimal (H,d,0)({\cal H},d,0)-metric multifair hypothesis.

Learning a metric multifair hypothesis

As in , we formulate the problem of learning a fair set of predictions as a convex program. Our objective is to minimize the expected loss Ex,yD[L(f(x),y)]\operatorname*{\textnormal{{E}}}_{x,y\sim{\cal D}}[L(f(x),y)], subject to the multifairness constraints defined by C{\cal C}.For the sake of presentation, throughout the theorem statements, we will assume that LL is O(1)O(1)-Lipschitz on the domain of legal predictions/labels to guarantee bounded error; our results are proved more generally. Specifically, we show that a simple variant of stochastic gradient descent due to learns such linear families efficiently.

Note that the metric sample complexity depends logarithmically on C\left|{\cal C}\right|. Thus, information-theoretically, we can hope to enforce metric mutlifairness with a class C{\cal C} that grows exponentially and still be efficient. While the running time of each iteration depends on C\left|{\cal C}\right|, note that the number of iterations is independent of C\left|{\cal C}\right|. In Section 4, we show conditions on C\left|{\cal C}\right| under which we can speed up the running time of each iteration to depend logarithmically on C\left|{\cal C}\right|.

We give a description of the switching subgradient method in Algorithm 3. At a high level, at each iteration, the procedure checks to see if any constraint is significantly violated. If it finds a violation, it takes a (stochastic) step towards feasibility. Otherwise, it steps according a stochastic subgradient for the objective.

For convenience of analysis, we define the residual on the constraint defined by SS as follows.

Note that RS(w)R_{S}(w) is convex in the predctions fw(x)f_{w}(x) and thus, for linear families is convex in ww. We describe the algorithm assuming access to the following estimators, which we can implement efficiently (in terms of time and samples). First, we assume we can estimate the residual R^S(w)\hat{R}_{S}(w) on each SCS\in{\cal C} with tolerance τ\tau such that for all wFw\in{\cal F}, RS(w)R^S(w)τ\left|R_{S}(w)-\hat{R}_{S}(w)\right|\leq\tau. Next, we assume access to a stochastic subgradient oracle for the constraints and the objective. For a function ϕ(w)\phi(w), let ϕ(w)\partial\phi(w) denote the set of subgradients of ϕ\phi at ww. We abuse notation, and let ϕ(w)\nabla\phi(w) refer to a vector-valued random variable where \operatorname*{\textnormal{{E}}}[\nabla\phi(w)\big{|}w]\in\partial\phi(w). We assume access to stochastic subgradients for RS(w)\partial R_{S}(w) for all SCS\in{\cal C} and L(w)\partial L(w). We include a full analysis of the algorithm and proof of Theorem 2 in the Appendix.

One specific application of this result is as a way post-process learned predictions to ensure fairness. In particular, suppose we are given the predictions from some pre-trained model for NN individuals, but are concerned that these predictions may not be fair. We can use Algorithm 3 to post-process these labels to select near-optimal metric multifair predictions. Note these predictions will be optimal with respect to the unconstrained family of predictions – not just predictions that come from a specific hypothesis class (like linear functions).

Specifically, in this setting we can represent an unconstrained set of predictions as a linear hypothesis in NN dimensions: take B=1B=1, and let the feature vector for xiXx_{i}\in{\cal X} be the iith standard basis vector. Then, we can think of the input labels to Algorithm 3 to be the output of any predictor that was trained separately.Nothing in our analysis required labels y{1,1}y\in\left\{-1,1\right\}; we can instead take the labels yy\in. For instance, if we have learned a highly-accurate model, but are unsure of its fairness, we can instantiate our framework with, say, the squared loss between the original predictions and the returned predictions; then, we can view the program as a procedure to project the highly-accurate predictions onto the set of metric multifair predictions. Importantly, our procedure only needs a small set of samples from the metric and not the original data used to train the model.

Post-processing prediction models for fairness has been studied in a few contexts . This post-processing setting should be contrasted to these settings. In our setting, the predictions are not required to generalize out of sample (in terms of loss or fairness). On the one hand, this means the metric multifairness guarantee does not generalize outside the NN individuals; on the other hand, because the predictions need not come from a bounded hypothesis class, their utility can only improve compared to learning a metric multifair hypothesis directly.

In addition to preserving the utility of previously-trained classifiers, separating the tasks of training for utility and enforcing fairness may be desirable when intentional malicious discrimination may be anticipated. For instance, when addressing the forms of racial profiling that can occur through targeted advertising (as described in ), we may not expect self-interested advertisers to adhere to classification under strict fairness constraints, but it stands to reason that prominent advertising platforms might want to prevent such blatant abuses of their platform. In this setting, the platform could impose metric multifairness after the advertisers specify their ideal policy.

Reducing search to agnostic learning

As presented above, the switching subgradient descent method converges to a nearly-optimal point in a bounded number of subgradient steps, independent of C\left|{\cal C}\right|. The catch is that at the beginning of each iteration, we need to search over C{\cal C} to determine if there is a significantly violated multifairness constraint. As we generally want to take C{\cal C} to be a rich class of comparisons, in many cases C\left|{\cal C}\right| will be prohibitive. As such, we would hope to find violated constraints in sublinear time, preferably even poly-logarithmic in C\left|C\right|. Indeed, we show that if a concept class C{\cal C} admits an efficient agnostic learner, then we can solve the violated constraint search problem over the corresponding collection of comparisons efficiently.

Agnostic learning can be phrased as a problem of detecting correlations. Suppose g,h:Ug,h:{\cal U}\to, and let D{\cal D} be some distribution supported on U{\cal U}. We denote the correlation between gg and hh on D{\cal D} as g,h=EiD[g(i)h(i)]\langle g,h\rangle=\operatorname*{\textnormal{{E}}}_{i\sim{\cal D}}[g(i)\cdot h(i)]. We let CU{\cal C}\subseteq^{\cal U} denote the concept class and HU{\cal H}\subseteq^{\cal U} denote the hypothesis class. The task of agnostic learning can be stated as follows: given sample access over some distribution (i,g(i))D×(i,g(i))\sim{\cal D}\times to some function gNg\in^{N}, find some hypothesis hHh\in{\cal H} that is comparably correlated with gg as the best cCc\in{\cal C}. That is, given access to gg, an agnostic learner with accuracy ε\varepsilon for concept class C{\cal C} returns some hh from the hypothesis class H{\cal H} such that

An agnostic learner is typically considered efficient if it runs in polynomial time in log(C)\log(\left|{\cal C}\right|) (or an appropriate notion of dimension of C{\cal C}), 1/ε1/\varepsilon, and log(1/d)\log(1/d). Additionally, distribution-specific learners and learners with query access to the function have been studied . In particular, membership queries tend to make agnostic learning easier. Our reduction does not use any metric samples other than those that the agnostic learner requests. Thus, if we are able to query a panel of experts according to the learner, rather than randomly, then an agnostic learner that uses queries could be used to speed up our learning procedure.

When we solve the convex program with switching subgradient descent, at the beginning of each iteration, we check if there is any SCS\in{\cal C} such that the residual quantity RS(w)R_{S}(w) is greater than some threshold. If we find such an SS, we step according to the subgradient of RS(w)R_{S}(w). In fact, the proof of the convergence of switching subgradient descent reveals that as long as when there is some SCS\in{\cal C} where RS(w)R_{S}(w) is in violation, we can find some RS(w)>ρR_{S^{\prime}}(w)>\rho for some constant ρ\rho, where SHX×XS^{\prime}\in{\cal H}\subseteq^{{\cal X}\times{\cal X}}, then we can argue that the learned hypothesis will be (C,d,τ)({\cal C},d,\tau)-metric multifair and achieve utility commensurate with the best (H,d,0)({\cal H},d,0)-metric multifair hypothesis.

We show a general reduction from the problem of searching for a violated comparison SCS\in{\cal C} to the problem of agnostic learning over the corresponding family of boolean functions. In particular, recall that for a collection of comparisons C2X×X{\cal C}\subseteq 2^{{\cal X}\times{\cal X}}, we can also think of C{\cal C} as a family of boolean concepts, where for each SCS\in{\cal C}, there is an associated boolean function cS:X×X{1,1}c_{S}:{\cal X}\times{\cal X}\to\left\{-1,1\right\} where cS(xi,xj)=1c_{S}(x_{i},x_{j})=1 if and only if (xi,xj)S(x_{i},x_{j})\in S. We frame this search problem as an agnostic learning problem, where we design a set of “labels” for each pair (x,x)X×X(x,x^{\prime})\in{\cal X}\times{\cal X} such that any function that is highly correlated with these labels encodes a way to update the parameters towards feasibility.

Recall the search problem: given a current hypothesis fwf_{w}, is there some SCS\in{\cal C} such that RS(w)=E(x,x)S[fw(xi)fw(xj)d(xi,xj)]>τR_{S}(w)=\operatorname*{\textnormal{{E}}}_{(x,x^{\prime})\sim S}[\left|f_{w}(x_{i})-f_{w}(x_{j})\right|-d(x_{i},x_{j})]>\tau? Consider the labeling each pair (xi,xj)(x_{i},x_{j}) with v(xi,xj)=fw(xi)fw(xj)d(xi,xj)v(x_{i},x_{j})=\left|f_{w}(x_{i})-f_{w}(x_{j})\right|-d(x_{i},x_{j}). Let ρ=RX×X(w)\rho=R_{{\cal X}\times{\cal X}}(w); note that we can treat ρ\rho as a constant for all SCS\in{\cal C}. Further, suppose SCS\in{\cal C} is such that RS(w)>τR_{S}(w)>\tau, or equivalently, E(xi,xj)S[v(xi,xj)]>τ\operatorname*{\textnormal{{E}}}_{(x_{i},x_{j})\sim S}[v(x_{i},x_{j})]>\tau. Then, by the assumption that C{\cal C} is γ\gamma-large, the correlation between the corresponding boolean function cSc_{S} and labels vv can be lower bounded as cS,v>2γτρ\langle c_{S},v\rangle>2\gamma\tau-\rho. Suppose we have an agnostic learner that returns a hypothesis hh with ε<γτ\varepsilon<\gamma\tau accuracy. Then, we know that h,vγτρ\langle h,v\rangle\geq\gamma\tau-\rho by the lower bound on the optimal cSCc_{S}\in{\cal C}. Then, consider the function Rh(w)R_{h}(w) defined as follows.

Thus, given that there exists some SCS\in{\cal C} where RS(w)>τR_{S}(w)>\tau, we can find some real-valued comparison Sh(x,x)=h(x,x)+12S_{h}(x,x^{\prime})=\frac{h(x,x^{\prime})+1}{2}, such that RSh(w)=Ω(h,v+ρ)Ω(γτ)R_{S_{h}}(w)=\Omega(\langle h,v\rangle+\rho)\geq\Omega(\gamma\tau). ∎

Discovering a violation of at least Ω(γτ)\Omega(\gamma\tau) guarantees Ω(γ2τ2)\Omega(\gamma^{2}\tau^{2}) progress in the duality gap at each step, so the theorem follows from the analysis of Algorithm 3.

Hardness of learning metric multifair hypotheses

In this section, we show that our algorithmic results cannot be improved significantly. In particular, we focus on the post-processing setting of Section 3.1. We show that the metric sample complexity is tight up to a Ω(loglog(C))\Omega(\log\log(\left|{\cal C}\right|)) factor unconditionally. We also show that some learnability assumption on C{\cal C} is necessary in order to achieve a high-utility (C,d,τ)({\cal C},d,\tau)-metric multifair predictions efficiently. In particular, we give a reduction from inverting a boolean concept cCc\in{\cal C} to learning a hypothesis ff that is metric multifair on a collection H{\cal H} derived from C{\cal C}, where the metric samples from dd encode information about the concept cc. Recall, that for any H{\cal H} and dd, we can always output a trivial (H,d,0)({\cal H},d,0)-metric multifair hypothesis by outputting a constant hypothesis. This leads to a subtlety in our reductions, where we need to leverage the learner’s ability to simultaneously satisfy the metric multifairness constraints and achieve high utility.

Both lower bounds follow the same general construction. Suppose we have some boolean concept class C{1,1}X0{\cal C}\subseteq\left\{-1,1\right\}^{{\cal X}_{0}} for some universe X0{\cal X}_{0}. We will construct a new universe X=X0X1{\cal X}={\cal X}_{0}\cup{\cal X}_{1} and define a collection of “bipartite” comparisons over subsets of X0×X1{{\cal X}_{0}\times{\cal X}_{1}}. Then, given samples from (x0,c(x0))(x_{0},c(x_{0})), we define corresponding metric values where d(x0,x1)d(x_{0},x_{1}) is some function of c(x0)c(x_{0}) for all x1X1x_{1}\in{\cal X}_{1}. Finally, we need to additionally encode the objective of inverting cc into labels for x0X0x_{0}\in{\cal X}_{0}, such that to obtain good loss, the post-processor must invert cc on X0{\cal X}_{0}. We give a full description of the reduction in the Appendix.

While we argued earlier that some relaxation of metric fairness is necessary if we want to learn from a small set of metric samples, it is not clear that multifairness with respect to C{\cal C} is the strongest relaxation we can obtain. In particular, we might hope to guarantee fairness on all large comparisons, rather than just a finite class C{\cal C}. The following theorem shows that such a hope is misplaced: in order for an algorithm to guarantee that the Lipschitz condition holds in expectation over a finite collection of large comparisons C{\cal C}, then either the algorithm takes Ω(logC)\Omega(\log\left|{\cal C}\right|) random metric samples, or the algorithm outputs a set of nearly useless predictions. For concreteness, we state the theorem in the post-processing setting of Section 3.1; the construction can be made to work in the learning setting as well.

Let γ,τ>0\gamma,\tau>0 be constants and suppose A{\cal A} is an algorithm that has random sample access to dd and outputs a (C,d,τ)({\cal C},d,\tau)-metric multifair set of predictions for γ\gamma-large C{\cal C}. Then, A{\cal A} takes Ω(logC)\Omega(\log\left|{\cal C}\right|) random samples from dd or outputs a set of predictions with loss that approaches the loss achievable with no metric queries.

The construction uses a reduction from the problem of learning a linear function; we then appeal to a lower bound from linear algebra on the number of random queries needed to span a basis.

Hardness from pseudorandom functions.

Our reduction implies that a post-processing algorithm for (C,d,τ)({\cal C},d,\tau)-metric multifairness with respect to an arbitrary metric dd gives us a way of distinguishing functions in C{\cal C} from random.

Assuming one-way functions exist, there is no efficient algorithm for computing (C,τ)({\cal C},\tau)-optimal (C,d,τ)({\cal C},d,\tau)-metric multifair predictions for general C,d,{\cal C},d, and constant τ\tau.

Essentially, without assumptions that C{\cal C} is a learnable class of boolean functions, some nontrivial running time dependence on C\left|{\cal C}\right| is necessary. The connection between learning and pseudorandom functions is well-established; under stronger cryptographic assumptions as in , the reduction implies that a running time of Ω(Cα)\Omega(\left|{\cal C}\right|^{\alpha}) is necessary for some constant α>0\alpha>0.

Related works and discussion

Many exciting recent works have investigated fairness in machine learning. In particular, there is much debate on the very definitions of what it means for a classifier to be fair . Beyond the work of Dwork et al. , our work bears most similarity to two recent works of Hébert-Johnson et al. and Kearns et al. . As in this work, both of these papers investigate notions of fairness that aim to strengthen the guarantees of statistical notions, while maintaining their practicality. These works also both draw connections between achieving notions of fairness and efficient agnostic learning. In general, agnostic learning is considered a notoriously hard computational problem ; that said, in the context of fairness in machine learning, show that using heuristic methods to agnostically learn linear hypotheses seems to work well in practice.

Metric multifairness does not directly generalize either or , but we argue that it provides a more flexible alternative to these approaches for subpopulation fairness. In particular, these works aim to achieve specific notions of fairness – either calibration or equalized error rates – across a rich class of subpopulations. As has been well-documented , calibration and equalized error rates, in general, cannot be simultaneously satisfied. Often, researchers frame this incompatibility as a choice: either you satisfy calibration or you satisfy equalized error rates; nevertheless, there are many applications where some interpolation between accuracy (à la calibration) and corrective treatment (à la equalized error rates) seems appropriate.

Metric-based fairness offers a way to balance these conflicting fairness desiderata. In particular, one could design a similarity metric that preserves accuracy in predictions and separately a metric that performs corrective treatment, and then enforce metric multifairness on an appropriate combination of the metrics. For instance, returning to the loan repayment example, an ideal metric might be a combination of credit scores (which tend to be calibrated) and a metric that aims to increase the loans given to historically underrepresented populations (by, say, requiring the top percentiles of each subpopulation be treated similarly). Different combinations of the two metrics would place different weights on the degree of calibration and corrective discrimination in the resulting predictor. Of course, one could equally apply this metric in the framework of , but the big advantage with metric multifairness is that we only need a small sample from the metric to provide a relaxed, but still strong guarantee of fairness.

We are optimistic that metric multifairness will provide an avenue towards implementing metric-based fairness notions. At present, the results are theoretical, but we hope this work can open the door to empirical studies across diverse domains, especially since one of the strengths of the framework is its generality. We view testing the empirical performance of metric multifairness with various choices of metric dd and collection C{\cal C} as an exciting direction for future research.

Finally, two recent theoretical works also investigate extensions to the fairness through awareness framework of . Gillen et al. study metric-based individually fair online decision-making in the presence of an unknown fairness metric. In their setting, every day, a decision maker must choose between candidates available on that day; the goal is to have the decision maker’s choices appear metric fair on each day (but not across days). Their work makes a strong learnability assumption about the underlying metric; in particular, they assume that the unknown metric is a Mahalanobis metric, whereas our focus is on fair classification when the metric is unknown and unrestricted. Rothblum and Yona study fair machine learning under a different relaxation of metric fairness, which they call approximate metric fairness. They assume that the metric is fully specified and known to the learning algorithm, whereas our focus is on addressing the challenge of an unknown metric. Their notion of approximate metric fairness aims to protect all (large enough) groups, and thus, is more strict than metric multifairness.

The authors thank Cynthia Dwork, Roy Frostig, Fereshte Khani, Vatsal Sharan, Paris Siminelakis, and Gregory Valiant for helpful conversations and feedback on earlier drafts of this work. We thank the anonymous reviewers for their careful reading and suggestions on how to improve the clarity of the presentation.

References

Appendix A Analysis of Algorithm 3

Here, we give a high-level overview of the analysis of Algorithm 3. We defer some technical lemmas to Appendix A.3. We refer to KfK_{f} as the set of “feasible iterations” where we step according to the objective; that is,

Fairness analysis

We begin by showing that the hypothesis wˉ\bar{w} that Algorithm 3 returns satisfies metric multifairness.

Suppose for all SCS\in{\cal C}, the residual oracle R^S\hat{R}_{S} has tolerance τ/5\tau/5. Then, wˉ\bar{w} is (C,d,τ)({\cal C},d,\tau)-metric multifair.

We choose our final hypothesis wˉ\bar{w} to be the weighted average of the feasible iterates. Note that the update rules for KfK_{f} and WW imply that wˉ\bar{w} is a convex combination of hypotheses where no constraint appears significantly violated, wˉ=1KfkKfwk\bar{w}=\frac{1}{\left|K_{f}\right|}\cdot\sum_{k\in K_{f}}w_{k}. By convexity of RSR_{S} we have the following inequality for all SCS\in{\cal C}.

Further, for all SCS\in{\cal C} and all k[T]k\in[T], by the assumed tolerance of RSR_{S}, we know that

Given that for all kKfk\in K_{f}, R^Sk(wk)4τ/5\hat{R}_{S_{k}}(w_{k})\leq 4\tau/5, then applying the triangle inequality, we conclude that for each comparison SCS\in{\cal C},

Hence, wˉ\bar{w} is (C,d,τ)({\cal C},d,\tau)-metric multifair. ∎

Utility and runtime analysis

We analyze the utility of Algorithm 3 using a duality argument. For notational convenience, denote L(w)=ExiD[L(fw(xi),yi)]L(w)=\operatorname*{\textnormal{{E}}}_{x_{i}\sim{\cal D}}[L(f_{w}(x_{i}),y_{i})]. In addition to the assumptions in the main body, throughout, we assume the following bounds on the subgradients for all wFw\in{\cal F}.

Let τ,δ>0\tau,\delta>0 and F=[B,B]n{\cal F}=[-B,B]^{n}. After running Algorithm 3 for T>302M2B2nlog(n/δ)τ2T>\frac{30^{2}M^{2}B^{2}n\log(n/\delta)}{\tau^{2}} iterations, then with probability at least 18δ1-8\delta (over the stochastic subgradients)

We give the full proof of Lemma 7 in Appendix A.3.

A.1 Answering residual queries

Next, we describe how to answer residual queries RS(w)R_{S}(w) efficiently, in terms of time and samples.

For τ,γ>0\tau,\gamma>0, for a γ\gamma-large collection of comparisons C2X×X{\cal C}\subseteq 2^{{\cal X}\times{\cal X}}, with probability 1δ1-\delta, given access to nn metric samples, every residual query RS(w)R_{S}(w) can be answered correctly with tolerance τ\tau provided

Proposition 9 shows that ES[d(x,x)]\operatorname*{\textnormal{{E}}}_{S}[d(x,x^{\prime})] can be estimated for all SCS\in{\cal C} from a small number of metric samples. The proof follows a standard Chernoff plus union bound argument. For completeness, we give a full proof next. Thus, Lemma 8 follows by showing that at each iteration ES[fw(x)fw(x)]\operatorname*{\textnormal{{E}}}_{S}[\left|f_{w}(x)-f_{w}(x^{\prime})\right|] can be estimated from a small number of evaluations of the current hypothesis fwf_{w}.

We can estimate the expected value of the deviation on ff over SCS\in{\cal C} with a small set of unlabeled samples from X×X{\cal X}\times{\cal X}; we will evaluate the hypothesis ff for each of these samples. Using an identical argument as in the case of the expected metric value, we can prove the following bound on how many comparisons we need to make, which shows the lemma.

Suppose C{\cal C} is γ\gamma-large. Then with probability at least 1δ1-\delta, for all SCS\in{\cal C}, the empirical estimate for ES[f(x)f(x)]\operatorname*{\textnormal{{E}}}_{S}[\left|f(x)-f(x^{\prime})\right|] of nn samples (x,x)M(x,x^{\prime})\sim{\cal M} deviates from the true expected value by at most τ\tau provided

Here, we show that a small number of samples from the metric suffices to estimate the expected metric distance over all SCS\in{\cal C}. Suppose C{\cal C} is γ\gamma-large. Then with probability at least 1δ1-\delta, for all SCS\in{\cal C}, the empirical estimates for ES[d(xi,xj)]\operatorname*{\textnormal{{E}}}_{S}[d(x_{i},x_{j})] of nn metric samples deviate from their true expected value by at most τ\tau provided

Let (xi,xj,Δij)(x_{i},x_{j},\Delta_{ij}) represent a random metric sample. Suppose for each SCS\in{\cal C}, we obtain mm such samples where (xi,xj)S(x_{i},x_{j})\sim S, and let d(S)=ijXSΔijd(S)=\sum_{ij\in X_{S}}\Delta_{ij} be the empirical average over the sample. Then, by Hoeffding’s inequality, we know

If mΩ(log(C/δ)τ2)m\geq\Omega\left(\frac{\log(\left|C\right|/\delta)}{\tau^{2}}\right), then the probability that the estimate d(S)d(S) is not within τ\tau of its true value is less than δC\frac{\delta}{\left|C\right|}. Union bounding over C{\cal C}, the probability that every estimate has tolerance τ\tau will be at least 1δ1-\delta.

Because C{\cal C} is γ\gamma-large, for every SCS\in{\cal C}, the probability a random metric sample (xi,xj)M(x_{i},x_{j})\sim{\cal M} is in SS is at least γ\gamma. If we take log(m)γ\frac{\log(m)}{\gamma} samples, then with probability at least 11/m1-1/m, one of the samples will be in SS. Thus, to guarantee d(S)d(S) has tolerance τ\tau for all SCS\in{\cal C} with probability 1δ1-\delta,

A.2 Answering subgradient queries

Next, we argue that the subgradient oracles can be implemented efficiently without accessing any metric samples. First, suppose we want to take a step according to RS(w)R_{S}(w); while RS(w)R_{S}(w) is not differentiable, we can compute a legal subgradient defined by partial subderivatives given as follows.

The subgradient does not depend on dd, so no samples from the metric are necessary. Further, Algorithm 3 only assumes access to stochastic subgradient oracle with bounded entries. If we sample a single (xi,xj)M(x_{i},x_{j})\sim{\cal M}, then sgn(w,xixj)(xilxjl)\textnormal{{sgn}}(\langle w,x_{i}-x_{j}\rangle)\cdot(x_{il}-x_{jl}) will be an unbiased estimate of a subgradient of RS(w)R_{S}(w); we claim, the entries will also be bounded. In particular, assuming xi11\left\|x_{i}\right\|_{1}\leq 1 implies each partial is bounded by 22, so that we can take M2=4nM^{2}=4n.

A.3 Utility analysis of Algorithm 3

In this appendix, we give a full proof of Lemma 7. We defer the proof of certain technical lemmas to Appendix A.4 for the sake of presentation.

with probability at least 18δ1-8\delta over the randomness of the algorithm.

As before, we refer to Kf[T]K_{f}\subseteq[T] as the set of feasible iterations, where we step according to the objective, and [T]Kf[T]\setminus K_{f} as the set of infeasible iterations, where we step according to the violated constraints. Recall, we denote the set of subgradients of a function LL (or RR) at ww by L(w)\partial L(w) and denote by L(w)\nabla L(w) a stochastic subgradient, where \operatorname*{\textnormal{{E}}}[\nabla L(w)\big{|}w]\in\partial L(w).

When we do not step according to the objective, we step according to the subgradient of some violated comparison constraint. In fact, we show that stepping according to any convex combination of such subgradients suffices to guarantee progress in the duality gap. In the case wher t∉Kft\not\in K_{f}, we assume that we can find some convex combination SCαk,SR^S(wk)>4τ/5\sum_{S\in{\cal C}}\alpha_{k,S}\hat{R}_{S}(w_{k})>4\tau/5 where for all SCS\in{\cal C}, αk,SΔC1\alpha_{k,S}\in\Delta_{\left|{\cal C}\right|-1}. We show that if we step according to the corresponding combination of the subgradients of RS(wk)R_{S}(w_{k}), we can bound the duality gap. Specifically, for k∉Kfk\not\in K_{f}, let the algorithm’s step be given by

where for each SCS\in{\cal C}, we have \operatorname*{\textnormal{{E}}}\left[\nabla R_{S}(w_{k})\big{|}w_{k}\right]\in\partial R_{S}(w_{k}). Let ηL=τGM\eta_{L}=\frac{\tau}{GM} and ηR=τM2\eta_{R}=\frac{\tau}{M^{2}} denote the step size for the objective and residual steps, respectively. Then, consider the following choice of dual multipliers for each SCS\in{\cal C}.

Expanding the definition of wˉ\bar{w} and applying convexity, we can bound the duality gap as follows

where (16) follows from expanding wˉ\bar{w} then applying convexity of LL and the definition of d(λˉ)d(\bar{\lambda}) and (18) follows by our choice of λˉS\bar{\lambda}_{S} for each SCS\in{\cal C}.

With the duality gap expanded into one sum over the feasible iterates and one sum over the infeasible iterates, we can analyze these iterates separately. The following lemmas show how to track the contribution of each term to the duality gap in terms of a potential function uku_{k} defined as

For notational convenience, for each kKfk\in K_{f}, let e(w_{k})=\operatorname*{\textnormal{{E}}}[\nabla L(w_{k})\big{|}w_{k}]-\nabla L(w_{k}) be the noise in the subgradient computation.

For all wFw\in{\cal F} and for all kKfk\in K_{f},

Again, for notational convenience, for each k[T]Kfk\in[T]\setminus K_{f}, let e(w_{k})=\sum_{S\in{\cal C}}\alpha_{k,S}\left(\operatorname*{\textnormal{{E}}}[\nabla R_{S}(w_{k})\big{|}w_{k}]-\nabla R_{S}(w_{k})\right) be the noise in the subgradient computation.

For all wFw\in{\cal F} and for all k[T]Kfk\in[T]\setminus K_{f},

We defer the proofs of Lemmas 10 and 11 to Appendix A.4. Assuming Lemmas 10 and 11, we bound the duality gap as follows.

by rearranging. Noting that ()(**) can be bounded by 3G5Mτ\frac{3G}{5M}\tau, it remains to bound ()(*). We show that for a sufficiently large TT, then ()(*) cannot be positive.

Consider the terms in the supremum over wFw\in{\cal F}. Note that we can upper bound sup{u0(w)}2B2n\sup\left\{u_{0}(w)\right\}\leq 2B^{2}n. Additionally, we upper bound the error incurred due to the objective subgradient noise with the following lemma, which we prove in Appendix A.4.

With probability at least 14δ1-4\delta, the contribution of the noisy subgradient computation to the duality gap can be bounded as follows.

Assuming the lemma and that T>302M2B2nlog(n/δ)τ2T>\frac{30^{2}M^{2}B^{2}n\log(n/\delta)}{\tau^{2}}, then, we can bound ()(*) by splitting the negative term involving TT to balance both positive terms.

Thus, the sum of ()(*) and ()(**) is at most 3G5Mτ\frac{3G}{5M}\tau. ∎

A.4 Deferred proofs from analysis of Algorithm 3

We show that the update rule wk+1πF(wkηkgk)w_{k+1}\leftarrow\pi_{{\cal F}}(w_{k}-\eta_{k}g_{k}) implies the following inequality in terms of ηk,gk,uk(w)\eta_{k},g_{k},u_{k}(w), and uk+1(w)u_{k+1}(w).

Suppose wk+1=πF(wkηkgk)w_{k+1}=\pi_{{\cal F}}(w_{k}-\eta_{k}g_{k}). Then, for all wFw\in{\cal F},

where (29) follows by substituting the definition of wk+1w_{k+1} twice; and (30) follows from the fact that for any closed convex set F{\cal F} and w0∉Fw_{0}\not\in{\cal F},

Rearranging (28) implies the following inequality holds for all wFw\in{\cal F}.

We will use the following technical identity to prove the lemma.

Finally, we can show the inequality stated in the lemma.

where (34) follows from (32); (35) follows by using Proposition 14 to write the expression in terms of uku_{k}’s; and (36) follows by the gradient step wk+1wk=ηkgkw_{k+1}-w_{k}=\eta_{k}g_{k}. ∎

Proof of Lemma 10

Here, we bound the contribution to the duality gap of each of the feasible iterations kKfk\in K_{f} as follows.

Let eL(wk)=gL(wk)L(wk)e_{L}(w_{k})=g_{L}(w_{k})-\nabla L(w_{k}) where gL(wk)=E[L(wk)]L(wk)g_{L}(w_{k})=\operatorname*{\textnormal{{E}}}[\nabla L(w_{k})]\in\partial L(w_{k}).

where (39) follows by substituting gLg_{L}; (40) follows by expanding the inner product and applying Lemma 13 to the first term; (41) follows by our choice of ηL=τ/GM\eta_{L}=\tau/GM. ∎

Proof of Lemma 11

Here, we bound the contribution to the duality gap of each of the infeasible iterates k[T]Kfk\in[T]\setminus K_{f}. We assume R^Sk(wk)\hat{R}_{S_{k}}(w_{k}) has tolerance τ/5\tau/5. Then we show

Recall, we let e(w_{k})=\sum_{S\in{\cal C}}\alpha_{k,S}\left(\operatorname*{\textnormal{{E}}}[\nabla R_{S}(w_{k})\big{|}w_{k}]-\nabla R_{S}(w_{k})\right). For each SCS\in{\cal C}, for any gS(wk)RS(wk)g_{S}(w_{k})\in\partial R_{S}(w_{k}), we can rewrite RS(w)-R_{S}(w) as follows.

Multiplying by ηR\eta_{R} and taking the convex combination of SCS\in{\cal C} according to αk\alpha_{k}, we apply Lemma 13 to obtain the following inequality.

where (43) follows by substituting RS(wk)\nabla R_{S}(w_{k}) for each gS(wk)g_{S}(w_{k}) and the definition of eR(wk)e_{R}(w_{k}); (44) follows by expanding the inner product and applying Lemma 13; (45) follows by our choice of ηk=τ2/M2\eta_{k}=\tau^{2}/M^{2}; (46) follows by the fact that when we update according to a constraint, we know SCαk,SR^S(wk)4τ/5\sum_{S\in{\cal C}}\alpha_{k,S}\hat{R}_{S}(w_{k})\geq 4\tau/5 with tolerance τ/5\tau/5, so SCαk,SRS(wk)3τ/5\sum_{S\in{\cal C}}\alpha_{k,S}R_{S}(w_{k})\geq 3\tau/5. ∎

Proof of Lemma 12

Here, we show that with probability at least 14δ1-4\delta, the contribution of the noisy subgradient computation to the duality gap can be bounded as follows.

Let ε=ηLg=ηRm=τMn\varepsilon=\eta_{L}\cdot g=\eta_{R}\cdot m=\frac{\tau}{M\sqrt{n}}. Further, let ηk=ηL\eta_{k}=\eta_{L} for kKfk\in K_{f} and ηR\eta_{R} for k∉Kfk\not\in K_{f}. Then, we can rewrite the expression to bound using ηk\eta_{k} and expand as follows.

Taking this probability to be at most 2δ/n2\delta/n, we can upper bound ZZ by ε2Tlog(n/δ)=τM2Tnlog(n/δ)\varepsilon\sqrt{2T\log(n/\delta)}=\frac{\tau}{M}\sqrt{2Tn\log(n/\delta)}. Then, noting that (w)l<B\left|(w)_{l}\right|<B for any wFw\in{\cal F}, we can take a union bound to conclude with probability at least 12δ1-2\delta the following inequalities hold.

Further, we note that the first summation concentrates at least as quickly as the second, so by union bounding again,

Appendix B Hardness for metric multifair predictions

The lower bounds follow the same general construction. Suppose we have some boolean concept class C{1,1}X{\cal C}\subseteq\left\{-1,1\right\}^{{\cal X}} for some universe X{\cal X}. We will construct a new universe X01=X0X1{\cal X}_{01}={\cal X}_{0}\cup{\cal X}_{1} where X0X{\cal X}_{0}\subseteq{\cal X} and define a collection of “bipartite” comparisons over subsets of X0×X1{{\cal X}_{0}\times{\cal X}_{1}}. We will assume access to random samples (x0,y(x0))(x_{0},y(x_{0})) for some function y:X{1,1}y:{\cal X}\to\left\{-1,1\right\}; we define corresponding metric values where d(x0,x1)d(x_{0},x_{1}) is some function of y(x0)y(x_{0}) for all x1X1x_{1}\in{\cal X}_{1}. Finally, we need to label X01{\cal X}_{01} such that such that to obtain good loss, the post-processing algorithm must learn something non-trivial about yy (which will differ across the two lower bounds we prove).

Consider X01=X0X1{\cal X}_{01}={\cal X}_{0}\cup{\cal X}_{1} with ideal labels given as (x0,0)(x_{0},0) for x0X0x_{0}\in{\cal X}_{0} and (x1,1)(x_{1},1) for x1X1x_{1}\in{\cal X}_{1}, and let LL be the hinge loss. We encode the original boolean concept in the distance metric, where for x0X0x_{0}\in{\cal X}_{0},

Then, consider the collection of comparisons given by H={Sc:cC}{\cal H}=\left\{S_{c}:c\in{\cal C}\right\} where we take Sc={(x0,x1)X0×X1:c(x0)=1}S_{c}=\left\{(x_{0},x_{1})\in{\cal X}_{0}\times{\cal X}_{1}:c(x_{0})=1\right\}. We take X1=2X0\left|{\cal X}_{1}\right|=2\left|{\cal X}_{0}\right|, large enough that the average prediction for x1X1x_{1}\in{\cal X}_{1} is at least 1τ1-\tau in the optimal utility set of multifair predictions. In particular, any average deviation by more than τ\tau in X1{\cal X}_{1} would result in larger loss than setting all of f(x0)=f(x1)=1τf(x_{0})=f(x_{1})=1-\tau for x1X1x_{1}\in{\cal X}_{1} and all x0X0x_{0}\in{\cal X}_{0} where y(x0)=1y(x_{0})=1.

Let γ,τ>0\gamma,\tau>0 be constants and suppose A{\cal A} is an algorithm that has random sample access to dd and outputs a (C,d,τ)({\cal C},d,\tau)-metric multifair set of predictions for γ\gamma-large C{\cal C}. Then, A{\cal A} takes Ω(logC)\Omega(\log\left|{\cal C}\right|) random samples from dd or outputs a set of predictions with loss that approaches the loss achievable with no metric queries.

Further note, if we only take nkn-k queries, the incurred loss approaches the trivial loss exponentially in kk. Appealing to lower bounds on the number of random queries needed to span a basis, the theorem follows.

Outline of hardness from pseudorandom functions

We use the construction to demonstrate that under weak complexity assumptions, there are algorithmic barriers to generally efficient algorithms for metric multifairness. In particular, we will assume that C{\cal C} defines a pseudorandom function family. The existence of one-way functions implies the existence of pseudorandom functions , so the proposition follows.

Assuming one-way functions exist, any algorithm for computing (H,τ)({\cal H},\tau)-optimal (H,d,τ)({\cal H},d,\tau)-metric multifair predictions for any H,d,{\cal H},d, and τ>0\tau>0 requires time (logH)ω(1)(\log\left|{\cal H}\right|)^{\omega(1)}.

Suppose we can post-process predictions to achieve (C,τ)({\cal C},\tau)-optimal (C,d,τ)({\cal C},d,\tau)-metric multifair predictions for any C{\cal C} and dd and some small constant τ>0\tau>0. Let C{\cal C} define a pseudorandom function family. We will show that we can distinguish between a function y=cRCy=c\leftarrow_{R}{\cal C} drawn uniformly at random from the function family and a truly random function y:X{1,1}y:{\cal X}\to\left\{-1,1\right\}.

We’re given some samples of the form (x,y(x))D×{1,1}(x,y(x))\sim{\cal D}\times\left\{-1,1\right\}. Returning to the proposed construction, in the case yy is a truly random function, then with high probability, ES[d(x0,x1)]1o(1)\operatorname*{\textnormal{{E}}}_{S}[d(x_{0},x_{1})]\geq 1-o(1) for all SHS\in{\cal H}. Thus, any τ\tau-optimal set of predictions will achieve loss O(τ)O(\tau).

In the case, where y=cy=c for some cCc\in{\cal C}, note that labeling x0X0x_{0}\in{\cal X}_{0} according to c(x0)c(x_{0}) and all x1X1x_{1}\in{\cal X}_{1} as 11 is a feasible point that obtains loss

This feasible quantity upper bounds the optimal loss. Then, we can express the expectation of difference in the predictions across the set defined by cc as follows.

We assume the set of predictions ff is (H,d,τ)({\cal H},d,\tau)-metric multifair. Thus, because ScHS_{c}\in{\cal H}, we can upper bound this term by τ\tau. In total then, Ex0:c(x0)=1[f(x0)f(x1)]τ\operatorname*{\textnormal{{E}}}_{x_{0}:c(x_{0})=1}[\left|f(x_{0})-f(x_{1})\right|]\leq\tau, and by the fact that Ex1X1[f(x1)]1τ\operatorname*{\textnormal{{E}}}_{x_{1}\sim{\cal X}_{1}}[f(x_{1})]\geq 1-\tau, then Ex0:c(x0)=1[f(x0)]12τ\operatorname*{\textnormal{{E}}}_{x_{0}:c(x_{0})=1}[f(x_{0})]\geq 1-2\tau. Further, consider the following lower bound on the loss on ff.

If C{\cal C} is a pseudorandom function family the Prx0X0[c(x0)=1]1/2\Pr_{x_{0}\sim{\cal X}_{0}}[c(x_{0})=1]\approx 1/2 must be bounded away from . Thus, in the case where yCy\in{\cal C}, we have a non-trivial lower bound on the achievable loss under (C,d,τ)({\cal C},d,\tau)-metric multifairness. Thus, we can distinguish when yCy\in{\cal C} and when yy is truly random. ∎