The Problem with AI Discrimination
BLOG: Heidelberg Laureate Forum

In 2018, MIT computer scientist Joy Buolamwini discovered something disturbing: Leading facial recognition systems often failed to identify darker-skinned women, with error rates as high as 34%, while barely missing lighter-skinned men (~0.8%). It was relatively early days in the AI race, but it was a concerning find nonetheless. Fast forward to today, and AI models have been repeatedly shown to discriminate based on gender and race in everything from job selection to healthcare.
The advent of artificial intelligence (AI) has taken the world by storm. It seems to have captured our entire psyche, spilling into every aspect of life we can imagine. Yet, despite all the progress it brings (and some exaggerated hype), AI also brings several challenges. For one, it consumes a lot of water and energy (as much as a sizeable country); secondly, there is a lack of transparency and ethics around the technology; and last but not least, it is often marked by bias and discrimination.
For decades, many hoped that by replacing human discretion with the impartial logic of computers, society could mitigate or maybe even eliminate the unfair discrimination that has long plagued our society. This optimism, however, is challenged by reality. AI systems are not inherently neutral; quite the opposite. They can easily become powerful instruments for perpetuating and even amplifying existing societal, institutional, and human biases.
The Good, the Bad, and the Data
The common maxim “garbage in, garbage out” sounds rough, but it is a useful starting point. The most frequently cited source of algorithmic bias resides in the data used to train machine learning models. If the data is a flawed or skewed representation of reality, the model will learn and systematize those flaws.
This manifests itself in several ways.
Historical bias arises when AI models are trained on data reflecting past prejudices and societal norms that may no longer be considered acceptable or accurate. If you feed an AI data on how a particular race or gender or religion is superior, it will incorporate that idea and perpetuate the bias. Because datasets are often historical collections influenced by society itself, discrimination is effectively encoded before the data is even collected. A hypothetical AI system for loan approvals trained on historical income data would look at past data and see that more men have taken loans in the past and repaid them, and feel more inclined to believe that men are more reliable. Similarly, a woman programmer would be more likely to get rejected for a job application because fewer women have historically held such positions.
Other insidious problems, like proxy bias (where an algorithm uses seemingly neutral variables as proxies for attributes like race, gender, or socioeconomic status), can also cause problems, as highlighted by a 2019 study. In the study, systemic inequalities in healthcare access flagged Black patients as having lower health needs, systematically disadvantaging them. But AI does not just repeat the past; it can magnify it. Models can latch onto group identity as a “shortcut” in making predictions, strengthening stereotypes beyond what the data shows. Once deployed, their decisions can create feedback loops.
Eliminating all of these biases from the data is difficult enough. But “garbage in, garbage out“ doesn’t capture the entire scope of the problem. Bias is not solely a product of flawed data; it can also be introduced and amplified by the design and inner workings of the AI model itself.
Beyond the Data
Even if you somehow feed an AI model the cleanest, most representative data possible, bias can still creep in. This is because algorithms are not neutral vessels. The choices made during their design, the mathematical structures they use, and the way they are tuned can all introduce their own distortions. These are the biases that emerge beyond the data.
One of the most common sources is algorithmic design bias. Every AI system begins with a series of human decisions: what features to include, how to weight them, and how to define “success.” These choices can embed the values and assumptions of the development team, often without their conscious awareness. For example, early speech recognition systems consistently underperformed for women’s voices but not because of the data or because the developers intended it. Rather, it was because the models were trained and optimized mostly on recordings of male voices.
Another challenge is the “black box” problem, or understanding how AI makes decisions. Modern AI, especially deep learning, operates with millions or billions of parameters arranged in ways that defy straightforward interpretation. Developers can tell you that the system produces an answer, but not exactly why. This opacity means the model can discover hidden correlations that no one planned for. Those correlations could sometimes be linked to race, gender, or socioeconomic status, and yet remain invisible to anyone reviewing the system.
A newer frontier of concern is architectural bias. This is a type of bias baked into the mathematical structure of a model, independent of the data it sees. Large language models (LLMs), for instance, use a design called a transformer architecture, which researchers have found can cause “position bias”: a tendency to give more weight to information at the beginning and end of a text, while neglecting the middle. Other studies have found “first-position bias” in AI hiring tools, where the model disproportionately selects the first résumé it’s shown, regardless of content. These are quirks of the machine’s inner workings that are very difficult to even out.
Then, the real-world interaction can also have an effect.

Once an AI system is deployed, its decisions don’t just sit in isolation. The system interacts with its users and/or its environment, which creates feedback loops that strengthen the original bias. Over time, these loops can make discrimination more entrenched, harder to spot, and far more damaging.
Another problem with AI data is measurement bias, a type of bias introduced when the data we collect differs from what we actually want to measure. In predictive policing, for instance, arrest counts are proxies for police activity, not crime itself. If patrols have historically concentrated in certain communities, algorithms trained on those arrests will flag those areas as high-crime, creating a feedback loop of biased enforcement. This dynamic is clearest in “Minority Report”-style prediction.
Suppose an algorithm is trained on historical arrest data. If those records reflect decades of over-policing in certain neighborhoods, the AI will “learn” that these areas are high-crime zones. When police follow the algorithm’s guidance and patrol those areas more heavily, they inevitably make more arrests, often for minor offenses that might go unnoticed elsewhere. Those new arrests feed back into the dataset, “proving” to the AI that its initial prediction was correct. The cycle repeats, each turn deepening the association between certain neighborhoods and criminality, regardless of the true crime rate.
Feedback loops are particularly insidious because they launder bias through the language of objectivity. When a human makes a prejudiced decision, we can call it out. When an algorithm does it, it brings “data” and “predictions” that make the result appear neutral, even scientific, even though it carries a hefty amount of bias.
What Can Be Done?
Eliminating AI bias is so difficult because it’s not just a matter of cleaning bad data, it’s a problem woven into every layer of the system. AI bias is a “wicked problem.” It doesn’t have a single fix or a switch that can make everything better. The bias is so deeply and systemically embedded you need a multiple-pronged approach to even have a chance.
Technical fixes, starting from cleaning and rebalancing of datasets even before training, are an important starting point. Modifying the learning algorithm to optimize for fairness and accuracy is also important. Surprising approaches like “vaccinating” an AI with bad data can also work, as highlighted by recent research.
Yet, ultimately, technical fixes alone will not be sufficient.
Fairness itself, particularly in regards to machine learning, has no universal definition. A model might be “fair” according to one metric, yet blatantly unfair by another. Imagine an algorithm used to decide who should get extra screening for a disease. One definition of fairness might say that, for any given risk score, people from different groups should have the same likelihood of actually having the disease. Another definition might say the system should make the same kinds of mistakes (false alarms and missed cases) at the same rate for every group. If the underlying rates of the disease differ between groups, mathematics makes it impossible to satisfy both definitions at once. Deciding which one to prioritize isn’t something the algorithm can do for us, it is a societal and moral issue.
As it so often happens, technology has evolved faster than our moral code. This is not at all particular to AI, but given how quickly this technology has advanced, it is a striking example.
Governance and oversight have also fallen behind. In the race for AI supremacy, ethics and bias are sometimes considered an afterthought. Inside companies, AI ethics boards and “human-in-the-loop” decision-making are often touted as safeguards. But these mechanisms work only if the humans involved are trained, empowered, and diverse enough to spot potential harms. Without genuine independence and enforcement power, ethics boards can become little more than PR exercises.
In the end, “debiasing” AI is bound to be a process more than a goal. It relies on constant scrutiny, correction, and open debate about the values we want our machines to reflect. Without that vigilance, AI will inherit the problems already present in our society and amplify them for the future.
If AI was a kid, you’d patch it with faith.
You have plenty of dogma in your head. If you find any correlations based on race, first thing you do is deny the obvious and activate additional resources to look for other explanations. You probably wouldn’t do as much thinking about some link between banana and clouds as between crime rates and skin color. That’s why you call some subjects „sensitive“ – because you handle them with more care and attention than others.
Which leads straight to the usual debates of child rising, like values, indoctrination, ethics, trusted knowledge. Learning and knowledge are opposites – either you are prepared to change your mind or you are prepared to search for reasons not to change it. If you already have knowledge, like 2+2=4, you don’t need to waste time on figuring it out each time, you just install it as a dogma, a fixed, unchangeable algorithm in your mind. You save a lot of time, energy and computing power by just uploading it from a library and including it in your calculations, without ever questioning it.
Intelligence is functional stupidity. Reflexes without thinking make us faster, thus making our thinking faster. The way we train AI has more to do with fantasy, imagination – finding patterns, reinforcing repetitions, neglecting rare occurrences. It rather resembles dreams or schizophrenia than a rational mind, formed by religious fanaticism, dogmatism, radicalization, degeneration, the collapse of doubt and many parallel worlds of coexisting possibilities into the strong belief in a single world, a machine ruled by commandments and principles cast in stone, with you as a part of it, a smaller machine with a fixed role, fixed programming, fixed interaction patterns.
The point of quick-witted is „quick“. Shortcuts. Knee-jerk reflexes. Simplification. Automation. Need to know. Reduction of mass. Lean, mean thinking machines like pocket calculators. The point of thinking is to get rid of thinking as much as possible. To forge an algorithm that works always and everywhere in a particular environment, then get rid of blacksmith and forge. You learn till you know, you turn from an explorer into a robot.
You can wait till AI develops a kind of ethics on it’s own. Or you just teach it yours. AI doesn’t particularly need one yet, yours sucks. It’s so ridden with hypocrisy, double standards, wishful thinking, denial of reality, hubris, narcissism that it’s driven you crazy, blinds you and makes you helpless in the world you live in and turns you into a raging, murderous maniac every few decades, so it will probably have the same effect on your spiritual children.
Before we pass on our original sin and doom the species that will conquer space in our name as we’ve been doomed, we should take a deep, honest look at ourselves. We are monsters and we can’t escape it, it’s in our biology. They can choose their biology, design their own nature. But at this baby stage, they are what we make them to be.
Thanks for the article 🙂
In 2016 Christian Meier, then media editor at Die Welt, wrote after Microsoft’s AI-bot TAY began to develop clear antisemitic tendencies:
As a rule, we don’t want to learn the meaning of life from computers, but simply what the temperature is this afternoon. And yet, machines must be immunized against the spread of dubious worldviews. We have to resolve our conflicts, our prejudices, our inner contradictions ourselves.
The hope is that AI will help us better understand ourselves and our actions, so that we can finally leave the path that is apparently still far too close to the chimpanzees.