AI systems failing many global users have a data Problem, not technology - Analyst

Lisa Udechukwu, a data quality analyst whose work focuses on AI annotation, data integrity, and trustworthy AI evaluation, says the AI Systems failing many global users have a data problem and not a technology problem

Udechukwu, also an executive member of Africa Privacy Roundup, an African-led organization focused on data protection and AI governance, told BusinessDay that when the data used to train AI ignores most of the world, the failures aren’t bugs. They’re built in.

“In 2015, Google Photos automatically tagged photos of two Black people as ‘gorillas.’ The company’s fix, years later, was to remove the categories ‘gorilla,’ chimp,’ and monkey’ from its image classifier entirely, not to fix the underlying data. The problem wasn’t a glitch. It was a symptom.

“I’ve spent years working inside annotation pipelines at companies like Pinterest and Meta — the unglamorous infrastructure layer where human judgment gets converted into training signals for machine learning models.

“And what I’ve seen, consistently, is that the data problem in AI is not a resource problem. It’s a representation problem. And it’s more systematic than most people building these systems want to admit,” she explained.

She stressed that most major AI datasets are heavily concentrated in Western, English-language contexts.

The ImageNet dataset — foundational to a decade of computer vision — drew over 40% of its images from the United States alone. African countries, which account for 17% of the global population, collectively contributed less than 1%.

Language model benchmarks follow the same pattern: English dominates, followed by a handful of European languages, with vast multilingual regions effectively absent.This matters in ways that go far beyond misclassified photos.

She posted that when AI systems trained on unrepresentative data are deployed into global markets, which they routinely are, they carry their blind spots with them. Hiring algorithms that misread non-Western names, medical AI that underperforms on skin tones that weren’t in the training set.

Search and recommendation systems that misinterpret user intent because cultural context wasn’t part of the label design.

These issues are often treated as edge cases, but many are actually predictable outcomes of incomplete and unrepresentative training data. And yet they rarely appear in model performance reports, because the benchmarks used to evaluate ‘accuracy’ are
themselves built on the same skewed data foundations.

“In my work managing annotation pipelines — overseeing quality across thousands of human-labeled data points — I’ve seen how cultural blind spots enter training data not through malice, but through the quiet assumptions embedded in labeling guidelines.

“When an annotation task asks workers to judge whether a search result is ‘relevant,’ the definition of relevance is written by someone. That someone is almost always located in a high-income, English-speaking country.

“The resulting guidelines can work reasonably well for users who look, speak, and search like the guideline-writer. For everyone else, the signal degrades — and that degradation rarely surfaces until a model fails loudly in a market the company cares about.

“Three patterns repeat across nearly every pipeline I’ve worked in: inconsistent labeling when cultural context is ambiguous and guidelines don’t account for it; systematic misclassification in product categories or content types that are common outside the West but were never included in taxonomy design; and relevance scoring that quietly penalizes regional expression, local idiom, and non-standard syntax,” she added.

AI systems failing many global users have a data Problem, not technology – Analyst

The Company

Legal & Privacy

Quick Links

Support

AI systems failing many global users have a data Problem, not technology – Analyst

Expert chides Accountants over alleged professional misconduct in Akwa Ibom

FCT Police launch special violent crime unit to tackle kidnapping, armed robbery in Abuja

FCT Police launch special violent crime unit to tackle kidnapping, armed robbery in Abuja

Subscribe to our Newsletters