Can Facebook really rely on artificial intelligence to spot abuse? -

Facebook faces a monumental challenge: how can its 35,000 moderators watch over billions of posts and comments every day to sift out abusive and dangerous content?

Just 18 months ago, Mark Zuckerberg, Facebook’s founder, was confident that rapid advances in artificial intelligence would solve the problem. Computers would spot and stop bullying, hate speech and other violations of Facebook’s policies before they could spread.

But while the company has made significant advances, the promise of AI still seems distant. In recent months, Facebook has suffered high-profile failures to prevent illegal content, such as live footage from terrorist shootings, and Mr Zuckerberg has conceded that the company still needs to spend heavily on humans to spot problems.

“There’s just so much content flowing through the system that we do need a lot of people looking at this,” he said.

In interviews, Facebook’s executives in charge of developing moderation software and outside experts said that there are persistent, and perhaps insurmountable, challenges.

These include finding the right data to train artificial intelligence algorithms, developing programs that understand enough nuance and context to spot hate speech, and outsmarting human adversaries who keep learning how to game the system.

“We’re pushing the frontier,” said Mike Schroepfer, Facebook’s chief technology officer. But where there have been grave mistakes, “the technology was just not up to what we do”.
A graphic with no description

From reactive to proactive

In its earlier days, Facebook relied on its users to report objectionable content, before human moderators would review the material and decide whether to take it down or not.

But over the past five years or so, Facebook has built a team of “hundreds” of machine learning experts, engineers and data scientists to develop algorithms that can automatically flag unwanted content.

According to Mr Schroepfer, technologies for image recognition — which were unreliable before 2014 — are now “stunningly good”. Language understanding, which was introduced for hate speech in 2017 for example, is improving, but still fairly nascent as algorithms struggle to account for context.

“If you have to sit and stare at a problem and do a bunch of internet research . . . and it’s going to take you 10 minutes, I don’t have a lot of hope that AI is going to understand that in the next 12 months,” he said. “But if you could sit there and do it in 5 to 10 seconds — we’re getting to the point where AI systems are probably going to be better than you at that.”

The use of these algorithms comes as a spate of media reports have highlighted the devastating effects on the mental health of content moderators, many of whom are low-paid contractors, of having to sift through disturbing content to remove it.

Training the machine

But the system needs to be trained. The more data that is fed into it — whether images of terrorist insignia or harmful keywords — the more the machine learning technology learns and improves.

Without enough training data, the system does not know what to look for.

A recent example was when Facebook said it did not have enough first-person shooter video footage for its algorithms to recognise and take down the videos of the attacks on two mosques in New Zealand earlier this year.

Facebook has now equipped London police with body cameras during terrorist training exercises to get more footage, having eschewed using footage of video game shoot-outs or paintballing.
According to Mr Schroepfer, its numerous data sets will typically be made up of tens of thousands — or even millions — of examples to learn from. These should include not just precise examples of what an algorithm should detect and “hard negatives”, but also “near positives” — something that is close but should not count. For example, for image recognition of a water bottle, the system should classify hand sanitiser as near positive.

Facebook will typically train its AI on content posted by its users, as well as publicly available data sets. When it comes to images and memes, data sets can be created to take into account the fact that some people will doctor an original in order to evade detection.

Can Facebook really rely on artificial intelligence to spot abuse?

'I better pass my neighbour generator' turns luxury at N140k

Further pressure for beleaguered naira as $1.3bn forwards fall due end of May

Nigeria still in search of nationhood 54 years after civil war

How Nigerians, lovers of brand-new cars, became ‘tokunbo’ consumers

Can Facebook really rely on artificial intelligence to spot abuse?

Nigeria’s Afrobeats superstars take on the world

What makes Nigerians in diaspora so successful

Nigeria’s founding principles are in jeopardy