Sheraa is Sharjah’s initiative to build a start-up ecosystem in the emirate. It has run extensive programming to educate its community about machine learning and is partnered with the University of Sharjah to find commercial applications for the technology. Courtesy Sheraa
Sheraa is Sharjah’s initiative to build a start-up ecosystem in the emirate. It has run extensive programming to educate its community about machine learning and is partnered with the University of Sharjah to find commercial applications for the technology. Courtesy Sheraa
Sheraa is Sharjah’s initiative to build a start-up ecosystem in the emirate. It has run extensive programming to educate its community about machine learning and is partnered with the University of Sharjah to find commercial applications for the technology. Courtesy Sheraa
Sheraa is Sharjah’s initiative to build a start-up ecosystem in the emirate. It has run extensive programming to educate its community about machine learning and is partnered with the University of Sh

Lost in translation: Why machine learning finds Arabic challenging


Kelsey Warner
  • English
  • Arabic

Machine learning is one of the fastest growing and most transformative technologies in the world. Yet as it increasingly caters to English and Chinese speakers - it leaves people and economies to play catch up at a critical time in its development.

"Arabic is falling behind," Professor Ashraf Elnagar, the coordinator of the machine learning and Arabic language processing research group at the University of Sharjah, told The National.

The global machine learning market is projected to grow from $7.3 billion (Dh26.7bn) in 2020 to $30.6bn in 2024, according to a 2019 report by Market Research Future. The pandemic has dented these outlays, but data scientists are still in demand.

There are currently about 75,000 active job listings worldwide for those with machine learning skills on LinkedIn, the majority listed in the US, Asia and Western Europe.

There are numerous examples of machine learning popping up in everyday life: Netflix's recommendations or the playlists generated by Spotify; Siri or Alexa responding to a spoken request for the local headlines; or a credit card company sending a text alert about potentially fraudulent activity.

Machine learning is also being used by businesses to generate consumer insights and improve customer service, reduce costs and to automate processes.

But chances are, the data being used for any of these activities - especially as they become more advanced - is in Chinese or English, or possibly Spanish or French, the most popular languages fueling this artificial intelligence boom, according to Prof Elnagar.

The challenges are twofold: the complexity of the language and the amount of resources and research being put into its development.

Arabic is "structurally ambiguous", according to Prof Elnagar, lacking capital letters to indicate proper nouns or the start of sentences, for example.

There are also three different types that are recognised, including classical, as in the Quran; the modern standard, which is conversational and seen on TV; and colloquial, which is "gaining traction on social media. It has its own population, its own customers and it is on the rise" and has 20 dialects, Prof Elnagar said.

The variability and ambiguity in meaning make Arabic very challenging to train machines to make human-like decisions when they are reading it.

John Lillywhite, the digital transition lead at Al Bawaba, a Jordanian media company, recently helped the company win a grant from Google to tackle this challenge.

The title of his pitch: 'Why Can’t Machines Read Arabic at Scale?'

With financial support from Google, his team will work to make one of the largest Arabic language news archives in the Middle East searchable.

There are commercial and scholarship upsides to making this database searchable, opening it up to third-party publishers and researchers who can extract meaning and insight from the archive. Once the tool is developed, which is expected to take around 18 months, it can be used by other publishing platforms and websites to improve the filtering and finding of Arabic content.

This would be a major milestone for Arabic media - and is a facet of news reading that native English speakers take for granted.

"It would be great if machines could tell the difference between a peace process story and a sporting event in Arabic," Mr Lillywhite told The National. "We're not there yet."

But his work is a sign of progress in the field of machine learning and Arabic, among others. The UAE's dedicated AI university, the Mohamed bin Zayed University of Artificial Intelligence, will swing open its doors next January, and the University of Sharjah is seeing greater interest from entrepreneurs and the private sector to find commercial applications for its research.

"I think we will see huge strides in the next decade," Prof Elnagar said. "It is extremely promising."

Living in...

This article is part of a guide on where to live in the UAE. Our reporters will profile some of the country’s most desirable districts, provide an estimate of rental prices and introduce you to some of the residents who call each area home.

Global state-owned investor ranking by size

1.

United States

2.

China

3.

UAE

4.

Japan

5

Norway

6.

Canada

7.

Singapore

8.

Australia

9.

Saudi Arabia

10.

South Korea

UAE currency: the story behind the money in your pockets
Key facilities
  • Olympic-size swimming pool with a split bulkhead for multi-use configurations, including water polo and 50m/25m training lanes
  • Premier League-standard football pitch
  • 400m Olympic running track
  • NBA-spec basketball court with auditorium
  • 600-seat auditorium
  • Spaces for historical and cultural exploration
  • An elevated football field that doubles as a helipad
  • Specialist robotics and science laboratories
  • AR and VR-enabled learning centres
  • Disruption Lab and Research Centre for developing entrepreneurial skills
ELIO

Starring: Yonas Kibreab, Zoe Saldana, Brad Garrett

Directors: Madeline Sharafian, Domee Shi, Adrian Molina

Rating: 4/5

THE SIXTH SENSE

Starring: Bruce Willis, Toni Collette, Hayley Joel Osment

Director: M. Night Shyamalan

Rating: 5/5

Squid Game season two

Director: Hwang Dong-hyuk 

Stars:  Lee Jung-jae, Wi Ha-joon and Lee Byung-hun

Rating: 4.5/5

Formula Middle East Calendar (Formula Regional and Formula 4)
Round 1: January 17-19, Yas Marina Circuit – Abu Dhabi
 
Round 2: January 22-23, Yas Marina Circuit – Abu Dhabi
 
Round 3: February 7-9, Dubai Autodrome – Dubai
 
Round 4: February 14-16, Yas Marina Circuit – Abu Dhabi
 
Round 5: February 25-27, Jeddah Corniche Circuit – Saudi Arabia
Benefits of first-time home buyers' scheme
  • Priority access to new homes from participating developers
  • Discounts on sales price of off-plan units
  • Flexible payment plans from developers
  • Mortgages with better interest rates, faster approval times and reduced fees
  • DLD registration fee can be paid through banks or credit cards at zero interest rates
Cryopreservation: A timeline
  1. Keyhole surgery under general anaesthetic
  2. Ovarian tissue surgically removed
  3. Tissue processed in a high-tech facility
  4. Tissue re-implanted at a time of the patient’s choosing
  5. Full hormone production regained within 4-6 months
Ticket prices

General admission Dh295 (under-three free)

Buy a four-person Family & Friends ticket and pay for only three tickets, so the fourth family member is free

Buy tickets at: wbworldabudhabi.com/en/tickets

Banned items
Dubai Police has also issued a list of banned items at the ground on Sunday. These include:
  • Drones
  • Animals
  • Fireworks/ flares
  • Radios or power banks
  • Laser pointers
  • Glass
  • Selfie sticks/ umbrellas
  • Sharp objects
  • Political flags or banners
  • Bikes, skateboards or scooters
Some of Darwish's last words

"They see their tomorrows slipping out of their reach. And though it seems to them that everything outside this reality is heaven, yet they do not want to go to that heaven. They stay, because they are afflicted with hope." - Mahmoud Darwish, to attendees of the Palestine Festival of Literature, 2008

His life in brief: Born in a village near Galilee, he lived in exile for most of his life and started writing poetry after high school. He was arrested several times by Israel for what were deemed to be inciteful poems. Most of his work focused on the love and yearning for his homeland, and he was regarded the Palestinian poet of resistance. Over the course of his life, he published more than 30 poetry collections and books of prose, with his work translated into more than 20 languages. Many of his poems were set to music by Arab composers, most significantly Marcel Khalife. Darwish died on August 9, 2008 after undergoing heart surgery in the United States. He was later buried in Ramallah where a shrine was erected in his honour.

COMPANY PROFILE
Name: Kumulus Water
 
Started: 2021
 
Founders: Iheb Triki and Mohamed Ali Abid
 
Based: Tunisia 
 
Sector: Water technology 
 
Number of staff: 22 
 
Investment raised: $4 million 
ELECTION%20RESULTS
%3Cp%3EMacron%E2%80%99s%20Ensemble%20group%20won%20245%20seats.%26nbsp%3B%3C%2Fp%3E%0A%3Cp%3EThe%20second-largest%20group%20in%20parliament%20is%20Nupes%2C%20a%20leftist%20coalition%20led%20by%20Jean-Luc%20Melenchon%2C%20which%20gets%20131%20lawmakers.%26nbsp%3B%3C%2Fp%3E%0A%3Cp%3EThe%20far-right%20National%20Rally%20fared%20much%20better%20than%20expected%20with%2089%20seats.%3C%2Fp%3E%0A%3Cp%3EThe%20centre-right%20Republicans%20and%20their%20allies%20took%2061.%3C%2Fp%3E%0A
Volvo ES90 Specs

Engine: Electric single motor (96kW), twin motor (106kW) and twin motor performance (106kW)

Power: 333hp, 449hp, 680hp

Torque: 480Nm, 670Nm, 870Nm

On sale: Later in 2025 or early 2026, depending on region

Price: Exact regional pricing TBA

While you're here
Chef Nobu's advice for eating sushi

“One mistake people always make is adding extra wasabi. There is no need for this, because it should already be there between the rice and the fish.
“When eating nigiri, you must dip the fish – not the rice – in soy sauce, otherwise the rice will collapse. Also, don’t use too much soy sauce or it will make you thirsty. For sushi rolls, dip a little of the rice-covered roll lightly in soy sauce and eat in one bite.
“Chopsticks are acceptable, but really, I recommend using your fingers for sushi. Do use chopsticks for sashimi, though.
“The ginger should be eaten separately as a palette cleanser and used to clear the mouth when switching between different pieces of fish.”

The Birkin bag is made by Hermès. 
It is named after actress and singer Jane Birkin
Noone from Hermès will go on record to say how much a new Birkin costs, how long one would have to wait to get one, and how many bags are actually made each year.

Top New Zealand cop on policing the virtual world

New Zealand police began closer scrutiny of social media and online communities after the attacks on two mosques in March, the country's top officer said.

The killing of 51 people in Christchurch and wounding of more than 40 others shocked the world. Brenton Tarrant, a suspected white supremacist, was accused of the killings. His trial is ongoing and he denies the charges.

Mike Bush, commissioner of New Zealand Police, said officers looked closely at how they monitored social media in the wake of the tragedy to see if lessons could be learned.

“We decided that it was fit for purpose but we need to deepen it in terms of community relationships, extending them not only with the traditional community but the virtual one as well," he told The National.

"We want to get ahead of attacks like we suffered in New Zealand so we have to challenge ourselves to be better."

My Cat Yugoslavia by Pajtim Statovci
Pushkin Press

Labour dispute

The insured employee may still file an ILOE claim even if a labour dispute is ongoing post termination, but the insurer may suspend or reject payment, until the courts resolve the dispute, especially if the reason for termination is contested. The outcome of the labour court proceedings can directly affect eligibility.


- Abdullah Ishnaneh, Partner, BSA Law 

MATCH INFO

Euro 2020 qualifier

Russia v Scotland, Thursday, 10.45pm (UAE)

TV: Match on BeIN Sports 

WHAT%20IS%20'JUICE%20JACKING'%3F
%3Cp%3E%E2%80%A2%20Juice%20jacking%2C%20in%20the%20simplest%20terms%2C%20is%20using%20a%20rogue%20USB%20cable%20to%20access%20a%20device%20and%20compromise%20its%20contents%3C%2Fp%3E%0A%3Cp%3E%E2%80%A2%20The%20exploit%20is%20taken%20advantage%20of%20by%20the%20fact%20that%20the%20data%20stream%20and%20power%20supply%20pass%20through%20the%20same%20cable.%20The%20most%20common%20example%20is%20connecting%20a%20smartphone%20to%20a%20PC%20to%20both%20transfer%20data%20and%20charge%20the%20former%20at%20the%20same%20time%3C%2Fp%3E%0A%3Cp%3E%E2%80%A2%20The%20term%20was%20first%20coined%20in%202011%20after%20researchers%20created%20a%20compromised%20charging%20kiosk%20to%20bring%20awareness%20to%20the%20exploit%3B%20when%20users%20plugged%20in%20their%20devices%2C%20they%20received%20a%20security%20warning%20and%20discovered%20that%20their%20phones%20had%20paired%20to%20the%20kiosk%2C%20according%20to%20US%20cybersecurity%20company%20Norton%3C%2Fp%3E%0A%3Cp%3E%E2%80%A2%20While%20juice%20jacking%20is%20a%20real%20threat%2C%20there%20have%20been%20no%20known%20widespread%20instances.%20Apple%20and%20Google%20have%20also%20added%20security%20layers%20to%20prevent%20this%20on%20the%20iOS%20and%20Android%20devices%2C%20respectively%3C%2Fp%3E%0A
RESULTS

Cagliari 5-2 Fiorentina
Udinese 0-0 SPAL
Sampdoria 0-0 Atalanta
Lazio 4-2 Lecce
Parma 2-0 Roma
Juventus 1-0 AC Milan