Why every Arab country is racing to build its own large language model

Arabic is spoken by more than 450 million people, yet artificial intelligence has never truly understood it. Global models stumble over dialects, flatten nuance and miss cultural context.

That gap is pushing countries across the region to build their own large language models, from the UAE’s Falcon, developed by the Technology Innovation Institute in Abu Dhabi, to Egypt’s Intella, and Saudi Arabia’s recently announced Humain Chat, created by Humain with backing from the Public Investment Fund. Each is a contender a race to ensure the future of AI reflects Arab voices and identities.

Humain Chat, launched last month and currently accessible to Saudi users in beta mode, is the kingdom’s first home-grown Arabic LLM.

Gulf companies can learn from China’s lean AI without spending billions

AI without the billions: What Gulf firms can learn from China’s lean approach

Developed with support from the Saudi sovereign wealth fund PIF, it is positioned as a secure, Arabic-first alternative to global systems, aimed at sectors such as government, education and business services.

The platform will be rolled out across the world in phases, according to the company.

A language AI has never mastered

Arabic language functions differently to more uniform languages such as English. Nour Al Hassan, founder of Arabic.ai, explained that Arabic isn’t just one language; it’s a family of dialects layered over a deep, classical base. This means each dialect could have a different word to express the same thing.

“The morphology is complex: one root can produce dozens of forms, and words often bundle multiple meanings into a single token,” she told The National. She added that this complexity is compounded by “diversity of dialects, code switching between English, French and Arabizi audiences, En, and the lack of standardised spelling”.

For example, the word “بس” or “bas” can mean “only” in Egypt, “but” in the Levant, or “enough” in the Gulf, differences that can completely change a sentence. Arabizi, meanwhile, is the informal practice of writing Arabic with Latin letters and numbers, such as “3” for ع or “7” for ح, which adds another layer of inconsistency for AI systems to process.

For AI to truly understand Arabic, Ms Al Hassan said, it must learn “the rhythm and nuance of how people actually speak and write across the region, not just formal Arabic in textbooks”. That challenge is what Egypt’s Intella was founded to address.

Chief executive Nour Taher told The National that Arabic’s difficulty for AI “isn’t just its complexity, but its duality”. She explained that Arabic takes many forms: the formal, written Modern Standard Arabic, and then the way people actually speak, which she described as “a rich, diverse spectrum of dialects”.

Listen more about different LLMs:

Most global models fail, she explained, because they rely on labelled data sets, which she said “don't exist in the case of dialectal Arabic”. Instead, Intella spent 18 months building one of the most diverse data sets in the world, curated and annotated by native speakers. Its conversational agent Ziila is already being used in banks, telecoms and government services.

Ms Taher said the company focuses on the application layer, building industry-specific small language models or fine-tuning existing ones, with a particular strength in its proprietary dialectal text-to-speech and speech-to-text engines. “We win by being the most accurate and effective solution for specific business problems, not by trying to be a generalist tool,” she said.

Missing ingredient: real-life Arabic data

If language complexity is the first barrier, data scarcity is the second. Ms Al Hassan called it “the single biggest bottleneck”. The problem, she said, is not just volume. “It’s about quality, balance and rights,” she explained.

“Too much of our Arabic data is either scraped news or religious text. What’s missing are everyday conversations, dialect-rich speech, and domain-specific content.”

She argued that progress depends on sovereign rights, cleared data sets and large-scale Arabic preference training with native raters, people who are proficient in a language and are tasked with evaluating, or rating, language use. “That’s how we close the gap between models that can translate and models that can actually reason and engage in Arabic,” Ms Al Hassan said.

AI as sovereignty and strategy

In the UAE, the motivation for developing Falcon goes beyond language. Dr Hakim Hacid, chief Researcher at the Artificial Intelligence and Digital Science Research Centre at the Technology Innovation Institute, said open sourcing Falcon was a deliberate choice “to accelerate innovation, build trust and ensure broad accessibility”.

Dr Hakim Hacid, chief researcher of the Technology Innovation Institute's AI and digital science research centre unit. Photo: TII

“We didn’t open source because we had to,” he added. “We did it because it works – technically, strategically and ethically,” he told The National. Falcon Arabic was trained on high-quality native Arabic data, covering both Modern Standard Arabic and regional dialects.

Dr Hacid said this allowed the model “to capture not only the structure of the language but also the nuance, tone, and cultural context that are often missing in generic multilingual models”. Ensuring AI reflects the richness of Arabic, he added, is “not just a technical goal, it is essential for inclusion and cultural relevance”.

On the UAE’s push for AI sovereignty, Dr Hacid explained that it isn't just about building models. “It involves having visibility into and ownership over the entire stack: data, infrastructure, algorithm, training and deployment,” he said.

Falcon, he said, gave the UAE hands-on experience in building a high-performance model from the ground up. “Falcon shows that this region can lead technically and contribute meaningfully to the global AI ecosystem,” he said.

While Falcon has performed strongly on global benchmarks, Dr Hacid said the priority is real-world application. “Our focus is on building models that are not only globally competitive, but also efficient, adaptable, and relevant to real-world use,” he said.

He added that if a model performs well in a lab but cannot be deployed responsibly or efficiently, “then it does not serve its purpose”.

Billions fuelling the Arabic AI race

The push is also being driven by money. Prosus Ventures, which recently led a $12.5 million Series A round in Intella, sees Arabic AI as a major opportunity. Robin Voogd, head of Middle East investments at the firm, said Arabic is the fifth-most spoken language in the world, yet Arabic AI models “severely underperform, particularly across dialects”.

This, he said, creates both “a huge gap and a major opportunity: whoever builds the best models for Arabic will gain a strategic data advantage in a massive underserved market”, he told The National.

Fadi Ghandour, executive chairman of the investment company Wamda, said investor appetite is immense.

Fadi Ghandour, executive chairman of Wamda Group. Pawan Singh / The National

“Sovereign wealth funds and government-backed entities have already committed billions to AI infrastructure, particularly in the UAE and Saudi Arabia,” he told The National. “These investments include large-scale data centres and strategic partnerships with companies like Nvidia, because without computer power, AI doesn’t happen.”

The business stakes are clear. According to Grand View Research's January 2024 report, the Mena AI market was valued at $11.9 billion in 2023 and is projected to reach $166.3 billion by 2030, growing at nearly 45 per cent annually.

In the UAE alone, the market is expected to grow from $3.5 billion in 2023 to $46.3 billion by 2030, according to a February report by Trends Research & Advisory, an independent research institution. Most of the momentum is in the Gulf, while the Levant plays a quieter role.

Mr Ghandour described Jordan and Lebanon as important sources of talent. “Jordan and Lebanon have exceptional AI engineers and data scientists, many of whom are already contributing to Arabic LLMs,” he said.

He noted that many are being recruited into Gulf companies or working in hubs in Amman and Irbid. This reflects how the Levant supports the growth of Arabic AI indirectly, even if the flagship projects have their headquarters elsewhere.

Real or hyped?

As with any emerging technology, the risk of hype is ever-present. Mr Ghandour acknowledged it, but said the region was at a turning point. “There’s always hype with new technology. But hype fades – and the serious players remain,” he said.

Ms Al Hassan stressed that Arabic LLMs are not hype if they are built on the right foundations. “They’re only as strong as the data and fine-tuning behind them,” she said.

Without curated corpora and alignment with cultural nuance, she warned, “Arabic LLMs risk being generic imitations.” But with the right investment in data and real use cases, “they become genuine breakthroughs”.

Ms Taher at Intella agreed that enterprises were already pushing beyond experimentation. She said her client “is leapfrogging the chatbot phase and moving directly to sophisticated conversational intelligence. This demonstrates a clear, top-down mandate to use AI as a core pillar of business strategy.”

The rise of Arabic LLMs is not just about catching up with Silicon Valley. It is about cultural relevance, digital sovereignty and economic opportunity.

Falcon, Intella and Humain each represent different answers to the same question: why should the region depend on others to build its technological future?

As Mr Ghandour put it, Arabic-focused LLMs are “not just about language – they’re about identity. The age of one-size-fits-all tech is behind us.”

On Women's Day

Dr Nawal Al-Hosany: Why more women should be on the frontlines of climate action

Shelina Janmohamed: Why shouldn't a spouse be compensated fairly for housework?

Justin Thomas: Challenge the notion that 'men are from Mars, women are from Venus'

The National Editorial: Is there much to celebrate on International Women's Day 2021?

Traits of Chinese zodiac animals

Tiger:independent, successful, volatile
Rat:witty, creative, charming
Ox:diligent, perseverent, conservative
Rabbit:gracious, considerate, sensitive
Dragon:prosperous, brave, rash
Snake:calm, thoughtful, stubborn
Horse:faithful, energetic, carefree
Sheep:easy-going, peacemaker, curious
Monkey:family-orientated, clever, playful
Rooster:honest, confident, pompous
Dog:loyal, kind, perfectionist
Boar:loving, tolerant, indulgent

How our nine-month-old has made the Canadian winter a little bit warmer for us

Canadians rallying around a targeted Syrian restaurant is telling about attitudes to migrants

Don Cherry and the real issue with being one of 'you people'

Key findings of Jenkins report

Founder of the Muslim Brotherhood, Hassan al Banna, "accepted the political utility of violence"
Views of key Muslim Brotherhood ideologue, Sayyid Qutb, have “consistently been understood” as permitting “the use of extreme violence in the pursuit of the perfect Islamic society” and “never been institutionally disowned” by the movement.
Muslim Brotherhood at all levels has repeatedly defended Hamas attacks against Israel, including the use of suicide bombers and the killing of civilians.
Laying out the report in the House of Commons, David Cameron told MPs: "The main findings of the review support the conclusion that membership of, association with, or influence by the Muslim Brotherhood should be considered as a possible indicator of extremism."

SPEC%20SHEET%3A%20APPLE%20IPAD%20PRO%20(12.9%22%2C%202022)

%3Cp%3E%3Cstrong%3EDisplay%3A%3C%2Fstrong%3E%2012.9-inch%20Liquid%20Retina%20XDR%2C%202%2C732%20x%202%2C048%2C%20264ppi%2C%20wide%20colour%2C%20True%20Tone%2C%20ProMotion%2C%201%2C600%20nits%20max%2C%20Apple%20Pencil%20hover%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EChip%3A%3C%2Fstrong%3E%20Apple%20M2%2C%208-core%20CPU%2C%2010-core%20GPU%2C%2016-core%20Neural%20Engine%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EMemory%3A%3C%2Fstrong%3E%20Storage%20%E2%80%93%20128GB%2F256GB%2F512GB%20%2F%201TB%2F2TB%3B%20RAM%20%E2%80%93%208GB%2F16GB%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EPlatform%3A%3C%2Fstrong%3E%20iPadOS%2016%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EMain%20camera%3A%3C%2Fstrong%3E%20Dual%2012MP%20wide%20(f%2F1.8)%20%2B%2010MP%20ultra-wide%20(f%2F2.4)%2C%202x%20optical%2F5x%20digital%2C%20Smart%20HDR%204%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EVideo%3A%3C%2Fstrong%3E%20ProRes%204K%20%40%2030fps%2C%204K%20%40%2024%2F25%2F30%2F60fps%2C%20full%20HD%20%40%2025%2F30%2F60fps%2C%20slo-mo%20%40%20120%2F240fps%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EFront%20camera%3A%3C%2Fstrong%3E%20TrueDepth%2012MP%20ultra-wide%20(f%2F2.4)%2C%202x%2C%20Smart%20HDR%204%2C%20Centre%20Stage%2C%20Portrait%2C%20Animoji%2C%20Memoji%3B%20full%20HD%20%40%2025%2F30%2F60fps%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EAudio%3A%3C%2Fstrong%3E%20Four-speaker%20stereo%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EBiometrics%3A%3C%2Fstrong%3E%20Face%20ID%2C%20Touch%20ID%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EI%2FO%3A%3C%2Fstrong%3E%20USB-C%2C%20smart%20connector%20(for%20folio%2Fkeyboard)%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EBattery%3A%3C%2Fstrong%3E%20Up%20to%2010%20hours%20on%20Wi-Fi%3B%20up%20to%20nine%20hours%20on%20cellular%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EFinish%3A%3C%2Fstrong%3E%20Silver%2C%20space%20grey%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EIn%20the%20box%3A%3C%2Fstrong%3E%20iPad%2C%20USB-C-to-USB-C%20cable%2C%2020-watt%20power%20adapter%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EPrice%3A%3C%2Fstrong%3E%20WiFi%20%E2%80%93%20Dh4%2C599%20(128GB)%20%2F%20Dh4%2C999%20(256GB)%20%2F%20Dh5%2C799%20(512GB)%20%2F%20Dh7%2C399%20(1TB)%20%2F%20Dh8%2C999%20(2TB)%3B%20cellular%20%E2%80%93%20Dh5%2C199%20%2F%20Dh5%2C599%20%2F%20Dh6%2C399%20%2F%20Dh7%2C999%20%2F%20Dh9%2C599%3C%2Fp%3E%0A

Dust and sand storms compared

Sand storm

Particle size: Larger, heavier sand grains
Visibility: Often dramatic with thick "walls" of sand
Duration: Short-lived, typically localised
Travel distance: Limited
Source: Open desert areas with strong winds

Dust storm

Particle size: Much finer, lightweight particles
Visibility: Hazy skies but less intense
Duration: Can linger for days
Travel distance: Long-range, up to thousands of kilometres
Source: Can be carried from distant regions

ATP RANKINGS (NOVEMBER 4)

1. Rafael Nadal (ESP) 9,585 pts ( 1)
2. Novak Djokovic (SRB) 8,945 (-1)
3. Roger Federer (SUI) 6,190
4. Daniil Medvedev (RUS) 5,705
5. Dominic Thiem (AUT) 5,025
6. Stefanos Tsitsipas (GRE) 4,000 ( 1)
7. Alexander Zverev (GER) 2,945 (-1)
8. Matteo Berrettini (ITA) 2,670 ( 1)
9. Roberto Bautista (ESP) 2,540 ( 1)
10. Gaël Monfils (FRA) 2,530 ( 3)
11. David Goffin (BEL) 2,335 ( 3)
12. Fabio Fognini (ITA) 2,290
13. Kei Nishikori (JPN) 2,180 (-2)
14. Diego Schwartzman (ARG) 2,125 ( 1)
15. Denis Shapovalov (CAN) 2,050 ( 13)
16. Stan Wawrinka (SUI) 2,000
17. Karen Khachanov (RUS) 1,840 (-9)
18. Alex De Minaur (AUS) 1,775
19. John Isner (USA) 1,770 (-2)
20. Grigor Dimitrov (BUL) 1,747 ( 7)

If you go

The Flights

Emirates and Etihad fly direct to Johannesburg from Dubai and Abu Dhabi respectively. Economy return tickets cost from Dh2,650, including taxes.

The trip

Worldwide Motorhoming Holidays (worldwidemotorhomingholidays.co.uk) operates fly-drive motorhome holidays in eight destinations, including South Africa. Its 14-day Kruger and the Battlefields itinerary starts from Dh17,500, including campgrounds, excursions, unit hire and flights. Bobo Campers has a range of RVs for hire, including the 4-berth Discoverer 4 from Dh600 per day.

%20Ramez%20Gab%20Min%20El%20Akher

%3Cp%3E%3Cstrong%3ECreator%3A%3C%2Fstrong%3E%20Ramez%20Galal%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EStarring%3A%3C%2Fstrong%3E%20Ramez%20Galal%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EStreaming%20on%3A%20%3C%2Fstrong%3EMBC%20Shahid%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3ERating%3A%20%3C%2Fstrong%3E2.5%2F5%3C%2Fp%3E%0A

Mental%20health%20support%20in%20the%20UAE

%3Cp%3E%E2%97%8F%20Estijaba%20helpline%3A%208001717%3Cbr%3E%E2%97%8F%20UAE%20Ministry%20of%20Health%20and%20Prevention%20hotline%3A%20045192519%3Cbr%3E%E2%97%8F%20UAE%20Mental%20health%20support%20line%3A%20800%204673%20(Hope)%3Cbr%3EMore%20information%20at%20hope.hw.gov.ae%3C%2Fp%3E%0A

David Haye record

Total fights: 32
Wins: 28
Wins by KO: 26
Losses: 4

The Details

Article 15
Produced by: Carnival Cinemas, Zee Studios
Directed by: Anubhav Sinha
Starring: Ayushmann Khurrana, Kumud Mishra, Manoj Pahwa, Sayani Gupta, Zeeshan Ayyub
Our rating: 4/5

MATCH INFO

Barcelona 5 (Lenglet 2', Vidal 29', Messi 34', 75', Suarez 77')

Valladolid 1 (Kiko 15')

Expert advice

“Join in with a group like Cycle Safe Dubai or TrainYAS, where you’ll meet like-minded people and always have support on hand.”

Stewart Howison, co-founder of Cycle Safe Dubai and owner of Revolution Cycles

“When you sweat a lot, you lose a lot of salt and other electrolytes from your body. If your electrolytes drop enough, you will be at risk of cramping. To prevent salt deficiency, simply add an electrolyte mix to your water.”

Cornelia Gloor, head of RAK Hospital’s Rehabilitation and Physiotherapy Centre

“Don’t make the mistake of thinking you can ride as fast or as far during the summer as you do in cooler weather. The heat will make you expend more energy to maintain a speed that might normally be comfortable, so pace yourself when riding during the hotter parts of the day.”

Chandrashekar Nandi, physiotherapist at Burjeel Hospital in Dubai

New Zealand 15 British & Irish Lions 15

New Zealand 15
Tries: Laumape, J Barrett
Conversions: B Barrett
Penalties: B Barrett

British & Irish Lions 15
Penalties: Farrell (4), Daly

Like a Fading Shadow

Antonio Muñoz Molina

Translated from the Spanish by Camilo A. Ramirez

Tuskar Rock Press (pp. 310)

Bahrain%20GP

%3Cp%3EFriday%20qualifying%3A%207pm%20(8pm%20UAE)%3C%2Fp%3E%0A%3Cp%3ESaturday%20race%3A%207pm%20(UAE)%3C%2Fp%3E%0A%3Cp%3ETV%3A%20BeIN%20Sports%3C%2Fp%3E%0A

SPEC%20SHEET%3A%20APPLE%20M3%20MACBOOK%20AIR%20(13%22)

%3Cp%3E%3Cstrong%3EProcessor%3A%3C%2Fstrong%3E%20Apple%20M3%2C%208-core%20CPU%2C%20up%20to%2010-core%20CPU%2C%2016-core%20Neural%20Engine%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EDisplay%3A%3C%2Fstrong%3E%2013.6-inch%20Liquid%20Retina%2C%202560%20x%201664%2C%20224ppi%2C%20500%20nits%2C%20True%20Tone%2C%20wide%20colour%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EMemory%3A%3C%2Fstrong%3E%208%2F16%2F24GB%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EStorage%3A%3C%2Fstrong%3E%20256%2F512GB%20%2F%201%2F2TB%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EI%2FO%3A%3C%2Fstrong%3E%20Thunderbolt%203%2FUSB-4%20(2)%2C%203.5mm%20audio%2C%20Touch%20ID%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EConnectivity%3A%3C%2Fstrong%3E%20Wi-Fi%206E%2C%20Bluetooth%205.3%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EBattery%3A%3C%2Fstrong%3E%2052.6Wh%20lithium-polymer%2C%20up%20to%2018%20hours%2C%20MagSafe%20charging%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3ECamera%3A%3C%2Fstrong%3E%201080p%20FaceTime%20HD%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EVideo%3A%3C%2Fstrong%3E%20Support%20for%20Apple%20ProRes%2C%20HDR%20with%20Dolby%20Vision%2C%20HDR10%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EAudio%3A%3C%2Fstrong%3E%204-speaker%20system%2C%20wide%20stereo%2C%20support%20for%20Dolby%20Atmos%2C%20Spatial%20Audio%20and%20dynamic%20head%20tracking%20(with%20AirPods)%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EColours%3A%3C%2Fstrong%3E%20Midnight%2C%20silver%2C%20space%20grey%2C%20starlight%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EIn%20the%20box%3A%3C%2Fstrong%3E%20MacBook%20Air%2C%2030W%2F35W%20dual-port%2F70w%20power%20adapter%2C%20USB-C-to-MagSafe%20cable%2C%202%20Apple%20stickers%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EPrice%3A%3C%2Fstrong%3E%20From%20Dh4%2C599%3C%2Fp%3E%0A

Why every Arab country is racing to build its own large language model

Regional investment in Arabic AI is soaring, with the market set to exceed $160 billion by 2030