Computer program puts pen to the sword


David Lepeska
  • English
  • Arabic

At some point in the recent past, humanity slipped imperceptibly from the Information Age into the Era of Big Data. We are today in the midst of a statistical revolution that's changing our world.

The shift is less about election polls and jobs numbers, share prices and box office than the unseen tools monitoring our virtual and real-world location, our shopping habits, the scale of rush-hour traffic jams in Mumbai and the precise placement of the home team's fielders in the eighth inning of a Major League baseball game.

Experts estimate the world's store of data is growing at a rate of about five trillion bits per second, doubling every two years. The data created each day could fill a billion books of 10 million pages each, according to Nate Silver, author of the just-published The Signal and the Noise. Yet our ability to tease out the lessons in all those numbers - to find the signal in the noise - has lagged behind. That's where Narrative Science comes in.

"People haven't been thinking about using data to communicate," says Kristian Hammond, the company's co-founder and chief technology officer, during a recent interview at the two-year-old company's Chicago headquarters. "They've been thinking about how do I show you the data, how do I expose data. We're all about taking the next step, the very human step of examining the data, drawing insight out of it and then crafting reporting out of that insight."

As the demand for teasing out and relaying the value and meaning of numbers increases exponentially, man's monopoly on writing is nearing its end. Narrative Science and a few other firms have devised computerised tools that transform data into relatively sophisticated narratives - raising the possibility, yet again, that the news reporter as we know him may soon become extinct.

"You're going to see more tools like this," says Matt Waite, a journalism professor at the University of Nebraska. "I don't see the trend shifting anytime soon, and inevitably they're going to get better."

The great irony of Narrative Science is that it was incubated in part at Northwestern University's Medill School of Journalism, one of America's better training grounds for reporters. While teaching a 2009 course on journalism and programming, co-instructors Hammond and Larry Birnbaum (now Narrative Science's chief scientific adviser) urged their students to create a system that could turn data into reporting.

One student group came up with a tool called Stats Monkey, which could transform baseball statistics into game recaps (with the byline, "The Machine"). "Once we did baseball we knew exactly what we had," says Hammond, referring to the program's ability to do all variety of data-heavy narratives.

Though its media output gets most of the attention, about 80 per cent of Narrative Science's total revenue - which has jumped four-fold in the past year - comes from big data clients. "We're not moving away from media but we're growing the rest of the company way faster," says Hammond.

One fast-food client, for example, receives detailed weekly reports tailored to sales and operations at each of its 14,000 franchises. Narrative Science's chief executive is Stuart Frankel, a former DoubleClick executive who brings his tech industry experience to bear. His company is now moving into medical reports and student testing and recently began creating personalised reports for business conference attendees based on Twitter comments.

Narrative Science is not the only player in this field. The US Department of Defense's Defense Advanced Research Projects Agency, or Darpa, has a team of scientists at the Massachusetts Institute of Technology working on a computer tool to transform raw data into clear, concise writing. Wikipedia uses bots to troll for errors in its millions of entries.

Yet Narrative Science goes much further. Its newswriting system, called Quill, begins with a set of tools governing each topic and the relationships between them. Input is routed into the appropriate category and Quill makes deductions based on the numbers, which it then turns into prose according to a set of templates and topic-specific vocabulary devised by Narrative Science programmers.

"By the time it hits the point where it's picking phrases, it usually has at least a half-dozen structural ways it's going to say what it wants to say," says Hammond. "At the micro-level, it knows how to pull words in and out to put in variable details, and beyond that it knows what it's already said earlier in the story."

The final product - US$10 (Dh36) for each 500-word article - reads something like this: "Analysts have become increasingly bullish on Discover Financial Services (DFS) in the month leading up to the company's third quarter earnings announcement scheduled for Thursday, September 27, 2012. The consensus earnings per share estimate has moved up from $1.02 a share to the current expectation of earnings of $1.04 a share."

Not exactly Hemingway, but as a market report it's perfectly acceptable, human even. In addition to Forbes.com, for which the above was written, Narrative Science media clients include the Big Ten Network, the financial information firm Markit, and GameChanger, which expects to produce 1.5 million recaps of children's baseball games this year, via Quill.

Since The Big Ten Network began using StatsMonkey in 2010, the sports-focused website has since seen its traffic increase by more than 30 per cent. Narrative Science's main competitor, Automated Insights (AI), also started with sports reporting and has moved on to other media areas. The company just began a partnership with Yahoo! under which it expects to produce 50 million personalised recaps of American football games over the next few months. AI also started a partnership this summer to produce as many as a million stories a year for a national online estate agent. As the number of computer-generated stories reaches into the tens of millions, these tools may start to reshape the news landscape. Clients of Narrative Science and Automated Insights are already able to customise a story's tone, from a wry, seen-it-all veteran Premier League football reporter to an overenthusiastic financial correspondent. Soon they'll start to personalise the news, tailoring stories to their neighbourhood, their profession, their politics.

A recent Gallup poll found that more than 60 per cent of Americans distrust their news sources, which suggests an opening for computer-generated content. This could lead to countless millions of personalised stories with built-in bias - and a world of people reading only the news they like.

Analysts argue this trend could push opposing groups further out of touch, reinforcing their own views and marginalising those outside the "news filter bubble". "Yes, it's possible to insert bias into our stuff," explains Hammond.

"Machines are scary, and part of our job and the content we produce is to make them less scary, and you make them less scary by making them more human."

But in making his machines more human, they may also become more scary - at least for media professionals. Hammond says he could adjust Quill to add the "human element" in stories, focusing on a single victim of a factory closing, for instance. Even so, it may be a long time before a computer understands complicated human ideas and expressions.

"Sarcasm, empathy, that sort of thing is really difficult for a computer to grasp," says Waite, the University of Nebraska journalism professor, who co-founded the Pulitzer Prize-winning news site Politifact. "Until computers are able to sense the subtle differences in the way we express ourselves, I'm really not afraid of Narrative Science taking away our jobs."

With smaller editorial staffs, Waite sees Narrative Science as a positive development for newsrooms. Hammond predicts that within 15 years tools like Quill will be creating as much as 90 per cent of all news stories.

It's hard to see how a string of algorithms could report from a natural or man-made disaster, so that number may be a bit high. And it will be a long time before Quill can punch out the closely-reported, long-form journalism that often wins awards. But most analysts estimate that by early 2014 every media company will have incorporated automated content into its newsroom in some way.

And Hammond has repeatedly predicted that a computerised reporting tool - able to comb oceans of data with much greater speed and efficiency than humans and potentially uncover powerful, previously-unattainable stories - will win a Pulitzer Prize by 2015. "That's three years away," he says with a grin, making the sound of a ticking clock.

The name Quill fits, as its arrival echoes Johannes Gutenberg's invention of the printing press. In advancing the technology that put words to paper, Gutenberg's press nearly made feather-and-ink scribes obsolete - though that was not his intention.

Certainly, the reporter will live on, in one form or another, but journalists of the future will need to be more disciplined and better trained. They'll need their own style, their own brand, to keep from being automated.

After all, a computer is infinitely faster and more responsive to request. It does not make mistakes, tire or grow lazy, whine about a dull assignment, take holidays or become frustrated with office politics. The Machine is coming.

David Lepeska is a freelance writer who contributes to The New York Times and Financial Times, and previously served as The National's Qatar correspondent.

Expo details

Expo 2020 Dubai will be the first World Expo to be held in the Middle East, Africa and South Asia

The world fair will run for six months from October 20, 2020 to April 10, 2021.

It is expected to attract 25 million visits

Some 70 per cent visitors are projected to come from outside the UAE, the largest proportion of international visitors in the 167-year history of World Expos.

More than 30,000 volunteers are required for Expo 2020

The site covers a total of 4.38 sqkm, including a 2 sqkm gated area

It is located adjacent to Al Maktoum International Airport in Dubai South

FIGHT CARD

Sara El Bakkali v Anisha Kadka (Lightweight, female)
Mohammed Adil Al Debi v Moaz Abdelgawad (Bantamweight)
Amir Boureslan v Mahmoud Zanouny (Welterweight)
Abrorbek Madaminbekov v Mohammed Al Katheeri (Featherweight)
Ibrahem Bilal v Emad Arafa (Super featherweight)
Ahmed Abdolaziz v Imad Essassi (Middleweight)
Milena Martinou v Ilham Bourakkadi (Bantamweight, female)
Noureddine El Agouti v Mohamed Mardi (Welterweight)
Nabil Ouach v Ymad Atrous (Middleweight)
Nouredin Samir v Zainalabid Dadachev (Lightweight)
Marlon Ribeiro v Mehdi Oubahammou (Welterweight)
Brad Stanton v Mohamed El Boukhari (Super welterweight

The specs

Engine: 1.5-litre turbo

Power: 181hp

Torque: 230Nm

Transmission: 6-speed automatic

Starting price: Dh79,000

On sale: Now

The specs: 2018 Ford Mustang GT

Price, base / as tested: Dh204,750 / Dh241,500
Engine: 5.0-litre V8
Gearbox: 10-speed automatic
Power: 460hp @ 7,000rpm
Torque: 569Nm @ 4,600rpm​​​​​​​
​​​​​​​Fuel economy, combined: 10.3L / 100km

The%20specs
%3Cp%3E%3Cstrong%3EEngine%3A%3C%2Fstrong%3E%203.0-litre%20six-cylinder%20turbo%20(BMW%20B58)%3Cbr%3E%3Cstrong%3EPower%3A%3C%2Fstrong%3E%20340hp%20at%206%2C500rpm%3Cbr%3E%3Cstrong%3ETorque%3A%3C%2Fstrong%3E%20500Nm%20from%201%2C600-4%2C500rpm%3Cbr%3E%3Cstrong%3ETransmission%3A%3C%2Fstrong%3E%20ZF%208-speed%20auto%3Cbr%3E%3Cstrong%3E0-100kph%3A%3C%2Fstrong%3E%204.2sec%3Cbr%3E%3Cstrong%3ETop%20speed%3A%3C%2Fstrong%3E%20267kph%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EOn%20sale%3A%3C%2Fstrong%3E%20Now%3Cbr%3E%3Cstrong%3EPrice%3A%3C%2Fstrong%3E%20From%20Dh462%2C189%3Cbr%3E%3Cstrong%3EWarranty%3A%3C%2Fstrong%3E%2030-month%2F48%2C000k%3C%2Fp%3E%0A
The%20specs
%3Cp%3E%3Cstrong%3EEngine%3A%20%3C%2Fstrong%3E6.5-litre%20V12%3Cbr%3E%3Cstrong%3EPower%3A%20%3C%2Fstrong%3E725hp%20at%207%2C750rpm%3Cbr%3E%3Cstrong%3ETorque%3A%20%3C%2Fstrong%3E716Nm%20at%206%2C250rpm%3Cbr%3E%3Cstrong%3ETransmission%3A%20%3C%2Fstrong%3E8-speed%20dual-clutch%20auto%3Cbr%3E%3Cstrong%3EOn%20sale%3A%20%3C%2Fstrong%3EQ4%202023%3Cbr%3E%3Cstrong%3EPrice%3A%20%3C%2Fstrong%3EFrom%20Dh1%2C650%2C000%3C%2Fp%3E%0A
FA Cup quarter-final draw

The matches will be played across the weekend of 21 and 22 March

Sheffield United v Arsenal

Newcastle v Manchester City

Norwich v Derby/Manchester United

Leicester City v Chelsea

Mina Cup winners

Under 12 – Minerva Academy

Under 14 – Unam Pumas

Under 16 – Fursan Hispania

Under 18 – Madenat

About Proto21

Date started: May 2018
Founder: Pir Arkam
Based: Dubai
Sector: Additive manufacturing (aka, 3D printing)
Staff: 18
Funding: Invested, supported and partnered by Joseph Group

Ferrari 12Cilindri specs

Engine: naturally aspirated 6.5-liter V12

Power: 819hp

Torque: 678Nm at 7,250rpm

Price: From Dh1,700,000

Available: Now

'My Son'

Director: Christian Carion

Starring: James McAvoy, Claire Foy, Tom Cullen, Gary Lewis

Rating: 2/5

The Bio

Hometown: Bogota, Colombia
Favourite place to relax in UAE: the desert around Al Mleiha in Sharjah or the eastern mangroves in Abu Dhabi
The one book everyone should read: 100 Years of Solitude by Gabriel Garcia Marquez. It will make your mind fly
Favourite documentary: Chasing Coral by Jeff Orlowski. It's a good reality check about one of the most valued ecosystems for humanity

Match info

Bournemouth 1 (King 45 1')
Arsenal 2 (Lerma 30' og, Aubameyang 67')

Man of the Match: Sead Kolasinac (Arsenal)

Best Academy: Ajax and Benfica

Best Agent: Jorge Mendes

Best Club : Liverpool   

 Best Coach: Jurgen Klopp (Liverpool)  

 Best Goalkeeper: Alisson Becker

 Best Men’s Player: Cristiano Ronaldo

 Best Partnership of the Year Award by SportBusiness: Manchester City and SAP

 Best Referee: Stephanie Frappart

Best Revelation Player: Joao Felix (Atletico Madrid and Portugal)

Best Sporting Director: Andrea Berta (Atletico Madrid)

Best Women's Player:  Lucy Bronze

Best Young Arab Player: Achraf Hakimi

 Kooora – Best Arab Club: Al Hilal (Saudi Arabia)

 Kooora – Best Arab Player: Abderrazak Hamdallah (Al-Nassr FC, Saudi Arabia)

 Player Career Award: Miralem Pjanic and Ryan Giggs

Water waste

In the UAE’s arid climate, small shrubs, bushes and flower beds usually require about six litres of water per square metre, daily. That increases to 12 litres per square metre a day for small trees, and 300 litres for palm trees.

Horticulturists suggest the best time for watering is before 8am or after 6pm, when water won't be dried up by the sun.

A global report published by the Water Resources Institute in August, ranked the UAE 10th out of 164 nations where water supplies are most stretched.

The Emirates is the world’s third largest per capita water consumer after the US and Canada.

Famous left-handers

- Marie Curie

- Jimi Hendrix

- Leonardo Di Vinci

- David Bowie

- Paul McCartney

- Albert Einstein

- Jack the Ripper

- Barack Obama

- Helen Keller

- Joan of Arc

The Cockroach

 (Vintage)

Ian McEwan 
 

Formula%204%20Italian%20Championship%202023%20calendar
%3Cp%3EApril%2021-23%3A%20Imola%3Cbr%3EMay%205-7%3A%20Misano%3Cbr%3EMay%2026-28%3A%20SPA-Francorchamps%3Cbr%3EJune%2023-25%3A%20Monza%3Cbr%3EJuly%2021-23%3A%20Paul%20Ricard%3Cbr%3ESept%2029-Oct%201%3A%20Mugello%3Cbr%3EOct%2013-15%3A%20Vallelunga%3C%2Fp%3E%0A
Guardians%20of%20the%20Galaxy%20Vol%203
%3Cp%3E%3Cstrong%3EDirector%3A%20%3C%2Fstrong%3EJames%20Gunn%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EStars%3A%3C%2Fstrong%3E%20Chris%20Pratt%2C%20Zoe%20Saldana%2C%20Dave%20Bautista%2C%20Vin%20Diesel%2C%20Bradley%20Cooper%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3ERating%3A%3C%2Fstrong%3E%204%2F5%3C%2Fp%3E%0A
NO OTHER LAND

Director: Basel Adra, Yuval Abraham, Rachel Szor, Hamdan Ballal

Stars: Basel Adra, Yuval Abraham

Rating: 3.5/5

BUNDESLIGA FIXTURES

(All games 4-3pm kick UAE time) Bayern Munich v Augsburg, Borussia Dortmund v Bayer Leverkusen, Hoffenheim v Hertha Berlin, Wolfsburg v Mainz , Eintracht Frankfurt v Freiburg, Union Berlin v RB Leipzig, Cologne v Schalke , Werder Bremen v Borussia Monchengladbach, Stuttgart v Arminia Bielefeld

Vikram%20Vedha
%3Cp%3E%3Cstrong%3EDirectors%3A%3C%2Fstrong%3E%20Gayatri%2C%20Pushkar%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EStars%3A%3C%2Fstrong%3E%20Hrithik%20Roshan%2C%20Saif%20Ali%20Khan%2C%20Radhika%20Apte%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3ERating%3A%C2%A0%3C%2Fstrong%3E3.5%2F5%3C%2Fp%3E%0A
The specs

Engine: Four electric motors, one at each wheel

Power: 579hp

Torque: 859Nm

Transmission: Single-speed automatic

Price: From Dh825,900

On sale: Now