AI challenge on SemEval2018

How to detect events in the news and count them?

We are running a new AI challenge on SemEval2018:

Task 5: Counting events and event participants in news articles with a very ‘long tail’. If you are interested and if you want to try this, check out:…/

AI truly tested!

More information about this referential quantification task can be found in this handout or at the task website.

Video release explaining NewsReader’s Reading Machine:

NewsReader releases the Storyteller demo






NewsReader released a new way for representing events on timelines to approximate news stories. Try out the demo here !!!!




NewsReader workshop/hackathon announcement on VU Faculty of Humanities website (Dutch)

See the announcement on the VU Faculty of Humanities website (Dutch).


NewsReader at European Data Forum

Come check out the NewsReader stand at European Data Forum today and tomorrow.

At the stand you can see our  demos, pick up a brochure, find out more about our upcoming events and grab a bag of our limited edition NewsReader winegums!



Workshop and Hackathon November 2015: Car Wars – Industrial Heroes Going Down Fighting

On 24 and 25 November 2015, we will showcase the NewsReader project and invite you to come explore our technology and its results yourself during our NewsReader Workshop and Hackathon. 

Our dataset encompasses 12 years of news charting the struggle of automotive players to rule the global market, to satisfy the expectations of the shareholders, and their suffering from the financial crisis and new economies: industrial heroes going down!

The Workshop

Tuesday 24 November 2015, 14:00 – 18:00 Amsterdam Public Library 

In this workshop, we will bring together start-ups, companies, researchers and developers to present and discuss the NewsReader project, the technological domains it draws from and future applications for these technologies.

This afternoon will feature invited talks, demos, a panel discussion and a networking reception.

Confirmed Speaker:

  • Prof. dr. Frank van Harmelen, Vrije Universiteit Amsterdam.  Frank van Harmelen is a professor in Knowledge Representation & Reasoning in the AI department (Faculty of Science) at the Vrije Universiteit Amsterdam. After studying mathematics and computer science in Amsterdam, he moved to the Department of AI in Edinburgh, where he was awarded a PhD in 1989 for his research on meta-level reasoning.
  • Bernardo Magnini, Bernardo Magnini is senior researcher at FBK, where he is the scientific coordinator of the Cognitive Computing research line. His interests are in the field of Natural Language Processing, particularly lexical semantics, question answering and textual entailment. He has launched EVALITA, the evaluation campaign for both NLP and speech tools for Italian and has co-chaired CLIC-it 2014, the first Italian conference on Computational Linguistics. He currently serves as President of the Italian Association for Computational Linguistics.
  • Sybren Kooistra, De Volkskrant and Yournalism. Sybren Kooistra is a data journalist at De Volkskrant and co-founder and editor-chief of Yournalism, a platform for investigative journalism. In 2008, he went to the United States as an aide to Obama’s presidential campain. In 2013 he won a prize at an international press innovation contest for the news website of the future. He studied sociology, political science and social geography at Radboud University Nijmegen.

The Hackathon 

Wednesday 25 November 2015, 10:00 – 18:00 Amsterdam Public Library 

In June 2014 and January 2015 we ran several hackathons in both London and Amsterdam in which NewsReader enabled the attendees to pull out networks of interactions between entrepreneurs, politicians, companies and thoroughly test drive our technology. This November, we’re releasing a new version of our processing pipeline and we’re scaling up to 10 million processed news articles from sources about the automotive industry to obtain a searchable database of the news. At the hackathon, you can play with this dataset and explore the processing pipeline.

The global automotive industry has a value in the order of $1 trillion annually. The industry comprises a massive network of suppliers, manufacturers, advertisers, marketeers and journalists. Each of these players has his/her own story, often with unexpected origins or endings; one day you may be CEO of a big car company, the next you are out and making pizzas. With NewsReader, you can uncover these stories to reconstruct the past.

This event may be of interest to you if:

  • You’re interested in natural language processing and/or semantic web technology
  • You’re a data journalist on an automotive desk;
  • You’re an analyst sifting daily news looking for information on your company or on competitors;
  • You’re a data analyst looking to understand how your customers operate their supply chain
  • You’re an analyst trying to find secondary events that could influence an investment decision;
  • You’re interested in visualising big data

Attendance is free but please register by 22 November 17:00 CET. 

NewsReader at ISWC

With Semantic Web technology being a huge part of NewsReader, it is no wonder that we will showcase some of our technology at the 14th International Semantic Web Conference (ISWC) next week in Bethlehem, Pennsylvania.

Here’s a roundup of the sessions in which NewsReader is involved.

Sunday 11 October

The third NLP&DBpedia workshop (location: RBC 91): This workshop combines the two main themes in NewsReader, namely Natural Language Processing and Semantic Web. NewsReader team member Marieke van Erp is a co-organiser of this workshop and she will present the position paper “Missing Mr. Brown and buying an Abraham Lincoln – Dark Entities and DBpedia” (Marieke van Erp, Filip Ilievski, Marco Rospocher and Piek Vossen)

Monday 12 October

Filip Ilievski will present the paper “LOTUS: Linked Open Text UnleaShed” (Filip Ilievski, Wouter Beek, Marieke van Erp, Laurens Rietveld and Stefan Schlobach) at the Sixth Consuming Linked Data Workshop. 12:00 – 12:20 in room RBC 91.

In the afternoon, Marieke van Erp will be co-organising the Linked Science workshop, which is on the use of linked data for publishing, sharing and interlinking scientific resources, data and complete experiments. 14:00 – 17:30 in room RBC 241.

Tuesday 13 October

Marco Rospocher will present the following two demos at the Poster and Demos session (18:30 – 21:00, location: Vision Bar and Bucks)



KnowledgeStore Demonstration Video (2015 Version)

A new demonstration video of the KnowledgeStore “in action”, with voice comments, has been released.

You can access it directly here or from the Demos section.

London Hackathon

Which cars crash most? Which automobile companies had to recall their cars over the last ten years and how does current news relate to news in the financial automobile industry domain from the last ten years?

Hacking at RICS

Hacking at RICS in London

On Friday 30 2015 three teams tried to answer these questions during the NewsReader Hack Day in London. At the foot of the Big Ben (to be precise, at the Royal Institution of Chartered Surveyors), participants explored NewsReader’s analyses of 1.3 million articles about the automobile industry from the last decade. Despite the complexity of the data, the crash investigation team managed to analyse which cars may be the most dangerous around,* the recall team revealed how we can make our data even more useful and, finally, the NewsReader word cloud team built a neat little visualisation that allows users to select input from current news and produces a word cloud from the most related terms in the NewsReader data.

Crash team's winning car

Crash team’s winning car

Overall, a nice variety of applications for the NewsReader data and new insights for the NewsReader team on how our data may be used and improved. The outcome of both hackathons are nicely illustrated by Pim Stouten, Strategy Director for LexisNexis,’s words: “I’m positively excited to see two years of research and development coming to life, with NewsReader being used for real cases, and with real data.”

*we will keep the outcome to ourselves to avoid frightening you/lawsuits

Amsterdam Hackathon Recap


On January 21st, the second NewsReader hackathon and the first part of our Y2 user evaluation took place at the Amsterdam Public Library. For the hackathon, around 30 participants from research groups, as well as companies, public institutions and even some students came to the 6th floor of the Amsterdam Public Library. The participants formed teams of varying size resulting in 8 presentations at the end of the day. The NewsReader team was super happy to see so many different ideas come out of this such as in-depth analyses of the age of CEOs when they get hired or fired from companies, integration with annotation tools to provide enrichments from the NewsReader dataset and recommender systems. These are ideas we hadn’t thought of ourselves and we think could lead to interesting applications for the project.


We know that the NewsReader dataset is quite complex,  due to its size and the many different layers of information embedded into it. Fortunately, the hackathon participants were up for the challenge, dug in and got some cool visualisations and analyses out. In the course of this process, the NewsReader team got lots of feedback on how to improve the API and several bugs were reported (it’s still research). We are now working on analysing our query log (100,000 queries were fired during the day, resulting in a 371MB log) and prepping for our second hackathon of this year to take place in London this Friday.

Check out some content of participants to the hackathon:

NewsReader Hackathon – blog post by Jaap Blom (Netherlands Institute for Sound and Vision)

NewsReader Hackathon – blog post by Paul Groth (Elsevier)

Selfdriving cars and sentiment analysis – presentation by Anca Dumitrache and her team members at the hackathon