DCU researchers analyse World Cup tweets to improve machine translation

Source: Silicon Republic
Story flagged by: Zsofia Koszegi-Nagy

Dublin City University (DCU) researchers, in collaboration with Microsoft Research at the CNGL Centre for Global Intelligent Content in Dublin, have developed a live tweet translation streaming service in time for the 2014 World Cup’s final stages.

The 20-strong research team calls its live FIFA World Cup translation service ‘Brazilator’ and, with it, fans can follow what’s being said by supporters in 24 of the 32 original competing countries.

Brazilator tracks tweets in Irish, German, French, Spanish, Italian, Portuguese, Croatian, Greek, Japanese, Korean, Chinese and Farsi that include hashtags such as #WorldCup and #WC2014. It then publishes each tweet in its original form along with translations in English, French, Portuguese, Spanish and German.

The stream also provides sentiment analysis for each team during matches and visitors can track back on previous games to see if the Twitterverse reacted positively or negatively to a team’s performance.

World Cup Twitter analysis

Brazilator’s sentiment analysis of Portugal vs USA

While translating and analysing 140-character tweets may seem manageable, the nature of language used on social media presents a significant technical challenge.

“Tweets typically contain noisy, diverse and unstructured language, such as incomplete sentences, misspellings, abbreviations, web links, emoticons and hashtags,” explains Dr Lamia Tounsi, research integration officer with CNGL and co-leader of the Brazilator project team. More.

See: Silicon Republic

Subscribe to the translation news daily digest here. See more translation news.

Comments about this article



Translation news
Stay informed on what is happening in the industry, by sharing and discussing translation industry news stories.

All of ProZ.com
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search