Jordi Atserias Batalla
  • Posts
  • Demos
    • Euro Love Map
    • Yahoo! Correlator
    • Yahoo! Quest
    • VERTa
  • NLP resources
    • Catalan and Spanish Wordnets
    • Semantically Annotated English Wikipedia
Hero Image
VERTa

VERTa is a linguistically motivated Machine Translation metric. It allows you to compare (evaluate) a translation (translated text) with (against) one or more reference translation. and evaluate adequacy, fluency and ranking of sentences. In Machine Translation (MT) evaluation plays a key role both in their development and improvement of the systems. BLEU is one of the most well-known and widely used but BLEU has its weaknesses regarding translation quality and its tendency to favour statistically-based MT systems.

June 8, 2020 Read
Hero Image
Yahoo! Semantically Annotated Snapshot of the English Wikipedia

Yahoo! Semantically Annotated Snapshot of the English Wikipedia, version 1.0 This SW1 dataset contains a snapshot of the English Wikipedia dated from 2006-11-04 processed with a number of publicly-available NLP tools. In order to build SW1, we started from the XML-ized Wikipedia dump distributed by the University of Amsterdam. This snapshot of the English Wikipedia contains 1,490,688 entries (excluding redirects). First, the text is extracted from the XML entry and split into sentences using simple heuristics.

June 8, 2020 Read
Hero Image
Euro Love Map

EuroLoveMap (2013 opeNER hackathon): Based on topics that relate to a specific country (eg. Berlusconi, Amsterdam) we want to see if the sentiment per country (language) differs. This demo was developed as part of the OpeNER’s Hackathon that took place 1st and 2nd of July. Using OpeNER APIs that the different project partners made available for the hackathon the differnt teams developed small but original applications along the day. More than 50 people spent the day programming, interchanging ideas and having fun.

July 23, 2013 Read
Hero Image
Yahoo! Quest

Yahoo! Quest (2013) Yahoo! Answers was one of the most popular Yahoo! properties were people ask and answers any type of questions. Although most of the time it was difficult to find related/similar questions. Yahoo! Quest was a smarter way to search and navigate through yahoo! answers. See this blog Yahoo answers text processing Processing the text from Yahoo! Answers to obtain dependency tree. Using syntactic information (dependencies) The syntactic dependencies were used to extract meaningful phrases

June 8, 2013 Read
Hero Image
Yahoo! Correlator

Yahoo! Correlator (2008) was the first yahoo! sandbox demo developed in Yahoo! Labs Barcelona. It allowed smart searches over the English Wikipedia exploding shallow NLP tools to identify Named entities (places, dates, people). At that time this demo was pushing what search engine were able to do, see this blog. This demo is no longer available since Yahoo! Sandbox demos were closed around 2014. Related papers: El paper de la lingüística en la cerca d’informació (In Catalan).

March 15, 2008 Read
Hero Image
Catalan & Spanish Wordnets

I was part of the team developing the Spanish Wordnet inside the EuroWordNet European funded project during 1996-1999.The original EuroWordNet project dealt with Dutch, Italian, Spanish, German, French, Czech, and Estonian. EuroWordNet is a system of semantic networks for European languages, based on WordNet. Each language develops its own wordnet but they are interconnected with interlingual links stored in the Interlingual Index (ILI). Spanish and Catalan Wordnets follow the EuroWordNet framework and are structured in the same way as the American wordnet for English (Princeton WordNet) trhough synsets (sets of synonymous words) with basic semantic relations between them.

June 8, 1999 Read
Navigation
  • About
  • Experiences
  • Education
  • Projects
  • Publications
  • Talks & Courses
Contact me:
  • jatserias
  • Jordi Atserias Batalla

Stay up to date with email notification

By entering your email address, you agree to receive the newsletter of this website.


Toha Theme Logo Toha
© Copyright 2022 Jordi Atserias Batalla. All Rights Reserved.
Powered by Hugo Logo