Français Anglais
Accueil Annuaire Plan du site
Home > Research results > Research highlights
Research results
Research highlight : ENTITY DISCOVERY AND ANNOTATION IN TABLES
ENTITY DISCOVERY AND ANNOTATION IN TABLES
7 January 2013

International Conference on Extending Database Technology (EDBT/ICDT 2013 Joint Conference), March 18-22, 2013 - Genoa, Italy
The Web is rich of tables (e.g., HTML tables, spreadsheets, Google Fusion Tables) that host a considerable wealth of high-quality relational data. Not surprinsingly, they have been increasingly drawing the attention of numerous researchers, especially from the information retrieval and extraction community; unlike unstructured texts, indeed, tables usually favour the automatic extraction of data because of their regular structure and properties. The data extraction is usually complemented by the annotation of the table, which finds its semantics by identifying a type for each column, the relations between columns, if any, and the entities that occur in each cell. In this paper, we focus on the problem of discovering and annotating entities in tables. More specially, entity annotation refers to the task of assigning a label (e.g. "restaurant", "museum") to a phrase (e.g. "T.G.I. Friday's", "Metropolitan Museum of Art") that denotes an entity.
Compared to existing approaches, we tackle this problem in a pragmatic way, which is motivated by specific application needs; in particular, we focus on Google Fusion Tables, which is a rapidly growing collection of tables with rich and high-quality data. The main novelty of our approach is that it does not rely on a pre-compiled reference catalogue of annotated entities, typically extracted from ontologies such as Yago and DBpedia, which limits the annotations to the sole entities that belong to the catalogue. Instead, we train an algorithm to look for information on previously unseen entities on the Web so as to annotate them with the correct type.



Keyword
  ° Databases
  ° Information integration

Group
  ° Artificial Intelligence and Inference Systems

Contact
  [none]
Research highlights
HOW FAST CAN YOU CONVERGE TOWARDS A CONSENSUS VALUE?
28 October 2021
In their recent work, Matthias Fuegger (LMF), Thomas Nowak (LISN), and Manfred Schwarz (TU Wien) stu

MODEL TRANSFORMATION AS CONSERVATIVE THEORY-TRANSFORMATION
30 October 2020
We present a new technique to construct tool support for domain-specific languages (DSLs) inside the

BEST STUDENT PAPER AWARD (ML) AT ECML 2019
20 September 2019
Guillaume Doquet (A&O) received the Best Student Paper Award (category Machine Learning) at ECML 201

BEST PAPER AWARD - HPCS 2019 - ON SERVER-SIDE FILE ACCESS PATTERN MATCHING
17 July 2019
Francieli Zanon Boito¹ , Ramon Nou², Laércio Lima Pilla³, Jean Luca Bez⁴, Jean-François Méhaut¹, T

BEST FULL PAPER AWARD EDM 2019 - EDUCATIONAL DATA MINING
5 July 2019
DAS3H: Modeling Student Learning and Forgetting for Optimally Scheduling Distributed Practice of Ski