An Entity Relatedness Test Dataset

HERRERA, J. ; CASANOVA, M. A. ; NUNES, B. P. ; Leme, L. A P. P. ; LOPES, G. R. . An Entity Relatedness Test Dataset. In: 16th International Semantic Web Conference, 2017, Viena. Proc. of the 16th International Semantic Web Conference (ISWC). Switzerland: Springer, 2017. v. 10588. p. 193-201. doi: 10.1007/978-3-319-68204-4_20


An Entity Relatedness Test Dataset

Authors

José Eduardo Talavera Herrera (PUC-Rio)
Marco Antonio Casanova (PUC-Rio)
Bernardo Pereira Nunes (PUC-Rio)
Luiz André P. Paes Leme (UFF)
Giseli Rabello Lopes (UFRJ)

Abstract

A knowledge base stores descriptions of entities and their relationships, often in the form of a very large RDF graph, such as DBpedia or Wikidata. The entity relatedness problem refers to the question of computing the relationship paths that better capture the connectivity between a given entity pair. This paper describes a dataset created to support the evaluation of approaches that address the entity relatedness problem. The dataset covers two familiar domains, music and movies, and uses data available in IMDb and last.fm, which are popular reference datasets in these domains. The paper describes in detail how sets of entity pairs from each of these domains were selected and, for each entity pair, how a ranked list of relationship paths was obtained.

Keywords:

Entity relatedness, Relationship path, Path ranking, Linked data, Knowledge bases

 

doi: 10.1007/978-3-319-68204-4_20