Resource: EVALution

Reference EVALution
Date of Submission March 23, 2020, 3:15 p.m.
Status accepted
ISLRN 490-239-801-102-1
Resource Type Primary Text
Media Type Text
Source
Language English, Mandarin Chinese
Format/MIME Type text/plain
Size 1976 KB
Access Medium Web Download
Description

*Introduction*

EVALution was developed by The Hong Kong Polytechnic University. It is comprised of English and Mandarin Chinese data sets -- EVALution 1.0 and EVALution-Man, respectively -- that contain semantic relations and metadata for training and evaluating distributional semantic models.

*Data*

EVALution 1.0 consists of approximately 7500 English tuples extracted from ConceptNet 5.0 and WordNet 4.0 and filtered through automatic methods and crowd-sourcing. Several semantic relations between word pairs were instantiated, including hypernymy, synonymy, antonymy and meronymy. The corpus also includes additional information that can be used to filter the pairs or to analyze the results, such as relation domain, word frequency, word part-of-speech and word semantic field.

EVALution-MAN consists of Chinese word pairs from two sources: Chinese Wordnet and humans who completed an elicitation task by supplying missing words to sentences. The human-supplied sentence word pairs were then judged by human raters for reliability.

All text data is presented as UTF-8 encoded tab separated plain text.

Version 1.0
Creator Chu-Ren Huang , Enrico Santus , Hongchao Liu
Distributor Linguistic Data Consortium
Rights Holder Portions © 2020 The Hong Kong Polytechnic University, © 2020 Trustees of the University of Pennsylvania