ISLRN

SemTransCNC

Full Official Name: SemTransCNC

Submission date: June 23, 2020, 4:40 p.m.

*Introduction* SemTransCNC was developed by The Hong Kong Polytechnic University. It is comprised of a semantic transparency dataset of Chinese nominal compounds built using a series of crowd-based experiments. Nominal compounds were selected from the Sinica Corpus and a modern Chinese lexicon. Crowd workers answered questionnaires that included demographic information and questions about the Chinese language. For assessing overall semantic transparency (OST) of selected compounds, they answered the question: "How is the sum of the meanings of A and B similar to the meaning of AB?" For assessing constituent semantic transparency (CST), they were asked to describe the similarity of A alone to its meaning in AB and the meaning of B alone to its meaning in AB. *Data* SemTransCNC consists of OST and CST data for 1,176 dimorphemic Chinese nominal compounds, which consist of free morphemes and have mid-range frequencies. The text data is presented as a UTF-8 encoded comma separated text file.

Creator(s)

Yao Yao

Angel Chan

Chu-Ren Huang

Shichang Wang

Distributor(s)

Linguistic Data Consortium

Right Holder(s)

Status : Accepted

ISLRN :

835-247-023-332-5

Version

1.0

Source

https://catalog.ldc.upenn.edu/LDC2020T12

Resource Type

Primary Text

Media Type

Text

Language(s)

Mandarin

Access Medium

Web Download