NoReC

Full Official Name: Norwegian Review Corpus
Submission date: Oct. 3, 2017, 2:47 p.m.

The Norwegian Review Corpus (NoReC) was created for the purpose of training and evaluating models for document-level sentiment analysis. More than 35,000 full-text reviews (approx. 15 million tokens) have been collected from several major Norwegian news sources and cover a range of different domains, including literature, movies, video games, restaurants, music and theater, in addition to product reviews across a range of categories. Each review is labeled with a manually assigned score of 1–6, as provided by the rating of the original author. The reviews are pre-processed using UDPipe and distributed in the CoNLL-U format.

Creator(s)
Distributor(s)
Right Holder(s)