Resource: ksfpc

Reference KOTUS Swedish-Finnish Parallel Corpus
Date of Submission Sept. 2, 2014, 10:33 a.m.
Status accepted
ISLRN 430-116-345-758-1
Resource Type Primary Text
Media Type Text
Language Finnish, Swedish
Size 1 other
Access Medium accessibleThroughInterface

The corpus contains parallel texts in Finnish and Swedish. The texts have been linked semiautomatically at the sentence level. The translations have usually been made from Finnish into Swedish in Finland. Some texts may have been translated from Swedish into Finnish. For the most part the sentences in the text files run in their original order, but not always, and some sentences in the original texts may be missing from this corpus. The corpus is therefore primarily to be seen as a collection of sentence pairs without context. The paragraph division of the source texts has not been preserved. All types of text elements (headers, bylines, captions) are tagged as sentences. Some sentence elements contain more than one orthographic sentence. Texts have been extracted form the Finnish legislation ( and Texts containing economic reports and press releases have also been gathered from various companies in Finland. The quality of the texts varies.The purpose of the resource use must be outlined in a research plan.

Version 2.0
Distributor Kotimaisten kielten keskusThe Institute for the Languages of Finland , CSC - Tieteen tietotekniikan keskus Oy CSC — IT Center for Science Ltd
Rights Holder Kotimaisten kielten keskusThe Institute for the Languages of Finland