These treebanks and parsebanks for Finnish were created by the FinnTreeBank project. The data in FinnTreeBank 1 is based on model sentences in Iso suomen kielioppi (The Large Grammar of Finnish), manually annotated with dependency-syntactic descriptions (see the tagset and the annotation manual). FinnTreeBank 1 was built as a Grammar Definition Corpus and intended as a model for further automatic analysis of Finnish. FinnTreeBank 2 is a small extension to FinnTreeBank 1, and it was manually annotated in the same fashion as the first treebank. FinnTreeBank 3 is a large treebank that was only automatically annotated, using an experimental method. As a result, the annotations in the third treebank are of much lower quality in comparison to the manually annotated treebanks.
The UD version of FinnTreeBank 1 was derived from FinnTreeBank 1 2014 by a scripted mapping of labels and some restructuring in an attempt to conform approximately to the UD Finnish model.
More information on UD Finnish FTB
UD versions: | |
UD Finnish-FTB: The UD version of FinnTreeBank 1 Metadata and license Attribution instructions |
Download the resource |
Search for these versions in META-SHARE |
Latest versions/subcorpora: | |
The Downloadable Version of the Finnish TreeBank 1 Metadata and license Attribution instructions |
Download the resource |
The Helsinki Korp Version of the Finnish TreeBank 1 Metadata and license Attribution instructions |
Select the corpus in Korp (as part of FTB2) |
The Downloadable Version of the Finnish TreeBank 2 Metadata and license Attribution instructions |
Download the resource |
The Helsinki Korp Version of the Finnish TreeBank 2 Metadata and license Attribution instructions |
Select the corpus in Korp |
The Downloadable Version of the Finnish TreeBank 3 Metadata and license Attribution instructions |
Download the resource |
The Helsinki Korp Version of the Finnish TreeBank 3 Metadata and license Attribution instructions |
Select the corpus in Korp |
Search for these versions in META-SHARE |
Several different versions of these resources are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found on the list above. Details on the content and license of each version are available via the metadata records.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021031604