Lataukset
Downloads
home
up
Location:
Logged in as: (null)
Name
Size
Description
acquis-ftb3/
-
The Finnish Sub-corpus of the JRC-Acquis Multilingual Parallel Corpus
AI2D-RST/
-
A multimodal corpus of 1000 primary school science diagrams
aku-egg/
-
Puheen ja EGG:n samanaikaiset tallenteet
AMPH/
-
amph Corpus
avoid/
-
Corpus of Age-related Voice Disguise
ccmh-src/
-
Corpus of Old Church Slavonic Texts, source
CEAL/
-
CEAL corpus
cfinsl/
-
Sign Language Corpora (in Finland)
coca/
-
Corpus of Contemporary American English
coha/
-
Corpus of Historical American English
coronavirus-ecorg/
-
The Coronavirus Corpus
Digilib/
-
digitala/
-
DigiTala (2019–2023)
DSPCON/
-
Aalto University DSP Course Conversation Corpus
eduskunta/
-
Plenary Sessions of the Parliament of Finland
ELFA/
-
ELFA corpus
FBC/
-
Finnish Broadcast Corpus
Fenno-Ugrica/
-
Fenno-Ugrica
fi-parliament-asr/
-
Aalto Finnish Parliament ASR Corpus 2008-2020
finchat/
-
Finnish conversational chat corpus
finestbert/
-
FinEst BERT
finka/
-
Raja-Karjalan korpus
finnish-tagtools/
-
Finnish Tagtools
finntreebank/
-
Finnish TreeBank (FTB)
FinnWordNet/
-
FinnWordNet
finsen/
-
FinnSentiment
FNC1/
-
Kansalliskirjaston lehtikokoelman suomenkieliset n-grammit 1820-2000
FTC-B/
-
Finnish Text Collection
FTC/
-
Finnish Text Collection
fvcc_v1/
-
Finnish Verbal Colorative Constructions
GeM-HTB/
-
A Multimodal Corpus of Tourist Brochures
giellagas-north/
-
Pohjoissaamen näytekorpus
glowbe/
-
Corpus of Global Web-Based English
hallituskausi/
-
Hallituskausi Translation Memories
hc/
-
Helsinki Corpus of English Texts
HCS/
-
Helsinki Corpus of Swahili 2.0
helpuhe/
-
The Longitudinal Corpus of Finnish Spoken in Helsinki
hfst-morphologies/
-
HFST morphologies for various languages
HFST-SweNER/
-
iijoki/
-
Iijoki Collection
italian-letters/
-
Italian Letters from the 16. Century
kipo/
-
Suomen viittomakielten kielipoliittinen ohjelma
KKS/
-
Karjalan kielen sanakirja (XML)
klk/
-
Kansalliskirjaston lehtikokoelma
la-murre/
-
The Finnish Dialect Syntax Archive
LAS2/
-
Advanced Finnish Learners’ Corpus
lehdet90ff/
-
Finnish Magazines and Newspapers from the 1990s and 2000s
lonnrot/
-
Elias Lönnrot's letters
montint-src/
-
Yves Montand in the USSR interviews, source
movie-ecorg/
-
The Movie Corpus
nlfcl/
-
Classics Library of the National Library of Finland
opensubtitles-fi/
-
Finnish OpenSubtitles
opusparcus/
-
Open Subtitles Paraphrase Corpus for Six Languages
ORACC/
-
Open Richly Annotated Cuneiform Corpus
psychlingdesc/
-
Psycholinguistic Descriptives
puhelahjat/
-
Donate Speech Corpus
reittidemo/
-
Reitti A-siipeen
rel-freq-fi-lit/
-
Relative frequencies in native and translated Finnish literary prose
ScotsCorr/
-
Helsinki Corpus of Scottish Correspondence
semfinlex/
-
Finnish parliament and court documents
seuruu/
-
Follow-up Study of Dialects of Finnish, downloadable version
SFNET/
-
SFNET Corpus
SKN/
-
Samples of spoken Finnish
SNC1/
-
Kansalliskirjaston lehtikokoelman ruotsinkieliset n-grammit 1770-1940
snowfrog/
-
SSDC/
-
Skolt Saami Documentation Corpus
STT/
-
Finnish News Agency
Suomi24/
-
The Suomi 24 Corpus
taajuussanasto9996/
-
Frequency Lexicon of the Finnish Newspaper Language
tallvocabl2fi/
-
Measurements of 15 L2 Finnish learners' vocabularies
tboneslim/
-
T-Bone Slim Corpus
termforum-src/
-
Terminology Forum Glossaries (selection), source
textreuse-sv-src/
-
Text reuse clusters in the Swedish-language press 1645-1918
TSK/
-
Sanastokeskus TSK
UHLCS/
-
U Helsinki Language Corpus Server
uspenskij-4bat-par/
-
Parallel Corpus of the book "Four Battles", written by L. Uspenskij
Wanca/
-
Collection of Uralic Languages
wikipedia-fi/
-
Finnish Wikipedia
wordvec/
-
Word embeddings trained with word2vec
xmas-gospel-tts/
-
Christmas Gospel text-to-speech in four Uralic languages
YLE/
-
Archives of the Finnish Broadcasting Company
Ylilauta/
-
Ylilauta Corpus