Lataukset
Downloads
home
up
Location:
Logged in as: (null)
Name
Size
Description
Ylilauta/
-
Ylilauta Corpus
YLE/
-
Archives of the Finnish Broadcasting Company
xmas-gospel-tts/
-
Christmas Gospel text-to-speech in four Uralic languages
wordvec/
-
Word embeddings trained with word2vec
wikipedia-fi/
-
Finnish Wikipedia
Wanca/
-
Collection of Uralic Languages
uspenskij-4bat-par/
-
Parallel Corpus of the book "Four Battles", written by L. Uspenskij
UHLCS/
-
U Helsinki Language Corpus Server
TSK/
-
Sanastokeskus TSK
textreuse-sv/
-
Text reuse clusters in the Swedish-language press 1645-1918
termforum-src/
-
Terminology Forum Glossaries (selection), source
tboneslim/
-
T-Bone Slim Corpus
tallvocabl2fi/
-
Measurements of 15 L2 Finnish learners' vocabularies
taajuussanasto9996/
-
Frequency Lexicon of the Finnish Newspaper Language
Suomi24/
-
The Suomi 24 Corpus
STT/
-
Finnish News Agency
SSDC/
-
Skolt Saami Documentation Corpus
snowfrog/
-
SNC1/
-
Kansalliskirjaston lehtikokoelman ruotsinkieliset n-grammit 1770-1940
SKN/
-
Samples of spoken Finnish
SFNET/
-
SFNET Corpus
semfinlex/
-
Finnish parliament and court documents
ScotsCorr/
-
Helsinki Corpus of Scottish Correspondence
rel-freq-fi-lit/
-
Relative frequencies in native and translated Finnish literary prose
reittidemo/
-
Reitti A-siipeen
puhelahjat/
-
Donate Speech Corpus
psychlingdesc/
-
Psycholinguistic Descriptives
ORACC/
-
Open Richly Annotated Cuneiform Corpus
opusparcus/
-
Open Subtitles Paraphrase Corpus for Six Languages
opensubtitles-fi/
-
Finnish OpenSubtitles
nlfcl/
-
Classics Library of the National Library of Finland
movie-ecorg/
-
The Movie Corpus
montint-src/
-
Yves Montand in the USSR interviews, source
lonnrot/
-
Elias Lönnrot's letters
lehdet90ff/
-
Finnish Magazines and Newspapers from the 1990s and 2000s
LAS2/
-
Advanced Finnish Learners’ Corpus
la-murre/
-
The Finnish Dialect Syntax Archive
klk/
-
Kansalliskirjaston lehtikokoelma
KKS/
-
Karjalan kielen sanakirja (XML)
kipo/
-
Suomen viittomakielten kielipoliittinen ohjelma
italian-letters/
-
Italian Letters from the 16. Century
iijoki/
-
Iijoki Collection
HFST-SweNER/
-
hfst-morphologies/
-
HFST morphologies for various languages
helpuhe/
-
The Longitudinal Corpus of Finnish Spoken in Helsinki
HCS/
-
Helsinki Corpus of Swahili 2.0
hc/
-
Helsinki Corpus of English Texts
hallituskausi/
-
Hallituskausi Translation Memories
glowbe/
-
Corpus of Global Web-Based English
giellagas-north/
-
Pohjoissaamen näytekorpus
GeM-HTB/
-
A Multimodal Corpus of Tourist Brochures
fvcc_v1/
-
Finnish Verbal Colorative Constructions
FTC/
-
Finnish Text Collection
FTC-B/
-
Finnish Text Collection
FNC1/
-
Kansalliskirjaston lehtikokoelman suomenkieliset n-grammit 1820-2000
finsen/
-
FinnSentiment
FinnWordNet/
-
FinnWordNet
finntreebank/
-
Finnish TreeBank (FTB)
finnish-tagtools/
-
Finnish Tagtools
finka/
-
Raja-Karjalan korpus
finestbert/
-
FinEst BERT
finchat/
-
Finnish conversational chat corpus
fi-parliament-asr/
-
Aalto Finnish Parliament ASR Corpus 2008-2020
Fenno-Ugrica/
-
Fenno-Ugrica
FBC/
-
Finnish Broadcast Corpus
ELFA/
-
ELFA corpus
eduskunta/
-
Plenary Sessions of the Parliament of Finland
DSPCON/
-
Aalto University DSP Course Conversation Corpus
digitala/
-
DigiTala (2019–2023)
Digilib/
-
coronavirus-ecorg/
-
The Coronavirus Corpus
coha/
-
Corpus of Historical American English
coca/
-
Corpus of Contemporary American English
cfinsl/
-
Sign Language Corpora (in Finland)
CEAL/
-
CEAL corpus
ccmh-src/
-
Corpus of Old Church Slavonic Texts, source
avoid/
-
Corpus of Age-related Voice Disguise
AMPH/
-
amph Corpus
aku-egg/
-
Puheen ja EGG:n samanaikaiset tallenteet
AI2D-RST/
-
A multimodal corpus of 1000 primary school science diagrams
acquis-ftb3/
-
The Finnish Sub-corpus of the JRC-Acquis Multilingual Parallel Corpus