Lataukset
Downloads
home
up
Location:
Logged in as: (null)
Name
Size
Description
cfinsl/
-
Sign Language Corpora (in Finland)
TSK/
-
Sanastokeskus TSK
eduskunta/
-
Plenary Sessions of the Parliament of Finland
klk/
-
Kansalliskirjaston lehtikokoelma
hallituskausi/
-
Hallituskausi Translation Memories
hfst-morphologies/
-
HFST morphologies for various languages
semfinlex/
-
Finnish parliament and court documents
finchat/
-
Finnish conversational chat corpus
finnish-tagtools/
-
Finnish Tagtools
STT/
-
Finnish News Agency
lehdet90ff/
-
Finnish Magazines and Newspapers from the 1990s and 2000s
finsen/
-
FinnSentiment
lonnrot/
-
Elias Lönnrot's letters
Wanca/
-
Collection of Uralic Languages
YLE/
-
Archives of the Finnish Broadcasting Company
iijoki/
-
Iijoki Collection
KKS/
-
Karjalan kielen sanakirja (XML)
ccmh-src/
-
Corpus of Old Church Slavonic Texts, source
digitala/
-
DigiTala (2019–2023)
tboneslim/
-
T-Bone Slim Corpus
xmas-gospel-tts/
-
Christmas Gospel text-to-speech in four Uralic languages
textreuse-sv/
-
Text reuse clusters in the Swedish-language press 1645-1918
uspenskij-4bat-par/
-
Parallel Corpus of the book "Four Battles", written by L. Uspenskij
movie-ecorg/
-
The Movie Corpus
coronavirus-ecorg/
-
The Coronavirus Corpus
puhelahjat/
-
Donate Speech Corpus
tallvocabl2fi/
-
Measurements of 15 L2 Finnish learners' vocabularies
wordvec/
-
Word embeddings trained with word2vec
fi-parliament-asr/
-
Aalto Finnish Parliament ASR Corpus 2008-2020
termforum-src/
-
Terminology Forum Glossaries (selection), source
hc/
-
Helsinki Corpus of English Texts
finntreebank/
-
Finnish TreeBank (FTB)
aku-egg/
-
Puheen ja EGG:n samanaikaiset tallenteet
kipo/
-
Suomen viittomakielten kielipoliittinen ohjelma
montint-src/
-
Yves Montand in the USSR interviews, source
finestbert/
-
FinEst BERT
AI2D-RST/
-
A multimodal corpus of 1000 primary school science diagrams
ORACC/
-
Open Richly Annotated Cuneiform Corpus
wikipedia-fi/
-
Finnish Wikipedia
opensubtitles-fi/
-
Finnish OpenSubtitles
rel-freq-fi-lit/
-
Relative frequencies in native and translated Finnish literary prose
psychlingdesc/
-
Psycholinguistic Descriptives
avoid/
-
Corpus of Age-related Voice Disguise
nlfcl/
-
Classics Library of the National Library of Finland
opusparcus/
-
Open Subtitles Paraphrase Corpus for Six Languages
finka/
-
Raja-Karjalan korpus
fvcc_v1/
-
Finnish Verbal Colorative Constructions
glowbe/
-
Corpus of Global Web-Based English
coha/
-
Corpus of Historical American English
coca/
-
Corpus of Contemporary American English
Suomi24/
-
The Suomi 24 Corpus
acquis-ftb3/
-
The Finnish Sub-corpus of the JRC-Acquis Multilingual Parallel Corpus
CEAL/
-
CEAL corpus
Ylilauta/
-
Ylilauta Corpus
SKN/
-
Samples of spoken Finnish
italian-letters/
-
Italian Letters from the 16. Century
DSPCON/
-
Aalto University DSP Course Conversation Corpus
AMPH/
-
amph Corpus
reittidemo/
-
Reitti A-siipeen
FNC1/
-
Kansalliskirjaston lehtikokoelman suomenkieliset n-grammit 1820-2000
SNC1/
-
Kansalliskirjaston lehtikokoelman ruotsinkieliset n-grammit 1770-1940
SSDC/
-
Skolt Saami Documentation Corpus
la-murre/
-
The Finnish Dialect Syntax Archive
FinnWordNet/
-
FinnWordNet
HCS/
-
Helsinki Corpus of Swahili 2.0
Fenno-Ugrica/
-
Fenno-Ugrica
helpuhe/
-
The Longitudinal Corpus of Finnish Spoken in Helsinki
GeM-HTB/
-
A Multimodal Corpus of Tourist Brochures
ScotsCorr/
-
Helsinki Corpus of Scottish Correspondence
giellagas-north/
-
Pohjoissaamen näytekorpus
LAS2/
-
Advanced Finnish Learners’ Corpus
taajuussanasto9996/
-
Frequency Lexicon of the Finnish Newspaper Language
UHLCS/
-
U Helsinki Language Corpus Server
FTC/
-
Finnish Text Collection
FTC-B/
-
Finnish Text Collection
FBC/
-
Finnish Broadcast Corpus
ELFA/
-
ELFA corpus
SFNET/
-
SFNET Corpus
snowfrog/
-
HFST-SweNER/
-
Digilib/
-