All previous researchers of the month can be found in the archive.
Do you know researchers who use the Language Bank of Finland and who might be good candidates for Researcher of the Month? Would you be one of them? Inform us: https://www.kielipankki.fi/support/contact-us/
Submit the basic details about your own resource to the Language Bank of Finland: http://urn.fi/urn:nbn:fi:lb-2021121421
The first version of the complete dataset includes the speech samples that were donated during 16.6.2020-14.9.2021. The total duration of the recordings in this version is approximately 3200 hours, out of which approximately 1,600 hours have been manually transcribed.
Researchers may already apply for access to Puhelahjat data. Research use in academic organizations is free of charge. Read more about using the data for research
Companies and other non-academic organizations may acquire a paid license for using one of the Puhelahjat datasets. Some of the data packages intended for commercial use are still in preparation. For further details, organizations and companies interested in using the data may already contact us by email at lahjoita-puhetta@helsinki.fi. Read more about commercial use of the data
The Donate Speech campaign is still on and you can still donate your speech in Finnish or in Swedish at https://lahjoitapuhetta.fi/. When the campaign ends, all of the data will be made available via the Language Bank.
If using a resource obtained via Kielipankki that contains personal data (the license includes a ”+PRIV” tag), you are required to submit the title of your project and a public link to the Privacy Notice regarding the purpose for which you are using the resource. Submit the information via this e-form.
Write the Privacy Notice according to the instructions given by your home organization. It is a good idea to store the document in a place where you are able to update the information when needed.
See also the guidelines for processing corpora stored in the Language Bank of Finland that contain personal data.
A new automatic speech recognition service, Tekstiks, is now up and running for test users. The automated system can recognise spoken Estonian and Finnish and produce a transcript of the recording. The Tekstiks service is the result of a collaboration between the Tallinn University of Technology, the Language Bank of Finland and Aalto University. Read more about Tekstiks and try it out!
Kielipankki – Language Bank of Finland has joined the open-source social network Mastodon. Welcome to follow us! @kielipankki@toot.community
FIN-CLARIAH, the national research infrastructure for Social Sciences and Humanities in Finland, received funding from the Academy of Finland for the years 2022-23. FIN-CLARIAH consists of two components, FIN-CLARIN and DARIAH-FI. The FIN-CLARIAH people gathered together in a workshop day held in Jyväskylä on 18th November. On the Language Bank website, you can find the presentation materials and a number of deliverables produced in the FIN-CLARIAH project.
Did you know that CLARIN offers grants for, e.g., researcher and teacher mobility, events and training activities? Check out the funding opportunities and current calls: https://www.clarin.eu/funding
We wish you a relaxing holiday season!
Mietta Lennes
Projet Planning Officer
fin-clarin@helsinki.fi
Subscribe/unsubscribe to this newsletter: https://www.kielipankki.fi/language-bank/newsletter-subscription/
See also the CLARIN Newsflash: https://www.clarin.eu/content/newsflash