BerryBERT - text classification for Finnish OCR texts
This is a BERT text classification for Finnish OCR texts, originally used for research on the commodification of wild lingon berries. This work is a part of the Centre for Digital Humanities' Pilot Projects 2021-2022, with a project titled "Text Mining Commodification: The Geography Of the Nordic Lingonberry Rush, 1860-1910". Source code is available at Github. This work is a part of the Centre for Digital Humanities' Pilot Projects 2021-2022, with project titled "Text Mining Commodification: The Geography Of the Nordic Lingonberry Rush, 1860-1910". More information about the pilot projects can be found here.
The SSH Open Marketplace is maintained and will be further developed by three European Research Infrastructures - DARIAH, CLARIN and CESSDA - and their national partners. It was developed as part of the "Social Sciences and Humanities Open Cloud" SSHOC project, European Union's Horizon 2020 project call H2020-INFRAEOSC-04-2018, grant agreement #823782.