Archiving Textual Corpora
This workflow presumes that you already created a textual corpus together with a data management plan and supporting documentation, and that you are ready to deposit it in a data repository. It explains how to deposit your corpus data, metadata and documentation either in a CLARIN-B data centre, or in your university’s institutional research data repository, or in a free-for-everybody repository like Zenodo.
This workflow was designed during the workshop "Creating, Managing and Archiving Textual Corpora in Under-resourced Languages”. The workshop was conceived by DARIAH Working Groups Research Data Management and Multilingual DH, financed by DARIAH-EU Funding Scheme for Working Group Activities 2023-25, and hosted by the University of Hamburg on 28th to 30th August 2024.
Workflow steps(5)
1 Decide where to deposit your corpus
2 Get requirements concerning file formats
3 Migrate your data to accepted, open-source file format(s)
4 Check and improve your metadata
5 Deposit data and metadata
The SSH Open Marketplace is maintained and will be further developed by three European Research Infrastructures - DARIAH, CLARIN and CESSDA - and their national partners. It was developed as part of the "Social Sciences and Humanities Open Cloud" SSHOC project, European Union's Horizon 2020 project call H2020-INFRAEOSC-04-2018, grant agreement #823782.