Skip to main content
Workflow

Archiving Textual Corpora

This workflow presumes that you already created a textual corpus together with a data management plan and supporting documentation, and that you are ready to deposit it in a data repository. It explains how to deposit your corpus data, metadata and documentation either in a CLARIN-B data centre, or in your university’s institutional research data repository, or in a free-for-everybody repository like Zenodo.
This workflow was designed during the workshop "Creating, Managing and Archiving Textual Corpora in Under-resourced Languages”. The workshop was conceived by DARIAH Working Groups Research Data Management and Multilingual DH, financed by DARIAH-EU Funding Scheme for Working Group Activities 2023-25, and hosted by the University of Hamburg on 28th to 30th August 2024.

Workflow steps(5)

  1. 1 Decide where to deposit your corpus

  2. 2 Get requirements concerning file formats

  3. 3 Migrate your data to accepted, open-source file format(s)

  4. 4 Check and improve your metadata

  5. 5 Deposit data and metadata

European Union flag

The SSH Open Marketplace is maintained and will be further developed by three European Research Infrastructures - DARIAH, CLARIN and CESSDA - and their national partners. It was developed as part of the "Social Sciences and Humanities Open Cloud" SSHOC project, European Union's Horizon 2020 project call H2020-INFRAEOSC-04-2018, grant agreement #823782.

CESSDACLARINDARIAH-EU