Skip to main content
Home

Named entity recognition from transcribed spoken data

This workflow is designed to systematically extract structured information from unstructured text. Its primary function is to process transcribed free-text descriptions of archaeological findings and identify relevant named entities. The workflow receives transcribed text from archaeological context sheets as its input and extracts entities related to the archaeology domain. It returns structured metadata that can be used for indexing an archaelogical record. The principal objective of this workflow is to enrich the archaeological records by converting unstructured descriptive text into structured, machine-readable metadata.

Workflow steps(2)

  1. 1 Text segmentation

  2. 2 Entity extraction

European Union flag

The SSH Open Marketplace is maintained and will be further developed by three European Research Infrastructures - DARIAH, CLARIN and CESSDA - and their national partners. It was developed as part of the "Social Sciences and Humanities Open Cloud" SSHOC project, European Union's Horizon 2020 project call H2020-INFRAEOSC-04-2018, grant agreement #823782.

CESSDACLARINDARIAH-EU