Epub text-extraction tool
This is a tool for extracting textual data from EPUB-books. The scripts convert epubs into txt files and perform basic statistics such as number of words, most frequent words etc. This project aims at analysing Epub in swedish.
The SSH Open Marketplace is maintained and will be further developed by three European Research Infrastructures - DARIAH, CLARIN and CESSDA - and their national partners. It was developed as part of the "Social Sciences and Humanities Open Cloud" SSHOC project, European Union's Horizon 2020 project call H2020-INFRAEOSC-04-2018, grant agreement #823782.