Skip to main content
Dataset

CoFiF-Corpus for Finance

CoFiF is the first corpus comprising company reports in the French language. It contains over 188 million tokens in 2655 reports, covering four types of documents: Reference documents (documents de référence) published annually, usually in the months following the end of the calendar year, and contain information regarding the financial situation and perspectives of a company; Annual report (résultats annuels) which summarises a company’s business and activities throughout the previous year. Semestrial (résultats semestriels): similar to annual reports in content but published every 6 months; Trimestrial reports (résultats trimestriels): similar to annual reports but published every 3 months; These documents are collected from the 60 largest French companies listed in France’s main stock indices CAC40 and CAC Next 20. The corpus spans over 20 years, ranging from 1995 to 2018.

European Union flag

The SSH Open Marketplace is maintained and will be further developed by three European Research Infrastructures - DARIAH, CLARIN and CESSDA - and their national partners. It was developed as part of the "Social Sciences and Humanities Open Cloud" SSHOC project, European Union's Horizon 2020 project call H2020-INFRAEOSC-04-2018, grant agreement #823782.

CESSDACLARINDARIAH-EU