Monitor Corpora. PressCoronaVírus

Identification

Project identification: Monitor Corpora. PressCoronaVírus
Coordination: Raquel Silva and Margarida Ramos (Lexicology, Lexicography and Terminology research group)
Start date: August 2021
Funding: Verão com Ciência, FCT – Portuguese Science and Technology National Foundation

Description

The Monitor Corpora Project falls within the scope of the COVID-19 Collaborative Glossary, a resource consisting of the terminology used by official Health bodies, professionals and scientists from the sector, as well as the media and social networks.

The aim of Monitor Corpora Project is the monitoring of a web corpus, specifically composed of texts from the media, collected in the context of the pandemic situation created by the Coronavirus – the PressCoronaVirus. This corpus gathers texts produced in European Portuguese for consultation, collection and processing of linguistic data. The texts were extracted from reliable journalistic sources whose textual productions are updated, which is of utmost importance for the current studies on neology in contemporary Portuguese language. Finally, the corpus is open to the inclusion of new texts in order to enrich the corpus of analysis.

The PressCoronaVirus comprises texts published between November 2020 and July 2021, and was processed with SketchEngine with the purpose of identifying neological lexical constructions associated with non-specialized communication situations related to the emergence of the pandemic. It is thus possible to observe in real time (synchrony) and within a certain time span (diachrony), the lexical and neological productivity resulting from the social, cultural and economic context caused by COVID-19. We intend that the outcome obtained with the methodology will allow us to: (i) keep the corpus updated; (ii) feed the COVID-19 collaborative glossary; (iii) and make the corpus available in open access. The corpus files and respective documentation (metadata files) will be made available under the Attribution-NonCommercial-NoDerivatives 4.0 International license of the Creative Commons.

Team

Raquel Silva
Margarida Ramos
Sandro Loupa

CLUNL