HEREDITermCorpus_pt (V0.1)

 

Description

In the context of the project HEREDITARY, HetERogeneous sEmantic Data integratIon for the guT-brAin interplay, dedicated multilingual corpora are being created. The HEREDITermCorpus_pt_V0.1 compiles a curated selection of texts dedicated to the microbiota-gut-brain axis (MGBA) and its emerging role in neurodegenerative disorders. The collection is intended to provide a resource for researchers, clinicians, and students interested in exploring how intestinal microorganisms influence brain health and disease mechanisms. The dataset comprises 126 documents, 100,610 sentences, 1,999,301 words and 2,665,436 tokens. All documents are written in European Portuguese and were selected to capture a wide range of perspectives on the MGBA.

Identifier

https://doi.org/10.5281/zenodo.16969241

Team

Rute Costa
Margarida Ramos
Matilde Canelas
Ana Mouro