G&T_COMMENTARY_TD

G&T_COMMENTARY_TD is a FAIR-compliant, manually annotated corpus of Portuguese commentary texts, developed by the G&T – Gramática & Texto research group (CLUNL, NOVA FCSH).

The corpus comprises 82 commentary texts published in Portuguese newspapers and magazines between 2005 and 2016, segmented into 373 Discourse Type (TD) units, following the theoretical framework of Sociodiscursive Interactionism (SDI).

This public release provides only the structural segmentation and discourse-type annotation (DI, DT, RI, N, including citation contexts), with all original textual content removed due to copyright restrictions.

The dataset is distributed as a single XML file and is intended for discourse analysis, sequential modelling, graph-based approaches (including directed and multiplex networks), and quantitative–qualitative studies of discourse organisation and genre tendencies.

Identifier: https://zenodo.org/records/18084593