NexusLinguarum – European network for Web-centred linguistic data science
Identification
- Project identification: NexusLinguarum – European network for Web-centred linguistic data science [COST Action CA18209]
- Coordinator: Jorge Garcia (Universidad Zaragoza, Spain)
- Duration: Oct.2019 – Oct.2023
- Funding entity: European Science Foundation
- Website: https://nexuslinguarum.eu/
Description
The main aim of this Action is to promote synergies across Europe between linguists, computer scientists, terminologists, and other stakeholders in industry and society, in order to investigate and extend the area of linguistic data science. We understand linguistic data science as a subfield of the emerging “data science”, which focuses on the systematic analysis and study of the structure and properties of data at a large scale, along with methods and techniques to extract new knowledge and insights from it. Linguistic data science is a specific case, which is concerned with providing a formal basis to the analysis, representation, integration and exploitation of language data (syntax, morphology, lexicon, etc.). In fact, the specificities of linguistic data are an aspect largely unexplored so far in a big data context.
In order to support the study of linguistic data science in the most efficient and productive way, the construction of a mature holistic ecosystem of multilingual and semantically interoperable linguistic data is required at Web scale. Such an ecosystem, unavailable today, is needed to foster the systematic cross-lingual discovery, exploration, exploitation, extension, curation and quality control of linguistic data. We argue that linked data (LD) technologies, in combination with natural language processing (NLP) techniques and multilingual language resources (LRs) (bilingual dictionaries, multilingual corpora, terminologies, etc.), have the potential to enable such an ecosystem that will allow for transparent information flow across linguistic data sources in multiple languages, by addressing the semantic interoperability problem.
(Retrieved from the COST Actions website.)
Team of CLUNL
Rute Costa
Ana Salgado
Bruno Almeida
Margarida Ramos
Sara Carvalho
Participating Entities
Full information at: https://www.cost.eu/actions/CA18209/#tabs|Name:parties
Menu < back
- Projects
- Ongoing projects
- MultiPoD – Multilingual and Multicultural Spaces for Political Deliberation
- HEREDITARY – HetERogeneous sEmantic Data Integration for guT-brAin interplay
- TTC-CPLP – Terminologias Técnicas e Científicas para a CPLP
- CHAMUÇA – Portuguese and South Asian Lexicon Archive
- e-Term ANCV – Recurso terminológico jurídico-parlamentar digital Assembleia Nacional de Cabo Verde
- NObarriers2Health: Reducing language and cultural barriers through machine translation literacy for inclusive multilingual health communication
- EPISTRAN – Epistemic Translation: Towards an Ecology of Knowledges
- DiTo – Didática do Texto
- REDGRAM – Digital Resources for Education – Grammatical Pathways
- iRead4Skills – Intelligent Reading Improvement System for Fundamental and Transversal Skills Development
- Active Citizenship Through Dialogue in Virtual teacher communities
- Heritage Languages go to School: The interplay of (extra)linguistic factors in successful language development
- Investigating the impact of implicit and explicit instruction on phonological acquisition in a second language
- LAUA – Language Attrition and Ultimate Attainment
- CORRELATE – Corpora and Lexical and Terminological Resources
- ANACOREX – Anafora y expresiones referenciales en el bilinguismo: triangulando enfoques de corpus y experimentales
- Caring Communication: gene therapy in the context of hemophilia
- CoRaLHis – Comparing Romance Languages through History: building a multilingual parallel diachronic corpus (13th-18th C.)
- MorDigital – Digitisation of Diccionario da Lingua Portugueza by António de Morais Silva
- EXPRIMI
- Language and literacy at school – the contribution of metasyntactic abilities to reading comprehension development
- G&T.Comenta
- COVID-19 Collaborative Glossary
- TERMVEST – The Clothing Terminology: European Portuguese version
- Digital Edition of the “Vocabulário Ortográfico da Língua Portuguesa” (VOLP-1940)
- PIPALE – Preventive Intervention Project for Learning to Read and Write
- POR Nível – Design and validation of a placement test to PFL
- Cultural Heritage Lexicon
- Concluded projects
- ProPerL2 – Production and Perception in L2 speech learning
- Western Sephardic Diaspora Roadmap
- ELEXIS – European Lexicographic Infrastructure
- Humanities Going Digital (HUGOD)
- LL2DS – Linking Linguistics to Data Science
- QuILL – Quality in Language Learning
- Corpus Linguístico & Avatar para a Língua Gestual Portuguesa
- Monitor Corpora. PressCoronaVírus
- Com@Rehab – Communication for interactive rehabilitation in virtual reality
- Read4Succeed: Improving migrant, refugee and from deprived neighbourhood children reading skills through an Animal Assisted Reading program
- Project GiroFLE
- ANACOR: A corpus-based approach to anaphora resolution in second language acquisition: beyond the interfaces
- OrthoDef
- European Portuguese-Standard Arab Dictionary
- MOCOLANG-O – MOdélisation COnceptuelle des troubles (du LANGage et de la communication) en Orthophonie
- Romance clitics in diachrony. An integrated approach
- Portuguese Literature Corpus for Distant Reading
- ALPROF – Automatic Assessment of Language Proficiency for Migrant Integration
- CLARIN CLUNL
- Utopia, Food and the Future
- Development of syntactic structures in Portuguese and French monolingual and bilingual acquisition
- The Case of Grammatical Relations
- BlackBox – a Collaborative Platform to Document Performance Composition: from conceptual structures in the backstage to customizable visualizations in the front-end
- Promotion of scientific literacy
- PerGRam – Percursos para o ensino da gramática nos primeiros anos de escolaridade
- Knowledge Organisation Proposal within the scope of infertility: the role of Terminology
- Subordination in Medieval Portuguese
- Crosslinguistic and Crosspopulation approaches to the Acquisition of Dependencies
- Syntactic and lexical factors in processing complexity
- SIERA – Integrating Sina Institute into the European Research Area
- Syntactic Dependencies from 3 to 10
- Events and subevents in Capeverdean
- TKB – Transmedia Knowledge Base for Contemporary Dance
- Research network projects
- ELEXIS Association
- PhraConRep – A Multilingual Repository of Phraseme Constructions in Central and Eastern European Languages
- Y-JustLang – Justice to youth language needs
- ENEOLI – European Network On Lexical Innovation
- Consortium Huma-Num ARIANE
- GRAFE’Maire
- UniDive – Universality, diversity and idiosyncrasy in language technology
- Metalex – International Metalexicography Network
- @ Cientista Regressa à Escola
- CLIL in Languages Other Than English
- NexusLinguarum – European network for Web-centred linguistic data science
- Distant Reading for European Literary History
- HL2C – Heritage Language Consortium
- KEYSTONE – Semantic Keyword-Based Search on Structures Data Sources
- ARLE – International Association for Research in L1 Education
- ENeL – European Network of e-Lexicography
- GraMaLL – Grasping Meaning Across Languages and Learners
- Language Impairment in a Multilingual Society: Linguistic Patterns and the Road to Assessment
- GIRTraduvino – Grupo de Investigación Reconocido sobre la Lengua de la Vid Y el Vino y su Traducción
- Value for Health CoLAB
- Infrastructures
- Services provision