Skip to main navigation Skip to search Skip to main content

Toward a corpus-based multilingual terminology database for Intercultural Communication

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

This contribution focuses on the methodological aspects of the ICoMuTe project aiming to
design a corpus-based multilingual terminology database for Intercultural Communication
(ICC). The project seeks to explore how ICC terms relate to each other within six European
languages (Dutch, English, German, French, Italian, Spanish), how these terms are connected
to their scientific and cultural contexts, and how they can be translated across different
languages and cultures while preserving meaning.
The selected approach is corpus-based, using comparable corpora of ICC handbooks and a
parallel corpus of texts produced by the European Parliament dealing with key questions
related to ICC. Using text recognition and data mining tools (e.g., Sketch Engine), the most
frequent ICC terms per language are extracted and analysed in context. To account for the
culturally specific aspects of terms while achieving a high degree of cultural neutrality, a
semantic model based on tags has been developed for comparing and linking terms across
languages in a neutral manner, but natural language corpus-based definitions are also provided
that reflect the cultural load of each term.
The main findings suggest that semantic tags are relevant to balance the cultural specificity and
neutrality of ICC terms, and that English acts as a reference linguistic and cultural framework
for the emergence and development of terms in other languages.
Original languageEnglish
Title of host publicationElectronic lexicography in the 21st century (eLex 2025): Intelligent lexicography. Proceedings of the eLex 2025 conference.
EditorsI. Kosem, M. Jakubíček, M. Medveď, K. Zgaga, Š. Arhar Holdt, T. Munda, A. Salgado
PublisherLexical Computing CZ s.r.o.
Pages435-452
Publication statusPublished - 2025

Keywords

  • intercultural communication
  • multilingual terminology
  • corpus-basedlexicography
  • lexical functions
  • semantic primes

Fingerprint

Dive into the research topics of 'Toward a corpus-based multilingual terminology database for Intercultural Communication'. Together they form a unique fingerprint.

Cite this