Abstract
The paper presents a methodology for automatic construction of lexical typological questionnaires for qualitative semantic domains (e.g. sharp, straight, thick, or smooth). Our algorithm is based on data from a monolingual corpus; it constructs a list of collocations for the corresponding lexemes, computes a vector representation for every collocation, clusters the vector space into semantically homogeneous groups and extracts the three central elements from every cluster. We compare the resulting questionnaires against test data from the semantic domains that are already well studied manually. The algorithm demonstrates high quality results and can be used in the practice of lexical typological research.
Original language | English |
---|---|
Title of host publication | The Typology of Physical Qualities |
Editors | Ekaterina Rakhilina, Tatiana Reznikova, Daria Ryzhova |
Publisher | John Benjamins |
Chapter | 11 |
Pages | 309-328 |
ISBN (Electronic) | 9789027257918 |
ISBN (Print) | 9789027210920 |
DOIs | |
Publication status | Published - 2022 |