Discontinuous Constituency and BERT: A Case Study of Dutch

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

In this paper, we set out to quantify the syntactic capacity of BERT in the evaluation regime of non-context free patterns, as occurring in Dutch. We devise a test suite based on a mildly context-sensitive formalism, from which we derive grammars that capture the linguistic phenomena of control verb nesting and verb raising. The grammars, paired with a small lexicon, provide us with a large collection of naturalistic utterances, annotated with verb-subject pairings, that serve as the evaluation test bed for an attention-based span selection probe. Our results, backed by extensive analysis, suggest that the models investigated fail in the implicit acquisition of the dependencies examined.
Original languageEnglish
Title of host publicationFindings of the Association for Computational Linguistics: ACL 2022
PublisherAssociation for Computational Linguistics
Pages3776–3785
DOIs
Publication statusPublished - May 2022

Fingerprint

Dive into the research topics of 'Discontinuous Constituency and BERT: A Case Study of Dutch'. Together they form a unique fingerprint.

Cite this