Label distributions help implicit discourse relation classification

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Implicit discourse relations can convey more than one relation sense, but much of the research on discourse relations has focused on single relation senses. Recently, DiscoGeM, a novel multi-domain corpus, which contains 10 crowd-sourced labels per relational instance, has become available. In this paper, we analyse the co-occurrences of relations in DiscoGem and show that they are systematic and characteristic of text genre. We then test whether information on multi-label distributions in the data can help implicit relation classifiers. Our results show that incorporating multiple labels in parser training can improve its performance, and yield label distributions which are more similar to human label distributions, compared to a parser that is trained on just a single most frequent label per instance.
Original languageEnglish
Title of host publicationProceedings of the 3rd Workshop on Computational Approaches to Discourse
PublisherAssociation for Computational Linguistics
Pages48–53
Publication statusPublished - 2022

Fingerprint

Dive into the research topics of 'Label distributions help implicit discourse relation classification'. Together they form a unique fingerprint.

Cite this