Abstract
Discourse segmentation is an important step in the process of annotating coherence relations. Ideally, implementing segmentation rules results in text segments that correspond to the units of thought related to each other. This paper demonstrates that accurate segmentation is in part dependent on the propositional content of text fragments, and that completely separating segmentation and annotation does not always yield text segments that correspond to the text units between which a conceptual relationship holds. In addition, it argues that elements belonging to the propositional content of the discourse should necessarily be included in the segmentation, but that inclusion of other text elements, for instance stance markers, should be optional.
Original language | English |
---|---|
Pages (from-to) | 357-386 |
Journal | Corpus linguistics and Linguistic theory |
Volume | 14 |
Issue number | 2 |
DOIs | |
Publication status | Published - 31 Aug 2018 |
Keywords
- segmentation
- discourse structure
- coherence relations
- corpus annotation
- stance marking