Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Developments in the Dutch language during the 17th century, part of the Early Modern period, form an active research topic in historical linguistics and literature. To enable automatic quantitative analysis, a corpus of letters by the 17th century Dutch author and politician P.C. Hooft is manually annotated with parts-of-speech, document segmentation and sociolinguistic metadata. The corpus is developed as part of the Nederlab online research portal, which is available through the CLARIN ERIC European research infrastructure. This paper discusses the design and evaluation of the annotation effort, as well as adding new annotations to an existing annotated corpus.
Original languageEnglish
Title of host publicationProceedings of the Eleventh International Conference on Language Resources and Evaluation
EditorsNicoletta Calzolari
Place of PublicationMiyazaki,Japan
PublisherEuropean Language Resources Association (ELRA)
Pages1146-1152
ISBN (Electronic)979-10-95546-00-9
Publication statusPublished - 7 May 2018
EventLanguage Resources and Evaluation Conference (LREC 2018): LREC -
Duration: 7 May 2018 → …

Conference

ConferenceLanguage Resources and Evaluation Conference (LREC 2018)
Period7/05/18 → …

Keywords

  • Early Modern Dutch
  • POS tagging
  • sociolinguistic annotation
  • data integration

Fingerprint

Dive into the research topics of 'Linguistic and Sociolinguistic Annotation of 17th Century Dutch Letters'. Together they form a unique fingerprint.

Cite this