Abstract
This study explores the use of ChatGPT for simplifying Dutch government letters to improve their comprehensibility while preserving legal accuracy. We employed a three-stage mixed-methods evaluation approach to assess the effectiveness of a naive baseline, RoBERTa, and ChatGPT in simplifying six of the most complex letters selected from a corpus of 200. The evaluation process involved comparing the outputs using four metrics (ROUGE, BLEU, BLEURT, and LiNT), followed by reviews from legal and linguistic experts, and culminating in a randomized controlled trial with 72 participants to test comprehension. Our results indicate that ChatGPT substantially enhances the comprehension of government letters, evidenced by more than a 20% increase in comprehensibility scores and a 19% improvement in participants’ ability to correctly answer questions related to follow-up actions based on the simplified texts. Additionally, our study underscores the importance of a thorough evaluation framework and advises caution in solely depending on automated metrics for assessing text simplification.
Original language | English |
---|---|
Title of host publication | DeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 at LREC-COLING 2024 - Workshop Proceedings |
Editors | Giorgio Maria Di Nunzio, Federica Vezzani, Liana Ermakova, Hosein Azarbonyad, Jaap Kamps |
Publisher | European Language Resources Association (ELRA) |
Pages | 152-178 |
Number of pages | 27 |
ISBN (Electronic) | 9782493814159 |
Publication status | Published - 21 May 2024 |
Event | 1st DeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 - Torino, Italy Duration: 21 May 2024 → … |
Publication series
Name | DeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 at LREC-COLING 2024 - Workshop Proceedings |
---|
Conference
Conference | 1st DeTermIt! Evaluating Text Difficulty in a Multilingual Context, DeTermIt! 2024 |
---|---|
Country/Territory | Italy |
City | Torino |
Period | 21/05/24 → … |
Bibliographical note
Publisher Copyright:© 2024 ELRA Language Resource Association.
Keywords
- ChatGPT 3.5
- human evaluation
- legal documents
- natural language generation
- prompt engineering
- real-life task
- text simplification