Text Simplification for Legal Domain: Insights and Challenges

Abstract

Legal documents such as contracts contain complex and domain-specific jargons, long and nested sentences, and often present with several details that may be difficult to understand for laypeople without domain expertise. In this paper, we explore the problem of text simplification (TS) in legal domain. The main challenge to this is the lack of availability of complex-simple parallel datasets for the legal domain. We investigate some of the existing datasets, methods, and metrics in the TS literature for simplifying legal texts, and perform human evaluation to analyze the gaps.1 We present some of the challenges involved, and outline a few open questions that need to be addressed for future research in this direction.

Publication
NLLP@EMNLP, 2022.
Date
Links