Ardi: a new dataset for automatic advocate recommendation in the Indian Legal System

Upal Bhattacharya, Aniket Deroy, Ayan Bandyopadhyay, Gourish Majumdar*, Shouvik Kumar Guha, Koustav Rudra, Saptarshi Ghosh, Kripabandhu Ghosh

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

We introduce a realistic expert recommendation problem called advocate recommendation. To facilitate investigation of the problem, we develop a rich dataset of 25k documents called the Automatic Advocate Recommendation Dataset in the Indian Legal System (ARDI), which also contains additional attributes. Extra information about areas is generated through an expert annotation process that we incorporate into our experimentation. Treating the problem as a multi-label classification task and carrying out extensive experimentation with various strategies, including using area-based representations, summarization, ensembling methods and multi-task learning, we find the advocate recommendation task quite challenging. Our results suggest that using longer contexts and combining information from different models are beneficial in reaching a viable solution.

Original languageEnglish
Number of pages29
JournalArtificial Intelligence and Law
DOIs
Publication statusE-pub ahead of print - 9 Oct 2025

Bibliographical note

Publisher Copyright:
© The Author(s) 2025.

Keywords

  • Advocate prediction
  • Dataset

Fingerprint

Dive into the research topics of 'Ardi: a new dataset for automatic advocate recommendation in the Indian Legal System'. Together they form a unique fingerprint.

Cite this