Penalty Estimation And Violation Detection

Swati Kanwal, Pulkit Parikh, Kamal Karlapalem

We propose to use a two step approach that leverages machine learning based methods to first, semantically segment legal text into eleven legally relevant categories by training a multi-class classifier. And then, train models to perform specialized tasks like penalty estimation and violation detection based on semantically relevant information required for each of the specialized tasks.

Step1: Semantic Segmentation
Eleven labels are defined for sentence level semantic segmentation of legal text, namely, procedural fact, material fact, related fact, statutory fact, related fact, issues framed, allegation, defendant claim, violation, penalty, subjective observation and others. Sentence level annotations of adjudication orders from the SEBI website serve as training data for the sentence classifier. This labelling is used to improve the efficiency of annotation and dataset creation to facilitate tasks like penalty estimation and violation detection.

Label Definitions:

Step 2: Specialized tasks
There are two main use cases that the final model aims to handle, the first is the Automated Legal Advice use case (Fig. 1) and the second is the Automated Legal Judgment use case (Fig. 2). Essentially, the only difference between these two use cases is the input that it considers for making an evaluation. In the first use case, only facts of the case are used to estimate the violation and the penalty whereas in the second use case facts along with allegations and defendant's claims are used to make the judgement. Automated Legal Advice use case takes into account the point of view of a lawyer or a legal advisor whereas the Automated Legal Judgment use case makes a decision from the point of view of a judge or an adjudicating officer.

Data: HERE

DEMO: HERE

Slides: Slides 1 Slides 2