Filtz, Erwin, Kirrane, Sabrina, Polleres, Axel, Wohlgenannt, Gerhard. 2019. Exploiting EuroVoc’s Hierarchical Structure for Classifying Legal Documents. In On the Move to Meaningful Internet Systems: OTM 2019 Conferences, Hrsg. Hervé Panetto, Christophe Debruyne, Martin Hepp, Dave Lewis, Claudio Agostino Ardagna, Robert Meersman, 164-181. Greece: Lecture Notes in Computer Science.
BibTeX
Abstract
Multi-label document classification is a challenging problem because of the potentially huge number of classes. Furthermore, real-world datasets often exhibit a strongly varying number of labels per document, and a power-law distribution of those class labels. Multi-label classification of legal documents is additionally complicated by long document texts and domain-specific use of language. In this paper we use different approaches to compare the performance of text classification algorithms on existing datasets and corpora of legal documents, and contrast the results of our experiments with results on general-purpose multi-label text classification datasets. Moreover, for the EUR-Lex legal datasets, we show that exploiting the hierarchy of the EuroVoc thesaurus helps to improve classification performance by reducing the number of potential classes while retaining the informative value of the classification itself.
Tags
Press 'enter' for creating the tagPublication's profile
Status of publication | Published |
---|---|
Affiliation | WU |
Type of publication | Contribution to conference proceedings |
Language | English |
Title | Exploiting EuroVoc’s Hierarchical Structure for Classifying Legal Documents |
Title of whole publication | On the Move to Meaningful Internet Systems: OTM 2019 Conferences |
Editor | Hervé Panetto, Christophe Debruyne, Martin Hepp, Dave Lewis, Claudio Agostino Ardagna, Robert Meersman |
Page from | 164 |
Page to | 181 |
Location | Greece |
Publisher | Lecture Notes in Computer Science |
Year | 2019 |
ISBN | 978-3-030-33246-4 |
URL | https://www.springer.com/gp/book/9783030332457 |
Open Access | N |
Associations
- People
- Filtz, Erwin (Former researcher)
- Kirrane, Sabrina (Details)
- Polleres, Axel (Details)
- External
- Wohlgenannt, Gerhard (ITMO University, Russian Federation)
- Organization
- Institute for Data, Process and Knowledge Management (AE Polleres) (Details)