Exploiting Rationale Data for Explainable NLP Models

Download statistics - Document (COUNTER):

Reimer, Maximilian: Exploiting Rationale Data for Explainable NLP Models. Hannover : Gottfried Wilhelm Leibniz Universität, Master Thesis, xvii, 95 S. DOI: https://doi.org/10.15488/11526

Selected time period:

year: 
month: 

Sum total of downloads: 424




Thumbnail
Abstract: 
In recent years, deep learning models have become very powerful – even outperforminghumans on a variety of tasks. This enables more real-world applications, including alsosensitive fields such as medical diagnoses or jurisdiction. Besides achieving sufficientlygood performance, the requirement to justify and explain the models’ decisions is becomingincreasingly important.This work aims to enable a broader application of a specific model class that is inherentlyinterpretable, namely explain-then-predict models, by reducing the annotation cost of theexplanations. We focus on the ExPred model as a representative of explain-then-predictmodels.We investigate its dependency on rationale annotations, a special kind of explanation,through training using gradually fewer rationale-labeled instances. Furthermore, we ex-plore different approaches that aim to reduce the number of human-labeled instancesrequired during training, such as active learning and weak supervision.Our results show that even with only a fraction of instances annotated with rationalesfrom the original dataset, ExPred still achieves good performance (within 95% of theperformance when using 100% annotation). Depending on the dataset, only a few thousandannotated rationales are required. Using weak supervision, this can be further reduced, atleast in specific settings. On the Movie Reviews dataset, we achieve good performance withonly 5% of the original rational labels. The tested off-the-shelf active learning methods donot provide any benefit over randomly selecting instances to label. However, the extensivebehavior analysis enables the future design of active learning methods that are tailoredto explain-then-predict models. We start by proposing an active learning method thatoutperforms the random baseline on the Movie Reviews dataset.
License of this version: CC BY 3.0 DE
Document Type: MasterThesis
Publishing status: publishedVersion
Issue Date: 2021-09-15
Appears in Collections:Fakultät für Elektrotechnik und Informatik

distribution of downloads over the selected time period:

downloads by country:

pos. country downloads
total perc.
1 image of flag of Germany Germany 120 28.30%
2 image of flag of United States United States 78 18.40%
3 image of flag of Norway Norway 21 4.95%
4 image of flag of United Kingdom United Kingdom 16 3.77%
5 image of flag of China China 16 3.77%
6 image of flag of No geo information available No geo information available 14 3.30%
7 image of flag of Turkey Turkey 13 3.07%
8 image of flag of Russian Federation Russian Federation 13 3.07%
9 image of flag of Netherlands Netherlands 12 2.83%
10 image of flag of France France 9 2.12%
    other countries 112 26.42%

Further download figures and rankings:


Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Search the repository


Browse