Exploiting Rationale Data for Explainable NLP Models

Reimer, Maximilian

Exploiting Rationale Data for Explainable NLP Models

Services

Deutsch English

About the Repository Search and Browse Publish

Home
→
Fakultäten
→
Fakultät für Elektrotechnik und Informatik
→
View Item

Download statistics - Document (COUNTER):

Reimer, Maximilian: Exploiting Rationale Data for Explainable NLP Models. Hannover : Gottfried Wilhelm Leibniz Universität, Master Thesis, xvii, 95 S. DOI: https://doi.org/10.15488/11526

Selected time period:

Sum total of downloads: 424

distribution of downloads over the selected time period
downloads by country

back to single item view (close usage statistics)

FileExploiting_Ration ...

Size11.65 MB

FormatAdobe PDF

View

Abstract:
In recent years, deep learning models have become very powerful – even outperforminghumans on a variety of tasks. This enables more real-world applications, including alsosensitive fields such as medical diagnoses or jurisdiction. Besides achieving sufficientlygood performance, the requirement to justify and explain the models’ decisions is becomingincreasingly important.This work aims to enable a broader application of a specific model class that is inherentlyinterpretable, namely explain-then-predict models, by reducing the annotation cost of theexplanations. We focus on the ExPred model as a representative of explain-then-predictmodels.We investigate its dependency on rationale annotations, a special kind of explanation,through training using gradually fewer rationale-labeled instances. Furthermore, we ex-plore different approaches that aim to reduce the number of human-labeled instancesrequired during training, such as active learning and weak supervision.Our results show that even with only a fraction of instances annotated with rationalesfrom the original dataset, ExPred still achieves good performance (within 95% of theperformance when using 100% annotation). Depending on the dataset, only a few thousandannotated rationales are required. Using weak supervision, this can be further reduced, atleast in specific settings. On the Movie Reviews dataset, we achieve good performance withonly 5% of the original rational labels. The tested off-the-shelf active learning methods donot provide any benefit over randomly selecting instances to label. However, the extensivebehavior analysis enables the future design of active learning methods that are tailoredto explain-then-predict models. We start by proposing an active learning method thatoutperforms the random baseline on the Movie Reviews dataset.
License of this version:	CC BY 3.0 DE
Document Type:	MasterThesis
Publishing status:	publishedVersion
Issue Date:	2021-09-15
Appears in Collections:	Fakultät für Elektrotechnik und Informatik

distribution of downloads over the selected time period:

downloads by country:

pos.	country		downloads
pos.	country		total	perc.
1		Germany	120	28.30%
2		United States	78	18.40%
3		Norway	21	4.95%
4		United Kingdom	16	3.77%
5		China	16	3.77%
6		No geo information available	14	3.30%
7		Turkey	13	3.07%
8		Russian Federation	13	3.07%
9		Netherlands	12	2.83%
10		France	9	2.12%
		other countries	112	26.42%

Further download figures and rankings:

Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Search the repository

Browse

All content
- Communities & Collections
- By Issue Date
- Authors
- Titles
- Subjects
- Subjects (GND)
- DDC
- License
- Type
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Subjects (GND)
- DDC
- License
- Type

Exploiting Rationale Data for Explainable NLP Models

Download statistics - Document (COUNTER):

Selected time period:

Sum total of downloads: 424

distribution of downloads over the selected time period:

downloads by country:

Further download figures and rankings:

Search the repository

Browse

All content

This Collection