Information extraction from articles on the impacts of COVID-19 lockdowns on air quality

Münch, Quentin

Information extraction from articles on the impacts of COVID-19 lockdowns on air quality

Services

Deutsch English

About the Repository Search and Browse Publish

Home
→
Fakultäten
→
Fakultät für Elektrotechnik und Informatik
→
View Item

Download statistics - Document (COUNTER):

Münch, Quentin: Information extraction from articles on the impacts of COVID-19 lockdowns on air quality. Hannover : Gottfried Wilhelm Leibniz Universität, Bachelor Thesis, 2022, IX, 36 S. DOI: https://doi.org/10.15488/13114

Selected time period:

Sum total of downloads: 218

distribution of downloads over the selected time period
downloads by country

back to single item view (close usage statistics)

FileInformation_Extra ...

Size1.14 MB

FormatAdobe PDF

View

Abstract:
In response to the COVID-19 pandemic, cities worldwide imposed lockdowns to combat the spread of the virus. Governments ordered people to stay at home. Therefore, vehicle and industrial emissions changed drastically. Several researchers studied the impact of such lockdowns on air quality. The research centre Jülich accumulated various articles to gather all information. They manually searched each article to extract the relevant information and created a database containing their findings. Using the gathered data, they developed a website to illustrate their findings to the community. Moreover, they published the data set for other researchers to use freely. However, searching the articles by hand takes significant time and resources. Since the number of articles in the database will continuously increase in the future, developing models for automated extraction of such data can be beneficial. Here, we present a script that utilises a rule-based matching approach to extract pollution data from articles automatically. Around 150 reviewed articles were split into 80% training and 20% test data. We utilised the training data to manually find rules for extracting pollutants, whereas the test data did not influence the creation of patterns. It only serves as a test data set for the evaluation of the model. By feeding the defined rules to the model, it learns to detect various patterns in sentences and how to extract relevant information from them. A significant problem for the automated extraction present tables. They contain a plethora of data. However, extracting information from one does not work appropriately, let alone detecting a table. After the training finishes, the program gets tested using the test data. It achieves a 22% recall and 43% precision value when executed. Compared to manual extraction by experts, this result is significantly worse. Nevertheless, by highlighting relevant text passages, the program offers a great starting point for manual extraction.
License of this version:	CC BY 3.0 DE
Document Type:	BachelorThesis
Publishing status:	publishedVersion
Issue Date:	2022-08-23
Appears in Collections:	Fakultät für Elektrotechnik und Informatik

distribution of downloads over the selected time period:

downloads by country:

pos.	country		downloads
pos.	country		total	perc.
1		Germany	116	53.21%
2		United States	32	14.68%
3		India	17	7.80%
4		Vietnam	7	3.21%
5		Russian Federation	7	3.21%
6		Sweden	6	2.75%
7		Czech Republic	6	2.75%
8		Austria	6	2.75%
9		China	5	2.29%
10		Netherlands	3	1.38%
		other countries	13	5.96%

Further download figures and rankings:

Hinweis

Zur Erhebung der Downloadstatistiken kommen entsprechend dem „COUNTER Code of Practice for e-Resources“ international anerkannte Regeln und Normen zur Anwendung. COUNTER ist eine internationale Non-Profit-Organisation, in der Bibliotheksverbände, Datenbankanbieter und Verlage gemeinsam an Standards zur Erhebung, Speicherung und Verarbeitung von Nutzungsdaten elektronischer Ressourcen arbeiten, welche so Objektivität und Vergleichbarkeit gewährleisten sollen. Es werden hierbei ausschließlich Zugriffe auf die entsprechenden Volltexte ausgewertet, keine Aufrufe der Website an sich.

Search the repository

Browse

All content
- Communities & Collections
- By Issue Date
- Authors
- Titles
- Subjects
- Subjects (GND)
- DDC
- License
- Type
This Collection
- By Issue Date
- Authors
- Titles
- Subjects
- Subjects (GND)
- DDC
- License
- Type

Information extraction from articles on the impacts of COVID-19 lockdowns on air quality

Download statistics - Document (COUNTER):

Selected time period:

Sum total of downloads: 218

distribution of downloads over the selected time period:

downloads by country:

Further download figures and rankings:

Search the repository

Browse

All content

This Collection