Institut für Informationswissenschaft der TH Köln
Refine
Year of publication
- 2022 (1)
Document Type
- Bachelor Thesis (1)
Language
- English (1)
Has Fulltext
- yes (1)
Keywords
- Natural Language Processing (1)
- Open Research Knowledge Graph (1)
- Text Mining (1)
- Tollwut (1)
- spaCy (1)
With the growing scientific output that is produced, its getting more important to automate the extraction of knowledge from articles. This bachelor thesis will describe an approach doing exactly this. Scientific articles will be obtained from a database.
These articles will be preprocessed to gain a set of training data, to update a language model that already exists for Python library spaCy. The model will be trained to recognize different sorts of entities regarding to the virus rabies. After this process the model will be used for ten articles and the extracted knowledge will be used to extend the Open Research Knowledge Graph.