Mining and Modeling Text
Interdisciplinary applications, informational development, legal perspectives (MiMo Text)
The acquisition of knowledge from large amounts of text and data which can no longer be handled by individuals is becoming increasingly important due to the possibilities of digitisation. For the humanities, this means in particular that digital full texts and rich metadata must not only be available, but must also be available in a form that promotes knowledge in the humanities.
The aim of the MiMoText project is therefore to establish an information network for the humanities fed from various sources, which, by making it available as Linked Open Data, is not only freely available and can be linked to other knowledge resources of the Semantic Web, but also offers innovative and efficient access possibilities to scientific information.
In the first project phase, the focus is on sources on the history of the French novel from 1750 to 1799, while in the second phase the approach will be transferred to a parallel epoch of German literary history. In both phases, it will be possible to draw on existing full-text digital copies from Gallica, TextGrid and VD18.
Bibliographic directories, specialist literature and primary texts serve as sources of information. From these, metadata, concrete text properties and descriptive or evaluative statements about relevant entities are extracted for example authors and works. For this purpose, quantitative methods for automatic text analysis as well as for the extraction and modelling of data from extensive text collections must be further and partly newly developed. After that the information is converted into a Linked Open Data format and can be linked to each other and to the outside world. From the start of the project, the legal framework will also be analysed in order to ensure that the knowledge network is set up and made available in accordance with copyright and data protection laws.
Project spokesperson: Prof Dr Christof Schöch (firstname.lastname@example.org)
Deputy speaker: Prof Dr Claudine Moulin (email@example.com)