PhD graduated
Team : BD
Departure date : 06/21/2021

Supervision : Bernd AMANN

Co-supervision : NAACKE Hubert

Exploring Topic Evolution in Large Scientific Archives with Pivot Graphs

There is an increasing demand for practical tools to explore the evolution of scientific research published in bibliographic archives such as the Web of Science (WoS), arXiv, PubMed or ISTEX. Revealing meaningful evolution patterns from these document archives has many applications and can be extended to synthesize narratives from datasets across multiple domains, including news archives, legal document archives and works of literature.In this thesis, we propose a data model and query language for the visualization and exploration of topic evolution graphs. Our model is independent of a particular topic extraction and alignment method and proposes a set of semantic and structural metrics for characterizing and filtering meaningful topic evolution patterns. These metrics are particularly useful for the visualization and the exploration of large topic evolution graphs. We also present a prototype implementation of our model on top of Apache Spark and experimental results obtained for four real-world document archives.

Defence : 06/22/2021

Jury members :

Mirian Halfeld Ferrari, Professeure, Université d’Orléans, LIFO [Rapporteur]
Nicolas Travers, Maître de conférences HDR, ESILV, De Vinci Research Center [Rapporteur]
‪Nathalie Aussenac-Gilles, Directrice de Recherche CNRS, Université de Toulouse, IRIT
Clémence Magnien, Directrice de Recherche CNRS, Sorbonne Université, LIP6
Bernd Amann, Professeur, Sorbonne Université, LIP6
Hubert Naacke, Maître de conférences, Sorbonne Université, LIP6

Departure date : 06/21/2021

2017-2021 Publications