Ajouter à votre agenda PhD student (Teaching assistant, ANR)
Team : BD
    Sorbonne Université - LIP6
    Boîte courrier 169
    Couloir 25-26, Étage 5, Bureau 502
    4 place Jussieu
    75252 PARIS CEDEX 05

Tel: +33 1 44 27 87 56, Ke.Li (at)
There is an increasing demand for practical tools to explore the evolution of scientific research published in bibliographic archives such as the Web of Science (WoS), arXiv, PubMed or ISTEX. Revealing meaningful evolution patterns from these document archives has many applications and can be extended to synthesize narratives from datasets across multiple domains, including news archives, legal document archives and works of literature.In this thesis, we propose a data model and query language for the visualization and exploration of topic evolution graphs. Our model is independent of a particular topic extraction and alignment method and proposes a set of semantic and structural metrics for characterizing and filtering meaningful topic evolution patterns. These metrics are particularly useful for the visualization and the exploration of large topic evolution graphs. We also present a prototype implementation of our model on top of Apache Spark and experimental results obtained for four real-world document archives.

Defence : 06/22/2021 - 14h - Campus Jussieu, Salle Jacques Pitrat (25-26/105)

Jury members :

Mirian Halfeld Ferrari, Professeure, Université d’Orléans, LIFO [Rapporteur]
Nicolas Travers, Maître de conférences HDR, ESILV, De Vinci Research Center [Rapporteur]
‪Nathalie Aussenac-Gilles, Directrice de Recherche CNRS, Université de Toulouse, IRIT
Clémence Magnien, Directrice de Recherche CNRS, Sorbonne Université, LIP6
Bernd Amann, Professeur, Sorbonne Université, LIP6
Hubert Naacke, Maître de conférences, Sorbonne Université, LIP6

2017-2020 Publications