BOUTALEB Allaa

PhD graduated (Teaching assistant, Bourse EDITE)
Team : BD
Arrival date : 10/01/2024
    Sorbonne Université - LIP6
    Boîte courrier 169
    Couloir 25-26, Étage 5, Bureau 520
    4 place Jussieu
    75252 PARIS CEDEX 05
    FRANCE

Tel: +33 1 44 27 75 13, Allaa.Boutaleb (at) nulllip6.fr
https://lip6.fr/Allaa.Boutaleb

Supervision : Bernd AMANN

Co-supervision : Rafael ANGARITA, Hubert Naacke

Table representation learning for data set discovery and data integration in datalakes

The aim of this thesis proposal is to define and develop new solutions for structured tabular data discovery by learning table representations using Large Language Models (LLMs) and Graph Neural Networks (GNNs). The proposed approach suggests that the underlying transfer learning capabilities and the ability to handle graph-based data provide a robust framework for the challenges of modern data integration, enabling deeper analysis and accurate models for discovering and integrating heterogeneous datasets in a data lake. The scientific approach requires theoretical and practical experience in structured data processing and deep learning.