BOUTALEB Allaa
PhD graduated (Teaching assistant, Bourse EDITE)
Team : BD
Arrival date : 10/01/2024
Tel: +33 1 44 27 75 13, Allaa.Boutaleb (at) nulllip6.fr
https://lip6.fr/Allaa.Boutaleb
Team : BD
Arrival date : 10/01/2024
- Sorbonne Université - LIP6
Boîte courrier 169
Couloir 25-26, Étage 5, Bureau 520
4 place Jussieu
75252 PARIS CEDEX 05
FRANCE
Tel: +33 1 44 27 75 13, Allaa.Boutaleb (at) nulllip6.fr
https://lip6.fr/Allaa.Boutaleb
Supervision : Bernd AMANN
Co-supervision : Rafael ANGARITA, Hubert Naacke
Table representation learning for data set discovery and data integration in datalakes
The aim of this thesis proposal is to define and develop new solutions for structured tabular data discovery by learning table representations using Large Language Models (LLMs) and Graph Neural Networks (GNNs). The proposed approach suggests that the underlying transfer learning capabilities and the ability to handle graph-based data provide a robust framework for the challenges of modern data integration, enabling deeper analysis and accurate models for discovering and integrating heterogeneous datasets in a data lake. The scientific approach requires theoretical and practical experience in structured data processing and deep learning.