Справочник

ENGILBERGE Martin

Аспирант at Sorbonne University
Подразделение : MLIA
https://martin.engilberge.io

Научны(е)й руководител(и)ь : Matthieu CORD

Deep multimodal embeddings and grounding

Nowadays Artificial Intelligence (AI) is omnipresent in our society. The recent development of learning methods based on deep neural networks also called "Deep Learning" has led to a significant improvement in visual and textual representation models. In this thesis, we aim to further advance image representation and understanding. Revolving around Visual Semantic Embedding (VSE) approaches, we explore different directions: We present relevant background covering images and textual representation and existing multimodal approaches. We propose novel architectures further improving retrieval capability of VSE and we extend VSE models to novel applications and leverage embedding models to visually ground semantic concept. Finally, we delve into the learning process and in particular the loss function by learning differentiable approximation of ranking based metric.

Защита диссертаций : 12.06.2020

Члены жюри :

M. AVRITHIS Yannis, Senior Researcher, INRIA Rennes [Rapporteur]
M. THOME Nicolas, Professeur, CNAM [Rapporteur]
Mme LARLUS Diane, Senior Research Scientist, NAVER Labs
M. PONCE Jean, Directeur de Recherche, INRIA - ENS
M. GALLINARI Patrick, Professeur, Sorbonne Université
M. PEREZ Patrick, Directeur de Recherche, Valeo.ai
M. CORD Matthieu, Professeur, Sorbonne Université

Окончание контракта : 12.06.2020

Публикации 2018-2020

All Информационная поддержка Диссертация

2020
- M. Engilberge : “Deep multimodal embeddings and grounding”, диссертация, Защита диссертаций 12.06.2020, Научны(е)й руководител(и)ь Cord, Matthieu (2020)
2019
- M. Engilberge, L. Chevallier, P. Pérez, M. Cord : “SoDeep: a Sorting Deep net to learn ranking loss surrogates”, CVPR 2019 - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, United States (2019)
2018
- M. Engilberge, L. Chevallier, P. Pérez, M. Cord : “Deep semantic-visual embedding with localization”, RFIAP 2018 - Congrès Reconnaissance des Formes, Image, Apprentissage et Perception, Marne-la-Vallée, France (2018)
- M. Engilberge, L. Chevallier, P. Pérez, M. Cord : “Finding beans in burgers: Deep semantic-visual embedding with localization”, CVPR 2018 - 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, United States, pp. 3984-3993, (IEEE) (2018)