Forschungsleitung (Direction de recherche) : Matthieu CORD
Co-Betreuung : THOME Nicolas
Study on training methods and generalization performance of deep learning for image classification.
Artificial intelligence has gained a lot of interest in recent years, particularly with the impressive advances in computer vision. More specifically, deep neural networks today enable us to automate many tasks in the processing of visual information. This family of machine learning algorithms demonstrates a remarkable facility to learn large datasets. Despite their impressive performance, their ability to generalize remains largely misunderstood. From a research and application point of view, deep learning performance and explicability are more and more demanding. This is the purpose of our research whose contributions are presented in this thesis.
We first worked on accelerating the training of deep networks via distributed computing methods among several GPUs. Decreasing the optimization time of neural networks makes it easier to test new architectures as well as new learning methods. We then studied the convolution neural network architectures in order to improve them, without increasing their complexity. We have proposed a new activation function, which filters less information than the standard ReLU function, allowing better processing of semantic information. Finally, we considered regularizing the training of networks. In particular, we have studied a criterion of regularization based on information theory, which we have deployed in two different ways. The first instantiation models the criterion layer by layer for more efficient regularization. The second uses a variational to bound the criterion and optimize it. Both methods have been rigorously validated by comparison with the state of the art on many image classification reference databases, such as ImageNet.
Verteidigung einer Doktorarbeit : 19.11.2018 - 14h - Site Jussieu 25-26/105
M. Liva Ralaivola, Aix-Marseille Université - LIF
Mme Elisa Fromont, INRIA de Rennes - IRISA
M. Aurelien Bellet, INRIA de Lille
M. Alain Rakotomamonjy, INSA de Rouen - LITIS [Rapporteur]
M. Christian Wolf, INSA de Lyon - LIRIS [Rapporteur]
M. Matthieu Cord, Sorbonne Université - LIP6
M. Nicolas Thome, CNAM - CEDRIC