PhD graduated
Team : SMA
    Sorbonne Université - LIP6
    Boîte courrier 169
    Couloir 25-26, Étage 4, Bureau 403
    4 place Jussieu
    75252 PARIS CEDEX 05

Tel: +33 1 44 27 87 94, Marc.Vincent (at)


Co-supervision : Vincent CORRUBLE

Reinforcement Learning for Multi-Function Radar Resource Management

In the wake of recent advances in the field of machine learning, much progress has been accomplished in one of its sub-fields, reinforcement learning, whose aim is to solve sequential decision problems under uncertainty. Radar resource management seems to represent an ideal application case for this type of technique. Indeed, a radar emits signals, called dwells, whose echoes are used to measure the state of surrounding objects; these dwells vary according to numerous parameters (duration, beam width...) and must be executed sequentially. The surveillance strategy of a multi-function radar thus consists in continuously selecting the dwells to perform, with the aim of searching the surrounding space while tracking already detected targets. The methods currently used to address this problem are largely heuristic, and are likely to run into difficulties in a range of complex situations involving hyper-velocity or hyper-maneuvering targets.
First, we propose applications of reinforcement learning techniques adapted to the current architecture of multi-function radars. These contributions focus on two aspects: dwell scheduling on the antenna using model-based methods, and active tracking dwell optimization using model-free methods. Secondly, we highlight the limitations of current resource management architectures, which leads us to consider an alternative architecture for which we propose new reinforcement learning algorithms designed to address the problems it raises. These contributions focus both on the multi-objective aspect, which is useful in multi-function radars to reflect the trade-offs to be made between different functions, and on the combinatorial aspect, which is due to the large number of tasks that the radar must carry out in parallel.

Defence : 09/08/2023 - 14h30 - Campus Pierre et Marie Curie, salle 55-65/211

Jury members :

Tristan CAZENAVE, Professeur des universités, Université Paris-Dauphine [Rapporteur]
Ann NOWÉ, Professeure des universités, Vrije Universiteit Brussel [Rapporteur]
Gauthier PICARD, Directeur de Recherche, ONERA
Olivier SIGAUD, Professeur des universités, Sorbonne Université
Amal EL FALLAH SEGHROUCHNI, Professeure des universités, Sorbonne Université
Vincent CORRUBLE, Maître de conférences, Sorbonne Université
Frédéric BARBARESCO, Segment Leader, Thales LAS France

2021-2023 Publications

  • 2023
  • 2021
    • A. Dey, B. Costé, É. Totel, A. Bécue, E. Aguas, A. Lambert, G. Blanc, H. Debar, Y. Chevalier, A. Medad, B. Gregorutti, E. Genetay, A. Nguema, G. Menguy, S. Bardin, R. Bonichon, C. De de Souza Lima, R. Charayron, Th. Lefèvre, N. Bartoli, J. Morlier, Z. Chihani, B. Carron, S. Brunessaux, L. Caquot, T. Charrier, B. El Bezzaz Semlali, O. Pasquero, A. Bazin, P.‑E. Flory, K. Kapusta, O. Stan, V. Thouvenot, K. Hynek, R. Ferrari, A. Boudguiga, R. Sirdey, A. Héliou, Th. Cejka, D. La Rocca, M. Zuber, G. Vardoulias, I. Papaioannou, A. Vekinis, G. Papadopoulou, M. Vincent, A. El Fallah‑Seghrouchni, V. Corruble, N. Bernardin, R. Kassab, F. Barbaresco, K. Tit, T. Furon, M. Rousset, L.‑M. Traonouez, P.‑Y. Lagrave, V. Vidal, M.‑C. Corbineau, T. Ceillier, A. IANCHERUK, Ah. Allali, J. Rodriguez, T. Bhor, R. Garcia, J.‑E. Guilhot‑Gaudeffroy, R. Plana : “Actes de la conférence CAID 2021 (Conference on Artificial Intelligence for Defense)”, pp. 1-152 (2021)
    • M. Vincent, A. El Fallah‑Seghrouchni, V. Corruble, N. Bernardin, R. Kassab, F. Barbaresco : “Monte Carlo Tree Search for Multi-function Radar Task Scheduling”, Conference on Artificial Intelligence for Defense, Rennes, France (2021)