Reinforcement Learning / Adaptive Optimization

Many adaptive and learning control problems can be formulated as optimization problems aiming at minimizing an objective function related to system performance: in this context, several research directions arise such as optimal control formulations for uncertain systems via the Bellman’s optimality principle, or solving the exploitation-exploration dilemma of controlling the system as well as possible based on current system knowledge (exploitation) or experimenting with the system so as to learn about its behavior and control it better in the future (exploration).

Representative Themes

  • Adaptive dynamic programming / Approximate dynamic programming
  • Distributed learning and distributed optimization
  • Cognitive adaptive control, adaptive optimal control
  • Adaptive dual control

Representative Applications

  • Self-optimizing energy efficient systems (e.g. of energy systems)
  • Self-optimizing formations (e.g. of unmanned vehicles)
  • Learning-based robots

Representative Results

  • Yue D., Baldi S., Cao J., and De Schutter B., “Distributed adaptive optimization with weight-balancing”, IEEE Transactions on Automatic Control, scheduled for publication, 2021. doi:10.1109/TAC.2021.3071651
  • Dai P., Yu W., Wang H., and Baldi S., “Distributed actor-critic algorithms for multi-agent reinforcement learning over directed graphs”, IEEE Transactions on Neural Networks and Learning Systems, scheduled for publication, 2022. doi:10.1109/TNNLS.2021.3139138
  • Michailidis I., Baldi S., Kosmatopoulos E. B., and Ioannou P. A., “Adaptive optimal control for large-scale nonlinear systems”. IEEE Transactions on Automatic Control, Vol. 62(11), pp. 5567-5577, 2017. doi:10.1109/TAC.2017.2684458