Online and Reinforcement Learning

My work in online and reinforcement learning is about models that must keep improving while decisions are being made. These settings are often constrained, dynamic, and expensive to simulate: orbital systems, edge-computing task offloading, target localization, and online convex objectives all require algorithms that learn from sequential feedback without assuming a static, fully observed world.

The projects in this topic combine algorithmic guarantees with benchmark design and applied decision-support systems. I am particularly interested in how online learning and reinforcement learning can be made reliable enough for infrastructure-like settings, where exploration, uncertainty, and operational cost have to be handled carefully.

Publications in this topic

CLASP: Online learning algorithms for Convex Losses And Squared Penalties, International Conference on Machine Learning, 2026.
OrbitZoo: Real Orbital Systems Challenges for Reinforcement Learning, NeurIPS, 2025.
PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning, ECML PKDD Applied Data Science Track, 2024.
ISEE.U: Distributed online estimation and control for improved target localization accuracy, 2021.

Share on

Twitter Facebook LinkedIn

Claudia Soares

Publications in this topic

Share on