Home Research Publications CV

Robust Distributed Optimization

Convergence errors of distributed optimization under parameter variations and disturbances
Convergence errors across all computing nodes under parameter variations and external disturbances.

Distributed optimization enables a network of agents to solve a global objective using only local data and neighbor-to-neighbor communication. A representative case is federated learning, where each agent retains its own dataset while jointly training a global model that minimizes the aggregate loss.

My work develops robust algorithms that provably converge despite complex agent dynamics, parametric uncertainties, and exogenous disturbances. We co-design the optimizer and controller—leveraging output regulation, extremum seeking, and reinforcement learning—and certify stability and performance using the small-gain theorem and singular perturbation theory. Applications include distributed computation, sensor networks, robotic swarms, and smart grids.


Data-Driven Traffic Signal Optimization

Performance comparison
Performance comparison.

Urban traffic signals are often fixed or heuristically actuated, and they struggle to handle uncertain and time-varying demand. I use a data-driven approach that learns traffic dynamics from sensor data and optimizes signal timings in real time via various data-based control strategies. I built and calibrated a 42-intersection SUMO model from field data; learned the traffic flow dynamics with linear regression, multi-layer perceptrons, and deep neural networks ; engineered an optimization pipeline to meet real-time constraints, integrating linear quadratic regulator, gain scheduling, reinforcement learning, and model predictive control (MPC) .

Deployed across 24 intersections with computer-vision sensors, the pipeline reduced average travel time by 19.4% and vehicle queue lengths by 15.6% in field experiments. The figures left illustrate the queue-length comparison between the proposed MPC controller and the standard NEMA controller.


Reinforcement Learning for Autonomous Driving

Mixed traffic on a ring road
Mixed traffic on a ring road.
Convergence of the controller parameters
Convergence of the controller parameters.

In mixed traffic, the parameters of human-driven vehicles (HDVs) are unknown and unmeasurable, making it difficult to regulate the overall platoon speed and alleviate congestion on freeways and ring roads. I developed a unified reinforcement learning approach that learns directly from vehicle input–output data and uses connected and autonomous vehicles (CAVs) to regulate the speed of the entire platoon in both settings. I first analyzed the stabilizability of mixed CAV–HDV strings for ring-road and freeway topologies, then—relying only on driving data and no model-parameter knowledge—designed RL controllers that (i) stabilize the system and (ii) attenuate external disturbances, with provable convergence of the optimal controller/parameters. The method improves platoon-level speed regulation and congestion mitigation; the figures above show a ring-road experiment (left) and the convergence of the learned parameters during training (right).


Reinforcement Learning for Energy-Efficient Train Operation

Optimal speed curve under speed limit
Optimal speed curve under speed limit.

As line conditions and operating scenarios grow more complex, classical optimal control struggles to compute energy-efficient train speed profiles. Dynamic programming can handle this numerically, but the standard approach suffers from the curse of dimensionality. To address this, I develop reinforcement learning methods that efficiently generate the approximate optimal speed profiles.

Concretely, I use three complementary approximators for value functions: a rollout estimator, an interpolation-based approximator, and a neural-network approximator. The resultant strategies can closely match solutions from the classical maximum principle. To make the comparison, I resolve an open question by proving the uniqueness of the optimal trajectory under speed limit constraints (see left figure). The proposed approach yields energy-efficient driving strategies in both single-train and multi-train operations when exploiting regenerative energy .


Last update on | Design and source code from Jon Barron's website