Tong Liu - Research

Home

Research

Publications

Robust Distributed Optimization

Convergence errors of distributed optimization under parameter variations and disturbances — Convergence errors across all computing nodes under parameter variations and external disturbances.

Distributed optimization enables a network of agents to solve a global objective using only local data and neighbor-to-neighbor communication. A representative case is federated learning, where each agent retains its own dataset while jointly training a global model that minimizes the aggregate loss.

My work develops robust algorithms that provably converge despite complex agent dynamics, parametric uncertainties, and exogenous disturbances. We co-design the optimizer and controller—leveraging output regulation, extremum seeking, and reinforcement learning—and certify stability and performance using the small-gain theorem and singular perturbation theory. Applications include distributed computation, sensor networks, robotic swarms, and smart grids.

Data-Driven Traffic Signal Optimization

Urban traffic signals are often fixed or heuristically actuated, and they struggle to handle uncertain and time-varying demand. I use a data-driven approach that learns traffic dynamics from sensor data and optimizes signal timings in real time via various data-based control strategies. I built and calibrated a 42-intersection SUMO model from field data; learned the traffic flow dynamics with linear regression, multi-layer perceptrons, and deep neural networks ; engineered an optimization pipeline to meet real-time constraints, integrating linear quadratic regulator, gain scheduling, reinforcement learning, and model predictive control (MPC) .

Deployed across 24 intersections with computer-vision sensors, the pipeline reduced average travel time by 19.4% and vehicle queue lengths by 15.6% in field experiments. The figures left illustrate the queue-length comparison between the proposed MPC controller and the standard NEMA controller.

Reinforcement Learning for Autonomous Driving

Convergence of the controller parameters.

In mixed traffic, the parameters of human-driven vehicles (HDVs) are unknown and unmeasurable, making it difficult to regulate the overall platoon speed and alleviate congestion on freeways and ring roads. I developed a unified reinforcement learning approach that learns directly from vehicle input–output data and uses connected and autonomous vehicles (CAVs) to regulate the speed of the entire platoon in both settings. I first analyzed the stabilizability of mixed CAV–HDV strings for ring-road and freeway topologies, then—relying only on driving data and no model-parameter knowledge—designed RL controllers that (i) stabilize the system and (ii) attenuate external disturbances, with provable convergence of the optimal controller/parameters. The method improves platoon-level speed regulation and congestion mitigation; the figures above show a ring-road experiment (left) and the convergence of the learned parameters during training (right).

Reinforcement Learning for Energy-Efficient Train Operation

As line conditions and operating scenarios grow more complex, classical optimal control struggles to compute energy-efficient train speed profiles. Dynamic programming can handle this numerically, but the standard approach suffers from the curse of dimensionality. To address this, I develop reinforcement learning methods that efficiently generate the approximate optimal speed profiles.

Concretely, I use three complementary approximators for value functions: a rollout estimator, an interpolation-based approximator, and a neural-network approximator. The resultant strategies can closely match solutions from the classical maximum principle. To make the comparison, I resolve an open question by proving the uniqueness of the optimal trajectory under speed limit constraints (see left figure). The proposed approach yields energy-efficient driving strategies in both single-train and multi-train operations when exploiting regenerative energy .

Last update on | Design and source code from Jon Barron's website