Reference policy + low-level controller + robot dynamics + safety filter, with stability indicator.
For robotics, control, and embodied-AI papers introducing safety wrappers around learned policies.
Replace the learned policy with a classical motion planner (RRT* + trajectory optimisation). Keep the safety filter and explain that the filter now serves as a runtime certificate.