Keynote

(Replay) Generating robot constitutions and aligning robot behavior

  • * Register (or log in) to the Neural Network to add this session to your agenda or watch the replay

  • Date
    7 April 2026
    Timeframe
    16:00 - 16:20 CET
    Duration
    20 minutes
    • Days
      Hours
      Min
      Sec

    This is a replay of the session that took place on the Frontier Stage during the AI for Good Global Summit in Geneva, Switzerland from 8 to 11 July, 2025. As robots are becoming increasingly integrated into society, ensuring their safe and ethical behavior is paramount. We explore the use of “robot constitutions”, sets of rules designed to govern robot actions, generated directly from real-world images, hospital injury reports and science fiction. We demonstrate how these constitutions can be automatically tailored to specific environments, and even optimized through counterfactual reasoning. By evaluating robot responses against generated “undesirable” scenarios, we can assess and improve constitution effectiveness. Notably, AI equipped with these constitutions shows significantly higher alignment with human values than in their absence. Reassuringly, we find that alignment of modern AI with humans is very high (96%) compared to AIs and robots depicted in science fiction (21%).

    Share this session with your network
    • Session starts in
      Days
      Hours
      Min
      Sec

    Are you sure you want to remove this speaker?