Physics-enhanced Reinforcement Learning with Human Feedback (PE-RLHF) is designed for safe and trustworthy autonomous driving.
- PE-RLHF is a novel framework that synergistically integrates human feedback with physics knowledge into the RL training loop.
- PE-RLHF introduces a new human-AI collaborative paradigm that ensures a trustworthy safety performance lower bound.
- PE-RLHF employs a reward-free approach with a proxy value function to represent human preferences and guide training process.
In extensive experiments across various driving scenarios:
- PE-RLHF significantly reduces safety violations, demonstrating superior performance compared to traditional methods.
- PE-RLHF significantly improves learning efficiency, enabling faster policy development compared to traditional RL methods.
- PE-RLHF adapts to different levels of human feedback quality, ensuring robust performance.