基于物理信息强化学习的混合交通合流区控制
DOI:
CSTR:
作者:
作者单位:

长沙理工大学 智能道路与车路协同湖南省重点实验室

作者简介:

通讯作者:

中图分类号:

基金项目:

国家自然科学基金(52402371,52572338);湖南省自然科学基金(2024JJ6037);同济大学道路与交通工程教育部重点实验室开放基金(K202408)


Physics-informed reinforcement learning for merging control in mixed traffic
Author:
Affiliation:

Fund Project:

National Natural Science Foundation of China (Grant Nos. 52402371 and 52572338); the Natural Science Foundation of Hunan Province (Grant No. 2024JJ6037); and the Open Fund of the Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University (Grant No. K202408).

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    【目的】为提升混合交通场景下车辆合流效率,本文提出一种基于物理信息强化学习的合流控制方法。【方法】基于Highway-env仿真平台构建混合交通快速路合流环境,将合流区建模为多智能体系统,设计融合物理信息约束的多智能体近端策略优化算法(Multi-Agent Proximal Policy Optimization, MAPPO),构建静态-动态组合动作掩码机制过滤违背交通安全规则的无效动作,并通过在奖励函数中加入物理合理性惩罚项,引导智能体遵循车辆运行规律,减少异常行为发生,提高训练效率和可解释性。【结果】该方法在合流成功率、车辆平均速度及运行稳定性等关键指标上均取得显著提升,与无控制方案相比,车辆平均速度最高提升12.51%,合流成功率最高提升9.44%;与先进先出控制策略相比,车辆平均速度最高提升7.96%,合流成功率最高提升5.85%;与无物理信息约束强化学习方法相比,该方法不仅加快了模型训练的收敛速度,还可以有效降低陷入局部最优的风险。【结论】所提出的控制方法在不同交通密度场景下都可以显著提升合流区车辆通行效率,为混合交通快速路合流控制提供一种新的解决方案。

    Abstract:

    [Purposes] To improve vehicle merging efficiency under mixed traffic conditions, this study proposes a merging control method based on physics-informed reinforcement learning. [Methods] A mixed traffic freeway merging environment is constructed on the Highway-env simulation platform, and the merging area is modeled as a multi-agent system. A multi-agent proximal policy optimization (MAPPO) algorithm with embedded physical-information constraints is designed. A static–dynamic hybrid action-masking mechanism is developed to filter out invalid actions that violate traffic safety rules. In addition, physical rationality penalty terms are incorporated into the reward function to guide agents to comply with vehicle motion laws, reduce abnormal behaviors, and improve training efficiency and interpretability. [Findings] Compared with the uncontrolled scheme, the proposed method achieves significant improvements in merging success rate, average vehicle speed, and operational stability. Specifically, the average vehicle speed increases by up to 12.89%, and the merging success rate improves by up to 9.44%. Compared with reinforcement learning methods without physical-information constraints, the proposed approach accelerates policy convergence and effectively reduces the risk of falling into local optima. [Conclusions] The proposed control strategy significantly enhances traffic efficiency in merging areas across various traffic density scenarios and provides a novel and effective solution for mixed-traffic freeway merging control.

    参考文献
    相似文献
    引证文献
引用本文
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-12-04
  • 最后修改日期:2026-01-13
  • 录用日期:2026-01-13
  • 在线发布日期:
  • 出版日期:
文章二维码