Welcome to Metal Mine! Today is Share:
×

扫码分享

Metal Mine ›› 2025, Vol. 54 ›› Issue (10): 191-200.

Previous Articles     Next Articles

Research on Lane-Change Decision-Making for Intelligent Mining Trucks Using Integrated Reinforcement Learning and State Machines#br#

CHENG Yu1 XIE Lirong1 BIAN Yifan1 YANG Zhiyong2 HU Guilin2 YAN Zhuang1   

  1. 1. School of Electrical Engineering,Xinjiang University,Urumqi 830017,China;
    2. Xinjiang Tianchi Energy Company Limited,Changji 831100,China
  • Online:2025-10-15 Published:2025-11-07

Abstract: In order to improve the performance of lane-changing decision-making for intelligent network-connected mining
trucks in surface coal mines,this paper proposes a lane-changing decision-making method that integrates deep reinforcement
learning and finite state machines. First,a two-layer decision-making framework is constructed,where the upper layer utilizes
deep Q-networks to generate preliminary lane-changing decisions,and the lower layer performs security constraints through finite
state machines. Second,the dual network and competitive network structure are introduced to optimize the DQN performance,
which effectively alleviates the Q-value over-estimation problem. Then,a state transfer rule is designed based on the Gipps
security model to dynamically evaluate the security of the lane-changing gap. Finally,a multi-objective reward function is
designed to comprehensively evaluate and guide the lane changing behavior. Experiments are conducted on the Highway-env
platform,and the results show that the success rate of the fusion method for lane changing reaches 81. 36% in high traffic density
scenarios,which is significantly improved compared to a single DuDQN(50. 84%),with a reduced number of collisions
and enhanced driving stability. This framework can effectively improve the safety and efficiency of decision-making,and has
certain reference significance for the decision-making of open-pit mine transportation lane-changing.

Key words: smart grid-connected mining truck,deep reinforcement learning,finite state machine,lane change decision,
multi-objective reward function

CLC Number: