Welcome to Metal Mine! Today is Share:
×

扫码分享

Metal Mine ›› 2025, Vol. 54 ›› Issue (10): 175-181.

Previous Articles     Next Articles

Lane Keeping Algorithm for Mining Trucks Relying on Retraining Reinforcement Learning

LIU Jinyao1 XIE Lirong1 BIAN Yifan1 AN Yi1,2 YANG Zhiyong3,4 HUANG Deqi1   

  1. 1. School of Electrical Engineering,Xinjiang University,Urumqi 830017,China;
    2. School of Control Science and Engineering,Dalian University of Technology,Dalian 116024,China;
    3. Xinjiang Tianchi Energy Company Limited,Changji 831100,China;
    4. Xinjiang Key Laboratory of Intelligent Production and Control of Open Pit Mines,Changji 831100,China
  • Online:2025-10-15 Published:2025-11-07

Abstract: In order to solve the problem that the self-driving mining truck is easy to lose the ability to adapt to the previous
strategy in the complex environment of the mine,a deep reinforcement learning lane keeping control algorithm considering
sample retraining is proposed. Firstly,by considering the characteristics of the target network update parameters,a periodic experience
extraction retraining model is derived,and the retraining round interval is incorporated into the traditional target network
update parameter model. Then,in order to avoid the influence of noise on the model,the experience playback buffer is set
in a smaller sampling range. The influence of noise and unrelated experience on the model will be reduced,and the system robustness
under extreme operating conditions will be enhanced. Finally,considering the typical cross-shaped road of open-pit
mine,the vehicle position is set at the crossroads in CARLA,and the average reward obtained under the fixed number of rounds
is used as the key performance index of the simulation. The experimental results show that the proposed periodic retraining deep
Q network (PR-DQN) strategy effectively reduces the fluctuation in the training process,makes the model converge faster,effectively
improves the performance of the model in non-stationary environment tasks,and shows significant advantages in stability
and generalization ability.

CLC Number: