Narrowband Active Noise Control with DDPG Based on Reinforcement Learning

被引:0
作者
Ryu, Seokhoon [1 ]
Lim, Jihea [1 ]
Lee, Young-Sup [1 ]
机构
[1] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon 22012, South Korea
关键词
Reinforcement learning; Active noise control; Secondary path model; Deterministic policy gradient; Control stability; SECONDARY PATH IDENTIFICATION; ALGORITHM;
D O I
10.1007/s12239-024-00102-x
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This study investigates the use of deep reinforcement learning for active noise control (DRL-ANC) to cancel narrowband noise. The filtered-x least mean square algorithm for ANC, which includes the secondary path model in itself, has been widely used in various applications. If the path model is inaccurate due to the variations of the actual path, control performance and stability of the algorithm can be restricted. To eliminate the effect by the model inaccuracy, it is considered to remove the path model in the novel DRL-ANC strategy. A DRL approach using the deep deterministic policy gradient without any path model is adopted to learn the behavior of a physical environment including the effect of the actual secondary path in real time. However, a temporal credit assignment problem arises due to the time-delayed reward inherent in the secondary path, which means that the current action could not be evaluated by its true. To address this problem, this study proposes a novel definitions of the state and action of the RL agent, specialized in narrowband noise suppression. Additionally, a novel exploration noise is also suggested to enhance effectiveness and practicality of the learning process. Computer simulations and real-time control experiments were conducted, and the results demonstrated that the proposed DRL-ANC algorithm can robustly cope with changes in the secondary path.
引用
收藏
页码:1389 / 1397
页数:9
相关论文
共 22 条
  • [1] [Anonymous], 1995, Active noise control systems: Algorithms and DSP implementations
  • [2] Optimal Variable Step-Size NLMS Algorithms With Auxiliary Noise Power Scheduling for Feedforward Active Noise Control
    Carini, Alberto
    Malatini, Silvia
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1383 - 1395
  • [3] Active Noise Cancellation Without Secondary Path Identification by Using an Adaptive Genetic Algorithm
    Chang, Cheng-Yuan
    Chen, Deng-Rui
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2010, 59 (09) : 2315 - 2327
  • [4] PERFORMANCE SURFACES OF MINIMUM EFFORT ESTIMATORS AND CONTROLLERS
    DARLINGTON, P
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (02) : 536 - 539
  • [5] Dozat T., 2016, INCORPORATING NESTER, P2013, DOI DOI 10.1007/978-3-642-33483-2
  • [6] Elliott S. J., 2001, SIGNAL PROCESSING AC
  • [7] Increasing the robustness of a preconditioned filtered-X LMS algorithm
    Fraanje, R
    Verhaegen, M
    Doelman, N
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (02) : 285 - 288
  • [8] An Introduction to Deep Reinforcement Learning
    Francois-Lavet, Vincent
    Henderson, Peter
    Islam, Riashat
    Bellemare, Marc G.
    Pineau, Joelle
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2018, 11 (3-4): : 219 - 354
  • [9] Fujimoto S, 2018, PR MACH LEARN RES, V80
  • [10] Recent advances on active noise control: open issues and innovative applications
    Kajikawa, Yoshinobu
    Gan, Woon-Seng
    Kuo, Sen M.
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2012, 1 (01)