Development of a Bias Compensating Q-Learning Controller for a Multi-Zone HVAC Facility

下载PDF

导出

摘要 We present the development of a bias compensating reinforcement learning(RL)algorithm that optimizes thermal comfort(by minimizing tracking error)and control utilization(by penalizing setpoint deviations)in a multi-zone heating,ventilation,and air-conditioning(HVAC)lab facility subject to unmeasurable disturbances and unknown dynamics.It is shown that the presence of unmeasurable disturbance results in an inconsistent learning equation in traditional RL controllers leading to parameter estimation bias(even with integral action support),and in the extreme case,the divergence of the learning algorithm.We demonstrate this issue by applying the popular Q-learning algorithm to linear quadratic regulation(LQR)of a multi-zone HVAC environment and showing that,even with integral support,the algorithm exhibits bias issue during the learning phase when the HVAC disturbance is unmeasurable due to unknown heat gains,occupancy variations,light sources,and outside weather changes.To address this difficulty,we present a bias compensating learning equation that learns a lumped bias term as a result of disturbances(and possibly other sources)in conjunction with the optimal control parameters.Experimental results show that the proposed scheme not only recovers the bias-free optimal control parameters but it does so without explicitly learning the dynamic model or estimating the disturbances,demonstrating the effectiveness of the algorithm in addressing the above challenges.

作者 Syed Ali Asad Rizvi Amanda J.Pertzborn Zongli Lin

机构地区 IEEE the Department of Electrical and Computer Engineering the Mechanical Systems and Controls Group in the Building Energy and Environment Division at the National Institute of Standards and Technology(NIST) the Charles L.Brown Department of Electrical and Computer Engineering

出处《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第8期1704-1715,共12页 自动化学报（英文版）

基金 supported in part by NIST(70NANB18H161)。

关键词 HVAC control optimal tracking Q-LEARNING reinforcement learning(RL)

分类号 TU83 [建筑科学—供热、供燃气、通风及空调工程] TP273 [自动化与计算机技术—检测技术与自动化装置]

引文网络
相关文献

1Mohamed M.Ouf,June Young Park,H.Burak Gunay.A simulation-based method to investigate occupant-centric controls[J].Building Simulation,2021,14(4):1017-1030. 被引量：6
2Tao Yang,Arkasama Bandyopadhyay,Zheng O’Neill,Jin Wen,Bing Dong.From occupants to occupants:A review of the occupant information understanding for building HVAC occupant-centric control[J].Building Simulation,2022,15(6):913-932. 被引量：3
3Maâmar Bettayeb,Rachid Mansouri,Ubaid M.Al-Saggaf,Abdulrahman U.Alsaggaf,Mohammed Moinuddin.Linear Active Disturbance Rejection Control with a Fractional-Order Integral Action[J].Computers, Materials & Continua,2022(11):3057-3079. 被引量：1
4Zhengkai Ding,Qiming Fu,Jianping Chen,You Lu,Hongjie Wu,Nengwei Fang,Bin Xing.MAQMC:Multi-Agent Deep Q-Network for Multi-Zone Residential HVAC Control[J].Computer Modeling in Engineering & Sciences,2023(9):2759-2785.
5Jingzhou Liu,Xin Xin,Jiejie Sun,Yueyue Fan,Xun Zhou,Wei Gong,Meiyan Yang,Zhiping Li,Yuli Wang,Yang Yang,Chunsheng Gao.Dual-targeting AAV9P1-mediated neuronal reprogramming in a mouse model of traumatic brain injury[J].Neural Regeneration Research,2024,19(3):629-635. 被引量：1
6D.Palanikkumar,R.Ramesh Kumar,Mehedi Masud,Mrim M.Alnfiai,Mohamed Abouhawwash.Bayes-Q-Learning Algorithm in Edge Computing for Waste Tracking[J].Intelligent Automation & Soft Computing,2023(5):2425-2440.
7Lingfei XIAO,Leiming MA,Xinhao HUANG.Intelligent fractional-order integral sliding mode control for PMSM based on an improved cascade observer[J].Frontiers of Information Technology & Electronic Engineering,2022,23(2):328-338. 被引量：5
8Zahed Dastan,Mahsan Tavakoli-Kakhki.Suppression of high order disturbances and tracking for nonchaotic systems:a time-delayed state feedback approach[J].Control Theory and Technology,2022,20(1):54-68.
9HAN HongGui,WANG Tong,SUN HaoYuan,WU XiaoLong,LI Wen,QIAO JunFei.Fuzzy super-twisting sliding mode control for municipal wastewater nitrification process[J].Science China(Technological Sciences),2022,65(10):2420-2428. 被引量：1
10Ibrahim M.Mehedi,Abdulah Jeza Aljohani,Ubaid M.Al-Saggaf,Ahmed I.Iskanderani,Thangam Palaniswamy,Mohamed Mahmoud,Mohammed J.Abdulaal,Muhammad Bilal,Waleed Alasmary.Control of Linear Servo Carts with Integral-Based Disturbance Rejection[J].Computers, Materials & Continua,2022(10):453-463.

IEEE/CAA Journal of Automatica Sinica

2023年第8期

浏览历史

内容加载中请稍等...

Development of a Bias Compensating Q-Learning Controller for a Multi-Zone HVAC Facility

相关作者

相关机构

相关主题

浏览历史