摘要
Occupant-centric controls(OcC)is an indoor climate control approach whereby occupant feedback is used in the sequence of operation of building energy systems.While OcC has been used in a wide range of building applications,an OcC category that has received considerable research interest is learning occupants'thermal preferences through their thermostat interactions and adapting temperature setpoints accordingly.Many recent studies used reinforcement learning(RL)as an agent for OcC to optimize energy use and occupant comfort.These studies depended on predicted mean vote(PMV)models or constant comfort ranges to represent comfort,while only few of them used thermostat interactions.This paper addresses this gap by introducing a new off-policy reinforcement learning(RL)algorithm that imitates the occupant behaviour by utilizing unsolicited occupant thermostat overrides.The algorithm is tested with a number of synthetically generated occupant behaviour models implemented via the Python APl of EnergyPlus.The simulation results indicate that the RL algorithm could rapidly learn preferences for all tested occupant behaviour scenarios with minimal exploration events.While substantial energy savings were observed with most occupant scenarios,the impact on the energy savings varied depending on occupants'preferences and thermostat use behaviour stochasticity.
基金
supported by Brainbox AI Inc。