The goal of this paper is to design a model-free optimal controller for the multirate system based on reinforcement learning.Sampled-data control systems are widely used in the industrial production process and multir...The goal of this paper is to design a model-free optimal controller for the multirate system based on reinforcement learning.Sampled-data control systems are widely used in the industrial production process and multirate sampling has attracted much attention in the study of the sampled-data control theory.In this paper,we assume the sampling periods for state variables are different from periods for system inputs.Under this condition,we can obtain an equivalent discrete-time system using the lifting technique.Then,we provide an algorithm to solve the linear quadratic regulator(LQR)control problem of multirate systems with the utilization of matrix substitutions.Based on a reinforcement learning method,we use online policy iteration and off-policy algorithms to optimize the controller for multirate systems.By using the least squares method,we convert the off-policy algorithm into a model-free reinforcement learning algorithm,which only requires the input and output data of the system.Finally,we use an example to illustrate the applicability and efficiency of the model-free algorithm above mentioned.展开更多
Based on the multi-model principle, the fuzzy identification for nonlinear systems with multirate sampled data is studied.Firstly, the nonlinear system with multirate sampled data can be shown as the nonlinear weighte...Based on the multi-model principle, the fuzzy identification for nonlinear systems with multirate sampled data is studied.Firstly, the nonlinear system with multirate sampled data can be shown as the nonlinear weighted combination of some linear models at multiple local working points. On this basis, the fuzzy model of the multirate sampled nonlinear system is built. The premise structure of the fuzzy model is confirmed by using fuzzy competitive learning, and the conclusion parameters of the fuzzy model are estimated by the random gradient descent algorithm. The convergence of the proposed identification algorithm is given by using the martingale theorem and lemmas. The fuzzy model of the PH neutralization process of acid-base titration for hair quality detection is constructed to demonstrate the effectiveness of the proposed method.展开更多
The present paper discusses a design method for the head position in a Hard Disk Drive (HDD) control system. In the HDD control system, the sampling interval of the head position is constrained because of the hardwa...The present paper discusses a design method for the head position in a Hard Disk Drive (HDD) control system. In the HDD control system, the sampling interval of the head position is constrained because of the hardware specifications, but the hold interval of the control input is not constrained. In the present study, a multirate control system is designed, in which the sampling and the hold intervals are not equal. A multirate control law, which stabilizes a closed-loop system, is extended using newly introduced parameters such that the sample response of the plant output is maintained. Furthermore, intersample ripples in the steady state are eliminated using the new design parameters, which can be selected independently of the sample response. As a result, the intersample response can be improved independently of the sample response. The proposed method is applied to a benchmark problem of an HDD system, and its effectiveness is demonstrated.展开更多
The use of a lower sampling rate for designing a discrete-time state feedback-based controller fails to capture information of fast states in a two-time-scale system, while the use of a higher sampling rate increases ...The use of a lower sampling rate for designing a discrete-time state feedback-based controller fails to capture information of fast states in a two-time-scale system, while the use of a higher sampling rate increases the amount of computation considerably. Thus,the use of single-rate sampling for systems with slow and fast states has evident limitations. In this paper, multirate state feedback(MRSF) control for a linear time-invariant two-time-scale system is proposed. Here, multirate sampling refers to the sampling of slow and fast states at different sampling rates. Firstly, a block-triangular form of the original continuous two-time-scale system is constructed. Then, it is discretized with a smaller sampling period and feedback control is designed for the fast subsystem. Later, the system is block-diagonalized and equivalently represented into a system with a higher sampling period. Subsequently, feedback control is designed for the slow subsystem and overall MRSF control is derived. It is proved that the derived MRSF control stabilizes the full-order system. Being the transformed states of the original system, slow and fast states need to be estimated for the MRSF control realization.Hence, a sequential two-stage observer is formulated to estimate these states. Finally, the applicability of the design method is demonstrated with a numerical example and simulation results are compared with the single-rate sampling method. It is found that the proposed MRSF control and observer designs reduce computations without compromising closed-loop performance.展开更多
基金This work was supported by National Key R&D Program of China(No.2018YFB1308404).
文摘The goal of this paper is to design a model-free optimal controller for the multirate system based on reinforcement learning.Sampled-data control systems are widely used in the industrial production process and multirate sampling has attracted much attention in the study of the sampled-data control theory.In this paper,we assume the sampling periods for state variables are different from periods for system inputs.Under this condition,we can obtain an equivalent discrete-time system using the lifting technique.Then,we provide an algorithm to solve the linear quadratic regulator(LQR)control problem of multirate systems with the utilization of matrix substitutions.Based on a reinforcement learning method,we use online policy iteration and off-policy algorithms to optimize the controller for multirate systems.By using the least squares method,we convert the off-policy algorithm into a model-free reinforcement learning algorithm,which only requires the input and output data of the system.Finally,we use an example to illustrate the applicability and efficiency of the model-free algorithm above mentioned.
基金supported by the National Natural Science Foundation of China(61863034)。
文摘Based on the multi-model principle, the fuzzy identification for nonlinear systems with multirate sampled data is studied.Firstly, the nonlinear system with multirate sampled data can be shown as the nonlinear weighted combination of some linear models at multiple local working points. On this basis, the fuzzy model of the multirate sampled nonlinear system is built. The premise structure of the fuzzy model is confirmed by using fuzzy competitive learning, and the conclusion parameters of the fuzzy model are estimated by the random gradient descent algorithm. The convergence of the proposed identification algorithm is given by using the martingale theorem and lemmas. The fuzzy model of the PH neutralization process of acid-base titration for hair quality detection is constructed to demonstrate the effectiveness of the proposed method.
文摘The present paper discusses a design method for the head position in a Hard Disk Drive (HDD) control system. In the HDD control system, the sampling interval of the head position is constrained because of the hardware specifications, but the hold interval of the control input is not constrained. In the present study, a multirate control system is designed, in which the sampling and the hold intervals are not equal. A multirate control law, which stabilizes a closed-loop system, is extended using newly introduced parameters such that the sample response of the plant output is maintained. Furthermore, intersample ripples in the steady state are eliminated using the new design parameters, which can be selected independently of the sample response. As a result, the intersample response can be improved independently of the sample response. The proposed method is applied to a benchmark problem of an HDD system, and its effectiveness is demonstrated.
基金supported by National Natural Science Foundation of China (No. 61750110524)National Key R&D Program of China (No. 2017YFE0128500)。
文摘The use of a lower sampling rate for designing a discrete-time state feedback-based controller fails to capture information of fast states in a two-time-scale system, while the use of a higher sampling rate increases the amount of computation considerably. Thus,the use of single-rate sampling for systems with slow and fast states has evident limitations. In this paper, multirate state feedback(MRSF) control for a linear time-invariant two-time-scale system is proposed. Here, multirate sampling refers to the sampling of slow and fast states at different sampling rates. Firstly, a block-triangular form of the original continuous two-time-scale system is constructed. Then, it is discretized with a smaller sampling period and feedback control is designed for the fast subsystem. Later, the system is block-diagonalized and equivalently represented into a system with a higher sampling period. Subsequently, feedback control is designed for the slow subsystem and overall MRSF control is derived. It is proved that the derived MRSF control stabilizes the full-order system. Being the transformed states of the original system, slow and fast states need to be estimated for the MRSF control realization.Hence, a sequential two-stage observer is formulated to estimate these states. Finally, the applicability of the design method is demonstrated with a numerical example and simulation results are compared with the single-rate sampling method. It is found that the proposed MRSF control and observer designs reduce computations without compromising closed-loop performance.