Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis

Tutsoy, Önder; Colak, Sule

Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis

Tarih

2015

Yazarlar

Tutsoy, Önder

Colak, Sule

Yayıncı

Sage Publications Ltd

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

A key open question in adaptive estimator design is how to assure that the parameters of the proposed algorithms are converging to their almost correct solutions; hence, the learning algorithm is unbiased. Moreover, determining the speed of parameter convergence is important as it provides insight about the performance of the learning algorithms. The main contributions of the article are fourfold: the first one is that the article, initially, introduces an adaptive estimator to learn the discounted Q-function and approximate optimal control policy without requiring linear, discrete time, unstable output error system dynamics, but using only the noisy system measurements. The simulation results show that the adaptive estimator minimizes the stochastic cost function and temporal difference error and also learns the approximate Q-function together with the control policy. The second one is consideration of a different approach by taking a simple test problem to investigate issues associated with the Q-function's representation and parametric convergence. In particular, the terminal convergence problem is analyzed with a known optimal control policy where the aim is to accurately learn only the Q-function. It is parameterized by terms which are functions of the unknown plant's parameters and the Q-function's discount factor, and their convergence properties are analyzed and compared with the adaptive estimator. The third one is to show that even though the adaptive estimator with a large Q-function discount factor yields larger control feedback gains, so that faster state converges upright, the learning problem is badly conditioned; hence, the parameter convergence is sluggish, as the Q-function discount factor approaches the inverse of the dominant pole of the unstable system. Finally, the fourth one is comparison of the state output learned by the adaptive estimator with the ones obtained from traditional system identification algorithms. Simulation result for a higher order unstable output error system shows that the adaptive estimator closely follows the real system output whereas the system identification algorithms do not.

Anahtar Kelimeler

Adaptive estimator, badly conditioned learning, closed-loop identification, discounted Q-function, parameter convergence analysis, unknown and unstable linear system with random output error-type noise

Kaynak

Proceedings of The Institution of Mechanical Engineers Part I-Journal of Systems and Control Engineering

WoS Q Değeri

Q3

Scopus Q Değeri

Q2

Cilt

229

Sayı

10

Bağlantı

https://doi.org/10.1177/0959651815603910
https://hdl.handle.net/20.500.14669/1750

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon