Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis

Tutsoy, Önder; Colak, Sule

Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis

dc.authorid	Tutsoy, Onder/0000-0001-6385-3025
dc.contributor.author	Tutsoy, Önder
dc.contributor.author	Colak, Sule
dc.date.accessioned	2025-01-06T17:36:05Z
dc.date.available	2025-01-06T17:36:05Z
dc.date.issued	2015
dc.description.abstract	A key open question in adaptive estimator design is how to assure that the parameters of the proposed algorithms are converging to their almost correct solutions; hence, the learning algorithm is unbiased. Moreover, determining the speed of parameter convergence is important as it provides insight about the performance of the learning algorithms. The main contributions of the article are fourfold: the first one is that the article, initially, introduces an adaptive estimator to learn the discounted Q-function and approximate optimal control policy without requiring linear, discrete time, unstable output error system dynamics, but using only the noisy system measurements. The simulation results show that the adaptive estimator minimizes the stochastic cost function and temporal difference error and also learns the approximate Q-function together with the control policy. The second one is consideration of a different approach by taking a simple test problem to investigate issues associated with the Q-function's representation and parametric convergence. In particular, the terminal convergence problem is analyzed with a known optimal control policy where the aim is to accurately learn only the Q-function. It is parameterized by terms which are functions of the unknown plant's parameters and the Q-function's discount factor, and their convergence properties are analyzed and compared with the adaptive estimator. The third one is to show that even though the adaptive estimator with a large Q-function discount factor yields larger control feedback gains, so that faster state converges upright, the learning problem is badly conditioned; hence, the parameter convergence is sluggish, as the Q-function discount factor approaches the inverse of the dominant pole of the unstable system. Finally, the fourth one is comparison of the state output learned by the adaptive estimator with the ones obtained from traditional system identification algorithms. Simulation result for a higher order unstable output error system shows that the adaptive estimator closely follows the real system output whereas the system identification algorithms do not.
dc.description.sponsorship	Turkish Science and Technology Research Department
dc.description.sponsorship	The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was financially supported by the Turkish Science and Technology Research Department.
dc.identifier.doi	10.1177/0959651815603910
dc.identifier.endpage	916
dc.identifier.issn	0959-6518
dc.identifier.issn	2041-3041
dc.identifier.issue	10
dc.identifier.scopus	2-s2.0-84944053526
dc.identifier.scopusquality	Q2
dc.identifier.startpage	902
dc.identifier.uri	https://doi.org/10.1177/0959651815603910
dc.identifier.uri	https://hdl.handle.net/20.500.14669/1750
dc.identifier.volume	229
dc.identifier.wos	WOS:000362674600002
dc.identifier.wosquality	Q3
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Sage Publications Ltd
dc.relation.ispartof	Proceedings of The Institution of Mechanical Engineers Part I-Journal of Systems and Control Engineering
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_20241211
dc.subject	Adaptive estimator
dc.subject	badly conditioned learning
dc.subject	closed-loop identification
dc.subject	discounted Q-function
dc.subject	parameter convergence analysis
dc.subject	unknown and unstable linear system with random output error-type noise
dc.title	Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis
dc.type	Article

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis

Dosyalar

Koleksiyon