Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis
dc.authorid | Tutsoy, Onder/0000-0001-6385-3025 | |
dc.contributor.author | Tutsoy, Önder | |
dc.contributor.author | Colak, Sule | |
dc.date.accessioned | 2025-01-06T17:36:05Z | |
dc.date.available | 2025-01-06T17:36:05Z | |
dc.date.issued | 2015 | |
dc.description.abstract | A key open question in adaptive estimator design is how to assure that the parameters of the proposed algorithms are converging to their almost correct solutions; hence, the learning algorithm is unbiased. Moreover, determining the speed of parameter convergence is important as it provides insight about the performance of the learning algorithms. The main contributions of the article are fourfold: the first one is that the article, initially, introduces an adaptive estimator to learn the discounted Q-function and approximate optimal control policy without requiring linear, discrete time, unstable output error system dynamics, but using only the noisy system measurements. The simulation results show that the adaptive estimator minimizes the stochastic cost function and temporal difference error and also learns the approximate Q-function together with the control policy. The second one is consideration of a different approach by taking a simple test problem to investigate issues associated with the Q-function's representation and parametric convergence. In particular, the terminal convergence problem is analyzed with a known optimal control policy where the aim is to accurately learn only the Q-function. It is parameterized by terms which are functions of the unknown plant's parameters and the Q-function's discount factor, and their convergence properties are analyzed and compared with the adaptive estimator. The third one is to show that even though the adaptive estimator with a large Q-function discount factor yields larger control feedback gains, so that faster state converges upright, the learning problem is badly conditioned; hence, the parameter convergence is sluggish, as the Q-function discount factor approaches the inverse of the dominant pole of the unstable system. Finally, the fourth one is comparison of the state output learned by the adaptive estimator with the ones obtained from traditional system identification algorithms. Simulation result for a higher order unstable output error system shows that the adaptive estimator closely follows the real system output whereas the system identification algorithms do not. | |
dc.description.sponsorship | Turkish Science and Technology Research Department | |
dc.description.sponsorship | The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was financially supported by the Turkish Science and Technology Research Department. | |
dc.identifier.doi | 10.1177/0959651815603910 | |
dc.identifier.endpage | 916 | |
dc.identifier.issn | 0959-6518 | |
dc.identifier.issn | 2041-3041 | |
dc.identifier.issue | 10 | |
dc.identifier.scopus | 2-s2.0-84944053526 | |
dc.identifier.scopusquality | Q2 | |
dc.identifier.startpage | 902 | |
dc.identifier.uri | https://doi.org/10.1177/0959651815603910 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14669/1750 | |
dc.identifier.volume | 229 | |
dc.identifier.wos | WOS:000362674600002 | |
dc.identifier.wosquality | Q3 | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.language.iso | en | |
dc.publisher | Sage Publications Ltd | |
dc.relation.ispartof | Proceedings of The Institution of Mechanical Engineers Part I-Journal of Systems and Control Engineering | |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.snmz | KA_20241211 | |
dc.subject | Adaptive estimator | |
dc.subject | badly conditioned learning | |
dc.subject | closed-loop identification | |
dc.subject | discounted Q-function | |
dc.subject | parameter convergence analysis | |
dc.subject | unknown and unstable linear system with random output error-type noise | |
dc.title | Adaptive estimator design for unstable output error systems: A test problem and traditional system identification based analysis | |
dc.type | Article |