A Novel Exploration-Exploitation-Based Adaptive Law for Intelligent Model-Free Control Approaches
dc.authorid | Erol Barkana, Duygun/0000-0002-8929-0459 | |
dc.authorid | Tutsoy, Onder/0000-0001-6385-3025 | |
dc.authorid | BALIKCI, KEMAL/0000-0001-6234-5627 | |
dc.contributor.author | Tutsoy, Önder | |
dc.contributor.author | Barkana, Duygun Erol | |
dc.contributor.author | Balikci, Kemal | |
dc.date.accessioned | 2025-01-06T17:36:05Z | |
dc.date.available | 2025-01-06T17:36:05Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Model-free control approaches require advanced exploration-exploitation policies to achieve practical tasks such as learning to bipedal robot walk in unstructured environments. In this article, we first construct a comprehensive exploration-exploitation policy that carries quality knowledge about the long-term predictor and the control policy, and the control signal of the model-free algorithms. Therefore, the developed model-free algorithm continues exploration by adjusting its unknown parameters until the desired learning and control are accomplished. Second, we provide an utterly model-free adaptive law enriched with the exploration-exploitation policy and derived step-by-step using the exact analogy of the model-based solution. The obtained adaptive control law considers the control signal saturation and the control signal (input) delay. Performed Lyapunov stability analysis ensures the convergence of the adaptive law that can also be used for intelligent control approaches. Third, we implement the adaptive algorithm in real time on a challenging benchmark system: a fourth-order, coupled dynamics, input saturated, and time-delayed underactuated manipulator. The results show that the proposed adaptive algorithm explores larger state-action spaces and treats the vanishing gradient problem in both learning and control. Also, we notice from the results that the learning and control properties of the adaptive algorithm are optimized as required. | |
dc.description.sponsorship | Research Council (TUB.ITAK) [215E047]; Turkish Academy of Sciences in Scheme of the Outstanding Young Scientist Award | |
dc.description.sponsorship | This work was supported in part by the Research Council (TUB.ITAK) under Project 215E047, and in part by the Turkish Academy of Sciences in Scheme of the Outstanding Young Scientist Award (TUBAGEB.IP). This article was recommended by Associate Editor H. Han. | |
dc.identifier.doi | 10.1109/TCYB.2021.3091680 | |
dc.identifier.endpage | 337 | |
dc.identifier.issn | 2168-2267 | |
dc.identifier.issn | 2168-2275 | |
dc.identifier.issue | 1 | |
dc.identifier.pmid | 34398780 | |
dc.identifier.scopus | 2-s2.0-85113266677 | |
dc.identifier.scopusquality | Q1 | |
dc.identifier.startpage | 329 | |
dc.identifier.uri | https://doi.org/10.1109/TCYB.2021.3091680 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14669/1747 | |
dc.identifier.volume | 53 | |
dc.identifier.wos | WOS:000732282100001 | |
dc.identifier.wosquality | Q1 | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.indekslendigikaynak | PubMed | |
dc.language.iso | en | |
dc.publisher | IEEE-Inst Electrical Electronics Engineers Inc | |
dc.relation.ispartof | Ieee Transactions on Cybernetics | |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.snmz | KA_20241211 | |
dc.subject | Adaptation models | |
dc.subject | Process control | |
dc.subject | Predictive models | |
dc.subject | Real-time systems | |
dc.subject | Noise measurement | |
dc.subject | Analytical models | |
dc.subject | Manipulator dynamics | |
dc.subject | Adaptive law | |
dc.subject | exploitation | |
dc.subject | exploration | |
dc.subject | intelligent control | |
dc.subject | Lyapunov stability | |
dc.subject | model free | |
dc.subject | uncertainty | |
dc.subject | vanishing gradient | |
dc.title | A Novel Exploration-Exploitation-Based Adaptive Law for Intelligent Model-Free Control Approaches | |
dc.type | Article |