Dimensionality Effects on the Markov Property in Shape Memory Alloy Hysteretic Environment

abstract

Shape Memory Alloy actuators can be used for morphing, or shape change, by controlling their temperature, which is effectively done by applying a voltage difference across their length. Control of these actuators requires determination of the relationship between voltage and strain so that an input-output map can be developed. To determine this policy and map the hysteretic region, a Reinforcement Learning algorithm called Sarsa was used. Proper use of Reinforcement Learning requires that the learning environment have the Markov Property. However, hysteresis spaces are commonly referenced as non-Markovian due to the fact that state history is needed to properly predict future states and rewards. This paper reveals that this formerly non-Markovian learning environment of Shape Memory Alloy hysteresis can become Markovian by means of increasing the dimensionality of the measured states. The paper compares learning attempts in both versions of the environment and will show that Reinforcement Learning is successful in the modified learning environment by learning a near-optimal policy for controlling the length of a Shape Memory Alloy wire. This is then validated by using the modified Reinforcement Learning agent to learn a near-optimal control policy in an experimental setting. 2009 IEEE.

name of conference

2009 IEEE International Conference on Systems, Man and Cybernetics

authors

Valasek, John

published proceedings

2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9

author list (cited authors)

Kirkpatrick, K., & Valasek, J.

citation count

1

complete list of authors

Kirkpatrick, Kenton||Valasek, John

publication date

October 2009

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

Conference proceedings / IEEE International Conference on Systems, Man, and Cybernetics. IEEE International Conference on Systems, Man, and Cybernetics Journal

keywords

Hysteresis
Markov Property
Morphing
Reinforcement Learning
Shape Memory Alloy

Digital Object Identifier (DOI)

10.1109/ICSMC.2009.5346132

International Standard Book Number (ISBN) 13

978-1-4244-2793-2

start page

2671

end page

2676

URL

http://dx.doi.org/10.1109/icsmc.2009.5346132

Dimensionality Effects on the Markov Property in Shape Memory Alloy Hysteretic Environment Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

Other

URL