Компьютерные исследования и моделирование (Dec 2016)

Model of formation of primary behavioral patterns with adaptive behavior based on the combination of random search and experience

  • Arthur Gevorkievich Tumanyan,
  • Sergei Igorevich Bartsev

DOI
https://doi.org/10.20537/2076-7633-2016-8-6-941-950
Journal volume & issue
Vol. 8, no. 6
pp. 941 – 950

Abstract

Read online

In this paper, we propose an adaptive algorithm that simulates the process of forming the initial behavioral skills on the example of the system eye-arm animat. The situation is the formation of the initial behavioral skills occurs, for example, when a child masters the management of their hands by understanding the relationship between baseline unidentified spots on the retina of his eye and the position of the real object. Since the body control skills are not hardcoded initially in the brain and the spinal cord at the level of instincts, the human child, like most young of other mammals, it is necessary to develop these skills in search behavior mode. Exploratory behavior begins with trial and error and then its contribution is gradually reduced as the development of the body and its environment. Since the correct behavior patterns at this stage of development of the organism does not exist for now, then the only way to select the right skills is a positive reinforcement to achieve the objective. A key feature of the proposed algorithm is to fix in the imprinting mode, only the final action that led to success, and that is very important, led to the familiar imprinted situation clearly leads to success. Over time, the continuous chain is lengthened right action - maximum use of previous positive experiences and negative forgotten and not used. Thus there is the gradual replacement of the random search purposeful actions that observed in the real young. Thus, the algorithm is able to establish a correspondence between the laws of the world and the inner feelings, the internal state of the animat. The proposed animat model was used 2 types of neural networks: 1) neural network NET1 to the input current which is fed to the position of the brush arms and the target point, and the output of motor commands, directing brush manipulator animat to the target point; 2) neural network NET2 is received at the input of target coordinates and the current coordinates of the brush and the output value is formed likelihood that the animat already know this situation, and he knows how to react to it. With this architecture at the animat has to rely on the experience of neural networks to recognize situations where the response from NET2 network of close to 1, and on the other hand, run a random search, when the experience of functioning in this area of the visual field in animat not (response NET2 close to 0).

Keywords