NetNews Usenet Archive 1992 #19

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #19 / NN_1992_19.iso / spool / comp / robotics / 1689 < prev next >

Wrap

Internet Message Format | 1992-09-02 | 2.4 KB

Path: sparky!uunet!mcsun!uknet!cam-eng!ckt From: ckt@eng.cam.ac.uk (C.K. Tham) Newsgroups: comp.robotics Subject: Reinforcement Learning for Robot Control Message-ID: <1992Sep2.171431.12671@eng.cam.ac.uk> Date: 2 Sep 92 17:14:31 GMT Sender: ckt@eng.cam.ac.uk (C.K. Tham) Distribution: comp.robotics Organization: Cambridge University Engineering Department, UK Lines: 50 Nntp-Posting-Host: dsl.eng.cam.ac.uk TECHNICAL REPORT ANNOUNCEMENT The following technical report is available via anonymous ftp: REINFORCEMENT LEARNING FOR MULTI-LINKED MANIPULATOR CONTROL by Chen K. Tham & Richard W. Prager. (Technical Report CUED/F-INFENG/TR104) Abstract: We present a trajectory planning and obstacle avoidance method which uses Reinforcement Learning to learn the appropriate real-valued torques to apply at each joint of a simulated two-linked manipulator in order to move the end-effector to a desired destination in the workspace. The inputs to the controller are the joint positions and velocities which are fed directly into a Cerebellar Model Arithmetic Computer (CMAC) (Albus,75). In each state, the expected reward and appropriate torques for each joint are learnt through self-experimentation using a combination of the Temporal Difference (TD) technique (Sutton,87) and stochastic hillclimbing (Williams,88). Actions which cause the manipulator to reach the desired destination are rewarded whereas actions which lead to collisions with either joint limits or obstacles are punished by an amount proportional to the velocity before collision. After training, the manipulator is able to move along smooth collision-free paths from different start positions in the workspace to the destination. The file is in compressed Postscript format. Procedure for obtaining the report: unix> ftp svr-ftp.eng.cam.ac.uk Name: anonymous Password: (your e-mail address) ftp> cd reports ftp> binary ftp> get tham_tr104.ps.Z ftp> quit unix> uncompress tham_tr104.ps.Z unix> lpr tham_tr104.ps .. etc. .. to print The authors welcome comments and suggestions from readers. ---------------------------------------------------------------------------- Chen K. THAM, E-mail: ckt@eng.cam.ac.uk Speech, Vision and Robotics Group, Tel. : +44 223 332754 Cambridge University Engineering Department, Fax : +44 223 332662 Trumpington Street, Cambridge CB2 1PZ, United Kingdom.