Reinforcement Learning for Racecar Control

Cleland, Benjamin George

Reinforcement Learning for Racecar Control

dc.contributor.author	Cleland, Benjamin George	en_NZ
dc.date.accessioned	2006-05-17T13:52:13Z
dc.date.available	2007-04-20T16:09:19Z
dc.date.issued	2006	en_NZ
dc.description.abstract	This thesis investigates the use of reinforcement learning to learn to drive a racecar in the simulated environment of the Robot Automobile Racing Simulator. Real-life race driving is known to be difficult for humans, and expert human drivers use complex sequences of actions. There are a large number of variables, some of which change stochastically and all of which may affect the outcome. This makes driving a promising domain for testing and developing Machine Learning techniques that have the potential to be robust enough to work in the real world. Therefore the principles of the algorithms from this work may be applicable to a range of problems. The investigation starts by finding a suitable data structure to represent the information learnt. This is tested using supervised learning. Reinforcement learning is added and roughly tuned, and the supervised learning is then removed. A simple tabular representation is found satisfactory, and this avoids difficulties with more complex methods and allows the investigation to concentrate on the essentials of learning. Various reward sources are tested and a combination of three are found to produce the best performance. Exploration of the problem space is investigated. Results show exploration is essential but controlling how much is done is also important. It turns out the learning episodes need to be very long and because of this the task needs to be treated as continuous by using discounting to limit the size of the variables stored. Eligibility traces are used with success to make the learning more efficient. The tabular representation is made more compact by hashing and more accurate by using smaller buckets. This slows the learning but produces better driving. The improvement given by a rough form of generalisation indicates the replacement of the tabular method by a function approximator is warranted. These results show reinforcement learning can work within the Robot Automobile Racing Simulator, and lay the foundations for building a more efficient and competitive agent.	en_NZ
dc.format.mimetype	application/pdf
dc.identifier.citation	Cleland, B. G. (2006). Reinforcement Learning for Racecar Control (Thesis, Master of Science (MSc)). The University of Waikato, Hamilton, New Zealand. Retrieved from https://hdl.handle.net/10289/2507	en
dc.identifier.uri	https://hdl.handle.net/10289/2507
dc.language.iso	en
dc.publisher	The University of Waikato	en_NZ
dc.rights	All items in Research Commons are provided for private study and research purposes and are protected by copyright with all rights reserved unless otherwise indicated.
dc.subject	Artificial Intelligence	en_NZ
dc.subject	Reinforcement Learning	en_NZ
dc.subject	Q-learning	en_NZ
dc.subject	Multiple Reward Sources	en_NZ
dc.subject	Control of Exploration	en_NZ
dc.subject	Inherent Exploration	en_NZ
dc.subject	Tabular Representation	en_NZ
dc.subject	Machine Learning.	en_NZ
dc.title	Reinforcement Learning for Racecar Control	en_NZ
dc.type	Thesis	en_NZ
pubs.place-of-publication	Hamilton, New Zealand	en_NZ
thesis.degree.discipline	School of Computing and Mathematical Sciences	en_NZ
thesis.degree.grantor	University of Waikato	en_NZ
thesis.degree.level	Masters
thesis.degree.name	Master of Science (MSc)	en_NZ
uow.date.accession	2006-05-17T13:52:13Z	en_NZ
uow.date.available	2007-04-20T16:09:19Z	en_NZ
uow.date.migrated	2009-06-09T23:34:46Z	en_NZ
uow.identifier.adt	http://adt.waikato.ac.nz/public/adt-uow20060517.135213	en_NZ

Files

Original bundle

Now showing 1 - 1 of 1

Name:: thesis.pdf
Size:: 7.19 MB
Format:: Adobe Portable Document Format

Name: thesis.pdf

Size: 7.19 MB

Kind: Adobe PDF

Collections

Masters Degree Theses