dc.contributor.advisor | Langseth, Helge | nb_NO |
dc.contributor.author | Holen, Vidar | nb_NO |
dc.contributor.author | Marøy, Audun | nb_NO |
dc.date.accessioned | 2014-12-19T13:33:46Z | |
dc.date.available | 2014-12-19T13:33:46Z | |
dc.date.created | 2010-09-04 | nb_NO |
dc.date.issued | 2008 | nb_NO |
dc.identifier | 348629 | nb_NO |
dc.identifier | ntnudaim:4142 | nb_NO |
dc.identifier.uri | http://hdl.handle.net/11250/251262 | |
dc.description.abstract | Upper Confidence bounds applied to Trees, or UCT, has shown promise for reinforcement learning problems in different kinds of games, but most of the work has been on turn based games and single agent scenarios. In this project we test the feasibility of using UCT in an action-filled multi-agent environment, namely the RoboCup simulated soccer league. Through a series of experiments we test both low level and high level approaches. We were forced to conclude that low level approaches are infeasible, and that while high level learning is possible, cooperative multi-agent planning did not emerge. | nb_NO |
dc.language | eng | nb_NO |
dc.publisher | Institutt for datateknikk og informasjonsvitenskap | nb_NO |
dc.subject | ntnudaim | no_NO |
dc.subject | SIF2 datateknikk | no_NO |
dc.subject | Intelligente systemer | no_NO |
dc.title | Learning robot soccer with UCT | nb_NO |
dc.type | Master thesis | nb_NO |
dc.source.pagenumber | 69 | nb_NO |
dc.contributor.department | Norges teknisk-naturvitenskapelige universitet, Fakultet for informasjonsteknologi, matematikk og elektroteknikk, Institutt for datateknikk og informasjonsvitenskap | nb_NO |