en English is Íslenska

Thesis (Bachelor's)

Reykjavík University > Tæknisvið / School of Technology > BSc Tölvunarfræðideild / Department of Computer Science >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1946/33574

  • Using SARSA with function approximation to create policies for MCTS
  • Bachelor's
  • This paper proposes CARL, a pair of agents that apply reinforcement learning and function approximation using regression to learn policies for games where human heuristics cannot be applied. The purpose of these policies is to do search control in Monte Carlo Tree Search (MCTS), a heuristic search algorithm to see if the learned policies can outperform upper confidence bound for trees (UCT).

  • Jun 12, 2019
  • http://hdl.handle.net/1946/33574

Files in This Item:
Filename Size VisibilityDescriptionFormat 
CarlAgent.pdf776.96 kBOpenComplete TextPDFView/Open