Show simple item record



dc.contributor.authorLisy , Viliam
dc.date.accessioned2017-02-09T09:33:43Z
dc.date.available2017-02-09T09:33:43Z
dc.date.issued2014
dc.identifier.citationActa Polytechnica. 2014, vol. 54, no. 5, p. 333-340.
dc.identifier.issn1210-2709 (print)
dc.identifier.issn1805-2363 (online)
dc.identifier.urihttp://hdl.handle.net/10467/67160
dc.description.abstractWe evaluate the performance of various selection methods for the Monte Carlo Tree Search algorithm in two-player zero-sum extensive-form games with imperfect information. We compare the standard Upper Confident Bounds applied to Trees (UCT) along with the less common Exponential Weights for Exploration and Exploitation (Exp3) and novel Regret matching (RM) selection in two distinct imperfect information games: Imperfect Information Goofspiel and Phantom Tic-Tac-Toe. We show that UCT after initial fast convergence towards a Nash equilibrium computes increasingly worse strategies after some point in time. This is not the case with Exp3 and RM, which also show superior performance in head-to-head matches.en
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.publisherČeské vysoké učení technické v Prazecs
dc.publisherCzech Technical University in Pragueen
dc.relation.ispartofseriesActa Polytechnica
dc.relation.urihttps://ojs.cvut.cz/ojs/index.php/ap/article/view/2230
dc.rightsCreative Commons Attribution 4.0 International Licenseen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.titleALTERNATIVE SELECTION FUNCTIONS FOR INFORMATION SET MONTE CARLO TREE SEARCH
dc.typearticleen
dc.date.updated2017-02-09T09:33:43Z
dc.identifier.doi10.14311/AP.2014.54.0333
dc.rights.accessopenAccess
dc.type.statusPeer-reviewed
dc.type.versionpublishedVersion


Files in this item



This item appears in the following Collection(s)

Show simple item record

Creative Commons Attribution 4.0 International License
Except where otherwise noted, this item's license is described as Creative Commons Attribution 4.0 International License