Výpočet Stackelbergových strategií s pamětí v sekvenčních hrách

Hlaváček Michael

Computing Stackelberg equilibrium with Memory in Sequential Games

Type of document

diplomová práce
master thesis

Author

Hlaváček Michael

Supervisor

Bošanský Branislav

Opponent

Velan Dominik

Field of study

Umělá inteligence

Study program

Otevřená informatika

Institutions assigning rank

katedra počítačů

Rights

A university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.html
Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.html

Metadata

Show full item record

Abstract

Tato práce se zabývá problémem representace Stackelbergových strategii v kompaktně representovaných sekvenčních hrách pomocí strategií s pamětí, a hledáním takových strategií. Dokazujeme že pro hry, které jsou reprezentované orientovaným acyklickým grafem bez náhodných stavů, stejně jako pro stochastické hry na grafech bez náhodných stavů, postačuje lineární množství pamětových stavů. Pro případ orientovaných acyklických grafů předkládáme polynomiální algoritmus pro hledání Stackelbergových strategií s pamětí. Pro hry na orientovaném acyklickém grafu s náhodnými stavy existuje kvadratická mez na potřebné paměťové stavy, avšak nalezení takové strategie je NP-těžké. Prezentujeme aditivní aproximační algoritmus pro tuto třídu her a experimentálně jej testujeme na náhodně generovaných hrách.

In this thesis we consider the problem of representing Stackelberg equilibria (sometimes called leader-follower equilibria) in compactly represented sequental games as strategies with memory, and the problem of finding such equilibria. We show that in games represented by a directed acyclic graph without chance nodes and in stochastic games without chance nodes, linear number of memory states suffice. We provide polynomial time algorithm for the DAG case. In games represented by a DAG with chance nodes, we give a quadratic bound on used memory states. We show that the problem of finding such strategies is NP-hard. We provide an additive aproximation algorithm for such games, and experimentally evaluate this algorithm on randomly generated games.