Hluboké posilované učení pro hru Super Mario Bros

Ondřej Schejbal

Deep Reinforcement Learning for Super Mario Bros

dc.contributor.advisor	Vašata Daniel
dc.contributor.author	Ondřej Schejbal
dc.date.accessioned	2022-06-02T22:54:19Z
dc.date.available	2022-06-02T22:54:19Z
dc.date.issued	2022-06-02
dc.identifier	KOS-1179143724105
dc.identifier.uri	http://hdl.handle.net/10467/101068
dc.description.abstract	V rámci této diplomové práce byl připraven odladěný model posilovaného učení, který je schopný natrénování inteligentního agenta způsobilého hrát hru Super Mario Bros.. Jeho architektura je založena na provedeném průzkumu aktuálních state-of-the-art technik posilovaného učení, kde mezi sebou byly porovnány modely, které jsou pro tento typ úlohy nejvíce relevantní. Pro možnost porovnání modelů byl proveden průzkum a popis nástrojů, které umožňují interakci modelů s hrou. Na základě výsledků porovnání modelů byla vybrána nejvhodnější metoda. Následně byly provedeny experimenty s aplikováním rozmanitých modifikací na vybraný model za účelem najít nejvhodnější úpravy pro hru Super Mario Bros.. Odladěný model byl následně použit k natrénování inteligentního agenta, jehož výkony byly otestovány na úrovni, na které byl natrénován a také na dalších dvou úrovních, které nikdy neviděl. Výkony agenta byly velmi dobré a ukázaly pěkné vzorce chování hlavně na úrovních, na kterých byl natrénován, ačkoliv jeho výkon na neznámých úrovních byl pochopitelně horší.	cze
dc.description.abstract	Within this master's thesis, a fine-tuned reinforcement learning model capable of preparing an intelligent agent able to play the Super Mario Bros. game has been created. Its architecture is based on conducted research on current state-of-the-art reinforcement learning techniques where the most relevant models for this type of task have been compared between each other. In order to compare the models, research and description of tools that allow the model to interact with the game had been done. Based on the comparison results, the most suitable approach was selected. Experiments with applying various modifications to the selected model have been done in order to find the most suitable modifications for the Super Mario Bros. game. The fine-tuned model has been used to train an intelligent agent, whose performances were tested on the level he was trained on and also on two levels that he had never seen before. The agent's performances were really good and showed nice behavioral patterns, mainly on the level he was trained on, as his performance on the unseen levels was understandably worse.	eng
dc.publisher	České vysoké učení technické v Praze. Vypočetní a informační centrum.	cze
dc.publisher	Czech Technical University in Prague. Computing and Information Centre.	eng
dc.rights	A university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.html	eng
dc.rights	Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.html	cze
dc.subject	Hluboké posilované učení	cze
dc.subject	Deep Q-learning	cze
dc.subject	Asynchronous Advantage Actor-Critic	cze
dc.subject	Twin Delayed Deep Deterministic policy gradient algorithm	cze
dc.subject	AI agent	cze
dc.subject	OpenAI Gym	cze
dc.subject	Super Mario Bros	cze
dc.subject	Deep Reinforcement Learning	eng
dc.subject	Deep Q-learning	eng
dc.subject	Asynchronous Advantage Actor-Critic	eng
dc.subject	Twin Delayed Deep Deterministic policy gradient algorithm	eng
dc.subject	AI agent	eng
dc.subject	OpenAI Gym	eng
dc.subject	Super Mario Bros	eng
dc.title	Hluboké posilované učení pro hru Super Mario Bros	cze
dc.title	Deep Reinforcement Learning for Super Mario Bros	eng
dc.type	diplomová práce	cze
dc.type	master thesis	eng
dc.contributor.referee	Novák Petr
theses.degree.discipline	Znalostní inženýrství	cze
theses.degree.grantor	katedra aplikované matematiky	cze
theses.degree.programme	Informatika	cze

Soubory tohoto záznamu

Název:: F8-DP-2022-Schejbal-Ondrej-the ...
Velikost:: 910.8Kb
Formát:: PDF
Popis:: PLNY_TEXT
: Zobrazit/otevřít

Název:: F8-DP-2022-posudek-Novak_Petr.pdf
Velikost:: 48.29Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Název:: F8-DP-2022-posudek-Vasata_Dani ...
Velikost:: 43.75Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Diplomové práce - 18105 [235]

Zobrazit minimální záznam