Statická detekce malware využívající rekurentní neuronové sítě

Matouš Kozák

Static malware detection using recurrent neural networks

dc.contributor.advisor	Jureček Martin
dc.contributor.author	Matouš Kozák
dc.date.accessioned	2020-06-20T22:51:54Z
dc.date.available	2020-06-20T22:51:54Z
dc.date.issued	2020-06-20
dc.identifier	KOS-862365810305
dc.identifier.uri	http://hdl.handle.net/10467/88342
dc.description.abstract	Neustále rostoucí počty útoků škodlivých programů na naši IT infrastrukturu si žádají nové a lepší metody ochrany. V této bakalářské práci se věnujeme využití rekurentních neuronových sítí pro rychlou a přesnou detekci malwaru. Pro reprezentaci podezřelých programů jsme využili pouze data extrahovaná ze souborů v PE formátu. Tato data jsme dále použili pro trénink různých druhů rekurentních neuronových sítí. V práci představujeme speciální architekturu neuronové sítě, kombinující husté a LSTM vrstvy, ke klasifikaci PE souborů. Pracovali jsme s naším vlastním datasetem obsahující 30154 souborů stažených z dostupných zdrojů. S tímto datasetem, který je rovnoměrně rozdělen mezi čisté a škodlivé soubory, jsme dosáhli přesnosti 98,41 % s pouze 0,5 % legitimních programů mylně klasifikovaných jako malware. K těmto výsledkům nám stačilo pouhých 250 iterací přes treninkový soubor vzorků k naučení naší sítě. Výsledky dokazují, že algoritmy strojového učení, hlavně LSTM sítě, mohou být využity jako rychlý a spolehlivý nástroj pro detekci škodlivých souborů.	cze
dc.description.abstract	An ever-growing number of malicious attacks on our IT infrastructure calls for new and better methods of protection. In this thesis, we focus on the use of recurrent neural networks as an agile and accurate way of detecting malware. We only used features extracted from files in the PE file format to represent the suspicious programs which we used to train various types of recurrent neural networks. In this work, we present unique neural network architecture combining dense and stacked LSTM layers to classify PE files. We worked with our dataset of 30,154 files collected from available resources with which we achieved an accuracy of 98.41%, while only 0.5% of benign samples were misclassified as malware on our balanced dataset. All this was accomplished with only 250 epochs of training. These results prove that machine-learning algorithms, especially LSTM networks, can be used as a quick and reliable tool for malware detection.	eng
dc.publisher	České vysoké učení technické v Praze. Vypočetní a informační centrum.	cze
dc.publisher	Czech Technical University in Prague. Computing and Information Centre.	eng
dc.rights	A university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.html	eng
dc.rights	Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.html	cze
dc.subject	detekce malware	cze
dc.subject	rekurentní neuronové sítě	cze
dc.subject	LSTM	cze
dc.subject	statická analýza	cze
dc.subject	PE soubory	cze
dc.subject	strojové učení	cze
dc.subject	malware detection	eng
dc.subject	recurrent neural networks	eng
dc.subject	LSTM	eng
dc.subject	static analysis	eng
dc.subject	PE files	eng
dc.subject	machine learning	eng
dc.title	Statická detekce malware využívající rekurentní neuronové sítě	cze
dc.title	Static malware detection using recurrent neural networks	eng
dc.type	bakalářská práce	cze
dc.type	bachelor thesis	eng
dc.contributor.referee	Lórencz Róbert
theses.degree.discipline	Teoretická informatika	cze
theses.degree.grantor	katedra teoretické informatiky	cze
theses.degree.programme	Informatika	cze

Soubory tohoto záznamu

Název:: F8-BP-2020-Kozak-Matous-thesis.pdf
Velikost:: 1.259Mb
Formát:: PDF
Popis:: PLNY_TEXT
: Zobrazit/otevřít

Název:: F8-BP-2020-posudek-Lorencz_Rob ...
Velikost:: 135.7Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Název:: F8-BP-2020-posudek-Jurecek_Mar ...
Velikost:: 136.7Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Bakalářské práce - 18101 [337]

Zobrazit minimální záznam