Self-supervised model pro efektivní rozpoznávání zvuku trénovaný na agregovaných datech

Houska, Vojtěch

Self-supervised model for efficient sound recognition trained on aggregated data

dc.contributor.advisor	Kovalenko, Alexander
dc.contributor.author	Houska, Vojtěch
dc.contributor.referee	Kordík, Pavel
dc.date.accessioned	2021-06-11T22:52:00Z
dc.date.available	2021-06-11T22:52:00Z
dc.date.issued	2021-06-11
dc.description.abstract	Tato práce shrnuje nejmodernější metody využívané v hlubokém učení. Probírá použití autoenkodérů a metody předzpracování v oblasti rozpoznávání zvuku. Jako zdroj slabě anotovaných dat pro učení těchto modelů byla použita platforma YouTube. Práce porovnala vlastnosti latentních prostorů navrhovaných autoenkoderů, které byly testovány pomocí shlukování K-means. Použitá metoda regularizovaného autoenkodéru nepřekonala náhodně inicializovaný autoenkodér. V závěru práce jsou rozebrány příčiny a byla navrhnuta další doporučení pro pozdější výzkum.	cs
dc.description.abstract	The thesis summarizes state-of-the-art approaches in deep learning. It discusses application of self-supervised autoencoders and pre-processing techniques used in sound recognition. YouTube platform served as a source of weakly-labeled data to train such models. Latent space properties of proposed autoencoders were compared and tested using K-means clustering. Implementation of Adversarially Constrained Autoencoder Interpolation failed to outperform randomly initialized autoencoder. The reasons are further discussed and several recommendations for future research are proposed.	en
dc.identifier	KOS-961987101905
dc.identifier.uri	http://hdl.handle.net/10467/95039
dc.publisher	České vysoké učení technické v Praze	cs
dc.publisher	Czech Technical University in Prague	en
dc.rights	A university thesis is a work protected by the Copyright Act of the Czech Republic. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one`s own expense. The use of thesis should be in compliance with the Copyright Act.	en
dc.rights	Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem v platném znění.	cs
dc.subject	slabě anotovaná data	cs
dc.subject	rozpoznávání zvuku	cs
dc.subject	hluboké učení	cs
dc.subject	autoenkodéry	cs
dc.subject	nevyvážená data	cs
dc.subject	učení bez učitele	cs
dc.subject	k-means	cs
dc.subject	Adversarially Constrained Autoencoder Interpolation	cs
dc.subject	Weakly-labeled data	en
dc.subject	Sound classification	en
dc.subject	Deep learning	en
dc.subject	Autoencoders	en
dc.subject	Imbalanced data	en
dc.subject	Self-supervised learning	en
dc.subject	K-means	en
dc.subject	Adversarially Constrained Autoencoder Interpolation	en
dc.title	Self-supervised model pro efektivní rozpoznávání zvuku trénovaný na agregovaných datech	cs
dc.title	Self-supervised model for efficient sound recognition trained on aggregated data	en
dc.type	bachelor thesis	en
dspace.entity.type	Publication
relation.isAdvisorOfPublication	1d81a757-e1a4-46a0-b2fb-9f2306a53e5b
relation.isAdvisorOfPublication.latestForDiscovery	1d81a757-e1a4-46a0-b2fb-9f2306a53e5b
relation.isAuthorOfPublication	13466209-9fc6-4fb0-b3d6-90a9efcaeba0
relation.isAuthorOfPublication.latestForDiscovery	13466209-9fc6-4fb0-b3d6-90a9efcaeba0
relation.isRefereeOfPublication	89151d71-aa84-441c-b049-096a1b4971cf
relation.isRefereeOfPublication.latestForDiscovery	89151d71-aa84-441c-b049-096a1b4971cf
theses.degree.discipline	Znalostní inženýrství	cs
theses.degree.grantor	katedra aplikované matematiky	cs
theses.degree.programme	Informatika 2009	cs

Files

Original bundle

Now showing 1 - 3 of 3

Name:: F8-BP-2021-Houska-Vojtech-thesis.pdf
Size:: 2.65 MB
Format:: Adobe Portable Document Format
Description:: PLNY_TEXT

Download

Name:: F8-BP-2021-posudek-Kordik_Pavel.pdf
Size:: 47.34 KB
Format:: Adobe Portable Document Format
Description:: POSUDEK

Download

Name:: F8-BP-2021-posudek-Kovalenko_Alexander.pdf
Size:: 50.77 KB
Format:: Adobe Portable Document Format
Description:: POSUDEK

Download

Collections

Bachelor Theses - 18105