Predikce biosyntézy terpenů pomocí strojového učení

Roman Bushuiev

Machine-learning prediction of terpene biosynthesis

dc.contributor.advisor	Pluskal Tomáš
dc.contributor.author	Roman Bushuiev
dc.date.accessioned	2021-06-11T22:52:42Z
dc.date.available	2021-06-11T22:52:42Z
dc.date.issued	2021-06-11
dc.identifier	KOS-961987855805
dc.identifier.uri	http://hdl.handle.net/10467/95071
dc.description.abstract	Biosyntéza v živých organismech se skládá z komplexních transformací molekul katalyzovaných enzymy. Ačkoli porozumění těmto biochemickým reakcím je zásadní pro moderní medicínu a strojové učení již prokázalo svou účinnost pro rozluštění velmi složitých problémů, predikce biosyntéz dosud nebyla studována. Dokonce i pro dobře definované reakce, jako je biosyntéza terpenů, velmi malé množství dosud charakterizovaných reakcí a komplikovanost jejich složek dělají problém zdánlivě neřešitelným. V této práci se zaměřuji na predikci biosyntézy seskviterpenů a navrhuji řešení nejprve snížením složitosti pomoci modelů strojového učení předtrénovaných na rozsáhlých databázích, a následovně využitím naučených vlastností na řešení primárního úkolu. Výsledky ukazují, že tento přístup umožňuje poměrně dobrou predikci reakcí biosyntézy seskviterpenů s použitím jen 315 trénovacích vzorků, a představuje tedy slibný směr pro další výzkum.	cze
dc.description.abstract	Biosynthesis in living organisms consists of complex molecular transformations catalyzed by enzymes. Even though deep understanding of such biochemical reactions is essential for modern medicine and machine learning has already proven its efficiency in unraveling complex tasks, the prediction of biosynthesis has not been studied yet. Even for highly conserved reactions, such as terpene biosynthesis, the relatively small amount of reactions characterized to date and the complexity of their components make the problem seem infeasible. In the present work, I focus on the prediction of sesquiterpene biosynthesis and propose a solution by first reducing the problem complexity with machine learning models pre-trained on large databases and then transferring the learned features to the primary task. Results show that the introduced approach allows for reasonable prediction of sesquiterpene biosynthetic reactions using only 315 training samples, which makes it remarkably interesting for further study.	eng
dc.publisher	České vysoké učení technické v Praze. Vypočetní a informační centrum.	cze
dc.publisher	Czech Technical University in Prague. Computing and Information Centre.	eng
dc.rights	A university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.html	eng
dc.rights	Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.html	cze
dc.subject	biochemie	cze
dc.subject	terpen	cze
dc.subject	biosyntéza	cze
dc.subject	strojové učení	cze
dc.subject	Transformer	cze
dc.subject	Variational Autoencoder	cze
dc.subject	biochemistry	eng
dc.subject	terpene	eng
dc.subject	biosynthesis	eng
dc.subject	machine learning	eng
dc.subject	Transformer	eng
dc.subject	Variational Autoencoder	eng
dc.title	Predikce biosyntézy terpenů pomocí strojového učení	cze
dc.title	Machine-learning prediction of terpene biosynthesis	eng
dc.type	bakalářská práce	cze
dc.type	bachelor thesis	eng
dc.contributor.referee	Hrabáková Jitka
theses.degree.discipline	Znalostní inženýrství	cze
theses.degree.grantor	katedra aplikované matematiky	cze
theses.degree.programme	Informatika 2009	cze

Soubory tohoto záznamu

Název:: F8-BP-2021-Bushuiev-Roman-thes ...
Velikost:: 5.784Mb
Formát:: PDF
Popis:: PLNY_TEXT
: Zobrazit/otevřít

Název:: F8-BP-2021-Bushuiev-Roman-pril ...
Velikost:: 37.91Mb
Formát:: Neznámý
Popis:: PRILOHA
: Zobrazit/otevřít

Název:: F8-BP-2021-posudek-Pluskal_Tom ...
Velikost:: 45.18Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Název:: F8-BP-2021-posudek-Hrabakova_J ...
Velikost:: 48.43Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Bakalářské práce - 18105 [244]

Zobrazit minimální záznam