Zobrazit minimální záznam

Predictor Factory: Learning from Relational Data



dc.contributor.advisorKordík Pavel
dc.contributor.authorJan Motl
dc.date.accessioned2022-04-07T13:19:13Z
dc.date.available2022-04-07T13:19:13Z
dc.date.issued2022-04-07
dc.identifierKOS-592967335705
dc.identifier.urihttp://hdl.handle.net/10467/100475
dc.description.abstractPropositionalization algorithms transform relational data into a single table with features, which can be used for classification or regression with conventional machine learning tools. However, the contemporary propositionalization algorithms were not designed to work on changing data and suffered from the production of many irrelevant and redundant features. We altered the propositionalization to work on temporal data and introduced meta-learning, which predicts, which features will be relevant and unique. To test Predictor Factory, our implementation of propositionalization, we have created a repository of relational datasets and implemented a scalable algorithm for relationship discovery between tables in the dataset. The implementations were open-sourced and applied to real-world banking, government, marketing, medicine, and telecommunication problems.cze
dc.description.abstractPropositionalization algorithms transform relational data into a single table with features, which can be used for classification or regression with conventional machine learning tools. However, the contemporary propositionalization algorithms were not designed to work on changing data and suffered from the production of many irrelevant and redundant features. We altered the propositionalization to work on temporal data and introduced meta-learning, which predicts, which features will be relevant and unique. To test Predictor Factory, our implementation of propositionalization, we have created a repository of relational datasets and implemented a scalable algorithm for relationship discovery between tables in the dataset. The implementations were open-sourced and applied to real-world banking, government, marketing, medicine, and telecommunication problems.eng
dc.publisherČeské vysoké učení technické v Praze. Vypočetní a informační centrum.cze
dc.publisherCzech Technical University in Prague. Computing and Information Centre.eng
dc.rightsA university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.htmleng
dc.rightsVysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.htmlcze
dc.subjectrelational learningcze
dc.subjectpropositionalizationcze
dc.subjectfeature extractioncze
dc.subjectdata preprocessingcze
dc.subjectrelational learningeng
dc.subjectpropositionalizationeng
dc.subjectfeature extractioneng
dc.subjectdata preprocessingeng
dc.titlePredictor Factory: Učení z relačních datcze
dc.titlePredictor Factory: Learning from Relational Dataeng
dc.typedisertační prácecze
dc.typedoctoral thesiseng
dc.contributor.refereeKliegr Tomáš
theses.degree.disciplineInformatikacze
theses.degree.grantorkatedra aplikované matematikycze
theses.degree.programmeInformatikacze


Soubory tohoto záznamu


Tento záznam se objevuje v následujících kolekcích

Zobrazit minimální záznam