Indexování XML dokumentů pomocí automatů: výběr neznámých uzlů

Karzhenkova Maria

Automata Approach to XML Data Indexing: Selecting Unknown Nodes

dc.contributor.advisor	Šestáková Eliška
dc.contributor.author	Karzhenkova Maria
dc.date.accessioned	2018-06-19T21:57:26Z
dc.date.available	2018-06-19T21:57:26Z
dc.date.issued	2018-06-13
dc.identifier	KOS-695599707505
dc.identifier.uri	http://hdl.handle.net/10467/76816
dc.description.abstract	Tato práce je součástí projektu "Indexování XML dokumentů pomocí automatů". Popisuje existující metody pro indexování XML dokumentů, které jsou založeny na teorii automatů, a jejich rozšíření, za účelem umožnění efektivního zpracování XPath dotazů skládajících se z libovolné kombinace child (/), descendant-or-self (//) os a asterisk (*) a nodename node testů, sloužících k navigaci v XML dokumentu. Ke konstrukci indexu pro daný XML dokument D s n elementy je využít odpovídající XML stromový model T. Zpracování dotazu Q o velikosti m proběhne v čase O(m) nezávislém na n. Tato práce obsahuje též diskuzi ohledně časové a paměťové složitosti pro každou z navržených metod. Všechny nově popsané algoritmy jsou implementovány a otestovány na reálních datech.	cze
dc.description.abstract	Being a part of the "Automata Approach to XML Data Indexing" project, this thesis is concerned with studying the existing methods of indexes creation algorithms based on the automata theory and extending them to deal with more significant fragment of XPath queries. The presented methods allow us to construct XML data indexes that support evaluation of all XPath queries using any combinations of child (/), descendant-or-self (//) axes, asterisk (*) and nodename node tests. Given an XML document D and its corresponding XML tree model T with n nodes, the tree is preprocessed and the index for the document D is constructed. The searching phase time of each of the constructed indexes for a query Q is bounded by O(m), where m is size of the query Q, and does not depend on the indexed XML document size n. Moreover, the space and time complexities for each of the proposed indexes are discussed, all the introduced algorithms are implemented and tested over the real-life datasets.	eng
dc.language.iso	ENG
dc.publisher	České vysoké učení technické v Praze. Vypočetní a informační centrum.	cze
dc.publisher	Czech Technical University in Prague. Computing and Information Centre.	eng
dc.rights	A university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.html	eng
dc.rights	Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.html	cze
dc.subject	XML,XPath,strom,konečný automat,index,neznámé uzly	cze
dc.subject	XML,XPath,tree,finite automaton,index,unknown nodes	eng
dc.title	Indexování XML dokumentů pomocí automatů: výběr neznámých uzlů	cze
dc.title	Automata Approach to XML Data Indexing: Selecting Unknown Nodes	eng
dc.type	bakalářská práce	cze
dc.type	bachelor thesis	eng
dc.date.accepted	2018-06-18
dc.contributor.referee	Pecka Tomáš
theses.degree.discipline	Teoretická informatika	cze
theses.degree.grantor	katedra teoretické informatiky	cze
theses.degree.programme	Informatika	cze

Soubory tohoto záznamu

Název:: F8-BP-2018-Karzhenkova-Maria-t ...
Velikost:: 929.7Kb
Formát:: PDF
Popis:: PLNY_TEXT
: Zobrazit/otevřít

Název:: F8-BP-2018-posudek-Sestakova_E ...
Velikost:: 138.4Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Název:: F8-BP-2018-posudek-Pecka_Tomas.pdf
Velikost:: 142.1Kb
Formát:: PDF
Popis:: POSUDEK
: Zobrazit/otevřít

Tento záznam se objevuje v následujících kolekcích

Bakalářské práce - 18101 [337]

Zobrazit minimální záznam