Strojové učení pro řízení inferencí a v automatickém dokazování vět

Zarathustra Goertzel

Learning Inference Guidance in Automated Theorem Proving

Typ dokumentu

disertační práce
doctoral thesis

Autor

Zarathustra Goertzel

Vedoucí práce

Urban Josef

Oponent práce

Veroff Robert

Studijní obor

Umělá inteligence a biokybernetika

Studijní program

Elektrotechnika a informatika

Instituce přidělující hodnost

katedra počítačů

Práva

A university thesis is a work protected by the Copyright Act. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one?s own expense. The use of thesis should be in compliance with the Copyright Act http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf and the citation ethics http://knihovny.cvut.cz/vychova/vskp.html
Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem http://www.mkcr.cz/assets/autorske-pravo/01-3982006.pdf a citační etikou http://knihovny.cvut.cz/vychova/vskp.html

Metadata

Zobrazit celý záznam

Abstrakt

Automatické dokazování vět je oborem umělé inteligence (AI), který sahá až k počátkům AI, například v “Logic Theory machine” od Simona a Newella. Symbolická umělá inteligence je často srovnávána se statistickým strojovým učením (ML). Tato práce se zabývá úkolem integrace statistického ML do symbolické AI v kontextu automatických dokazovačů vět, konkrétně dokazovače E. E provádí úplné vyhledávání důkazu dané domněnky. ENIGMA je systém, který používá ML (lineární regrese, rozhodovací stromy s gradientem a grafové neuronové sítě) k výběru, kterou klauzuli zpracovat jako další během hledání důkazu. Modely ML zvažují, upřednostňují a filtrují klauzule, které má E použít, a řídí tak průběh vyhledání důkazu. Byly vyvinuty a experimentálně vyhodnoceny tři metody: ProofWatch, ENIGMAWatch a Parental Guidance. Práce zahrnuje šest článků o těchto metodách. ProofWatch využívá informace z mnoha předchozích důkazů k řízení aktuálního vyhledání pomocí vektorů, které vyjadřují, jak moc se současné prohledávání shoduje s předchozími. ENIGMAWatch kombinuje systémy ENIGMA a ProofWatch pomocí dynamických důkazových vektorů popisujících stav důkazu, které jsou využity systémy ML. Parental Guidance filtruje, které klauzule jsou generovány pomocí vlastností rodičovských klauzulí, aby se zjednodušilo rozhodovací prostředí ENIGMA modelů pro výběr klauzulí. Metodu Parental Guidance lze kombinovat s dalšími modely ENIGMA a vytvořit tak třífázový systém ENIGMA, zatím nejsilnější v praxi. Krátký dokument “Make E Smart Again” vypíná dvě funkce nezbytné pro efektivní a úplné vyhledávání důkazů, konkrétně uspořádání termů a výběr literálů, aby bylo vidět, jak soběstačné jsou modely výběru klauzulí založené na ML. Výsledky v dokumentu „The Isabelle ENIGMA“ ukazují, že ENIGMA a Parental Guidance jsou účinné na problémy ze systému Isabelle Sledgehammer. Všechny metody v této práci vylepšují nejmodernější výkon na matematické knihovně Mizar v době publikace. Nejlepší jednotlivá strategie s ProofWatch má o 26,5% lepší výsledky než základní strategie a nejlepší soubor pěti strategií má o 7% lepší výsledky než nejlepší soubor základních strategií. ENIGMAWatch dokázal o 8,8% více než ENIGMA v první tréninkové smyčce a dosahuje rychlejších tréninkových časů. Experimenty v “Make E Smart Again” ukázaly, že ENIGMA dosahuje 256% nárůstu oproti holé verzi E a může překonat výkon E pomocí dvou silných strategií. Parental Guidance zlepšuje výkon samostatné ENIGMY o 11,7%. Třífázová ENIGMA zlepšuje nejlepší předchozí výsledek o 17,4% a automatický mód E o 60%. Metody výběru klauzulí ENIGMA a Parental Guidance jsou přenositelné na jeden z největších korpusů matematických problémů Isabelle Sledgehammer a fungují na stejné úrovni jako nejmodernější dokazovač CVC5. ENIGMA a CVC5 prokázaly značný počet odlišných problémů, přičemž ENIGMA překonala automatický mód E o 25,5%

Automated theorem proving is a subfield of Artificial Intelligence (AI) that goes back to its origins with Simon and Newell’s Logic Theory Machine.Symbolic AI has often been contrasted with statistical machine learning (ML). This thesis addresses the task of integrating statistical ML into symbolic AI in the form of automated theorem provers, specifically the E prover. E performs a refutation-complete search for a proof of a given conjecture. ENIGMA is a system that uses ML (linear egression, gradientboosted decision trees, and graph neural networks) to select which clause to process next in a proof search via weight functions. The ML models weigh, prioritize, and filter the clauses to be used by E, thus guiding the proof search. Three methods are developed and experimented with: ProofWatch, ENIGMAWatch, and Parental Guidance. The six papers on these methods are included. ProofWatch uses hints from many other proofs to guide the current proof search by using proof vectors that count how much of each proof on the watchlist is matched. ENIGMAWatch combines ENIGMA with ProofWatch by using the evolving proof vectors as a description of the proof state to be used by the ML systems. Parental Guidance filters which clauses get generated by their parental features to simplify the decision landscape for clause election ENIGMA models. The Parental Guidance method can be combined with other ENIGMA models to create the 3-phase ENIGMA system, the strongest so far. The “Make E Smart Again” short paper disables two features essential for an effective and complete refutational proof search, term ordering and literal selection, to see how self-sufficient the ML-based clause selection models are. The results in the “The Isabelle ENIGMA” paper demonstrate that ENIGMA and Parental Guidance are effective on the Isabelle Sledgehammer Problems.All of the methods in this thesis improve upon the state-of-the-art performance over the Mizar Mathematical Library at the time of publication. The best single strategy with ProofWatch performed 26.5% better than the baseline strategy, and the best ensemble of five strategies performed 7% better than the best ensemble of baseline strategies. ENIGMAWatch proved 8.8% more than ENIGMA in the first training loop with a faster training time. The experiments in “Make E Smart Again” indicated that ENIGMA attains a 256% increase over a bare-bones version of E and can surpass the performance of E with two strong strategies. Parental Guidance improves the performance of standalone ENIGMA by 11.7%. The 3-phase ENIGMA improves upon the best previous result by 17.4% and upon E’s auto-schedule by 60%. The ENIGMA clause selection and Parental Guidance methods transfer to one of the largest corpora of Isabelle Sledgehammer problems and perform on par with the state-of-the-art prover CVC5. ENIGMA and CVC5 prove a significant number of distinct problems, with ENIGMA outperforming E’s auto-schedule by 25.5%.