Generativní modely pro měření ve fyzice vysokých energií

V této práci jsme augmentovali Monte-Carlo simulovaná data pro výzkum nabitého Higgsova bosonu. Jednotlivé eventy jsou vybrány, pokud obsahují dva lehké leptony (elektron, mion) stejného náboje a přesně jeden hadronicky se rozpadající tauon. Pro augmentaci dat byl zvolen jako model variační autoenkodér s evidence lower bound a symmetric equilibrium algoritmem učení. Obě metody učení byly také otestováný s hierarchickou (Ladder) architekturou. K posouzení kvality dat byly voleny jak kvantitativní, tak kvalitatnivní metriky. Standardní evidence lower bound model byl zvolen jako nejlepší na základě nejlepších dosažených výsledků. Model byl dále zvolen pro augmentaci dat určených pro experiment zaměřený na separaci signálu a pozadí. Závislost výkonu klasifikátoru na velikosti trénovacího datasetu byla demonstrována za použití dvou obecně známých paradigmat strojového učení pro tabulková data: gradient-boosted decision trees a hlubokých neuronových sítí.

In this thesis, we augmented Monte-Carlo simulated data for charged Higgs boson searches. Events are selected if they have two light leptons (electron or muon) of the same sign and exactly one hadronically decaying tau-lepton. For the data generation, a variational autoencoder model was used with evidence lower bound and symmetric equilibrium learning. Both mentioned learning approaches were also tested with hierarchical (Ladder) architecture. For the data quality assessment, both qualitative and quantitative metrics were taken into account. The standard evidence lower bound (ELBO) learning model was selected as the best-performing option. The model was then used to generate data for the signal and background separation analysis experiments. The dependence of classifier performance on the training dataset size was demonstrated using two widely used machine learning paradigms for tabular data classification: gradient-boosted decision trees and deep neural networks.

Permanent link

http://hdl.handle.net/10467/114631

Rights/License

A university thesis is a work protected by the Copyright Act of the Czech Republic. Extracts, copies and transcripts of the thesis are allowed for personal use only and at one`s own expense. The use of thesis should be in compliance with the Copyright Act.

Vysokoškolská závěrečná práce je dílo chráněné autorským zákonem. Je možné pořizovat z něj na své náklady a pro svoji osobní potřebu výpisy, opisy a rozmnoženiny. Jeho využití musí být v souladu s autorským zákonem v platném znění.

Collections

Master Theses - 13133

Full item page

Generative Models for High Energy Physics Measurements