260 likes | 301 Views
This project by the Hfitter Team involves extracting the Higgs signal using Maximum Likelihood (ML) fitting techniques. The team, led by Pierre-Antoine Delsart and others from LAPP and CERN, focuses on analyzing the H → γγ decay channel. They utilize the HiggsToGamGam Athena framework to process event data and perform fitting using the Hfitter package. Categories are defined for finer data segregation, and production modes and conversion categories are considered for a comprehensive analysis. The fitting strategy involves fitting PDF shapes for signal and background on limited statistics from older samples, with plans for updates. Fit variables like mHJ and NJets are crucial in the analysis. The project aims to optimize data fitting for improved signal extraction accuracy.
E N D
Extracting the Hgg signal using ML-fitting techniques The Hfitter Team: Pierre-Antoine Delsart, Sandrine Laplace, Jean-François Marchand (LAPP), Andreas Höcker, Guillaume Unal (CERN), Nicolas Berger (LAPP) Nicolas Berger, H->gamma gamma CSC Meeting
Overview • Analysis Strategy: 2 steps • Produce small ntuples from AOD data in Athena. • Use Maximum-likelihood fitter to extract signal yield. • Samples (CSC note): • Signal: generated with Pythia, reweighted (mainly in Higgs pT) using ResBos – maybe MC@NLO • Irreducible (gg) background : Pythia, reweighted w/ResBos • Reducible (gj, jj) backgrounds : AlpGen • Most plots shown here will have only gj Bkg. Nicolas Berger, H->gamma gamma CSC Meeting
Athena Framework: HiggsToGamGam • Ntuples produced using the HiggsToGamGam package (PhysicsAnalysis/HiggsPhys/HiggsToGamGam) in 12.0.5/6 • Based on EventView, dumps Higgs information in a AAN format. • Photons cuts based on isEM tune2: • e = 74.6% for isEM selection • e = 72.7% for pTg > 15 GeV => eTot = 54.3% • Calculates refitted primary vertices (code from I. Koletsou) , photon momentum correction for each vertex. • Choice of “best vertex” : use MC truth for now • Also impose ETg1 > 40 GeV, ETg2 > 20 GeV before fitting. Nicolas Berger, H->gamma gamma CSC Meeting
Event Processing AOD collections PhotonCollection ElectronCollection Et>15 GeV Matching with Conversion VxContainer Et>15 GeV OptIsEM cut EVFinalState Label Photon ConvertedPhoton Sum Photon+ConvertedPhoton AllPh_N>=2 Keep 2 most energetic amongst Photon + ConvertedPhoton HiggsCombo: Only 1 combo thus 1 view (EV0) ! EVFinalState Label EnergeticPhoton Nicolas Berger, H->gamma gamma CSC Meeting
Conversions • Rates: • ~50% of photons convert • ~20% early convs. (45 < R < 350 mm) • ~35% of events have early conv • Reconstruction: • 50% of early convs reconstructed • For others, reconstruct "1-track" conversions • Require no hits in B-layer • high efficiency • Contribution • +14% events with 2-track convs • +7% with 1-track convs Nicolas Berger, H->gamma gamma CSC Meeting
Fitter • Hfitter: see /groups/catsusy/Hfitter in cvs • Performs unbinned extended ML fits, arbitrary num. of samples, categories & fit variables (based on RooFit). • Likelihood: With: And: Nicolas Berger, H->gamma gamma CSC Meeting
2.5 Large Eta Apart Large Eta Close |h2| Crack 1.65 1.35 Wide Center 1 0.75 Large Eta Apart Center 0.75 1 1.35 1.65 |h1| 2.5 Better resolution for small |h| Categories • “Categories” used to split data into subsets • Separate events into disjoint subsets • Different categories can have • different values of PDF parameters. • or different PDFs altogether • Gives finer-grained description of data • Increase significance • Reduces biases from correlations • Here, as an example, categories based on photon h’s: Nicolas Berger, H->gamma gamma CSC Meeting
Higgs Production at LHC ttbar threshold Associated Productions Nicolas Berger, H->gamma gamma CSC Meeting
Categories (2) • Categories for photon h (previous slide) • Merge "Large Eta Close" and "Wide Center" since resolutions are similar • Exclude crack region for the rest of the study • Categories for production modes • Njets=0,1,2 • 2-jet: 2 jets with pt>20 GeV, <p1,p2> > 0.2 rad • 1-jet: pt > 20 GeV • Separate category for 2-jet/VBF • 2 jets with pt > 20 GeV • <p1,p2> > p/2 • Dh > 3.6 • No central jet veto • Categories for associated modes • WH, ZH • ttH ? • Categories for conversions • Nconv=0,1,2 – may separate 1-track/2-track in Nconv=1 Nicolas Berger, H->gamma gamma CSC Meeting
Use the following categories: WBF associated W, Z, or ttbar the remainder L = 30 fb-1 agrees with Categories : a toy example gg only:4.3s gg+VBFtogether5.1s gg+VBF+W/Z/tt:5.8s Whole sample: 4.5s WBF only2.8s H+W/Z/tt only:2.9s Bertrand Brelier Nicolas Berger, H->gamma gamma CSC Meeting
g q* In Higgs rest frame Higgs g Fit variables • Use • mgg • pT(Higgs) • |cos q*|, q*=Higgs decay angle • MHJ, invariant mass of Higgs cand. and highest-pt jet. Nicolas Berger, H->gamma gamma CSC Meeting
Fit Variables: mHJ NJets=1 NJets=2 NJets=2, VBF Longer tails in signal Limited by MC statistics! Nicolas Berger, H->gamma gamma CSC Meeting
Variables used Not for CSC Note Nicolas Berger, H->gamma gamma CSC Meeting
Fitting Strategy • Fit PDF shapes for signal and Bkg on MC • See next pages for details • Samples used are "old" with very limited statistics for bkg – updated soon with new ntuples. • Fit data with Sig+Bkg or Bkg-only models • Fit for • Nsigtotal – fixing the fraction fsignal in each category • NBkg in each category • Also keep mH floating • What window ? => Need to be optimized as fcn of lumi • Also float some background shape parameters Nicolas Berger, H->gamma gamma CSC Meeting
Fractions of Events in Categories • Can gain significance by isolating VBF • Need to make sure this is not spoilt by cross-feed from other categories. Photon h Signal Bkg NJets Nicolas Berger, H->gamma gamma CSC Meeting
Components Fits : mgg • Use 12.0.4 without pileup, vertex-corrected mass. • Signal: use Crystal Ball Shape • Crystal Ball: • Background: use a simple exponential – widen the mgg range for stats Background e-xm Signal Nicolas Berger, H->gamma gamma CSC Meeting
mgg : Parameter values • Split x and sm in categories: • Different x for Njet categories • Different sm for photon h categories • x floats in fit to data Nicolas Berger, H->gamma gamma CSC Meeting
Comparison with diphox • Spectrum OK-fitted by an exponential in the relevant region • x values similar to what was found in gjet • Need to evaluate effect of changes in bkg shape on signal yield. x = (-2.42 ± 0.01) 10-2 All photons x = (-2.87 ± 0.02) 10-2 x = (-2.12 ± 0.02) 10-2 Nicolas Berger, H->gamma gamma CSC Meeting
Higgs pT Signal • Fit to functions of the form • Signal: use a sum of 2 such components. • Background: just one component (for now…) • x, l float in fit to data Distributions from PYTHIA: needs to be updated on reweighted MC Background Nicolas Berger, H->gamma gamma CSC Meeting
Signal: Comparison with ResBos • ResBos: hh->B+X generator, where B boson (or gg pair) • resums soft/virtual QCD corrections (LL) => better description than NLO at small pT : • only generates gg->H (?) so look at same process with Pythia. Nicolas Berger, H->gamma gamma CSC Meeting
Higgs |cos q*| • Use: Background Signal Background a2, a4 float in fit to data Nicolas Berger, H->gamma gamma CSC Meeting
Higgs-Jet mass (mHJ) • Fit to functions of the form Signal Background floats in fit to data Nicolas Berger, H->gamma gamma CSC Meeting
Performance Studies • Generate pseudo-data from PDFs, using expected signal/Bkg yields • Perform fit in the same conditions as data (floating parameters, etc.) • Fit the same sample with different models • Removes sensitivity to statistical effects in comparison • To calculate significance • Fit with S+B, B-only hypotheses • Calculate Dlog L = log LS+B – log LB • Get s = (2 Dlog L) [asymptotic formula, 1 dof] • Repeat on O(100) toys, average s = expected significance • Need to decide between 1-sided and 2-sided intervals (i.e. do we allow NSig < 0 ?) Nicolas Berger, H->gamma gamma CSC Meeting
Example: mgg-only vs. mgg Ä cos q* Ä pT NSignal mgg Signif. 1-var Peak = 4.37 ± 0.08 Peak = 207 ± 4 Peak = 119.97 ± 0.03 4-var Signif. NSignal mgg Peak = 200 ± 3 Peak = 119.95 ± 0.03 Peak = 6.10 ± 0.08 Nicolas Berger, H->gamma gamma CSC Meeting
Results • Significant gains from both categories and additional variables. Nicolas Berger, H->gamma gamma CSC Meeting
Outlook • Major update effort underway for CSC note • Realistic backgrounds • Use MC@NLO for signal ? • Add conversions categories • Add categories for assoc. prod. • Further improvements • Make better use of jet info. (MVA) • Optimize fit for early data-taking • Better MC for signal and irred. bkg. would allow more realistic modeling and improve performance. • Help is welcome! Nicolas Berger, H->gamma gamma CSC Meeting