Search for new phenomena in two-body invariant mass distributions using unsupervised machine learning for anomaly detection at $\sqrt{s} = 13$ TeV with the ATLAS detector

The ATLAS collaboration Aad, Georges ; Abbott, Braden Keim ; Abeling, Kira ; et al.
CERN-EP-2023-112, 2023.
Inspire Record 2674351 DOI 10.17182/hepdata.144864

Searches for new resonances are performed using an unsupervised anomaly-detection technique. Events with at least one electron or muon are selected from 140 fb$^{-1}$ of $pp$ collisions at $\sqrt{s} = 13$ TeV recorded by ATLAS at the Large Hadron Collider. The approach involves training an autoencoder on data, and subsequently defining anomalous regions based on the reconstruction loss of the decoder. Studies focus on nine invariant mass spectra that contain pairs of objects consisting of one light jet or $b$-jet and either one lepton ($e$, $\mu$), photon, or second light jet or $b$-jet in the anomalous regions. No significant deviations from the background hypotheses are observed.

15 data tables match query

Distributions of the anomaly score from the AE for data and five benchmark BSM models. Their legends, from top to bottom, are; (1) charged Higgs boson production in association with a top quark, $tbH^{+}$ with $H^{+} \rightarrow t\bar{b}$; (2) a Kaluza-Klein gauge boson, $W_{KK}$, with the SM $W$ boson and a radion $\phi$; (3) a $Z'$ boson decaying to a composite lepton $E$ and $\ell$, with $E \rightarrow Z\ell$ with a mass of 0.5 TeV; (4) the SSM $W$'$\rightarrow W Z' \rightarrow \ell\nu q\bar{q}$; (5) a simplified dark-matter model with an $Z$ axial-vector mediator $Z' \rightarrow q\bar{q}$, where one of the quarks radiates a $W$ boson decaying to $\ell\nu$. The BSM predictions represent the expected number of events from 140 $fb^{-1}$ of data for heavy particle ($H^{+}$ ,$W_{KK}$ , $Z'$ , $W'$ and $Z'$, respectively) masses around 2 TeV. The distributions for the BSM models are smoothed to remove fluctuations due to low MC event counts. The vertical lines indicate the start of the three anomaly regions (ARs). The labels of the three ARs indicate the visible cross section for hypothetical processes yielding the same number of events as observed in the 140 $fb^{-1}$ dataset. The AE is applied to preselected events without any requirements on invariant mass distributions.

Invariant mass distributions of jet+Y for $M_{jY}$ > 0.3 TeV in the 10 pb AR along with the fit of Eq. (1). The fits are represented by the lines, while the associated statistical uncertainties are indicated by the shaded bands. The lower panels show the bin-by-bin significances of deviations from the fit, calculated as $(d_{\textit{i}} - f_{i})/\delta_{\textit{i}}$, where $d_{i}$ is the data yield, $f_{\textit{i}}$ is the fit value, and $\delta_{i}$ is the data uncertainty in the $\textit{i}$-th bin.

Values of $\Delta Z$ for the discovery sensitivity, as defined in the text, as a function of the invariant mass $\textit{m}$. The j+j invariant mass distribution is calculated in the 10 pb AR. Positive percentages indicate improvements in sensitivity. Horizontal dashed lines are drawn at 100% and 200% to guide the eye. The five benchmark BSM models are (1) charged Higgs boson production in association with a top quark, $tbH^{+}$ with $H^{+} \rightarrow t\bar{b}$; (2) a Kaluza-Klein gauge boson, $W_{KK}$, with the SM $W$ boson and a radion $\phi$; (3) a $Z'$ boson decaying to a composite lepton $E$ and $\ell$, with $E \rightarrow Z\ell$; (4) the sequential standard model $W' \rightarrow W Z' \rightarrow \ell\nu q\bar{q}$; (5) a simplified dark-matter model with an axial-vector mediator $Z' \rightarrow q\bar{q}$, where one of the quarks radiates a $W$ boson decaying to $\ell\nu$. The multiple markers shown for the composite-lepton model at the same invariant mass values correspond to different composite lepton ($E$) masses between 0.25 and 3.5 TeV. The center positions of the markers are set to the masses of the corresponding heavy particles.

More…