Digital repository of Slovenian research organisations

Show document
A+ | A- | Help | SLO | ENG

Title:Neural fake factor estimation using data-based inference
Authors:ID Gavranovič, Jan, Institut "Jožef Stefan" (Author)
ID Čalić, Lara (Author)
ID Debevc, Jernej, Institut "Jožef Stefan" (Author)
ID Lytken, Else (Author)
ID Kerševan, Borut Paul, Institut "Jožef Stefan" (Author)
Files:URL URL - Source URL, visit https://link.springer.com/article/10.1007/JHEP04(2026)188
 
.pdf PDF - Presentation file, download (2,87 MB)
MD5: 788F56699E95853A4E0BE91B91D1DD25
 
Language:English
Typology:1.01 - Original Scientific Article
Organization:Logo IJS - Jožef Stefan Institute
Abstract:In a high-energy physics data analysis, the term “fake” backgrounds refers to events that would formally not satisfy the (signal) process selection criteria, but are accepted nonetheless due to mis-reconstructed particles. This can occur, e.g., when leptons from secondary decays are incorrectly identified as originating from the hard-scatter interaction point (known as non-prompt leptons), or when other physics objects, such as hadronic jets, are mistakenly reconstructed as leptons (resulting in mis-identified leptons). These fake leptons are usually estimated using data-driven techniques, one of the most common being the Fake Factor method. This method relies on predicting the fake lepton contribution by reweighting data events, using a scale factor (i.e. fake factor) function. Traditionally, fake factors have been estimated by histogramming and computing the ratio of two data distributions, typically as functions of a few relevant physics variables such as the transverse momentum pT and pseudorapidity η. In this work, we introduce a novel approach of fake factor calculation, based on density ratio estimation using neural networks trained directly on data in a higher-dimensional feature space. We show that our method enables the computation of a continuous, unbinned fake factor on a per-event basis, offering a more flexible, precise, and higher-dimensional alternative to the conventional method, making it applicable to a wide range of analyses. A simple LHC open data analysis we implemented confirms the feasibility of the method and demonstrates that the ML-based fake factor provides smoother, more stable estimates across the phase space than traditional methods, reducing binning artifacts and improving extrapolation to signal regions.
Keywords:high energy physics, fake factor, electroweak precision physics
Publication status:Published
Publication version:Version of Record
Submitted for review:11.11.2025
Article acceptance date:28.02.2026
Publication date:23.04.2026
Publisher:SISSA
Year of publishing:2026
Number of pages:str. 1-27
Numbering:Vol. 2026, article no. 188
Source:Italija
PID:20.500.12556/DiRROS-29233 New window
UDC:539.1
ISSN on article:1029-8479
DOI:10.1007/JHEP04(2026)188 New window
COBISS.SI-ID:276535043 New window
Copyright:© The Authors
Note:Nasl. z nasl. zaslona; Soavtorja iz Slovenije: Jernej Debevc, Borut Paul Kerševan; Opis vira z dne 24. 4. 2026;
Publication date in DiRROS:29.04.2026
Views:39
Downloads:21
Metadata:XML DC-XML DC-RDF
:
Copy citation
  
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:The journal of high energy physics
Shortened title:J. high energy phys.
Publisher:SISSA
ISSN:1029-8479
COBISS.SI-ID:1314148 New window

Document is financed by a project

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:J1-60028-2025
Name:Razvoj metod strojnega učenja za natančno določitev procesov ozadja pri iskanju nove fizike na Velikem hadronskem trkalniku (LHC)

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P1-0135-2022
Name:Eksperimentalna fizika osnovnih delcev

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:23.04.2026
Applies to:VoR

Secondary language

Language:Slovenian
Keywords:fizika visokih energij, hadronski trkalnik


Back