<?xml version="1.0"?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/"><dc:title>Neural fake factor estimation using data-based inference</dc:title><dc:creator>Gavranovič,	Jan	(Avtor)
	</dc:creator><dc:creator>Čalić,	Lara	(Avtor)
	</dc:creator><dc:creator>Debevc,	Jernej	(Avtor)
	</dc:creator><dc:creator>Lytken,	Else	(Avtor)
	</dc:creator><dc:creator>Kerševan,	Borut Paul	(Avtor)
	</dc:creator><dc:subject>high energy physics</dc:subject><dc:subject>fake factor</dc:subject><dc:subject>electroweak precision physics</dc:subject><dc:description>In a high-energy physics data analysis, the term “fake” backgrounds refers to events that would formally not satisfy the (signal) process selection criteria, but are accepted nonetheless due to mis-reconstructed particles. This can occur, e.g., when leptons from secondary decays are incorrectly identified as originating from the hard-scatter interaction point (known as non-prompt leptons), or when other physics objects, such as hadronic jets, are mistakenly reconstructed as leptons (resulting in mis-identified leptons). These fake leptons are usually estimated using data-driven techniques, one of the most common being the Fake Factor method. This method relies on predicting the fake lepton contribution by reweighting data events, using a scale factor (i.e. fake factor) function. Traditionally, fake factors have been estimated by histogramming and computing the ratio of two data distributions, typically as functions of a few relevant physics variables such as the transverse momentum pT and pseudorapidity η. In this work, we introduce a novel approach of fake factor calculation, based on density ratio estimation using neural networks trained directly on data in a higher-dimensional feature space. We show that our method enables the computation of a continuous, unbinned fake factor on a per-event basis, offering a more flexible, precise, and higher-dimensional alternative to the conventional method, making it applicable to a wide range of analyses. A simple LHC open data analysis we implemented confirms the feasibility of the method and demonstrates that the ML-based fake factor provides smoother, more stable estimates across the phase space than traditional methods, reducing binning artifacts and improving extrapolation to signal regions.</dc:description><dc:publisher>SISSA</dc:publisher><dc:date>2026</dc:date><dc:date>2026-04-29 12:53:38</dc:date><dc:type>Neznano</dc:type><dc:identifier>29233</dc:identifier><dc:identifier>UDK: 539.1</dc:identifier><dc:identifier>ISSN pri članku: 1029-8479</dc:identifier><dc:identifier>DOI: 10.1007/JHEP04(2026)188</dc:identifier><dc:identifier>COBISS_ID: 276535043</dc:identifier><dc:source>Italija </dc:source><dc:language>sl</dc:language><dc:rights>© The Authors</dc:rights></metadata>
