<?xml version="1.0"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/"><rdf:Description rdf:about="https://dirros.openscience.si/IzpisGradiva.php?id=29233"><dc:title>Neural fake factor estimation using data-based inference</dc:title><dc:creator>Gavranovič,	Jan	(Avtor)
	</dc:creator><dc:creator>Čalić,	Lara	(Avtor)
	</dc:creator><dc:creator>Debevc,	Jernej	(Avtor)
	</dc:creator><dc:creator>Lytken,	Else	(Avtor)
	</dc:creator><dc:creator>Kerševan,	Borut Paul	(Avtor)
	</dc:creator><dc:subject>high energy physics</dc:subject><dc:subject>fake factor</dc:subject><dc:subject>electroweak precision physics</dc:subject><dc:description>In a high-energy physics data analysis, the term “fake” backgrounds refers to events that would formally not satisfy the (signal) process selection criteria, but are accepted nonetheless due to mis-reconstructed particles. This can occur, e.g., when leptons from secondary decays are incorrectly identified as originating from the hard-scatter interaction point (known as non-prompt leptons), or when other physics objects, such as hadronic jets, are mistakenly reconstructed as leptons (resulting in mis-identified leptons). These fake leptons are usually estimated using data-driven techniques, one of the most common being the Fake Factor method. This method relies on predicting the fake lepton contribution by reweighting data events, using a scale factor (i.e. fake factor) function. Traditionally, fake factors have been estimated by histogramming and computing the ratio of two data distributions, typically as functions of a few relevant physics variables such as the transverse momentum pT and pseudorapidity η. In this work, we introduce a novel approach of fake factor calculation, based on density ratio estimation using neural networks trained directly on data in a higher-dimensional feature space. We show that our method enables the computation of a continuous, unbinned fake factor on a per-event basis, offering a more flexible, precise, and higher-dimensional alternative to the conventional method, making it applicable to a wide range of analyses. A simple LHC open data analysis we implemented confirms the feasibility of the method and demonstrates that the ML-based fake factor provides smoother, more stable estimates across the phase space than traditional methods, reducing binning artifacts and improving extrapolation to signal regions.</dc:description><dc:publisher>SISSA</dc:publisher><dc:date>2026</dc:date><dc:date>2026-04-29 12:53:38</dc:date><dc:type>Neznano</dc:type><dc:identifier>29233</dc:identifier><dc:source>Italija </dc:source><dc:language>sl</dc:language><dc:rights>© The Authors</dc:rights></rdf:Description></rdf:RDF>
