-
1 Particle - 1 Qubit: Particle Physics Data Encoding for Quantum Machine Learning
Aritra Bal, Markus Klute, Benedikt Maier, Melik Oughton, Eric Pezone, Michael Spannowsky
• Submitted to Physical Review Letters
• Accepted at EPS HEP 2025 (Marseille)
• Accepted at ML4Jets 2025 (CalTech)
We introduce 1P1Q, a novel quantum data encoding scheme for high-energy physics (HEP), where each particle is assigned to an individual qubit, enabling direct representation of collision events without classical compression. We demonstrate the effectiveness of 1P1Q in quantum machine learning (QML) through two applications: a Quantum Autoencoder (QAE) for unsupervised anomaly detection and a Variational Quantum Circuit (VQC) for supervised classification of top quark jets. Our results show that the QAE successfully distinguishes signal jets from background QCD jets, achieving superior performance compared to a classical autoencoder while utilizing significantly fewer trainable parameters. Similarly, the VQC achieves competitive classification performance, approaching state-of-the-art classical models despite its minimal computational complexity. Furthermore, we validate the QAE on real experimental data from the CMS detector, establishing the robustness of quantum algorithms in practical HEP applications. These results demonstrate that 1P1Q provides an effective and scalable quantum encoding strategy, offering new opportunities for applying quantum computing algorithms in collider data analysis.
-
Model-agnostic search for dijet resonances with anomalous jet substructure in proton-proton collisions at √s = 13 TeV
CMS Collaboration
• Published in Reports on Progress in Physics (Impact Factor: 19)
• Presented at ML4Jets 2024 (Paris)
This paper presents a model-agnostic search for narrow resonances in the dijet final state in the mass range 1.8-6 TeV. The signal is assumed to produce jets with substructure atypical of jets initiated by light quarks or gluons, with minimal additional assumptions. Search regions are obtained by utilizing multivariate machine-learning methods to select jets with anomalous substructure. A collection of complementary anomaly detection methods - based on unsupervised, weakly supervised, and semisupervised algorithms - are used in order to maximize the sensitivity to unknown new physics signatures. These algorithms are applied to data corresponding to an integrated luminosity of 138 fb⁻¹, recorded by the CMS experiment at the LHC, at a center-of-mass energy of 13 TeV. No significant excesses above background expectations are seen. Exclusion limits are derived on the production cross section of benchmark signal models varying in resonance mass, jet mass, and jet substructure. Many of these signatures have not been previously sought, making several of the limits reported on the corresponding benchmark models the first ever. When compared to benchmark inclusive and substructure-based search strategies, the anomaly detection methods are found to significantly enhance the sensitivity to a variety of models.
-
Distilling particle knowledge for fast reconstruction at high-energy physics experiments
Aritra Bal, Tristan Brandes, Fabio Iemmi, Markus Klute, Benedikt Maier, Vinicius Mikuni, Thea Klaeboe Aarrestad
Published in Machine Learning Science and Technology
(This work was the Bachelor Thesis of T. Brandes) Knowledge distillation is a form of model compression that allows artificial neural networks of different sizes to learn from one another. Its main application is the compactification of large deep neural networks to free up computational resources, in particular on edge devices. In this article, we consider proton-proton collisions at the High-Luminosity Large Hadron Collider (HL-LHC) and demonstrate a successful knowledge transfer from an event-level graph neural network (GNN) to a particle-level small deep neural network (DNN). Our algorithm, DistillNet, is a DNN that is trained to learn about the provenance of particles, as provided by the soft labels that are the GNN outputs, to predict whether or not a particle originates from the primary interaction vertex. The results indicate that for this problem, which is one of the main challenges at the HL-LHC, there is minimal loss during the transfer of knowledge to the small student network, while improving significantly the computational resource needs compared to the teacher. This is demonstrated for the distilled student network on a CPU, as well as for a quantized and pruned student network deployed on an field programmable gate array. Our study proves that knowledge transfer between networks of different complexity can be used for fast artificial intelligence (AI) in high-energy physics that improves the expressiveness of observables over non-AI-based reconstruction algorithms. Such an approach can become essential at the HL-LHC experiments, e.g. to comply with the resource budget of their trigger stages.
-
Machine learning techniques for model-independent searches in dijet final states
CMS Collaboration
In preparation for publication
We present the performance of Machine Learning-based anomaly detection techniques for extracting potential new physics phenomena in a model-agnostic way with the CMS Experiment at the Large Hadron Collider. We introduce five distinct outlier detection or density estimation techniques, namely CWoLa, Tag N'Train, CATHODE, QUAK, and QR-VAE, tailored for the identification of anomalous jets originating from the decay of unknown heavy particles. We demonstrate the utility of these diverse approaches in enhancing the sensitivity to a wide variety of potential signals and assess their comparative performance in simulation.