Notebook

Machine learning technique for signal-background separation of nuclear interaction vertices in the CMS detector ¶

Phil Baringer: baringer@ku.edu, Anna Kropivnitskaya (speaker): kropiv@cern.ch

University of Kansas
_for CMS Collaboration_

Code with Toy data is available at Binder

Abstract:¶

The CMS inner tracking system is a fully silicon-based high precision detector. Accurate knowledge of the positions of active and inactive elements is important for simulating the detector, planning detector upgrades, and reconstructing charged particle tracks. Nuclear interactions of hadrons with the detector material create secondary vertices whose positions map the material with a sub-millimeter precision in situ, while the detector is collecting data from LHC collisions.

A neural network (NN) with two hidden layers was used to separate secondary vertices due to combinatorial background from those arising from nuclear interactions with material. The NN was trained and tested on data from proton-proton collisions at a center-of-mass energy of 13 TeV, recorded in 2018 at the LHC.

NN training is performed using Keras and Matplotlib in a Jupyter notebook. Secondary vertices in the training data are classified as signal or background, based on their geometrical position. Even though the variables used in training show only small differences between background and signal, the NN has impressive separation power. Hadrographies of the CMS inner tracker detector before and after background cleaning are presented.

Table of contents¶

Introduction
CMS detectors
- Pixel detector
Nuclear interactions
Neural network (NN) motivation and strategy
Classification strategy for NN
Principal component analysis (PCA)
Keras mode: NN with 2 hidden layers
Summary
Documentation
Acknowledgment

Material structure	Radius definition	Material center position
beam pipe (BP)	r, centered at BP	(1.71, -1.76) mm
BPIX inner/outer shields, layers 1-4	r, centered at BPIX	(0.86, -1.02) mm
BPIX rails and pixel support tube	r, centered at tube	(0.80, 3.18) mm

B/S ratio		BP		IS		L1		L2		L3		L4		OS		Tube		Rails		Background
no classification		0.16		2.58		1.2		0.92		0.71		0.17		0.41		0.06		0.13		1000.0

Background to Signal ratio		BP		IS		L1		L2		L3		L4		OS		Tube		Rails
no classification		0.16		2.58		1.2		0.92		0.71		0.17		0.41		0.06		0.13

Machine learning technique for signal-background separation of nuclear interaction vertices in the CMS detector ¶

Abstract:¶

Table of contents¶

Introduction ¶

CMS detectors ¶

Pixel detector¶

Nuclear interactions¶

Data selection and reconstruction¶

Toy Data¶

Connect/Generate data¶

Display generated Toy data:¶

Neural network (NN) motivation and strategy ¶

Classification strategy for NN ¶

Set Signal and Background regions for beam pipe ¶

Set Signal and Background regions for BPIX ¶

Set Signal and Background regions for pixel support tube ¶

Set Signal and Background regions for rails, by using x position of NI candidate ¶

Classify NI candidates as Signal, as Background, and as Non-classified regions ¶

Check classification result ¶

Estimate background-signal ratio (B/S) of each signal region¶

B/S ratio

BP

IS

L1

L2

L3

L4

OS

Tube

Rails

Background

no classification

0.16

2.58

1.2

0.92

0.71

0.17

0.41

0.06

0.13

1000.0

Shuffle Data ¶

Sort track variables by $p_T$ and normalize $p_T$ tracks ¶

Plot variables, injected to NN ¶

Divide data into Train and Test data sets ¶

Data preparation and classification for the NN¶

Principal Component Analysis (PCA) ¶

Keras mode: NN with 2 hidden layers ¶

Import libraries ¶

Create function for NN model with 2 hidden layers ¶

Create NN model structure and compile it ¶

NN model training ¶

Save/Load NN model to/from file ¶

Monitor performance during training ¶

Model results ¶

Predict the probability distribution of NN classes for Train and Test sets¶

The probability distribution for injected vertex to be a signal ¶

NN model optimization with Test set ¶

Plot Train and Test prediction for Signal-Background separation as function of BPIX radius ¶

Background to Signal (B/S) ratios in Signal regions (S0-S6)¶

Background to Signal ratio

BP

IS

L1

L2

L3

L4

OS