SAL-T: Spatially Aware Linear Transformer for Jet Tagging

This repository contains code and scripts to preprocess jet data and train efficient transformer-based models for jet tagging on various datasets.

🛠️ Setup

1. Clone the helper Repository

git clone https://github.com/JavierZhao/l1-jet-id-hls4ml
cd l1-jet-id-hls4ml

2. Install Dependencies

pip install -e .

📦 Dataset Preparation

To download and process the hls4ml dataset :

python l1-jet-id-hls4ml/fast_jetclass/data/prepare_hls4ml_data.py \
  --root PATH-TO-DATA-DIR \
  --nconst [16|32|150] \
  --feats ptetaphi \
  --norm standard \
  --seed 42 \
  --kfolds 5

To download and process the Top Quark Tagging Reference Dataset:

python SAL-T4HEP/scripts/python process_top.py \
--input_dir PATH-TO-STORE-RAW-DATA \
--output_dir PATH-TO-STORE-PROCESSED-DATA \

To download and process the Quark Gluon Dataset:

python SAL-T4HEP/scripts/python process_qg.py \
--input_dir PATH-TO-STORE-RAW-DATA \
--output_dir PATH-TO-STORE-PROCESSED-DATA \

At this point, for the purpose of running our code, you no longer need the raw data, so you can safely delete that directory.

🚀 Running Training

To train our Linformer-based jet classifier:

chmod +x SAL-T4HEP/scripts/run_all.sh

./SAL-T4HEP/scripts/run_all.sh \
  --data_dir PATH-TO-DATA \
  --dataset [top|QG|hls4ml] \
  --save_dir PATH-TO-SAVE-RESULTS \
  --cluster_E \
  --cluster_F \
  --convolution \
  --batch_size 4096 \
  --d_model 16 \
  --d_ff 16 \
  --num_heads 4 \
  --proj_dim 4 \
  --num_particles [150|200] \
  --sort_by [kt|deltaR|pt]

Arguments:

--data_dir: Path to preprocessed data
--dataset: Dataset type (top, hls4ml, QG)
--save_dir: Output directory for logs and checkpoints
--cluster_E, --cluster_F: Enable spatial partitioning on keys/values
--convolution: Enable convolution layer in attention
--batch_size: Training batch size
--d_model, --d_ff, --num_heads, --proj_dim: Model hyperparameters
--num_particles: Number of input particles per jet
--sort_by: Sorting strategy (kt, pt, deltaR.)

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
models		models
scripts		scripts
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAL-T: Spatially Aware Linear Transformer for Jet Tagging

🛠️ Setup

1. Clone the helper Repository

2. Install Dependencies

📦 Dataset Preparation

🚀 Running Training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SAL-T: Spatially Aware Linear Transformer for Jet Tagging

🛠️ Setup

1. Clone the helper Repository

2. Install Dependencies

📦 Dataset Preparation

🚀 Running Training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages