This is the PyTorch implementation of our paper: Drift-Aware Continual Tokenization for Generative Recommendation
We propose DACT, a Drift-Aware Continual Tokenization framework with two stages:
- Tokenizer Fine-tuning: Augmented with a jointly trained Collaborative Drift Identification Module (CDIM) that outputs item-level drift confidence and enables differentiated optimization for drifting and stationary items.
- Hierarchical Code Reassignment: Using a relaxed-to-strict strategy to update token sequences while limiting unnecessary changes.
cd rqvae
bash finetune_book_dact.shcd rqvae
bash generate_code_dact.shcd TIGER-backbone
bash finetune.shcd LC-Rec-backbone/scripts_qwen
bash train.sh