TritonParse Release Notes (last 24 commits)
- Date range: 2025-10-14 β 2025-11-03
- Scope: IR Analysis enhancements (beta), Reproducer template extensions, code viewer improvements, bug fixes.
Highlights
- π IR Analysis (Beta): New analysis capabilities for visualizing Software Pipelining (SWP), BufferOps statistics, and loop schedules in Triton IR. Note: This is a beta feature.
- π·οΈ Variable Location Tracking: Complete location alias tracking system for mapping IR locations back to source code with frontend visualization.
- π§ TritonBench Template: New reproducer template for easy TritonBench integration and kernel benchmarking.
- π¨ Code Viewer Enhancements: Full Python source extraction, function highlighting, and performance optimizations.
- π Reproducer Refactoring: AST-based function extraction eliminates code duplication and simplifies template maintenance.
Changes by area
π IR Analysis (Beta)
- Software Pipelining (SWP) visualization (PR #189):
- Analyzes inner
scf.forloops and identifies prologue, loop_body, and epilogue stages - Tracks
tt.loadandtt.dotoperations through TTIR β TTGIR β Python source mappings - Frontend displays simplified source code with SWP stage information
- Limitations: Does not support Warp Specialization or Blackwell operators yet
- Analyzes inner
- BufferOps backend information (PR #181):
- Statistical analysis of buffer operations (tt.load/store, amdgpu.buffer_load/store, global_load/store) at TTGIR and AMDGCN levels
- Useful for AMD GPU backend optimization analysis
- Web frontend IR Analysis page (PR #184):
- New dedicated page at
/ir-analysisroute with integrated display for loop schedules and BufferOps statistics
- New dedicated page at
π·οΈ Variable Location Tracking
Complete three-part implementation (PR #186, #187, #188):
- Fixed #loc storage key conflict in IR parser
- Added location alias parsing support in
ir_parser.pyandtrace_processor.py - Frontend visualization with CSS styling and interactive location display in Code Viewer
π Reproducer System
- TritonBench template support (commit 3493ac8):
- New template:
tritonparse/reproducer/templates/tritonbench.py - CLI option:
--template tritonbenchfor TritonBench-compatible reproducers - Integrates with TritonBench's
BenchmarkOperatorand benchmark harness
- New template:
- AST-based refactoring (PR #178):
- New module:
tritonparse/reproducer/function_extractor.pyusing Python AST - Simplified
example.pytemplate from ~370 lines to ~20 lines
- New module:
- Bug fixes:
π Callsite Location Support
- TTIR/TTGIR callsite location (PR #190):
- Extended IR parser to extract callsite location information
- Better debugging with call graph information and test coverage
π» Code Viewer & Frontend
- Full Python source extraction (commit 2976887):
- Enhanced
structured_logging.pyto extract complete Python source files
- Enhanced
- Full file display with function highlighting (commit 220d5a4):
- CodeViewer now supports displaying entire source files with function-level highlighting
- CodeComparisonView performance optimization (commit c17e584):
- Significant rendering performance improvements for large files
- Reduced re-renders and improved memory efficiency
π Website & Maintenance
- Dependency updates (PR #179): Added automation script
website/scripts/update_deps.sh - Copyright updates (PR #183): Updated copyright headers across source files
Compatibility notes
- No breaking changes: All updates are backward compatible with v0.3.0.
- IR Analysis (Beta): New optional feature accessible through web UI.
- TritonBench template: Optional, does not impact existing reproducer generation.
Upgrade guidance
-
Using IR Analysis (Beta):
- Open web UI and navigate to IR Analysis page after parsing
- View SWP stage information (prologue/loop_body/epilogue) and BufferOps statistics
- Note: Beta feature with some limitations on advanced pipelining patterns
-
Generating TritonBench reproducers:
tritonparseoss reproduce trace.ndjson.gz --line <N> --template tritonbench --out-dir <output>
-
Code viewer enhancements: Automatically enabled with full source display and function highlighting