Skip to content

ChaosAbyssF/tf_graph

 
 

Repository files navigation

Tensorflow Graph Optim

  • 优化 tensorflow graph,尤其是客户 release 的 .pb / .spec 等
# 将 spec 文件转成 pb 并优化
python convert_spec_to_frozen_graph_def.py --graph /path/to/graph.spec

# 推理 pb 模型(修改graph runner脚本里的musa_plugin 路径)  for latency test
python graph_runner.py --graph /path/to/graph.spec --batch_size 1024 --platform cpu/cuda/musa

# profiling pb 模型
bash get_getprofile_${platfrom}.sh --graph /path/to/graph.spec

# 寻找坏节点
# fix pb 文件
python scan_fix_frozen_pb.py --pb /path/to/graph.pb --fix
python detect_bad_node.py --graph /path/to/graph.spec --platform cpu/cuda/musa

# XLA 融合分析(dump HLO/PTX + 融合统计 + XLA开关耗时对比)
python run_xla_fusion_analysis.py \
  --pb /path/to/graph_frozen.pb \
  --platform cuda \
  --batch-size 1024 \
  --warmup 10 \
  --iters 50 \
  --output-dir fusion_analysis

# 单独分析已有的 XLA dump 目录
python analyze_hlo_ptx.py \
  --dump-dir fusion_analysis/run_xxx/xla_dump \
  --json fusion_analysis/run_xxx/reports/xla_dump_report.json

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 87.0%
  • Shell 13.0%