Skip to content

docs: update examples to use hg38 as primary reference genome#1046

Merged
etal merged 2 commits intoetal:masterfrom
haoyu-haoyu:docs/hg38-primary-reference
Apr 15, 2026
Merged

docs: update examples to use hg38 as primary reference genome#1046
etal merged 2 commits intoetal:masterfrom
haoyu-haoyu:docs/hg38-primary-reference

Conversation

@haoyu-haoyu
Copy link
Copy Markdown
Contributor

Summary

Update documentation to use hg38/GRCh38 as the primary reference genome in all examples, while preserving hg19 references for legacy workflows.

Background

PR #837 added hg38 data files (data/refFlat_hg38.txt, data/access-10kb.hg38.bed) but the documentation still defaulted to hg19 examples. hg38 has been the standard reference for over 10 years.

Changes

File Changes
doc/pipeline.rst 13 hg19→hg38 updates across batch, target, access, antitarget, autobin, reference sections
doc/quickstart.rst 6 updates in download instructions and batch examples
doc/nonhybrid.rst 1 update in WGS batch example
doc/scripts.rst 1 update in guess_baits.py example

Key decisions

  • All command examples now use data/access-10kb.hg38.bed and data/refFlat_hg38.txt (both already exist in repo)
  • UCSC download links updated to hg38 equivalents
  • Legacy note preserved: "hg19 access and annotation files are also available in data/ for legacy workflows"
  • Mappability link corrected: hg38 does not have encodeDCC/wgEncodeMapability/ — points to bigZips/ instead, with hg19 fallback link

Ref #822

hg38/GRCh38 data files were added in PR etal#837 but documentation still
defaulted to hg19 examples. Update pipeline, quickstart, nonhybrid,
and scripts docs to show hg38 as the primary reference while noting
hg19 files remain available for legacy workflows.

Ref etal#822
hg38 does not have an encodeDCC/wgEncodeMapability directory on UCSC.
Point to the hg38 bigZips directory for current data, and keep the
hg19 ENCODE link for legacy workflows.
Copy link
Copy Markdown
Owner

@etal etal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks good, thanks!

@codecov
Copy link
Copy Markdown

codecov bot commented Apr 14, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 66.41%. Comparing base (2f6f3e0) to head (5be29b3).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #1046   +/-   ##
=======================================
  Coverage   66.41%   66.41%           
=======================================
  Files          72       72           
  Lines        7409     7409           
  Branches     1303     1303           
=======================================
  Hits         4921     4921           
  Misses       2049     2049           
  Partials      439      439           
Flag Coverage Δ
unittests 66.41% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@etal etal merged commit 6d002c4 into etal:master Apr 15, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants