Skip to content

Commit f678667

Browse files
authored
Merge pull request #33 from databio/dev
release v0.6
2 parents 8bec5f0 + 57ae0b6 commit f678667

File tree

11 files changed

+439
-281
lines changed

11 files changed

+439
-281
lines changed

CHANGELOG.md

Lines changed: 24 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,41 @@
11
# Change log
22
All notable changes to this project will be documented in this file.
33

4+
## [0.7.0] -- Unreleased
5+
6+
7+
8+
## [0.6.0] -- 2017-12-13
9+
10+
### Added
11+
- Single-end reads are now allowed
12+
- Pipeline can now accommodate `.bam` input files
13+
- Added a single-base bed file output in addition to the smoothed version
14+
15+
### Changed
16+
- Fixed a bug with peak counting for fseq
17+
- Fixed a bug with reporting estimated library sizes
18+
- Fixed issues with TSS enrichment calculation that could lead to stalled jobs or excess CPU use
19+
- Reduced verbosity of error messages for some tools
20+
- Reduced amount of resources requested by default
21+
- Introduced requirement on pypiper v0.7
22+
423
## [0.5.0] -- 2017-09-13
524

625
### Added
7-
- Adds rudimentary figure reporting
26+
- Added rudimentary figure reporting
827

928
### Changed
1029
- Changed default trimmer from trimmomatic to skewer
11-
- Make output from several tasks less verbose to make logs cleaner
12-
- Fixes an issue that left behind temporary samtools files if the job was killed
30+
- Made output from several tasks less verbose to make logs cleaner
31+
- Fixed an issue that left behind temporary samtools files if the job was killed
1332

1433
## [0.4.0] -- 2017-07-21
1534

1635
### Added
1736
- Added [fseq](https://github.com/aboyle/F-seq) as a peak caller option
18-
- Peak caller is specified by a command line argument (defaults to macs2)
19-
- Count of called peaks is now reported as a pipeline result.
37+
- Peak caller is now specified by a command line argument (defaults to macs2)
38+
- Count of called peaks is now reported as a pipeline result
2039
- Add R and ggplot2 as requirements
2140

2241
### Changed

README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,7 @@ You have two options for running the pipeline.
7676

7777
### Run option 1: Running the pipeline script directly
7878

79-
To see the command-line options for usage, see [usage.txt](usage.txt), which you can get on the command line by running `pipelines/ATACseq.py --help`. You just need to pass a few command-line parameters to specify sample_name, reference genome, input files, etc. See example command in [cmd.sh](cmd.sh) using test data.
79+
To see the command-line options for usage, see [usage.txt](usage.txt), which you can get on the command line by running `pipelines/ATACseq.py --help`. You just need to pass a few command-line parameters to specify sample_name, reference genome, input files, etc. See [example commands](example_cmd.txt) that use test data.
8080

8181
To run on multiple samples, you can just write a loop to process each sample independently with the pipeline, or you can use *option 2*...
8282

@@ -139,6 +139,11 @@ URL="http://hgdownload.soe.ucsc.edu/goldenPath/hg38/database/refGene.txt.gz"
139139
wget -O ${GENOME}_TSS_full.txt.gz ${URL}
140140
zcat ${GENOME}_TSS_full.txt.gz | awk '{if($4=="+"){print $3"\t"$5"\t"$5"\t"$4"\t"$13}else{print $3"\t"$6"\t"$6"\t"$4"\t"$13}}' | LC_COLLATE=C sort -k1,1 -k2,2n -u > ${GENOME}_TSS.tsv
141141
echo ${GENOME}_TSS.tsv
142+
143+
Mouse:
144+
GENOME="mm10"
145+
URL="http://hgdownload.soe.ucsc.edu/goldenPath/mm10/database/refGene.txt.gz"
146+
142147
```
143148

144149
Another option from Gencode GTF:

examples/gold_atac/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,10 @@ python get_geo.py -i ~/code/ATACseq/examples/gold_atac/metadata/gold_atac_gse.cs
1313

1414
I used resulting file [metadata/annocomb_gold_atac_gse.csv](metadata/annocomb_gold_atac_gse.csv) to create the looper metadata sheet, [metadata/gold_atac_annotation.csv](metadata/gold_atac_annotation.csv).
1515

16-
I create project config file and sampled test data. The SRA fastq files should be stored in a folder `${SRAFQ}`, and then this will run with looper with no additional changes.
16+
I create project config file and sampled test data. The SRA fastq files should be stored in a folder pointed to by environment variable `SRAFQ`, and then this will run with `looper` with no additional changes.
1717

1818
## Run pipeline
1919

2020
```
2121
looper run ${CODE}ATACseq/examples/gold_atac/metadata/project_config.yaml -d
22-
```
22+
```

examples/test_project/test_config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,4 +30,4 @@ implied_columns:
3030
mouse:
3131
genome: mm10
3232
macs_genome_size: "mm"
33-
prealignments: null
33+
prealignments: "mouse_chrM2x"

pipeline_interface.yaml

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,22 @@
11
protocol_mapping:
22
ATAC: ATACseq.py
3-
ATAC-SEQ: ATACseq
3+
ATAC-SEQ: ATACseq.py
44

55
pipelines:
66
ATACseq.py:
77
name: ATACseq
88
path: pipelines/ATACseq.py
99
looper_args: True
10-
required_input_files: [read1, read2]
10+
required_input_files: [read1]
1111
all_input_files: [read1, read2]
1212
ngs_input_files: [read1, read2]
1313
arguments:
1414
"--sample-name": sample_name
1515
"--genome": genome
1616
"--input": read1
17-
"--input2": read2
1817
"--single-or-paired": read_type
1918
optional_arguments:
19+
"--input2": read2
2020
"--frip-ref-peaks": FRIP_ref
2121
"--prealignments": prealignments
2222
"--genome-size": macs_genome_size
@@ -40,14 +40,14 @@ pipelines:
4040
file_size: "0.5"
4141
cores: "4"
4242
mem: "16000"
43-
time: "02-00:00:00"
43+
time: "00-04:00:00"
4444
micro:
4545
file_size: "1"
4646
cores: "8"
4747
mem: "32000"
48-
time: "07-00:00:00"
48+
time: "02-00:00:00"
4949
milli:
50-
file_size: "4"
50+
file_size: "10"
5151
cores: "16"
5252
mem: "64000"
53-
time: "07-00:00:00"
53+
time: "03-00:00:00"

0 commit comments

Comments
 (0)