EPA Publication List: NSCEP

### EPA Publication List

* *Agency*: Environmental Protection Agency
* *Agency Division*: National Service Center for Environmental Publications (NSCEP)
* *Data Type*: Various
* *Data Format*: PDF

I have mined the metadata for the [EPA Publication List](https://nepis.epa.gov/EPA/html/pubs/pubtitle.html) and have hosted the [direct download links](https://raw.githubusercontent.com/hkuchampudi/GovDataDump/master/EPA%20Publications%20List/directLinks.txt) to the PDFs in [my repository](https://github.com/hkuchampudi/GovDataDump/tree/master/EPA%20Publications%20List). I need help mining the documents themselves as I do not have the space to download them.

## Downloading the Documents
You can execute the following command (after downloading the directLinks.txt file) replacing the placeholders with the appropriate values to download files in bulk: 
```
awk 'FNR>=[Starting_Line_Number] && FNR<=[Ending_Line_Number]' [Links_Location] | while read -r link; do wget -t 10 -T 10 -U "Mozilla" $(echo $link | tr -d '\r'); done
```
 - [Starting_Line_Number] with the line number of the first link to download
 - [Ending_Line_Number] with the line number of the last link to download
 - [Links_Location] with the path to the downloaded directLinks.txt file from the repository

## Download Information

| Property           | Value           |
|--------------------|-----------------|
| Number links/documents       | 75973           |
| Estimated total filesize | 16.386551273 GB |




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EPA Publication List: NSCEP #359

EPA Publication List

Downloading the Documents

Download Information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Property	Value
Number links/documents	75973
Estimated total filesize	16.386551273 GB

EPA Publication List: NSCEP #359

Description

EPA Publication List

Downloading the Documents

Download Information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions