Skip to content

Releases: SojaSurfer/Webscraper

1.1 Removed Interviews

14 Nov 17:56

Choose a tag to compare

  • removed 109 speeches with[Ii]nterview substring inside title
  • 964 speeches remained

visualization

Raw result of Web scraping

14 Nov 17:23

Choose a tag to compare

Result from scraping presidency.ucsb.edu on 10.11.2024. It contains 1073 speeches.

The zip file includes

  • a folder containing all speeches as plain txt files
  • a metadata table with one row per speech as csv & excel file
  • a txt file with all urls which were scraped
  • a quick png visualization of the metadata

Screenshot 2024-11-14 at 18 49 10