Simple python script for scraping tests off of HKN's (UC Berkeley) test archive. Used for https://github.com/kcparashar/exams
clayshieh cc4e39c4ba added readme and changed python file name 1 year ago
hkn_scrape.py added readme and changed python file name 1 year ago
readme.md added readme and changed python file name 1 year ago

readme.md

HKN Test Scraper Script

What is it?

A Python script that scrapes UC Berkeley Eta Kappa Nu’s (HKN) test archives and downloads the files in an organized file structure. Uses multithreading to improve download performance in terms of waiting for the HTTP requests.

Usage

  • Navigate to the directory that you want the downloaded files to appear in
  • run python hkn_scrape.py

Future Improvements

  • Add support for Python3
  • Maybe use a more efficient library than BeautifulSoup