Web scraping is a technique for obtaining and reorganizing data from websites. This workshop will cover the fundamentals of web scraping with the help of popular Python modules. Accessing websites, processing information, and storing data in a CSV file will all be practiced. This workshop is for data scientists who are new to web scraping but have some Python experience or have previously attended the Introduction to Python course.
Workshop goals
- Understand about Data Collection
- Learn the concepts of Web scraping
Pre-requisites
STOP: before starting this workshop, please attend the following Digital Scholarship Lab workshop(s) before completing this one:
OR make sure you’re comfortable with the following concepts (study suggestions in parentheses)
- Python (resource 1)
- Databases (resource 2)
Workshop Content
Time Estimate | Section | Keypoints |
---|---|---|
Pre-Workshop | Setup | Install required software and download files required for the lesson |
00:00 | 1. Web Scraping | Key question (FIXME) |
00:00 | Finish | Please fill out the workshop survey |
The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.
Workshop Recording
Survey
Thank you for attending this workshop or reading through the workshop material! If you could take 3-5 min to respond to our anonymous survey, we can continue to improve this workshop. We appreciate any and all feedback!
Next Up…
Check out these workshops after you’ve completed this one: