Lecture notes
- L0: Course Introduction (slides)
- L1: Introduction to Big Data (slides)
- Linux
- L2: Using Linux as a Data Scientist (slides)
- Git
- Python
- L4: Statistical Modeling with Python (slides)
- L5: Web Scraping with Python
- L5.1 Basic Web Scraping with Python (slides)
- L5.1 Web Scraping with selenium
- Distributed computing
Note: Interactive slides are based on Jupyter notebook and aliyun E-MapReduce service. Please find them on the aliyun server. They are only available within the Autumn semester.