Open on DataHub
# HIDDEN
# Clear previously defined variables
%reset -f

# Set directory for data loading to work properly
import os
os.chdir(os.path.expanduser('~/notebooks/21'))

Contributors

We thank Joe Hellerstein, Bin Yu, and Fernando Perez for their significant efforts towards building the first iterations of Data 100.

This textbook also contains substantial contributions from past Data 100 students. We list the contributors below and thank them for their effort in creating content for the textbook.

Name Contributions
Ananth Agarwal 9.1 (Relational Databases), 9.2 (SQL Queries), 15.3 (Cross-Validation)
Ashley Chien 9.3 (SQL Joins), Reference Table Appendix
Andrew Do 8.1 (Python String Methods), 8.2 (Regular Expressions), 8.3 (Regex in Python and pandas)
Sona Jeswani 18.1 (Introduction to Hypothesis Testing), 18.2 (Permutation Testing), 18.3 (Bootstrapping for Linear Regression)
Tiffany Jann 7.1 (HTTP), 11.3 (Convexity)
Andrew Kim 7.1 (HTTP), 8.1 (Python String Methods), 8.2 (Regular Expressions), 8.3 (Regex in Python and pandas)
Jun Seo Park 9.1 (Relational Databases), 9.2 (SQL Queries), Reference Table Appendix
Allen Shen 2.2 (Probability Overview), 11.3 (Convexity), 12 (Probability and Generalization), 15 (Bias-Variance Tradeoff)
Katherine Yen 9.3 (SQL Joins), 15.3 (Cross-Validation)
Daniel Zhu 2.2 (Probability Overview), 8 (Working with Text), 12 (Probability and Generalization), 15 (Bias-Variance Tradeoff)