140 likes | 195 Views
Explore UC Berkeley's popular Data 8 course covering inferential thinking, computational thinking, and social issues in data analysis. Emphasizes hands-on work and critical thinking skills for all skill levels. No prerequisites required.
E N D
UC Berkeley’s Data 8 The Foundations of Data Science: Inferential Thinking, Computational Thinking, and Real-World Relevance Presented by: Ava Meredith, Seattle Central College
What is Data 8? • Data 8 is a popular introductory Data Science class at UC Berkeley • Designed to be accessible to a broad range of students without the typical prerequisites for a data science class • Data 8's unique model: inferential thinking, computational thinking, and real-world relevance • Focus on social issues in data analysis • All materials for the course are available for free online under a CC license.
The Data 8 Teaching Philosophy • Represents a shift from traditional teaching each of the individual concepts in a course. • Introductory courses in statistics, computer science, writing, and ethics (among others) combined into a single introductory course.
Data 8 Goals • Diversity • Equity • Pedagogical Clarity • Scalability • Depth • No computational barrier to entry
Core Concepts • Critical thinking • Don't take your data for granted • Use the combination of CS + Stats as a feature, not a bug • Focus on hands on work • Determine if your inference is sound • Experiment • Know the right statistical tools for the job
Learn about data limitations • Quantify and understand uncertainty in data • Turn your data analysis into a decision • Think of ways that you could be wrong • Consider edge-cases
Focus on main ideas (shield the students from non essential topics) • Use the data science module rather than many package APIs • Use JupyterHub (no need for students to setup environment)
Abstract cleaning data by providing pre-collected/cleaned data • Provide further resources • Aim the course for anybody, not just statistics or CS majors.
Intersections of Topics • Intersectionality is a feature, not a bug • Connect CS and statistics concepts • Use interactivity to let people explore
Topics covered • Programming fundamentals • Statistics, sampling, and hypothesis testing • Inference, prediction, and models • Comparing distributions
Connector courses Connector courses offer the ways in which data science is applied in a domain knowledge field
Tech Stack • Managing course content - Jupyter notebooks • Programming language - Python 3 • Primary data object and functions - Use of data analytics packages in Python (Data 8 wraps several) • Handling the Python environment - Python dev environment managed with miniconda
Next Steps View the course onlinehttp://data8.org/ Free online textbook: https://www.inferentialthinking.com/chapters/intro Data Science Academic Resource Kit:https://data.berkeley.edu/education/ark