https://studiegids.vu.nl/en/courses/2024-2025/X_400645After completing this course, a student can: 1. import, clean, transform, filter, and explore data in Python 2. store and retrieve semi-structured data in and from various kinds of database 3. create appropriate and well-formatted visualizations and tables 4. fit statistical models and train basic machine learning models 5. address a research question using a large dataset and report on their findingsThis course aims to integrate various aspects of modern data science and teach the fundamentals of working with big data. Topics include working with structured data; statistical data analysis; visualization of data; preparing data for processing; storing unstructured data; fitting statistical models; training basic machine learning models. Python is used throughout this hands-on project-based course.Lectures and tutorial sessionsHand-in individual assignments plus a group project, which entails an oral presentation and a final written report. The weights will be specified on Canvas; the weighted average needs to be 5.5 or higher. There is no resit for this course.2BAProgramming experience in any language is necessary