Data Wrangling in Python

Please note that due to the COVID-19 pandemic, all SSCC training will be offered online. Additional details and course links will be provided after registration.

"Data Wrangling" is the process of preparing data for analysis, which includes importing, cleaning, recoding, restructuring, combining, and anything else data needs before it can be analyzed. Data wrangling is a critical skill for research. This course teaches wrangling skills using mostly the data wrangling tools of the Pandas package in Python. Pandas is a collection of functions/methods for working with data similar to R's tidyverse.

This course will cover importing data, cleaning data, creating and transforming variables, merging data, and plotting. It is a hands-on class with time devoted to practicing using these tools to ready data for analysis. It is designed for people who have no experience with Python and pandas, Python users who would like to learn pandas will also benefit from the class. Graduate students who will work in Python and pandas may choose to take this course at the beginning of their graduate student career or wait until they're ready to start doing research.

Instructor: Dimond
Room: Online Training
Dates: 1/11, 1/12, 1/13, 1/14, 1/15, 1/19, 1/20, 1/21, 1/22
Time: 1:30 - 3:30

Each session of this class builds on the material taught in the previous sessions. If you cannot attend all of the class's sessions but still want to take the class, you must contact the Help Desk, find out what will be covered in the session(s) you will miss, and learn that material on your own before the next session. In most cases the material can be found in the SSCC Knowledge Base.