Data Wrangling In Stata

"Data Wrangling" is the process of preparing data for analysis, which includes importing, cleaning, recoding, restructuring, combining, and anything else data needs before it can be analyzed. Data wrangling is a critical skill for research.

In this class you'll learn how to wrangle data using Stata. We'll cover some of the key concepts and workflows of data science as well as the structure and logic of Stata. We'll emphasize real-world issues like handling missing data and checking for errors, as well as best practices for research computing and reproducibility. Our goal is give you a strong foundation you can build on to become an expert data wrangler.

Students should take SSCC's Introduction to Stata or have equivalent experience before taking this class. You should be comfortable writing do files and understand all the component parts of a command like sum x if y>5, detail. Graduate students may choose to take this class at the start of their graduate student career or wait until they are ready to start doing research. If you've taken Stata for Researchers before you probably don't need to take Data Wrangling in Stata, but you may want to look over the online materials (when they're available) to see what's new.

Instructor: Dimond
Room: 6232 Sewell Social Sciences Building
Dates: 9/11, 9/16, 9/18, 9/23, 9/25, 9/30, 10/2
Time: 11:00 - 1:00

Each session of this class builds on the material taught in the previous sessions. If you cannot attend all of the class's sessions but still want to take the class, you must contact the Help Desk, find out what will be covered in the session(s) you will miss, and learn that material on your own before the next session. In most cases the material can be found in the SSCC Knowledge Base.