Tidying Data with Python and OpenRefine

Date:

Thursday, April 16, 2020, 1:00pm to 3:30pm

Location:

Cambridge

In his paper "Tidy Data," Hadley Wickham riffs on Tolstoy: "Like families, tidy datasets are all alike but every messy dataset is messy in its own way." When we spend 75% of our "analysis" time cleaning and preprocessing data, it makes sense to focus on strategies to standardize our data. In this workshop, we will focus on correcting common errors in collected data and (re)structuring datasets to facilitate analysis. We will be using OpenRefine and Python for these tasks; while you don't need to be a Pythonista, you should have some familiarity with Python or other similar scripting languages, as we won't be spending much time on syntax.

Please see the following page for registration details: https://dssg.fas.harvard.edu/event/tidy-data-python-openrefine/

Export

iCal

S	M	T	W	T	F	S
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30

7ba1562f576848c6110977b6a9d02028

Tidying Data with Python and OpenRefine

Date:

Location:

April 2024

Filter by Type

Filter by Research Technology

About

For Assistance

Programs & Products

Learn More

Research Resources

Our Impact

b80e00feb9581d080cf21481e3ae267d

752591ecf802a24a30b0cf2dfb1d91c6

fd88e8821a7292374fa97567160ce79d

9213fe68871fba0e5a76964151c63a58