Research Technology

2020 Sep 23

Dataverse Open Office Hours

Repeats every week every Wednesday until Wed Dec 23 2020 except Wed Nov 11 2020.
11:00am to 1:00pm

Location: 

Online via Zoom

Weekly virtual office hours are open to Harvard researchers and staff to provide support for Dataverse 5.0. Demo of 5.0 will begin promptly at 11am.

Open Hours: Wednesdays, 11AM - 1PM

RSVP required to: support@dataverse.org

For any questions on how to share your data with Dataverse, contact: support@dataverse.org

2020 Sep 11

Structured Data, Databases, and SQL

1:00pm to 4:00pm

Location: 

Online: Zoom

Collecting, analyzing, and managing data is the bread-and-butter of any research project, and standard tools like Microsoft Excel are the go-to apps as they're omnipresent and easy to use. But these start to show their limitations when one needs to handle tens of thousands of rows or merge data from multiple sources. Using a relational database, such as SQLite, can meet this gap and is the logical next step for bigger data projects. 

This class will discuss the fundamentals of structured data, introduce you to using SQLite (a lightweight database available on all most computing platforms), and teach you the basics of querying and summarizing data with SQL. Meeting these objectives could open up new opportunities for research and help you with your research data management goals.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Nov 20

Version Control with Git / GitKraken

1:00pm to 4:00pm

Location: 

Online: Zoom

Version control software allows you to save “versions” of files -- scripts, text files, web pages, data, etc. -- which show the changes that were made to the files over time, and allows you to backtrack if necessary and undo those changes. The ability alone – of being able to compare two versions or reverse changes, makes it fairly invaluable when working on larger projects. Even more so when collaborating in research groups.

This hands-on workshop will take you through the steps of using Git / GitKraken and GitHub, to track changes, revert to older versions, and share your files with other people. Ultimately, to keep you organized, to reduce the clutter, and maintain an intelligible history of files in your projects.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Nov 13

Do Less Work by Using the Unix Shell

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop will introduce you to the Unix shell, a power tool that allows people to do complex things with just a few keystrokes, combine existing programs in new ways, and automate repetitive tasks.

The Unix shell (command line) has been around longer than most of its users have been alive. It has survived so long because it’s a power tool that allows people to do complex things with just a few keystrokes. More importantly, it helps them combine existing programs in new ways and automate repetitive tasks so they aren’t typing the same things over and over again. Use of the shell is fundamental to using a wide range of other powerful tools and computing resources (including “high-performance computing” supercomputers). These lessons will start you on a path towards using these resources effectively.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Nov 06

Python Natural Language Processing

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop will introduce foundational concepts in natural language processing (NLP) as well as techniques for analyzing text (natural language) data using Python's Natural Language ToolKit (NLTK) library. We will work through an entire basic NLP workflow covering acquiring text corpora from the web, text pre-processing, summary statistics and visualization, and building generative models. 

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Sep 18

Stata Introduction

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop provides an introduction to Stata, including how to import and manipulate data, as well as calculate descriptive statistics. This workshop is appropriate for those with little or no prior experience with Stata.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Setup instructions and materials: http://bit.ly/dss_statainstall
Class website: http://bit.ly/dss_stataintro

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Oct 30

Python Web-Scraping

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop will introduce basic techniques for web-scraping using popular Python libraries. This is an intermediate-level, and somewhat challenging, workshop appropriate for those who have been using Python for at least a few months. You should be familar with all of the material in the Python Introduction workshop and have used these skills in your own projects to the point where you are comfortable with them.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Setup instructions and materials: http://bit.ly/dss_pythoninstall
Class website: http://bit.ly/dss_pythonwebscrape

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Oct 23

Python Introduction

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop introduces the basic elements of Python by working through an example of how to analyze text data. Python is a general purpose programming language commonly used for data cleaning, analysis, visualization, and other applications. Note that the focus of the workshop is on how to use Python rather than why you might want to use Python over other software. This workshop is appropriate for those with little or no prior experience with Python.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Setup instructions and materials: http://bit.ly/dss_pythoninstall
Class website: http://bit.ly/dss_pythonintro

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Oct 09

R Graphics (flipped classroom)

1:00pm to 2:30pm

Location: 

Online: Zoom

PLEASE NOTE: This workshop is being delivered in a FLIPPED CLASSROOM format. This means that participants will be responsible for working through the online materials at their own pace IN ADVANCE of the scheduled meeting time. During the scheduled meeting time, the instructor will demonstrate how to complete the example exercises and will be available to answer questions related to the workshop materials. The instructor WILL NOT walk through all the online materials during the scheduled meeting.

This hands-on workshop provides an introduction to the popular ggplot2 R graphics package. It will cover how to create a wide variety of graphical displays in R, using techniques such as layering, mapping variables to aesthetics, working with scales, faceting, and themes. This is an intermediate-level workshop appropriate for those who have been using R for at least a few weeks. You should be familar with all of the material in the R Introduction workshop and have used these skills in your own projects.

This workshop will be delivered by the Data Science Services group at IQSS in partnership with the HBS Research Computing Services group.

Setup instructions and materials: http://bit.ly/dss_rinstall
Class website: http://bit.ly/dss_rgraphics

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Oct 16

R Data Wrangling (flipped classroom)

1:00pm to 2:30pm

Location: 

Online: Zoom

PLEASE NOTE: This workshop is being delivered in a FLIPPED CLASSROOM format. This means that participants will be responsible for working through the online materials at their own pace IN ADVANCE of the scheduled meeting time. During the scheduled meeting time, the instructor will demonstrate how to complete the example exercises and will be available to answer questions related to the workshop materials. The instructor WILL NOT walk through all the online materials during the scheduled meeting.

This hands-on workshop will prepare you for dealing with messy data by walking you through a real-life example. This is an intermediate-level workshop appropriate for those who have been using R for at least a few weeks. You should be familar with all of the material in the R Introduction workshop and have used these skills in your own projects.

This workshop will be delivered by the Data Science Services group at IQSS in partnership with the HBS Research Computing Services group.

Setup instructions and materials: http://bit.ly/dss_rinstall
Class website: http://bit.ly/dss_rdatawrangling

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

... Read more about R Data Wrangling (flipped classroom)

2020 Oct 02

R Regression Models

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop will demonstrate how to deploy a variety of statistical procedures using R, including multiple regression, modeling with categorical variables, as well as model diagnostics and comparison. Note that the focus of the workshop is on how to use R to fit models - we do not teach the theory behind the models and assume that you already have a solid background in statistical modeling and want to apply this in R. This is an intermediate-level workshop appropriate for those who have been using R for at least a few weeks. You should be familar with all of the material in the R Introduction workshop and have used these skills in your own projects.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Setup instructions and materials: http://bit.ly/dss_rinstall
Class website: http://bit.ly/dss_rmodels

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Sep 25

R Introduction

1:00pm to 4:00pm

Location: 

Online: Zoom

This hands-on workshop will teach you how to use R to import and manipulate data, install and manage packages, conduct basic statistical analyses, and create common graphical displays. Note that the focus of the workshop is on how to use R rather than why you might want to use R over other software. This workshop is appropriate for those with little or no prior experience with R.

This workshop will be delivered by the HBS Research Computing Services group in partnership with the Data Science Services group at IQSS.

Setup instructions and materials: http://bit.ly/dss_rinstall
Class website: http://bit.ly/dss_rintro

Visit the Harvard Training Portal for details and registration information (requires HarvardKey).

2020 Jun 17

Dataverse Community Meeting

Wed - Fri, Jun 17 to Jun 19, 10:00am - 2:00pm

Location: 

Online

#dataverse2020 will be an online event. Read more...

The annual Dataverse Community Meeting is an opportunity to build, grow, and enrich the global community. Like the open-source Dataverse product itself, the activities of the Dataverse Community Meetings are community-driven. Over three days of presentations, workshops, and working group meetings we aim to promote and learn about...

Read more about Dataverse Community Meeting

Pages