Find us on GitHub

A Data Carpentry Workshop

National Society of Black Engineers Professional Development Conference

August 23, 2019

8:00 am - 5:00 pm

Instructors: Donna Wrublewski, Saranya Canchi, Ye Li

General Information

Data Carpentry workshops are for any researcher or professional who has data they want to analyze, and who wants to learn more about tools and techniques for working with data.

This hands-on workshop will cover how to work more effectively with data and an introduction to programming in R. By the end of the workshop, participants should be able to load data, conduct simple analyses, and produce basic visualizations. They will also be aware of best practices for data management to enable reproducibility.

No prior computational experience is required. Participants should bring their laptops and plan to participate actively.

Who: Participants in the National Society for Black Engineers Professional Development Conference, who want to learn more about (and get hands-on experience with!) data management practices and tools. In order to attend, please register through the eventbrite link above or contact the organizer.

Where: 8757 Rio San Diego Dr, San Diego, CA 92108. Get directions with OpenStreetMap or Google Maps.

Requirements: Participants must bring a laptop with a Mac, Linux, or Windows operating sytem (not a tablet, Chromebook, etc.) that they have administrative privileges on. They should have a few specific software packages installed (listed below). They are also required to abide by Data Carpentry's Code of Conduct.

Contact: Please mail yel@mit.edu for more information.


Preparation

Surveys

Surveys

Please be sure to complete this survey before the workshop.

Pre-workshop Survey

Setup

To participate in a Data Carpentry workshop, you will need working copies of the described software. Please make sure to install everything (or at least to download the installers) before the start of your workshop. Participants should bring and use their own laptops to insure the proper setup of tools for an efficient workflow once you leave the workshop.

Please follow these Setup Instructions.

We maintain a list of common issues that occur during installation as a reference for instructors that may be useful on the Configuration Problems and Solutions wiki page.


Resources and Materials

Etherpad: https://pad.carpentries.org/dc-nsbe-20190823.
We will use this Etherpad for chatting, taking notes, and sharing URLs and bits of code.

Schedule

TimeSubject
8:00a - 830a Software Installation Help
8:30a - 9:00a Introduction
9:00a - 10:00a Data Organization in Spreadsheets
10:00a - 10:15a Break
10:15a - 11:00a Data Cleaning in OpenRefine
11:00a - 11:15a Break
11:15a - 12:00p Introduction to R and RStudio [Script from workshop]
12:00p - 1:00p Lunch
1:00p - 2:45p Data Structures & Exploring Data Frames [Script from workshop]
2:45p - 3:00p Break
3:00p - 4:30p Creating Publication Quality Graphics [Script from workshop]
4:30p - 5:00p Workshop Wrap-Up (Post workshop survey: Post-workshop Survey)

Syllabus

We will be using material from these Data Carpentry lessons:

Lesson Website
Data Organization in Spreadsheets Spreadsheet Materials
Data Cleaning with OpenRefine Open Refine Materials
Data Analysis and Visualization in R R Materials
R for Reproducible Scientific Analysis R Materials