Instructors:
Cody Hennesy (UCB), Jose Niño Muriel (UCSB), Jamie Jamison (UCLA), Tim Dennis (UCLA), Seth Erickson (UCSB), Ryan Horne (UCLA), Derek Devnich (UCM), Reid Otsuji (UCSD)
Helpers:
Dave George (UCLA), Kristi Liu (UCSB), Geno Sanchez (UCLA), Renee Hui Xin Ng (UCSD), Hannah Sutherland (UCLA), Geoffery Boushey (UCSF), Kat Koziar (Fresno State), David Palmquist (Fullerton)
General Information
Registration: Please register in Zoom before attending the workshop.
The Carpentries project comprises the Software Carpentry, Data Carpentry, and
Library Carpentry communities of Instructors, Trainers, Maintainers,
helpers, and supporters who share a mission to teach foundational computational and data science
skills to researchers.
Want to learn more and stay engaged with The Carpentries? Carpentries Clippings is The Carpentries' biweekly newsletter, where we share community news, community job postings, and more.
Sign up to receive future editions and read our full archive: https://carpentries.org/newsletter/
Library Carpentry
is made by people working in library- and information-related roles to help you:
automate repetitive, boring, error-prone tasks
create, maintain and analyze sustainable and reusable data
Who:
The course is for people working in library- and information-related roles.
You don't need to have any previous knowledge of the tools that
will be presented at the workshop.
Where: This training will take place online.
The instructors will provide you with the information you will need to connect to this meeting.
Requirements:
Participants must have access to a computer with a
Mac, Linux, or Windows operating system (not a tablet, Chromebook, etc.) that they have administrative privileges on.
They should have a few specific software packages installed (listed below).
Accessibility:
We are committed to making this workshop
accessible to everybody.
We are dedicated to providing a positive and accessible learning environment for all.
We do not require participants to provide documentation of disabilities or disclose any unnecessary personal information.
However, we do want to help create an inclusive, accessible experience for all participants.
We encourage you to share any information that would be helpful to make your Carpentries experience accessible.
To request accessibility support for this workshop, please fill out the
accessibility support request form.
If you have questions or need assistance with the accessibility support form please email us.
Glosario is a multilingual glossary
for computing and data science terms. The glossary helps
learners attend workshops and use our lessons to make sense of computational and programming jargon written in English by offering it
in their native language. Translating data science terms also provides a teaching tool for Carpentries Instructors to reduce barriers
for their learners.
Workshop Recordings:
Carpentries workshops are designed to be interactive rather than lecture-based, with lessons that build upon one another.
To foster a positive online learning environment, we strongly recommend that participants join in real time.
As a result, workshop recordings are not recommended and may not be available to learners.
Roles:
To learn more about the roles at the workshop (who will be doing what),
refer to our Workshop FAQ.
Code of Conduct
Everyone who participates in Carpentries activities is required to conform to the Code of Conduct. This document also outlines how to report an incident if needed.
Collaborative Notes
We will use this collaborative document for chatting, taking notes, and sharing URLs and bits of code.
Surveys
Please be sure to complete these surveys before and after the workshop.
Mid-Workshop Break: No sessions Thursday, May 14 – Sunday, May 17.
Day 4: Monday, May 18, OpenRefine
09:00
Introduction to OpenRefine
09:30
Importing & Exploring Data
10:15
Faceting and Filtering
10:30
Break
10:45
Clustering & Working with Columns
11:30
Transformations (GREL) & Exporting
12:00
Wrap-up
Day 5: Tuesday, May 19, Python (Part 1)
09:00
Python: Variables & Types
09:45
Lists & Built-in Functions
10:30
Break
10:45
Libraries & Introduction to Pandas
11:30
For Loops & Looping Over Datasets
12:00
Wrap-up
Day 6: Wednesday, May 20, Python (Part 2)
09:00
Advanced Pandas & Filtering Data
10:00
Conditionals & Writing Functions
10:30
Break
10:45
Data Visualization with Python
11:30
Post-workshop Survey & Wrap-up
12:00
End of Workshop
Setup
To participate in a
Library Carpentry
workshop,
you will need access to software as described below.
In addition, you will need an up-to-date web browser.
If you haven't used Zoom before, go to the
official website
to download and install the Zoom client for your computer.
Set up your workspace
Like other Carpentries workshops,
you will be learning by "coding along" with the Instructors.
To do this, you will need to have both the window for the tool
you will be learning about (a terminal, RStudio, your web browser, etc..)
and the window for the Zoom video conference client open.
In order to see both at once,
we recommend using one of the following set up options:
Two monitors: If you have two monitors,
plan to have the tool you are learning up on one monitor and
the video conferencing software on the other.
Two devices: If you don't have two monitors,
do you have another device (tablet, smartphone) with a medium to large
sized screen? If so, try using the smaller device as your video
conference connection and your larger device (laptop or desktop)
to follow along with the tool you will be learning about.
Divide your screen: If you only have one device
and one screen, practice having two windows
(the video conference program and one of the tools you will be using
at the workshop) open together.
How can you best fit both on your screen?
Will it work better for you to toggle between them
using a keyboard shortcut?
Try it out in advance to decide what will work best for you.
This blog post includes detailed information on how to set up your screen to follow along during the workshop.
Spreadsheet Software
To interact with spreadsheets in this lesson, we will use
Microsoft Excel. If you don't have access to Excel, we recommend using Google Sheets. Google Sheets is free with a (free) Google or Gmail account. You can sign in at Google Workspace.
There are other free spreadsheet tools that you can use, though we may not have time to answer questions during the workshop about differences between all of these:
LibreOffice,
Gnumeric,
OpenOffice.org.
Commands may differ a bit between programs, but general ideas for thinking about spreadsheets is the same.
The Bash Shell
Bash is a command-line tool that lets you work with files and folders by typing commands.
For this lesson, you will need a Unix-like shell environment and the lesson data.
Please complete the setup below before the workshop. If you get stuck, stop and we will help you during the workshop.
Unzip or extract the file. You should end up with a folder on your Desktop called shell-lesson.
Test your setup
Open your shell and run:
ls
pwd
If you can type commands and see output, your shell is ready. During the workshop, we will use the shell-lesson folder on your Desktop.
If something goes wrong
If you are on Windows, make sure you are opening Git Bash, not Command Prompt or PowerShell.
If you cannot find the data, check that shell-lesson is on your Desktop.
If you get stuck, stop and we will help you during the workshop.
OpenRefine
For this lesson you will use OpenRefine, a tool for cleaning and organizing data.
It runs on your computer and opens in your web browser (no internet connection needed after launch).
Please install OpenRefine and confirm it opens before the workshop.
Save it somewhere easy to find, such as your Desktop or Downloads folder.
During the workshop, we will open this file in OpenRefine.
Test your setup
OpenRefine should open in your browser and show a page with options like
"Create Project" and "Open Project".
If something goes wrong
If nothing opens, go to http://127.0.0.1:3333 in your browser.
If you see a security warning on Mac, use Control-click → Open.
If you get stuck, stop and we will help you during the workshop.
Git
Git is a version control system that helps you track changes to files and collaborate with others.
In this workshop, we will also use GitHub to share code.
Please make sure Git is installed and that you can sign in to GitHub before the workshop.
Create a GitHub account
You will need a free GitHub account for this lesson.
If you already have one, please make sure you can sign in before the workshop.