Jan. 30, 2023

PUL Love Data Week 2023 in a heart

Explore One of the World's Largest Microdata Collections
Monday, February 13; 10am-11am
Firestone Library, Room B-6-F                        
Since 1964, Data and Statistical Services (DSS) has been collecting data to build one of the world’s largest microdata collections. Highlights include data on economics, education, elections, environment, ethnicity and race, finance, health, labor, public opinion, real estate, and the governments of India and Palestine. DSS also provides support for social science methods and statistical software for R/R Studio, Stata, Matlab, Python (for text mining and social science applications), SAS, and SPSS. Presented by Bobray Bordelon.
                    
Creating an Orthorectified Mosaic Image from Historical Aerial Photographs
Tuesday, February 14; 4pm-5pm
Engineering Library, Fine Visualization Lab
Historical aerial photographs are very useful data for many researchers. They can provide an excellent visual picture of an area to understand the changing environment of a landscape. This presentation will describe the workflow of scanning and creating an orthorectified mosaic image from historical aerial photographs and describe what sort of tools we hope to develop to extract data from the aerial photographs. Presented by Tsering W. Shawa.
                                            
Douglass Day Transcribe-a-thon
Tuesday, February 14; 12pm-3pm 
Firestone Library, Room A-6-F 
Douglass Day is an annual, nationwide program that marks the birth of Frederick Douglass and strives to create new & freely available resources for learning about Black history during a "Transcribe-a-thon" - a crowdsourced transcription event. This year, we will use Zooniverse to access and transcribe the papers of Mary Ann Shadd Cary, one of the earliest Black women to edit a newspaper, serve as a Civil War recruiter, and attend law school. The resulting datasets will allow her papers to be more discoverable and accessible by communities all over the world.    

Data Visualization with Tableau                
Wednesday, February 15; 2pm-3:30pm
Stokes Library, Stokes Viz Hub (Room #070 Wallace Hall)
This 90-minute workshop offers a gentle introduction to data visualization with Tableau. Attendees will learn how to connect to and edit data sources; use the Tableau workspace to create data visualizations; build a range of chart types; create quick calculations; and build an interactive dashboard to present data insight. No prior programming knowledge is needed. Presented by Ofira Schwartz-Soicher.

Introduction to Data Analysis with R
Wednesday, February 15, 10am-12pm
Julis Romo Rabinowitz Building, Room 217            
This session is a hands-on introduction to data analysis using the R programming language, aimed at people who have never used R or RStudio before. This workshop is ideal for those who are at the initial stages of doing independent research requiring quantitative analysis. Presented by Oscar Torres-Reyna, Head of Economics Statistical Services.

Big Data Analytics and Machine Learning with Spark
Wednesday, February 15, 4:30pm-6pm
Lewis Library, Room 120
This workshop will show participants how to get started with analyzing big data and training machine learning models on big data using Apache Spark on the Research Computing clusters. Spark is a scalable, general-purpose big data processing engine that supports multiple frontend languages including Python and R. Presented by Jonathan Halverson, Research Software and Computing Training Lead.