Data Expeditions
This repository includes datasets developed as part of the Data Expeditions program sponsored by the Information Initiative at Duke (iiD). The datasets were put together by teams of graduate students for Duke faculty to use in their courses. Each dataset comes with suggested questions amenable to both exploratory data analysis and advanced mathematical/statistical modeling. The datasets concern topics from multiple disciplines. The iiD, in collaboration with the Duke Social Science Research Institute and the Duke Library, makes Expeditions datasets available in this repository with the intention of allowing many Duke faculty and students to take advantage of these resources for learning quantitative methods
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Recent Submissions
-
ENV 350S / PUBPOL 280S Seminar in Marine Conservation Leadership
(2016)Duke PhD student Stephanie Stefanski recently taught a class focused on the process of designing, implementing, and analyzing the results from an economic valuation survey. The class was given as a module to inform the broader ... -
STA 112, Data Science, Statcast
(2016-12-12)In this Data Exploration, students were introduced to baseball dataset Statcast, downloaded from baseballsavant.mlb.com, that included every pitch thrown in the first week of the 2016 season, with 21 characteristics. The ... -
Math 412 - Topology with Applications
(2016-06-24)Highlights of Data Expedition: • Students explored daily observations of local climate data spanning the past 35 years. • Topological Data Analysis, or TDA for short, provides cutting-edge tools for studying the geometry ... -
North Carolina Traffic Stops
(2014) -
Major League Baseball and National Basketball Association regular season data by team
(2014)With the rise of sports statistics, especially sabermetrics in baseball, statistics have proven crucial not only for managing teams and assessing player value, but also for forecasting team and individual performance. In ... -
Exploring lemur olfactory communication
(2015-11-30)In Fall 2015, we (Kendra Smyth & Lydia Greene) led a Data Expeditions (DE) workshop in Advanced Research in Evolutionary Anthropology, a senior-level class on the research process. The goal of the workshop was to get students ... -
Math 412: Music + Topology
(2014)In this mini assignment you will explore an application of "sliding windows and persistence" on time series data (see Jose Perea's paper for more theory). Specifically, you will look at how to transform musical audio data ... -
2015 Call for Proposals
(2015) -
Signal, noise, and bias in yeast MNase-seq data
(2014)This is an optional challenge for students interested in applying what we have learned in class to a real computational genomics research problem; practicing the skills of using Python or R (or any other tool you wish) to ... -
2014 Data Expeditions Call for Proposals
(2015-11-30)