Course Modules

This course is broken up into eleven modules that cover the gamut of data-driven storytelling. Each module contains different kinds of learning materials, including brief videos to introduce concepts, external resources that elaborate on those concepts, and tutorials and activities that allow you to apply skills associated with those concepts. The course also includes different data challenges that allow you to demonstrate your mastery of the course content.

Each module contains roughly three to four hours of learning materials, with the activities and challenges adding to that total. While it is possible to consume several modules within a single week, I would discourage you from doing so. Take your time and let the material sink in.

While the modules are designed to be consumed in a linear fashion, this isn't necessary. You're welcome to remix the curriculum to suit your needs as the material also contains several refreshers for earlier content. You're also encouraged to go off-script and tackle on datasets and challenges not included in this course.


Module 1: Data in Journalism

This module introduces journalistic data-driven storytelling, clarifying what is unique about it and what the process of producing such stories typically looks like. This serves as a brief foundation for what is to come over the next ten modules.

Learning Materials


Module 2: Introduction to R and RStudio

This module covers different kinds of data you might encounter as well as two key tools used by data journalists to analyze data: the R programming language and the RStudio development environment.

Learning Materials


Module 3: Evaluating Sources and Exploratory Analysis

This module introduces best practices for how to find datasets and ascertain whether they come from trustworthy sources. You'll also begin to analyze data using descriptive statistical methods. This module also contains your first data challenge, where you can begin to put the skills you're developing into practice.

Learning Materials


Module 4: Exploratory Visualization and Calculations

This module introduces more statistical concepts to ensure that you are performing analyses that fit your data, and that those analyses are not over-generalized. You'll also begin to explore your dataset in a visual fashion, which can help you identify values of interest that you might otherwise miss.

Learning Materials


Module 5: Data Extraction and Cleaning

This module covers tools and methods used to extract structured data from non-structured files and documents. Unfortunately, this process is often imperfect (and even structured files might come with mistakes), so you'll begin to "clean" those data to remove mistakes that might impact your analysis. You'll also take on your second data challenge at this point, ensuring that you can translate a basic analysis of a dataset into a news brief.

Learning Materials


Module 6: Story Ideas and Interviewing Data

This module focuses on one approach to data-driven storytelling: Starting with data and generating interesting, comprehensive story ideas from them. You'll also begin to apply more advanced grouping and data transformation functions in your data analysis.

Learning Materials


Module 7: Humans in Data-Driven Storytelling

This module focuses on how to interview human sources and integrate them into a data-driven story. The module also highlights the advantages of human sources, and how they can be incorporated alongside quantitative information within a news story.

Learning Materials


Module 8: Telling Stories

This module covers the elements of a story story, including the development of a compelling lead paragraph, the translation of complex quantitative information into something more easily digested by a general audience, and different approaches to writing with numbers. You'll also have your third data challenge alongside this module, and begin to write your first full-length data-driven story.

Learning Materials


Module 9: The Visual in Data-Driven Storytelling

This module introduces basic design principles and how to select the chart that best conveys an important observation from your data analysis. The module also introduces ideas for how to assess the effectiveness of existing data visualizations and points to different visual style guides used in the field.

Learning Materials


Module 10: Creating Uni- and Bi-Variate Charts

This module covers how to produce simple, interactive charts that are useful for conveying observations from descriptive statistical analyses. It also shows how you can create a properly formatted CSV file that can be imported into popular data visualization tools.

Learning Materials


Module 11: Cartography and Journalism

This module introduces basic principles for designing maps and visualizing spatial phenomena. The module also covers the creation of two different types of maps: choropleth and symbol maps. Finally, you'll be completing your final data challenge, which requires you to produce two interactive data visualizations, in this module.

Learning Materials