For this Data Challenge, you will be analyzing data about journalists killed while performing their jobs, which was collected by the Committee to Protect Journalists. CPJ makes some methodological details about the data available here. These data only contain killings where the motive was confirmed and only covers journalists (not all media workers).

For each question below, try to describe (a) your logic for how you intend to answer the question, (b) the code itself, and (c) a final answer. By following that process, you’ll increase the likelihood that the material will “stick” because you’re expressing things both conceptually and technically.

You can download a copy of this notebook (to make getting started easier) by clicking here.

Load Data

library(tidyverse)
journalists_killed <- read_csv("http://projects.rodrigozamith.com/datastorytelling/data/cpj_journalists_killed.csv")

Question 1

Do you trust this data source? Why or why not? (Please offer your best evaluation of the data source.)

Answer

Put your answer here.

Question 2

Do you see any ethical issues with the way these data were collected? Are there any ethical considerations you believe a journalist should be mindful of when writing a story with these data and/or publishing these data?

Answer

Here.

Question 3

What was the deadliest year for journalists? How many journalists died that year?

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

Put your answer here.

Question 4

What was the deadliest country for journalists? How many journalists died in that country?

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

Put your answer here.

Question 5

Were the journalists who died in Iraq primarily freelancers, staff reporters, or something else?

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

Put your answer here.

Question 6

What proportion of reporters killed in Iraq in 2006 were male?

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

Put your answer here.

Question 7

How many American journalists have been killed since 2005?

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

Put your answer here.

Question 8

Please produce the output (data) of a data frame that contains information about (a) the year of death, (b) the journalist’s name, (c) the type of death, and (d) the organizations associated with the journalist for each journalist slayed between 2010 and 2016 in Syria.

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

See the above table.

Question 9

What was the mean number of journalists killed in the Philippines between 2008 and 2014? What was the median? Which of those numbers best represents the average number of journalists killed during that time period?

Rationale

Describe your rationale here.

Code

# Insert your code here

Answer

Put your answer here.

Question 10

Please perform some original data analysis using this dataset and produce a news brief (1-2 paragraphs) from that information. (Include all the necessary code below.)

Code

# Insert your code here

Answer

Put your brief here.