This course will become read-only in the near future. Tell us at community.p2pu.org if that is a problem.

Finding the Data [Aug. 21, 2012, 10:23 a.m.]



The advantage of reverse-engineering a news story in this way is that good newspapers *should* cite the source of their underlying data.

In this case, the Guardian has a link to 'Get the Data' embedded in their article: http://www.guardian.co.uk/news/datablog/2011/jan/31/world-carbon-dioxide-emissions-country-data-co2#zoomed-picture

Go and take a look!

Some points to remember:

1) It's good practice to go to the source rather than simply using the direct download of the spreadsheet. You might well spot another useful data set, some extra metadata, or something else that gives you a better sense of the data's context.

2) Check the license!

3) Look carefully for metadata. Make a note of anything that seems significant.

If you follow through the links on this particular article, you end up at http://www.eia.gov/cfapps/ipdbproject/IEDIndex3.cfm?tid=90&pid=44&aid=8 - the US Energy Information Administration.

Take a moment to explore the page.

- Note especially the license (definitely open! see 'Policies > Copyright & Reuse' at the bottom of the page, which also tells you *how* to accredit the data).

- Now that you're happy you're working with open data, have a play! The website allows you to explore different indicators, different years, different regions, different units. Find anything particularly interesting? Share it below.

Once you've explored, return to the data set we will be using for this challenge. This is the Total Carbon Dioxide Emissions from the Consumption of Energy (Million Metric Tons) for all regions from 2006-2010. 

Download the data!