Skip to Main Content



Available here are data sets separated alphabetically and by subject. 

Citing Data

When writing a paper or doing a presentation, it is always important to cite datasets that you have referenced or produced. This is important to give the rightsholder credit and allow your audience to access the data for their own purposes. 

In order to cite raw data, you must include the following:

  • Rightsholder's name 
  • Year of release/publication
  • Title of the data set
  • Description of form
  • Where you retrieved the information
Rightsholder, A. A. (Year). Title of program [Description of form]. Retrieved from http://xxxxxxxx


The General Social Survey. (2017). 1972-2016 Cross-sectional cumulative data [Data file and code book]. Retrieved from


  • Do not italicize the names of software, programs, or languages. 
  • Do not italicize the title of a data set.
  • If an individual has proprietary rights to the software, name him or her as the author; otherwise, treat such references as unauthored works. 
  • In parentheses immediately after the title, identify the version number, if any. 
  • In brackets, immediately after the title or version number, identify the source as a computer program, language, software, and so forth. Do not use a period between the title and the bracketed material. 
  • Give the location and name of the organization that produced the work, if applicable, in the publisher position. If the program can be downloaded or ordered from the web, give this information in the publisher position.