Citing datasets is just as important as citing journal articles, books, and other sources that contributed to your research.
By citing your use of a dataset, you are supporting the reproducibility of your research and attributing credit to those who provided the data-including datasets that you have created yourself. Citations also allow for tracking reuse and measuring impact.
Citation styles do not consistently provide examples for dataset citations. Many data providers also recommend their preferred citation or supply an example.
from: ICPSR and APA Style Guide
Double check which citation style your professor requires!
Author: Name(s) of each individual or organizational entity responsible for the creation of the dataset.
Date of Publication: Year the dataset was published or disseminated.
Title: Complete title of the dataset, including the edition or version number, if applicable.
Publisher and/or Distributor: Organizational entity that makes the dataset available by archiving, producing, publishing, and/or distributing the dataset.
Electronic Location or Identifier: Web address or unique, persistent, global identifier used to locate the dataset (such as a DOI). Append the date retrieved if the title and locator are not specific to the exact instance of the data you used.
These are the minimum elements required for dataset identification and retrieval. Fewer or additional elements may be requested by author guidelines or style manuals. Be sure to include as many elements as needed to precisely identify the dataset you have used.
from: ICPSR
Author(s) (year). Title. [Data set]. Publisher. URL
Example:
O’Donohue, W. (2017). Content analysis of undergraduate psychology textbooks (ICPSR 21600; Version V1). [Data set]. ICPSR. https://doi.org/10.3886/ICPSR36966.v1
The Chicago Manual of Style does not directly address how to cite datasets. However, the basic Chicago formatting for a source is as follows:
Author. Title of Work. Format. Date of creation or completion. Medium. Name of Institution. URL (accessed month dd, yyyy).
Example
O’Donohue, William. Table. 2017. Content analysis of undergraduate psychology textbooks (ICPSR 21600; Version V1). csv. ICPSR. https://doi.org/10.3886/ICPSR36966.v1 (accessed August 21, 2023).
Data Cite is a global network of dataset researchers, whose goal, as stated on their website, is “to help make data more accessible and more useful; our purpose is to develop and support methods to locate, identify and cite data and other research objects.” They recommend the following when citing a dataset:
Creator (PublicationYear). Title. Version. Publisher [or Distributor]. (ResourceType.) Identifier
Example:
Organisation for Economic Co-operation and Development (OECD) (2018-04-06). Main Economic Indicators (MEI): Finance | Country: Argentina | Indicator ID: CCUS, 01/1959 - 12/2017. Data Planet™ Statistical Datasets: A SAGE Publishing Resource. (dataset). Dataset-ID: 062-003-004. https://doi.org/10.6068/DP163F9ED671E6
The Institute for Electrical and Electronics Engineers (IEEE) is a professional organization supporting many branches of engineering, computer science, and information technology. In addition to publishing journals, magazines, and conference proceedings, IEEE also makes many standards for a wide variety of industries.
IEEE citation style includes in-text citations, numbered in square brackets, which refer to the full citation listed in the reference list at the end of the paper. The reference list is organized numerically, not alphabetically. For examples, see the IEEE Editorial Style Manual and the IEEE Citation Guidelines.
[number of reference] Author, Title. Location of publisher: publisher, date. [format]. Available: URL [accessed month dd, yyyy]
Example:
[1] Hoen, B.D., Diffendorfer, J.E., Rand, J.T., Kramer, L.A., Garrity, C.P., Hunt, H.E., United States Wind Turbine Database. U.S. Geological Survey, 2018. [JSON]. Available: https://eerscmap.usgs.gov/arcgis/rest/services/uswtdb/uswtdbTiled/MapServer [accessed Aug. 22, 2023]
The MLA Handbook does not directly mention datasets. However, the basic MLA formatting is as follows:
Author. Title of Dataset (including date range of dataset). Publisher, Publication Date. Database Name, DOI.
Example:
Energy Information Administration. Retail Gasoline Prices: Retail Gasoline Prices - All Grades, 08/20/1990 - 05/30/2016. Data Planet Statistical Datasets: A SAGE Publishing Resource, 17 Sept. 2017. https://doi.org/10.6068/DP15E5374E97A17.