Skip to main content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

How to Cite Data: Key Components

Subject guide for citing Social Science data

Why Cite Data?

When you collect your own data, citing its location makes it possible for others to find them and extend your research, raising your profile as a researcher. ICPSR provides a good overview of the importance of data citation:

"Citing data files in publications based on those data is important for several reasons:

  • Other researchers may want to replicate research findings and need the bibliographic information provided in citations to identify and locate the referenced data.
  • Citations appearing in publication references are harvested by key electronic social sciences indexes, such as Web of Science, providing credit to the researchers.
  • Data producers, funding agencies, and others can track citations to specific collections to determine types and levels of usage, thus measuring impact."

If you're using data you didn't gather yourself, citing your source is just as important as citing your other research sources. For other scholars to be able to examine and extend your work, they must be able to find the original data.

Consequently, although most style guides do not include examples for citing data, consider the key components and other elements at right and work them into the style you're using.

Key Components of a Data Citation




The original researcher(s) who collected the data

Study name/Title

What did the original researcher call it?


The organization that sponsored the research, usually the author's institution. This takes the place of a publisher in an ordinary citation, so be prepared to list the place of publication as well. It may be useful to add a designation like [producer] if it is not actually a publisher.

Year Data Produced

When did the Producer first release the data? Treat this like the publication date.

Other Possible Elements



Unique Identifier, like a Digital Object Identifier (DOI)

If you got the data from a repository like ICPSR, note their unique identifier as part of the title. If the data file has a DOI, include it as you would a URL for a web site. Check here for information on how to obtain a DOI.


The organization that makes the data available. From what organization did you get it? If directly from the author, listing the author's institution/organization once (as the publisher) is sufficient. However if the distributor is different from the producer, it's important to list it separately; it may be useful to add a designation like “[distributor]” to clarify its role.

Year Data Collected

When did the original researcher collect the data? You may choose how specific to be--it may only be important to list the years, or you may want to provide more specific date ranges if it would be important for subsequent users to know the periodicity (months, weeks, days, etc.).

Your Librarian

Sheila Devaney's picture
Sheila Devaney
Research & Instruction Department
Main Library
University of Georgia
912-268-0539 (For Spring 2021 please use my Google Voice number)