1 / 23

Tracking and Managing Citations: Data Centers and Best Practices

Tracking and Managing Citations: Data Centers and Best Practices. W. Christopher Lenhardt CIESIN – Columbia University 25 October 2006 – CODATA 2006. Outline. Summarize the challenges Why do data citations matter Summarize CIESIN experience Related efforts

julius
Download Presentation

Tracking and Managing Citations: Data Centers and Best Practices

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Tracking and Managing Citations: Data Centers and Best Practices W. Christopher Lenhardt CIESIN – Columbia University 25 October 2006 – CODATA 2006

  2. Outline • Summarize the challenges • Why do data citations matter • Summarize CIESIN experience • Related efforts • Summary of potential best practices • Additional thoughts

  3. Challenges • Citing digital data • Bits are still ephemeral • Standardization still in progress • Sociology of science • How to get credit for publishing data • Rapidly changing technology • Issue of How (theory) versus Doing (practice)

  4. Related Issues • Data quality • Facilitate usage • Attribution • Provenance/authenticity

  5. Address the problem from a different angle • Potential contribution of data centers • Contribute to standards development • Develop and promote best practices

  6. What do you need? • Some policies • Some procedures/operational practices • Some content

  7. Potentially Relevant Policies • Data quality policy (and procedure) • Information quality policy (and procedure) • Responsible use

  8. Quality Review and Documentation • What kinds of data and information • Quality review and documentation • Making quality information available to end-users

  9. Responsible Use • Data providers have certain legal and ethical responsibilities related to data stewardship and dissemination • Opportunity to remind users about issues such as attribution and confidentiality • Can be a link • Could pop up prior to a download • http://www.icpsr.umich.edu/org/policies/respuse.html

  10. Operational Practices • Quality review and documentation • Recommended citations • Technical publications about data • Citation style guides

  11. Provide recommended citations • Essential reminder/aid to facilitate citation • Can be non-trivial depending on things like collections versus subsets • Helpful to users to add a “download to a citation manager link”

  12. Collect Citation Information • Gives an indication of usage and quality • Provides a reminder to users to cite data in their research and publications • Ideally do this for all your data, but may be valuable for flagship data products • Potential for automation? • Pull and push

  13. Generate or Reference [Peer-reviewed] Publications or Technical Notes About the Data

  14. Provide Access to or Develop a Citation ‘Style Guide’ • http://sedac.ciesin.columbia.edu/citations

  15. Related Activities

  16. Work at Harvard/MIT • http://gking.harvard.edu/files/cite.pdf

  17. IASSIST http://iassistblog.org/?cat=17 • Review of styles • Subgroup working on the issue • Blog http://www.iassistdata.org/

  18. Stats Canada • Gaeton Drolet – Univ of Quebec Laval http://www.statcan.ca/english/freepub/12-591-XIE/12-591-XIE2006001.htm

  19. Summary of Potential Best Practices • Provide a recommended citation • Provide access to guides on citation • Encourage responsible use • Publish about data in peer reviewed literature • Collect citations to the data from other researchers and users

  20. Additional Challenges • Downloads of whole data sets versus subsets of data • Composite data sets • Collections • Aggregations • Resources may be limited; Can you do this for all of your holdings? • It may not make sense to develop your own style guide, may be more efficacious to utilize a pre-existing guide • Location and naming – for citations to be useful, the location must be stable • URNs/DOIs etc.

  21. To Address the Larger Challenge Need to Involve • Funders • Publishers • Professional associations • Creators of data • Other data centers

  22. Should we treat data more like a traditional publication? • Research data is messy • Persistence: Are data sets analogous to books? • Do we need unique identifiers and/or catalog numbers for data sets? • ISBN v. catalog number

  23. Thanks…

More Related