160 likes | 309 Views
Office of Scientific and Technical Information. Scientific and Technical Information Program April 20-21, 2005. United States Department of Energy. Data Consistency. Patty Simmons. 241.1 DTD HEDB Legacy Harvesting. Where Are We?. Business Rules Data Elements.
E N D
Office of Scientific and Technical Information Scientific and Technical Information Program April 20-21, 2005 United States Department of Energy Data Consistency Patty Simmons
241.1 DTD HEDB Legacy Harvesting Where Are We? US Department of Energy Office of Scientific and Technical Information
Business Rules Data Elements Data Consistency is the Key for all Product Types US Department of Energy Office of Scientific and Technical Information
Implementation Target Dates Discovery May 6, 2005 Design/Development June 29, 2005 Testing July 20, 2005 Data Cleanup July 22, 2005 Implementation Aug 27, 2005 Comments Due –Friday, May 6, 2005 Do you want to submit test data? How much lead time do you need? System Redesign US Department of Energy Office of Scientific and Technical Information
New layout, user friendly Basic order not changed Required fields still displayed in red Selection specific display of fields Selection of specific fields change view of form to display only fields appropriate to that section Required fields specific to selection displayed in red Use of drop down boxes for consistency Full text can now be attached during input 241.1 Prototype US Department of Energy Office of Scientific and Technical Information
Field Changes Publication date now allows a ‘Text Date’ Acceptable electronic formats have changed to ‘Searchable PDF’ or ‘HTML’ if posted at site, or ‘Searchable PDF’ or ‘Non-Searchable PDF’ if uploading to OSTI ‘Other’ moved from Part III to Part I and changed to ‘Related Document Information’ 241.1 Prototype cont. US Department of Energy Office of Scientific and Technical Information
Additional Data Fields Work Proposal Number Proposal Revision Number Work Authorization Number Authorization Revision Number Digital Object Identifier 241.1 Prototype cont. US Department of Energy Office of Scientific and Technical Information
Data Fields No Longer in Form Software Manual (Use DOE F 241.4) SBIR, STTR (Use DOE F 241.3) U.S. Dissemination Only Contact Information 241.1 Prototype cont. US Department of Energy Office of Scientific and Technical Information
Changing from SGML to XML XML very similar to SGML DOE going to XML XML is a more modern markup language More programs available to generate and parse data Looks different but does the same thing Changes are going to be required anyway New DTD follows the order of the paper form Technically can be changed BATCH PROCESSING US Department of Energy Office of Scientific and Technical Information
FY 2000 – HEDB moved to Dublin Core FY 2001 – HEDB moved to output products FY 2005 – HEDB moved to ELink Sites can make corrections and/or add full text Accessed by Site Code Dates back to 1975 2.1 million records moved HEDB used as Site Code if Site Code could not be determined HEDB MIGRATION US Department of Energy Office of Scientific and Technical Information
Site Codes Assigned to Historic Records US Department of Energy Office of Scientific and Technical Information
Site Codes Assigned to Historic Records cont. US Department of Energy Office of Scientific and Technical Information
OSTI would like to receive electronic full text documents OSTI can provide listing of bibliographic records that do not have electronic full text for site comparison Data fields provided might include: Report Number Author Title OSTI ID LEGACY DOCUMENTS US Department of Energy Office of Scientific and Technical Information
If OSTI already has electronic full text, we can provide to the site Full text provided by the site will override any other full text OSTI ID required for matching of bibliographic record and full text System checks for accurate matching of bibliographic record and full text Who is scanning? What are you doing? LEGACY DOCUMENTS cont. US Department of Energy Office of Scientific and Technical Information
Harvesting is also included in our efforts to achieve data consistency HARVESTING US Department of Energy Office of Scientific and Technical Information
A computer cannot turn bad data into good data. John R. PierceAmerican Communications Engineer and Scientist US Department of Energy Office of Scientific and Technical Information