1 / 26

The Need to Liberate The Data

Data Liberation Initiative. The Need to Liberate The Data. Michel Séguin DLI Chief December 2006. Data Liberation Initiative. The Need to Liberate The Data. Historically Stats Canada made published data available to public through the DSP

nhargrove
Download Presentation

The Need to Liberate The Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Liberation Initiative The Need to Liberate The Data MichelSéguin DLI Chief December 2006

  2. Data Liberation Initiative The Need to Liberate The Data • Historically Stats Canada made published data available to public through the DSP • These were regular paper publications and did not include electronic numeric files (ie. Public use micro data files) • Data files were available to researchers at marginal costs • Custom tables were another, more costly, method to access unpublished data

  3. Data Liberation Initiative The Need to Liberate The Data • In 1980’s federal budget cuts resulted in Stats Canada’s increased emphasis on cost recovery • In early 1990’s the cost of public use microdata files underwent a dramatic increase • This pushed most data files became out of reach for majority of academic researchers & students

  4. Data Liberation Initiative The Need to Liberate The Data • A consortium of universities had been created to gain access to 1986 Census data • This idea was well received by STC and led to a movement within academic community to Liberatethe rest of STC’s electronic datafiles • A paper in 1991: “Liberating the Data: Proposal for a Proposal” led to a working group to further investigate this idea • Group made up of reps. from: universities, SSFC, CARL, CAPDU, as well as STC and DSP

  5. Data Liberation Initiative The Need to Liberate The Data • Champions within both the academic community and Statistics Canada came forth to push this idea • Informal approval was received in 1995 • This was followed by the creation of: • An internal STC Steering Committee • A Project Team • An External Advisory Committee

  6. Data Liberation Initiative The Need to Liberate The Data • A Licence Agreement was drafted and approved • Author divisions were asked to provide their data to the Initiative • Institutions were invited to join the initiative • Other Gov’t agencies became involved and formal approval for 5-year pilot received from Treasury Board in early 1996

  7. Data Liberation Initiative The Need to Liberate The Data • Use of Internet as a dissemination tool seen as a key component of initiative • Established mechanisms for communications, storage, finding and ordering data • Created an FTP Site at STC • DLILIST - a forum for questions and sharing of information • WWW DLI ORDER DESK - for placing orders for products not on the FTP site • Began disseminating files in 1996

  8. Data Liberation Initiative The Need to Liberate The Data • Before DLI about 15 institutions offered a data service • Therefore co-operative training of members was seen as an extremely important aspect due to varying degrees of experience of members. • Established a training committee and began to develop a curriculum, identify trainers, establish budgets • Regional training workshops started in 1997

  9. Data Liberation Initiative The Need to Liberate The Data • Training workshops have been given in each region on an annual basis since then • One suggestion was to have another Orientation session for new members who missed the one in 1997 • This workshop and this special Orientation session part of continuing co-operative training

  10. Data Liberation Initiative The Need to Liberate The Data • In 1996 there were 50 post secondary members • In 1998 there were 61 • Today there are 70 members • There are over 19,000 files in the DLI collection including data files, documentation, CD’s etc. • Can now access the collection via DLI Web Site as well as FTP

  11. Data Liberation Initiative The Need to Liberate The Data • The DLI is now a permanent program at Stats Canada located within the Library and Information Centre • Today’s graduates have had the opportunity to use Canadian data throughout their studies • The DLI has been described as one of the most important developments in the social sciences in Canada for the past 50 years!

  12. Data Liberation Initiative What is The Data Liberation Initiative?

  13. Data Liberation Initiative • The Products • The Licence • The Service • The Community

  14. Data Liberation Initiative THEPRODUCTS • DLI provides access to Stats Canada data produced as standard electronic productsavailable to the public • These data are digitally encoded and stored in a file structure • These include: • Micro data Files • Geography Files • Databases • Aggregate data in table format

  15. Data Liberation Initiative THEPRODUCTS • Main focus of DLI Collection on Socio-Economic data: • Health • Education, Literacy • Labour Market, Income • Travel • Justice • Census, Demographic • Etc.

  16. Data Liberation Initiative THEPRODUCTS Not usually produced as a standard electronic product for public dissemination • DLI includes some business products such as: • Trade data • Financial Performance Indicators CD • Inter-Corporate Ownership • Fleet Report • Survey of Manufacturing

  17. Standard Electronic Product An “off the shelf ” electronic product available to the public Not included are standard publications available in electronic form as these are usually part of DSP Registered in STC Catalogue of Products and Services and has a Product Number Data Liberation Initiative THEPRODUCTS

  18. Data Liberation Initiative THEPRODUCTS • Metadata available in both Official Languages whenever available • New data products continually being added to Collection • Includes: • Updated data from regular on-going surveys • Data from ad-hoc special surveys -one time only • Data from new surveys in STC program

  19. Data Liberation Initiative THEPRODUCTS • Updates may be provided in different format than earlier version: • For example PUMF Beyond 20/20 • As new versions are received the DLI has to decide to either replace data or add to Collection • Over 19,000 files in Collection including: • Data files • Metadata & Readme files • Census & Geography • CD’s

  20. Data Liberation Initiative THEPRODUCTS • Not all products in DLI Collection are standard electronic products • Have some “special” products just for DLI which contain non-public data: • KLEMSdatabase • An experimental database of productivity data • Justice Statistics • Complete set of Beyond 20/20 tables normally only available to members of CCJS Initiative

  21. Data Liberation Initiative THELICENCE • DLI is open to all accredited Post Secondary Institutions in Canada • Data made available on a subscription basis • All member institutions must sign a Licence Agreement • Data made available to Educators, Students and Other Staff while they have such status at the Institution • E.g.. A student who goes to USA to do Masters no longer has access to data

  22. Data Liberation Initiative THELICENCE • Data is made available for: • Academic Research and Publishing • Teaching • Planning of academic/educational services • Use of data in textbooks falls under a different set of STC licences and permissions • Data not to be used in any commercial or private activities (even if no $$ involved) • DLI Contact responsible to ensure eligible use of data

  23. Data Liberation Initiative THELICENCE • Important elements of the Licence Agreement: • Data & products offered “as is “ • STC remains owner of intellectual property - only access to data is provided • Users must not link data or otherwise try to identify individual respondents • DLI Contact to implement data security measures • May have users sign before allowing access

  24. Data Liberation Initiative THESERVICES • DLI was conceived to be a Internet based means of dissemination - internet the main mode of data transfer and communications • DLI Team offers both an FTP and a Web based service for access to Collection • DLILIST - forum for making enquires, sharing of information and general communication between and among members • DLIORDER & WWW DLI ORDER DESK - processes to order hard copy versions of products not available electronically

  25. Data Liberation Initiative THECOMMUNITY • There are a number of advantages to belonging to DLI: • The DLI provides academic community with “one stop shopping” for STC products • Provide a forum for sharing information and obtaining advice • Value added to basic STC products (e.g. SPSS) • Participation in training workshops also a great “community builder”

More Related