430 likes | 550 Views
IQML overview and Metadata in Codacmos. Defeng Ma ( defeng.ma@ed.ac.uk ) Centre for Educational Sociology The University of Edinburgh, UK. Outline. IQML overview IQML QDT (Questionnaire Design Tool) IQML SAT (Survey Administration Tool) IQML Registry/Repository Codacmos project
E N D
IQML overviewand Metadata in Codacmos Defeng Ma (defeng.ma@ed.ac.uk) Centre for Educational Sociology The University of Edinburgh, UK
Outline • IQML overview • IQML QDT (Questionnaire Design Tool) • IQML SAT (Survey Administration Tool) • IQML Registry/Repository • Codacmos project • Metadata for data collection (WP6 in Codacmos)
IQML Partners • University of Edinburgh, UK – coordinator, QDT • Comfact, Sweden – QPT, DIT, XML specification • Desan, Netherlands – SAT • Dimension EDI, UK – standards, repository • CSO, Ireland – user • Statistics Norway – user • National Technical University of Athens, Greece – user Funding under EuroStat EPROS as part of the EU 5th Framework R&D
IQML modules • QDT(Questionnaire Design Tool): design the questionnaire • SAT (Survey Administration Tool): manage the survey • MetadataRepository: storage and retrieval of metadata • QPT (Questionnaire Presentation Tool): present questionnaire to the user and collection the data • DIT (Database Interrogation Tool): extraction of data from database to the questionnaire
QDT Metadata Repository SAT IQML IQML IQML QPT/ DIT IQML Metadata SAT QPT/DIT Repository Respondent Respondent qnaire qnaire Survey Survey info info Dataset Dataset The IQML Processing/Data Flow
Questionnaire Designer SurveyAdministration Questionnaire Presentation API QuestionnaireXML API MetadataRepository System Structure Common Model DatabaseIntegration Tool
Common model Web Servlet Repositpry Java Storage MetadataRepository Local Storage Application API API • Questionnaire designer (QDT) • Survey administration (SAT) • Externalisation (e.g.IQML XML form) C++, Java Script, Active X EEG6/WG4 adopted (as basis for XML4DR) Web Application Syntax is XML Interoperability, Technical Infrastructure and Standards • Shareable metadata • Published questionnaire • Questionnaire presentation (QPT) • Database interrogation (DIT)
Questionnaire UML Model in QDT
Models mapping forInteroperability • QDT model -> IQML common model (High level only) • Questionnaire, questionnaire body -> Questionnaire • Section, Question -> Set • Response Group -> Domain • Variable Group (Sub question) -> Tupel • Variable -> Data • Member of Response Group -> CodeList, Code • This mapping is used to publish questionnaire from QDT to Metadata repository
IQML QDT IQML Metadata Repository IQML SAT IQML QPT/DIT Respondent qnaire Survey info Dataset DESAN Web Server Survey DDI UEDIN Web Server Faster Survey Faster Nesstar Server (UKDA) Faster Nesstar publisher Publishing Qnaire DDI Catalogue.xml Searching URL of DDI object Catalogue.xml Cosmos Registry Mapping to external standards: DDI (in Cosmos project)
Group elements Notes Group elements Notes Group elements Notes Group elements Notes Group elements Notes section question section question section question section question section question Philosophy of QDT Create basic components Define Question type Question bank Assemble questionnaire Publish questionnaire
Defining Question Type • Question type defines • data type (text / integer / real / boolean) • response type (simple / multi / choice) • sub-questions (yes / no)
Main QDT components • Question type • Question bank • Element groups • Response Group • Missing value group • Sub-Question group • Note • Question • Section • Control Elements • Calculation • Validation • Navigation Rule (‘Dynamic Guidance’) • Questionnaire
QDT – complex question Question text Response group variable variable group Sub-question group
Functions • Sampling • import population database • select sample(s) • Survey definition • questionnaire (QDT -> Reg/Repository -> SAT) • sample(s) (SAT -> Reg/Repository -> SAT) • Distribute survey • Monitor response • Send reminders
Sample & Survey Process txt xsl Questionnaire (QDT) Pop (RR) Survey sample Sample
Distribution & Monitor Process Survey • Html generation • Prefill • Distribution Message Response (QPT) Reminder Monitoring
IQML Reg/Rep • ebXML standard • Web servlet Interface with Tomcat web server • Remote accessible • Administration GUI • Not directly visible for the user, the QDT and SAT can communicate through the Reg/Rep • IQML common model for storing objects in the repository
Contact Details for IQML Defeng Ma (Defeng.Ma@ed.ac.uk) http://www.epros.ed.ac.uk/iqml
IQML -> Codacmos • IQML is mainly focus on the primary data collection based on the web/email survey (Trails in SSB, CSO and NSSG) • Codacmos (Cluster of Data Collection Integration & Metadata Systems for Official Statistics ) is a cluster of relevant projects • TELER, • DATAMED, • IQML and • IPIS
Codacmos: Goal • Main Goal is to identify ways • improve the cost-effectiveness of the data collection process and the quality of official statistics, and • lowering the statistical burden of the respondents
Codacmos: Workpackages • WP1: Management and Coordination • WP2: Integration of available results • WP3: Primary data collection • WP4: Secondary data collection • WP5: Integration of primary and secondary data collection • WP6: Metadata system for data collection • WP7: Relevant issues for FP6 • WP8: Assessment, Evaluation and Peer review • WP9: Dissemination and Publication
Data collection NSI Respondent Respondent Market Research Institute Respondent Tax Office Respondent
Data sharing Data Provider NSI Data Provider EuroStat Data Provider World Bank
Optimum solution? Data Provider NSI Input Output Data Provider EuroStat Virtual Metadata Repository Data Provider World Bank
Data Definition Production Dissemination collection of statistics Scope of the WP6 model Statistical production process* Focus on Data collection, Thinking of the whole process But: where to stop? * Picture from H. Rouhurirta: “On the structing of statistical Information”, First Metanet conference, 2001
Processing metadata Primary Data Collection Content Metadata Integration of primary and secondary Statistical Data Secondary Data Collection Metadata for data collection • Metadata for definition statistics (survey, questionnaire, statistical unit, etc.) • Metadata for production (disclosure, quality control, constrains, etc.) Additional:
Looking back … • State-of-art review • Existing systems/models review and analyse • Study • Similarity/Common elements • Difference • Pro/Con • Archive the common understanding of the requirements/user needs
Top-down approach for Common metadata model • Develop the Abstract model at top level first to cover the common aspect of the metadata for data collection process (cover the large scope) • Then go down in the low levels with more details of the metadata models (few examples) as far as we can: • Statistical data (content) • Processing • Constraints • Survey/questionnaire • …
Working with the top down • Generalise the data collection process • Abstract metadata model for data collection • Define the level of the metadata • Define the category of the metadata • Define the exchange model of the metadata • Define common terminology
The abstract model • Keep the model • Generic • Simple • Clear
Bottom-up approach: Start with what we have… • Metadata in the view of different aspect • Primary/Secondary data collection • Organisation/International • Enterprise/individual • Economical/Social • Administrative/statistical data • Centralised/distributed • Register/Survey • Questionnaire/processing/transformation, etc.
Integration/Combination of top down and bottom up • Mapping existing models to common model • Identify the tasks for integration • Outline the working direction/guideline for improve the existing systems • Refine the common model • Specify/define the demo experiment field and strategy of the demo
WP6 work plan • Before 1st TM: • preparation, collection, review of state-of-art • 1st TM (Sep. 2003): • Get common understanding on the target and tasks • Detailed working plan • Draft structure of the common model • Initial discussion of the Integration analyse • After 1st TM: • common core model • integration analyse of the existing system • Demo definition and specification
Work plan (cont.) • 2nd TM (Jan. 2004) • Finalize common model and integration of existing system • Demo preparation • Workshop preparation • Deliverables preparation • After 2nd TM • Workshop • Demo • Deliverables
Codacmos website http://www.codacmos.eu.org Project coordinator: Alberto Sorce ISTAT source@istat.it Contact Author: defeng.ma@ed.ac.uk