90 likes | 208 Views
Data Quality in Cooperative Information Systems. Monica Scannapieco (monscan@dis.uniroma1.it) Dipartimento di Informatica e Sistemistica Universita’ di Roma “La Sapienza ”. Who am I ?. Ph.D. Student in Computer Engineering (next december Ph.D. defense)
E N D
Data Quality in Cooperative Information Systems Monica Scannapieco (monscan@dis.uniroma1.it) Dipartimento di Informatica e Sistemistica Universita’ di Roma “La Sapienza” Monica Scannapieco - Dagstuhl Seminar (September 2003)
Who am I ? • Ph.D. Student in Computer Engineering (next december Ph.D. defense) • Member of the Data & Knowledge Bases Group at the Dipartimento Informatica e Sistemistica, Università di Roma “La Sapienza” • Leaders of the group: Tiziana Catarci and Maurizio Lenzerini • Group Members: D. Berardi, E. Bertini, A. Cali, D. Calvanese, G. De Giacomo, D. Lembo, S. Kimani, M. Mecella, R. Rosati, G. Santucci Monica Scannapieco - Dagstuhl Seminar (September 2003)
Data Quality Research Issues • DaQuinCIS project - “Methodologies and Tools for Data Quality inside CooperativeInformation Systems'' • Università di Roma “La Sapienza” • Università di Milano “Bicocca” • Politecnico di Milano • Web Site: http://www.dis.uniroma1.it/~dq/ • Cooperative Information Systems include cooperation at a data level (e.g., data integration systems) and at a process level (e.g., inter-organizational workflow) • My research issues on DQ in CIS: • Conceptual Modeling • Process Modeling Languages • Data Integration strategies • Improving techniques Monica Scannapieco - Dagstuhl Seminar (September 2003)
An Architecture for DQ in CIS Monica Scannapieco - Dagstuhl Seminar (September 2003)
The DaQuinCIS Main Components • CIS organizations export data + quality • Semistructured data model D2Q (data and data quality model) • XQuery to query both data and quality • The Data Quality Broker processes query + quality • Returns the best quality result • Performs an improvement function • Quality Notification Service • Publish and Subscribe engine to maintain quality levels • Rating Service • Trust model rating the credibility of sources vs. exported data Monica Scannapieco - Dagstuhl Seminar (September 2003)
Data Quality inWeb Information Systems Dynamic Quality DQViewer to visualize quality metadata Monica Scannapieco - Dagstuhl Seminar (September 2003)
“Real World” Applied Research with Carlo Batini (AIPA and Università di Milano Bicocca) • Current: Quality of Addresses in the Italian Public Administration • Joint project of the Italian National Authority for Information Technology (AIPA) and the Italian National Institute of Statistics (ISTAT) • Past: DQ-driven process redesign in the Italian Public Administration-the RAE project Monica Scannapieco - Dagstuhl Seminar (September 2003)
Reference Bibliography (1) • (Main) DaQuinCIS platform publications: • M. Mecella, M. Scannapieco, A.Virgillito, R. Baldoni, T. Catarci and C. Batini, “Managing Data Quality in Cooperative Information Systems”. In Proc. of CoopIS 2002. • Extended version to appear on Journal of Data Semantics, 2003 • Scannapieco M, Virgillito A., Marchetti M., Mecella M., Baldoni R.: The DaQuinCIS architecture: a platform for exchanging and improving data quality in Cooperative Information Systems. To appear on Information Systems, 2003. • P. Bertolazzi, L. De Santis, M. Scannapieco, “Automatic Record Matching in Cooperative Information Systems”. In Proc. of the ICDT 03 Workshop on Data Quality in Cooperative Information Systems, Siena, Italy, 2003. • L. De Santis, M. Scannapieco, T.Catarci: “Trusting Data Quality in Cooperative Information Systems”. In Proc. of CoopIS 2003. Monica Scannapieco - Dagstuhl Seminar (September 2003)
Reference Bibliography (2) • Dq on the Web • B. Pernici, M. Scannapieco: “Data Quality in Web Information Systems”. In Proc. of ER 2002. • Extended version to appear on Journal of Data Semantics, 2003. • Experiences • Bertoletti M., Missier P., Scannapieco M., Aimetti P., Batini C.: ImprovingGovernment-to-Business relationships through data reconciliation and process re-engineering. In Proc. of ICIQ 2002. • Extended version to appear on AMIS-IQ Monograph, 2003. • Process Modeling • Scannapieco M., Pernici B., Pierce E. B., IP-UML: A Methodology for Quality Improvement based on IP-MAP and UML. In Proc. of ICIQ 2002. • Extended version to appear on AMIS-IQ Monograph, 2003. • Quality Dimension Definitions • T.Catarci, M. Scannapieco: “Data Quality under the Computer Science Perspective”. Italian Journal of “Archivi & Computer”, Anno XII, number 2/02, 2002. Monica Scannapieco - Dagstuhl Seminar (September 2003)