330 likes | 519 Views
Data Sharing at Scientific Database of CAS. Computer Network Information Center, CAS Xiao Yun June 21, 2004. Outline. 1. Background 2. Data Resources 3. Sharing Demand 4. Sharing Policy 5. Share Methods 6. Standard 7. System Platform 8. Development Planning. Background.
E N D
Data Sharing at Scientific Database of CAS Computer Network Information Center, CAS Xiao Yun June 21, 2004
Outline 1. Background 2. Data Resources 3. Sharing Demand 4. Sharing Policy 5. Share Methods 6. Standard 7. System Platform 8. Development Planning
Background • CAS is China’s supreme institution on natural science research. • Scientists accumulated a large sum of precious data resources during the long-term scientific research practice, but these data hadn’t been fully developed and applied because of the backward management methods before the project of Scientific Database was initiated. • 1960’s,The Development of database technology made the effective storage, management, application of large sums of scientific data possible. • 1970’s,Some chemical institutes of CAS began to develope their own databases. • 1983年,CAS put forward the great project of “Scientific Database and its Information System”.
Background(Cont.) • Major project of State Planning Commission (1986-1995) • Major project of CAS (1986-1990) • Special support project of CAS basic research (1991-2000) • Major project of NSFC network application (1995-1996) • Major project of CAS 10th five-year informationalization construction (2001-2005) Scientific database, SDB, since 1983
Background(Cont.) • Scientific database system is large-scale and multi-discipline clusters of scientific databases; • Scientific database is not only a research project, but also an engineering project; • Scientific database has played important roles in many areas: • Scientific research • Science and education • Enterprise and economy • Information and service • … • First-grade prize of CAS in 1997, and second-grade prize of the state in 1998
Organizational Management CAS Experts Committee of Scientific Database CNIC Scientific Database Office Scientific Database Center Organics institute Geography institutes Zoology institutes Microbiology institute ······
Organizational Management (Cont.) • Experts Committee of Scientific Database • Academic leading institution of scientific database • Composed of data experts from institutes of CAS • Responsible to policy-making, funds management, project evaluation,etc • Scientific Database Office • Responsible to the daily management of scientific database • Scientific Database Center • Organize construction of platform and standards • Provide basic environment of operation service • Provide data integration and support service • Database developer • Specialized database construction
Data resources • Database building units: 45 • Number of professional databases: 313 • Data volume: 8.2TB • Data service websites: 45 • On-line data volume: 4.3TB By November 2003 Data had increased greatly; more subjects had been covered; and Internet had promoted share work.
Sharing Demand • The huge challenge faced by modern science needs more share • The scientific research issue is ever complicated, scientific research is not simple and isolated any longer. • The real-time obtaining and processing of scientific research information and data, simulation and large-scale calculation have become the main methods of analysis, discovery and forecast, they need the accumulation and support of large sums of scientific data. • Closer cooperation and communication among scientists is the drive of share • Modern network information technology has provided technical conditions for large-scale and large-scope data share without time and space restriction
Sharing Demand (Cont.) • It’s not a problem of whether or not t, but a problem of how to share: • What kind of data share management policy is needed? • How should we promote scientists to share all the data needed by their scientific research? • How should we combine data share with market economy and knowledge economy? • Data sharing will speed the construction of e-Science environment, providing a better platform for data resource re-construction and integration service. • Data sharing will make it possible to do research and share resource among different academic domains throughout the world.
Sharing Policy • The data sharing is a systematic project. • The core is data sharing rules, and in addition, the corresponding system platform, standards, criteria, data resources construction and the auxiliary measures. • Data share policy provide important action reference for the planning, construction and service activities of scientific database.
Data share of scientific database cooperation Sharing Policy Training and communication System Platform Sharing rules Data resources Standard,Criteria
Sharing rules • There are 10 chapters and 54 articles in the frame of sharing methods, including: (1) General rules, (2) Share management system of scientific data, (3) Classification and grading, (4) Issue and share, (5) Gathering of scientific data, (6) Integration management, (7) User grading, (8) Application and protection of intellectual property rights, (9) Prizing and punishment, (10) Annexes: terms. • Define the rights, obligations, responsibilities and protocols in data sharing activities by management system, participating roles and data meaning.
Sharing rules (Cont.) • Sharing principle • Don’t damage of the state’s and database developors’ interests, effectively protect the intellectual property rights, make the scientific data used and shared widely and freely, and realize the standard management, effective use and value-added service of scientific service; • The data resources should be issued in society, let the governmental sectors, scientific research staffs and public share and use it in a wide scope
SDB Expert Committee Data center Branch center of subject Branch center of subject Database Developer Database Developer Database Developer Data service User Data gathering User User User Sharing relus(Cont.) Is an execution system of gathering scientific data, integrated management, share service and arbitration • Management system • Expert committee • Data center • Branch center • Developer • Data user
Branch center Database developer Right Obligation Data center Development Share Producer Quality control Manager and server Integration Share Issuing Property right User Property right Gathering Training Issuing Users of project Gathering Quality control Share Public good of project Property right Profits of project Sharing rules (Cont.) • Define data share management system and behavior criteria in share activities, according to the main roles of data sharing.
Sharing relus (Cont.) Articles of intellectual property rights • Article 37 The ownership of all databases and branch databases built by the CAS project of “Scientific Database” belongs to the database building units in principle. • Article 38 CAS has the access right of all databases and branch databases built by the CAS project of “Scientific Database” for free and with no restriction, and authorizes “Scientific Database Expert Committee of CAS” to exert the access right. • Other development behavior of scientific database should be permitted by the Expert Committee and corresponding database building units in written form, and be marked with “CAS Scientific Database”. • The returns and interests gained basing on scientific database should be rationally distributed between development participators and data producers or providers, the concrete distribution methods should be decided in the form of contract.
Standards, Criterion • Criterion of establishing project and evaluation • Criterion of phase work checking • Criterion of appraisal of project Project management • To make necessary standards for the construction, management, integration and share of scientific database is an inevitable step of promoting data sharing. • Standards of grading and classifying scientific data • Standards of sharing and issuing scientific data • Standards of sharing and issuing scientific data of all subjects • Data service standards of scientific database Sharing service • Database construction process • Database construction file • Database management and service Database • Metadata format and specifications of data set • Metadata format and specifications t in different domain • Mutual operation standards of metadata Metadata • Data format • Standards of data exchange • Data classification and code • Data quality control and evaluation Data
Standard(Cont.) • Main work • Project management • Project Management rules of Scientific Database • Sharing service • Data Share rules of Scientific Database 1.0 • Database • Construction Process and File Standards of Scientific Database 1.1 • Metadata • Core Metadata Standards of Scientific Database 1.2 • Atmospheric Data Metadata Standards of Scientific Database 1.0 • Ecological Data Metadata Standards of Scientific Database 1.1 • Multimedia Metadata Standards of Scientific Database 1.0 • Data • Has started the study of data quality evaluation standards of scientific database
System Platform • Providing basic environment and technological platform for the sharing, service and integrated application of scientific database. • Hardware environment • Super server • Massive storage system • Software platform • Scientific DataGrid Middleware platform • Network environment • www.CSTnet.net.cn
SDG Platform • Data Center • 20TB SAN Storage • TFLOPS-scale computing capacity Lenovo DeepComp 6800
SDG Software Application applications • SDG Middleware Architecture app-oriented, unified program interface SecuritySystem Info. Service Grid API coordinated access to multiple data resources Data Res. Broker uniform access interface to single data resource Uniform Access Int. local data management system, could be DBMS or file system Local Data System databases
SDG Software • SDG Software Modules
Development Planning • On the foundation of current “data sharing methods of scientific database”, go on perfecting the share methods and make it operable. • Now the research of detailed rules of data sharing on geography and chemistry has been started, that of more subject areas will be promoted to further advance and guide share. • Combine the future development planning and data sharing policy of scientific data, refer to the principle spirit of data sharing in the construction of data resources, standard, system platform, training, communication, and application system development, etc. • Standardize and promise the obligations of corresponding roles in data sharing in the contact of the subprojects. • Rapidly promote the metadata project application of scientific database, develop directory system of data resources, fasten the implementation of data sharing rules.
Serve for e-science program! Serve for science and technology innovation! Thanks!