400 likes | 571 Views
Report on Data Activities in China. Vice-President of CODATA-China General Director of CNIC, CAS Dr. Yan Baoping ybp@cnic.ac.cn CODATA-DSAO, Bangkok, Thailand, Jan.12-12,2006. Requirements on Scientific Data Main Data Activities National Programs on Data Activities Summary. Outline.
E N D
Report on Data Activities in China Vice-President of CODATA-China General Director of CNIC, CAS Dr. Yan Baoping ybp@cnic.ac.cn CODATA-DSAO, Bangkok, Thailand, Jan.12-12,2006
Requirements on Scientific Data Main Data Activities National Programs on Data Activities Summary Outline
Scientific discovery & innovation Data form the foundation of scientific discovery In the past, Scientific data explains the observable world Extraction of Essence Explanation of the Complex Prediction from Data Today, we have exciting new capability to observe nature Requires and generates large amounts of quality data The information revolution coming with Computer and internet technology Data at the very heart of the revolution Requirements on Scientific Data
Data sharing and exchanging Valuable national strategic resource Full & open in a timely and equitable manner to public Closer cooperation and communication among scientists The frontiers of science, big science plan need large-scale and large-scope data supporting Requirements on Scientific Data
Main Data Activities Data sharing and exchanging Data and metadata specification Database Construction and integration Data quality control and assessment Scientific Data Activities Data sharing policies Information systems and platform Support and consultant Network Super computer Storage Applications and services
National Programs on Data Activities • Scientific Database and Information System • Founded by CAS since 1982 2. Platform Construction for National S&T Infrastructure Founded by MOST since 1999, Supported by MOST, MOE, MOF and NDFC since 2004 3. Scientific Basic Resource Platform of MOE Founded by MOE since 2004 4. Land and Resource Data Center Founded by MLR since 2003 5. NSF Scientific Data Collection and Sharing
Scientific Database and Information System (SDB) • Founded by Chinese Academy of Sciences since 1982 • 1986-2000 • 725GB • 180 databases • 19 member institutes involved • 2001-2005 • CAS initiated the Informatization Project • SDB is one of the infrastructures of CAS informatization
2002 Initiated three sub project of SDB Scientific Data Resource Construction Scientific Data Standard and Specification Making System Platform Construction (Scientific Data Grid -SDG ) 2003 Prof. Jiang Mianheng, Deputy president of CAS signed to formally initiate “Scientific Database and Information System Project” 2000-2005, CAS supports US$ 7.50 Million Scientific Database and Information System (SDB)
Objectives Expand and strengthen the data accumulation and integration, sharing Improve the digital environment for S&T research Set up information and data service system for S&T research and social development Promote to transfer data to acknowledge Objectives of SDB
Expert Committee (EC) Executive Office (EO) Scientific Database Center (SDC) Member Institute Organization of SDB CAS EC CNIC EO SDC Inst. Of Geography Inst. of Microbiology Inst. of Engineer andProcess ······
Biennial SDB Technical Symposium During “Ten-Five”, three symposiums and three publications Annual technical training 5 technical training Technical Training and Exchanging
3. Quality Control And Assessment 4.Storage 2. Databases 1.Metadata Specification SDB 10. e-Science 5.Portal 9. Supports 6.Super Computer 8. High speed network 7.Application Achievements and Activities SDB
By 2005 45 institutes of CAS 503 databases a gross volume of 16.6TB 10TB are available on the Internet Growth of SDB Data Volume
SDB Metadata Registry System Registry Search Mapping Explore
Software and Hardware Super server TB Storage Visualization System High Speed Network System Software SDB System Platform
Super Server-Lenovo6800 59 Nodes( 4-way) SAN:20TB ,50TB Tape 2Gbps Network bands
SDB Portal • 7X24 • Login one time, access all databases Http:// www.csdb.cn
2003.8-2005.10 Visiting number:2.50Million Page-viewing number:18.50Million SDB On-line Data Serving Stat. of SDB portal
2000-2005, more than 100 application cases. Scientific research Space Environment data is used to guarantee the safety of the spaceflight with people Chemical data is used in the study of the anti-SARS Identify Avian Flu by using the data in the Virus Database Natural resource data is used to research the evolution of the ecology and environment and the sustainable development of soil and water in West China Genetic data is used in the SARS gene study … Social and economy development SDB Typical Applications
International cosmic ray data processing system IHEP of CAS、CNICof CAS Avian Bird Flu Information Platform and Alarm System IMB of CAS, IV of CAS,IZ OF CAS, CNIC OF Applications Based on SDB
Platform Construction for National S&T Infrastructure (MOST) Founded by (MOST) since 1999 Platform Construction for National S&T Infrastructure (MOST) Experimental base and large-scale scientific apparatus sharing platform Science research network environment Natural scientific resource sharing platform Achievement-transfer and public serving platform Scientific data sharing Platform (SDSP) Scientific literature sharing platform
More than US$ 12 Million The whole project could be divided into three sub-systems 1 portal system 20 scientific data centers or scientific data nets over 300 main databases. By 2005, 12 sharing trial scientific data centers have been established. Data sharing policies are made Metadata specifications are made The Portal system are developed Scientific Data Sharing Platform (SDSP)
Portal of SDSP http://www.sciencedata.cn
Forest SDC Basic SDC Earthquake SDC Meteorological SDC Rural S&T SDC Oceanic SDC SDSP Medicine SDC Sustainable Develop Hydrological SDC Survey & Mapping SDC Agriculture SDC Earth System SDC 12 Science Data Centers (SDC)
Founded by Ministry of Education since 2004. Integrating distributed scientific base resources in the various university with high information and communication technology. 17 subsystems, about 100 universities are involved Scientific Basic Resource Platform of MOE
Founded by Ministry of Land and Resources since 2003 Information Center of the Ministry of Land and Resources (Leading organization) Bureau of Geological Survey Chinese Academy of Geological Sciences Chinese Land Surveying and Planning Institute under the Ministry of Land and Resources 37 databases Land and Resource Data Center (LRDC)
Activities and Achievements LRDC Founded by (MLR) since 2003 Land and Resource Data Center (LRDC) Land and resource data standardization and integration Land and resource data sharing policies and specifications Information service system platform Public service system, such as data retrieval based on metadata
System Platform of LRDC http://www.mlr.gov.cn/pub/sjgx/index.htm
In recent years, all projects supported by NSFC are required to provide the complete scientific data NSFC also made a series of regulations to manage and share the scientific data Scientific data are playing a key role in the national S&T innovation NSFC Scientific Data Collection and Sharing
Main data activities in China, which includes metadata specification, data integration, information system, application and service, network and supercomputer. Scientific Data Sharing Platform (NSDSP) spent over US$12million to support data activities in the past 5 years. In 2005, Ministry of Science and Technology, Ministry of Finance, Ministry of Education and National Development and Reform Commission carried out “Suggestion on Implementing the Platform Construciton for National S&T Infrastructure during the 11th Five Year Plan”, which will strengthen the construction of National Science Research Foundation. Especially, in the next 5 years the budget for Scientific Data Sharing reaches US$375 millions. Summary
In the soon future, China government will organize a leading committee under Chinese State Department to guide and supervise the scientific data activities. Meanwhile all the databases such as SDB, LRDC and Data Platform of MOE will be gradually integrated into SDSP, which will serve as the support platform of S&T data in the national S&T innovation system. SDB achieved a lot in the past over 20 years, we made the SDB metadata specifications, established the super computing and high speed network environment, accumulated and integrated 16.6TB database and developed an unified information platform. Summary
Now SDB are serving as the support system of S&T data for CAS S&T innovation. In the next five years, CAS will promote the science research and S&T innovation by establishing the e-Science environment. And we believe the whole society will benefit from the e-Science. We have gotten some experience and lessons in the process of scientific data activities, we hope facilitate cooperation with other countries in scientific data activities, also in the science research. Summary
Welcome Comments! And Thanks A lot