410 likes | 543 Views
Scientific Collaboration Cyberspace. TieJian Luo Ph.D. tjluo@gucas.ac.cn. Agenda. Motivation Modeling Collaboration The Challenges for e-scientists Five Pilot Projects Lessons learned. New Science Paradigms. Thousand years ago: Experimental Science
E N D
Scientific Collaboration Cyberspace TieJian Luo Ph.D. tjluo@gucas.ac.cn
Agenda • Motivation • Modeling Collaboration • The Challenges for e-scientists • Five Pilot Projects • Lessons learned
New Science Paradigms • Thousand years ago: Experimental Science - description of natural phenomena • Last few hundred years: Theoretical Science - Newton’s Laws, Maxwell’s Equations … • Last few decades: Computational Science - simulation of complex phenomena • Today: e-Science or Data-centric Science - unify theory, experiment, and simulation - using data exploration and data mining • Data captured by instruments • Data generated by simulations • Processed by software • Scientist analyzes databases/files (With thanks to Jim Gray)
Scientific Computing Research Paradigm data Verify model (1)problem model (2)experiment, data collection (3)get computing resource (4)computing, verify model Deduce natural law (1)identify problem domain (2)experiment, data collection (3)get computing resource (4)analyze data , deduce law extract information simulation result Data mining model deduce idea Scientific computing
Experiments & Instruments facts questions facts ? Other Archives facts answers Literature facts Simulations The Problem for the e-Scientist • Data ingest • Managing a petabyte • Common schema • How to organize it? • How to reorganize it? • How to coexist & cooperate with others? • Data Query and Visualization tools • Support/training • Performance • Execute queries in a minute • Batch (big) query scheduling
Data Acquisition Data Ingest Metadata Annotation Provenance Data Storage Data Cleansing Data Mining Curation Preservation Scientific Data Life Cycle
What is a Scientific Collaboration ? • Definition: two or more people work together to create or achieve the same thing. • Inter-discipline guys • Teamwork • Agreement • Common interests • Divide task into several works • Discuss problems, use instruments and share information • Goal: create new knowledge • The Problem is how to make this things happen? • Solution: Human Cooperation + Resource Share
People to People Communication Groupware Service Access to Facilities Interaction with the Physical World Access to Instrument online Access to Information Digital Libraries, E-Pub Search Service Basic Collaboration Model---G.Olson The concept back from 1989 National Collaboratories ---Applying IT for Scientific Research , NAP, 1993
Current collaboration technology • Electronic communication tools send messages, files, data, or documents between people and hence facilitate the sharing of information. • e-mail • faxing • voice mail • Web publishing • Electronic conferencing tools also facilitate the sharing of information, but in a more interactive way. • data conferencing— networked PCs share a common "whiteboard" that each user can modify • voice conferencing—telephones allow users to interact • video conferencing (and audio conferencing) — networked PCs share video or audio signals • Internet forums (also known as message boards or discussion boards) — a virtual discussion platform to facilitate and manage online text messages • chat rooms— a virtual discussion platform to facilitate and manage real-time text messages • electronic meeting systems (EMS) — a conferencing system built into a room. The special purpose room will usually contain a large screen projector interlinked with numerous PCs. • Collaborative management tools facilitate and manage group activities. • electronic calendars (also called time management software) — schedule events and automatically notify and remind group members • project management systems — schedule, track, and chart the steps in a project as it is being completed • workflow systems— collaborative management of tasks and documents within a knowledge-based business process • knowledge management systems — collect, organize, manage, and share various forms of information • extranet systems (sometimes also known as 'project extranets') — collect, organize, manage and share information associated with the delivery of a project (eg: the construction of a building) • social software systems — organize social relations of groups
Case Study : China State Key Labs Area of Study Labs Chemistry 22 Math and Physics 15 Geognosy 18 Biology 38 Information 26 Material 18 Engineering 25 • Users more than 10k • Large instruments more than 6k
SKL B Know how device people Virtual team C info data Virtual team B Virtual team A SKL A SKL D SKL C Know how Know how Know how device device device people people people Info Info Info data data data Physical Resource and Virtual Team
Bio Che Phy Mat Eng Math Geo Domain Joint Resources Sites Services …… Management Collaboration ‘game rules’ Pub/Search Know How Expert Services Metrics Team Project Task Instrument Acc Semantic Data Network com Auth ACL LDAP SSL …… Network Network Infrastructure Data Resource Instruments Computing Archie Data Multimedia Raw data Instrument Physical Scientific Collaboration Cyberspace
Modeling Elements Entity Relationship Object Participant Management Institute Affiliate org E-community Info flow Capital flow Data Flow Communicate Management Instrument Data Storage Behavior Behavior According constraint {Action Des} Container Collaboration cyberspace {name}
Legend Participant Management Agency Affiliate org E-community Instrument Data storage Platform Behavior Constraints Info flow Capital flow Data Flow Communicate Management {活动描述} {规则名称} Knowledge create and distribute Model Ensure a fair game ! 3 Members Agreement Labs (1,n) (1,m) 1 Buy rule 2 2 1 2 Collaboration Cyberspace 2 Outside labs (1,q) (1,p) Extra members 1.contribution2.get Knowledge3.oversee
SCC Web Architecture • Benefits • Adaptable • More than 10 different science subjects templates • Scalability • Dynamic growing VO does not affect platform performance • Extensibility • Easy plug in the a new service to the platform
user SCS Org.A Org.F Org.B Org.C Org.E Org.D Portal model for accessing autonomic resource • Benefits: • Trust access path • Single sign-on • Delegate permits and Proxy • Interoperate
Video and audio interactive component Benefits 1. Multi node access 2. Plug in SCC 3. Security and efficiency
Project1:Soil Carbon recycle mechanism database • A lot of Units • Distribute data across mainland of China • More than 1K scientists Soil Carbon Recycle Data Schema
Project 2:FACE(Free Air CO2 Enrichment) • Find out the mechanism for the rice growing when the CO2 climate change • 15 Collaboration org.,9 domestic orgs(3SKL),7 oversea country, 100 research staff. • Invest 100m RMB, only one in China • 30 science topics
FACE scientists • Federal Agricultural Research Centre (Germany) • National Institute of Agro-Environmental Sciences (Japan) North Carolina State University • Tohoku National Agricultural Experiment Station (Japan) • U.S. Water Conservation Laboratory • University of Oklahoma • 大气边界层物理与大气化学国家重点实验室 (大气物理所) • 作物遗传与种质创新国家重点实验室(吉林农业大学南京农业大学) • 土壤与农业可持续发展国家重点实验室 (南京土壤研究所) • 上海植物生理生态研究所 • 沈阳农业大学 • 沈阳应用生态研究所 • 扬州大学 • 北京教育出版社
Contribution to the FACE community • Monitor the farm site by video • Automatic upload the raw data • Real time display control pane by Browser
FACE website • Old • Only one function • Info pub • New • 10 service for collaboration • 2 years runtime
Project3: Neutron diffraction online experiment and data sharing Remote monitor in Browser • 1.Protect the staff from radiation • 2.Monitor the experiment process • 3.Inter-discipline scientists Neutronemission facility
Neutron reactor remote access deployment Benefit 1 staff are isolated from experiment site 2 remote real time monitor the process 3 scientists online discuss
Neutron diffraction remote control interface in Browser Data analysis software
Project 4: Collaboration for BSL3 Labs (http://clb.gucas.ac.cn)
BSL3 setting 针眼摄像头 办公区
Project 5: Collaboration for Experiments Centers (http://cec.gucas.ac.cn)
The PHI 700 Field Emission Scanning Auger Nanoprobe Get data Trio MRI 磁共振脑成像系统 [北京磁共振脑成像中心] Trio MRI 配套工作站 Transfer data 800MHz核磁共振谱仪 [experiment statues PHI700对应工作站
Lessons from SCC development • How to address the user’s application simpler? • System development is less and less about coding than about using things and gluing them together. • Create a flexible enough architecture to allow for changes. • Customers will not be able to elucidate at the start what they want; only by using the system will they be able to tell you what you should have done. • SSC 1.0 rely on MS SharePoint ; SSC 2.0 open source • Future concerns should be e-community trust metrics.
Acknowledge • Dec.2005--Dec.2007, China Bio-Safe Level 3 Labs Collaborative Cyberspace, Grant from MOST of China • Dec.2005--Dec.2007, China Large Scientific Instruments Collaborative Cyberspace, Grant from MOST of China • Jul.2005--Jul.2008, EU-Asia Link Programme HPC-Grid Computing Course Model, Grant from EU