350 likes | 368 Views
Understand the concept of nature in computer systems and how data is stored and utilized. Learn about data structures, rules, and applications in Dataology for exploring real-world phenomena.
E N D
Introduction to Dataology Yangyong Zhu05/07/2009
Outline 6 1 2 5 4 3 What is Data Nature What is Dataology The Framework of Dataology Challenges Applications of Dataology What are the Differences
Nature (Real Nature) Nature, in the broadest sense, is equivalent to the natural world, physical world or material world. "Nature" refers to the phenomena of the physical world, and also to life in general.
Data Nature The phenomena of nature are stored in computer systems • GIS • Digital Earth Plan • Human Genome Plan • ……………
Data Nature Human behaviors are stored in computer systems • Communication • (Telephone/Mobile phone/ Email/MSN…) • Behavior patterns • (Credit card usage) • Activities • (Government/Enterprises) • …………………..
Data Tribe Data Country Data Zone Data Nature The usage of computers is the procedure ofproducing data. • Web data • Biological data • Public data • Private data • Data may (or may not) be managed by DBMS Data Nature Data Nature
Data Nature Data Nature Data Nature
Real natureData nature Data Nature The Second Life
Data Nature The Second Life NASA
Data Nature The Second Life IBM
3 2 4 1 Out of Control Impossible for human to control • Diversity • In- INTERNET/ • Out-of-INTERNET • public/private • audio/video • Complexity • Involve various languages • All trades and professions • Spatial, oceanic, DNA data, etc Unknown Human do not understand Data Nature Data in computer systems exhibit all the characteristics of real nature.
Dataology Dataology is the theories, methods and technologies for studying data nature. • To study the structures of a dataset in data nature • To acquire usage data from data nature • To prove the rules of data nature by theoretical methods • To discover the rules of data nature by experimental modes • To develop and utilize the data resource in data nature
Dataology • Existing researches of Dataology : • Data Collection and Data Integration • Data Mining • Data Reasoning (AI) • Data Security • Data Visualization • Developing continuously…..
Dataology • Prospective Researches of Dataology: • Data Experiment • Data Camouflage / Data perception • Data Taxonomy • Data Aware • Dataology for Specific Domains
Dataology • Data Experiment: • Discover the features and rules from a dataset. • Focus on the random of methods and the unpredictability of results. • Different from data mining
Dataology • Data Camouflage / Data perception: • Tocamouflage the private data which are exposed in the public • To percept the camouflageddata. • Different from data security (privacy protection, privacy mining)
Dataology • Data Taxonomy: • Classify data to form the pedigree of data and the history of development. • Similar to the classification for the species, history and culture • Data types, utilities and relationships
Dataology • Data Aware: • To make data visualable, sniffable, audible, tangible. • People want to feel data nature as feeling nature. • To develop various technologies of data aware.
Dataology • Specific Domains: • Universal Dataology • Life Dataology • Behavior Dataology • and so on…
Why not computer science Computer science consists of hardware and software Machine language Translate Computer Software To use Translate Computer Hardware Nature Language
pull Computer Software Why not computer science The software research pulls the software to nature language Assemble language High level language 4GL HORN clauses First order logic High order logic ??????? It means to model nature language, and to model the nature Computer Hardware Nature language Machine language
pull Computer Software Why not computer science The hardware research improves the capability of computing and storage Assemble language High level language 4GL HORN clauses First order logic High order logic ??????? Super-computer Grid Computing Cloud Computing Computer Hardware GB TB PB … Nature language Machine language
Knowledge Information Data Why not information science Data Anything stored in Computer Systems • Data is one of symbol representation of information • Information is the interpretation of data
The Differences Dataology Computer Science To explore, develop and utilize, etc Applications Modeling Acquiring Data Nature Real Nature Data Set • Study data in computer systems • Theoretically, independent from computer Computer & Network
Human Being Nature Science Social Science Real nature Universal & Life Human Behavior Society , Laws, Economic…. Dataology Data nature Anything stored in computer systems The relationship
Life Dataology Universal Dataology Behavior Dataology 。。。。。。。。 cyclopaedia The Framework of Dataology Applications of Dataology Data Acquire Data Aware Data Analysis Foundations of Dataology Data Explore Data Visualization Data Experiment Data Camouflage Data Sniffable Data Mining Data Perception Data Audiblization Data Integration Data Taxonomy Data Tangiblization Data Management
Behavior Informatics Bio informatics Social Networks 。。。。。。。。 Life Dataology Universal Dataology Behavior Dataology 。。。。。。。。 They all work on Data Nature Applications of Dataology
Challenges What are the foundational theories of Dataology? Does the unique theory of Data Nature exist? How to Develop and UtilizeData Resource? How to know what we get from Data Nature are true? How human beings survive in Data Nature?
Thanks yyzhu@fudan.edu.cn www.dataology.fudan.edu.cn