130 likes | 437 Views
Data Science. Data?. According to IDC 2.8 zettabytes of data created in 2012....to double by 2015 Three quarters of this is generated by individuals Average American office worker produces 1.8 million megabytes/year...5000 MB/day. Quantity of Data.
E N D
Data? • According to IDC 2.8 zettabytes of data created in 2012....to double by 2015 • Three quarters of this is generated by individuals • Average American office worker produces 1.8 million megabytes/year...5000 MB/day
Source: http://practicalanalytics.files.wordpress.com/2012/10/60seconds.jpg
What is Big Data? • Massive frequently created volume of structured and unstructured data. • Volume • Variety • Velocity
What is “Big”Data Science? • Integration of methods from statistics, computer science, and other fields for gaining insights from data. In practice, data science encompasses an iterative process of data harvesting, cleaning, analysis and visualization, and implementation. • Interdisciplinary and cross-functional field that leads to decisions that move an organization forward, whether the object of the decision is product design, a proposed investment, communal help or business strategy. • Source: Forbes
Data Scientist • February 2010 Kenneth Cukier writes in ”Data, data everywhere: A special report on managing information“: ”… a new kind of professional has emerged, the data scientist, who combines the skills of software programmer, statistician and storyteller/artist to extract the nuggets of gold hidden under mountains of data.” • June 2010Mike Loukides writes in “What is Data Science?”: “Data scientists combine entrepreneurship with patience, the willingness to build data products incrementally, the ability to explore, and the ability to iterate over a solution. They are inherently interdisciplinary. They can tackle all aspects of a problem, from initial data collection and data conditioning to drawing conclusions. They can think outside the box to come up with new ways to view the problem, or to work with very broadly defined problems: ‘here’s a lot of data, what can you make from it?’”
IDC also estimates that less than 1% of world data is analyzed and thus demand for data scientists will significantly increase. McKinsey estimates the United States alone needs about 190,000 data scientists in addition to another 1.5 million data-savvy managers to handle the data and make game-changing decisions. • .