170 likes | 190 Views
Big data & data science. Information technology skills for the 21 st century. Dan d’urso. owner Orange Coast Database Associates, inc. IT Instructor at University of phoenix
E N D
Big data & data science Information technology skills for the 21st century
Dan d’urso • owner Orange Coast Database Associates, inc. • IT Instructor at University of phoenix • 25 years software manager at digital EQUIPMENT (NOW HEWLETT-PACKARD); software engineer compilers and operating systems at burroughs corporation • Mba quantitative methods , ms computer science IT in BIg Data and Data Science
Big data & data science Growth opportunities Big data Data science Applications technologies Careers Skills required IT in BIg Data and Data Science
Growth opportunities • Big data and data science are related fields – data science has been called the marriage of big data and statistics • Sexiest job of the 21st century (Harvard Business Review, 2016) • By2018 we will need 1.5 million managers and analysts at least familiar with data operations along with 140 to 190 thousand data scientists • Data scientist will be 3rd highest paying occupation (Lynda.com, 2016) IT in BIg Data and Data Science
What is BIG DATA HUGE VOLUMES EXTREME VELOCITY Wide variety IT in BIg Data and Data Science
Big data volume IT in BIg Data and Data Science
Big data velocity IT in BIg Data and Data Science
Big data variety • New data sources making data analysis job more complex IT in BIg Data and Data Science
Data science Uses big data from databases and social media Applies statistical and visualization tools Current uses include predictive analytics and machine learning IT in BIg Data and Data Science
applications • electronic health records, patient monitoring • Real time product offers • Predictive analytics using social media data (cross-selling, market prices) • Fraud detection • Online ad placement • Manufacturing process improvements • Analysis of scientific data: astronomy, physics, biology (genomics) IT in BIg Data and Data Science
Base technologies • Unix/linux • Hadoop/spark – parallel processing using hundreds of computers • Java /scala – main programming languages for above • Python – swiss army knife programming language • R statistics language • Analytical tools – excel, tableau, bi tools, sas/spss, etc. IT in BIg Data and Data Science
careers Back end Front end • Systems administrator (unix/linux) • Data base administrator • Network administrator • Data scientist • Statistician • Data analyst • Business analyst • Business intelligence analyst IT in BIg Data and Data Science
Industries Potential Varies by industry Most will make increasing use of big data and data scientists Finance and IT are key oppportunities IT in BIg Data and Data Science
IT technology skills Data engineering Data analysis/data science • Unix/linux • Hadoop/spark • Databases (esp. nosql) • Java • python • Database querying/BI tools • Python • R (leading statistics language) • Yes, excel, tableau. Etc. • Unix/linux command line may be helpful IT in BIg Data and Data Science
Required Skill set IT in BIg Data and Data Science
references • Davenport T., & Patil, D. (2012, October). Data Scientist: The Sexiest Job of the 21st Century. Harvard Business Review. Retrieved from https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century • Davis, P. (2013, July 23). McKinsey Report Highlights the Impending Data Scientist Shortage. Pivotal. Retrieved from: https://blog.pivotal.io/data-science-pivotal/news/mckinsey-report-highlights-the-impending-data-scientist-shortage • Hurwitz, J., Nugent, A., Halper, Dr. F. & Kaufman, M. (2013). Big Data for Dummies. Hoboken, NJ: John Wiley & Sons, Inc. • King, J. (2014, December 4). 2014 Data Science Salary Survey. Radar. Retrieved from: http://radar.oreilly.com/2014/12/2014-data-science-salary-survey.html • Laney, D. (2001, February 6). 3D Data Management: Controlling Data Volume, Velocity and Variety. Meta Group. Retrieved from http://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf • Lynda.com. (2016). Introduction to Data Science. Retrieved from: https://www.lynda.com/Big-Data-tutorials/Introduction-Data-Science/420305-2.html • Manyika, J., Chui, M., Brown B., Bughin J., Dobbs R., Roxburgh, C., Beyers, A. (2011, May). Big data: The Next frontier for innovation, competition and productivity. Report – McKinsey Global Institute. Retrieved from http://www.mckinsey.com/business-functions/digital-mckinsey/our-insights/big-data-the-next-frontier-for-innovation • Provost, F., & Fawcett, T. (2013). Data Science for Business. Sebastopol, CA: O’Reilly Media, Inc. IT in BIg Data and Data Science