480 likes | 643 Views
Cloud Computing Overview: Big Data and Business Analytics Hsinchun Chen University of Arizona. Interesting Questions Cloud Computing Applications Big Data Analytics Business Models ( CIA ). Cloud Computing Applications: Overview and Examples. IQ: How Amazon makes its money?.
E N D
Cloud Computing Overview: Big Data and Business AnalyticsHsinchun ChenUniversity of Arizona
Interesting QuestionsCloud Computing ApplicationsBig Data AnalyticsBusiness Models (CIA)
Cloud computing: applications, system software, and hardware delivered as services over the Internet. • Service oriented architecture + virtualization + utility computing • Software as a Service (SaaS), Infrastructure as a Service (IaaS), Platform as a Service (PaaS) • From web services to cloud computing applications • Moving towards cloud applications and cloud business models, e.g., SaleForce.com, Apple iTune, Amazon Cloud Computing Overview
Major Could Computing Platforms • Amazon Elastic Compute Cloud (EC2):LAMP (Linux, Apache, mySQL, and PHP) stack • Google App Engine:Java and Python runtime, Java Persistence API (JPA), Google Bigtable, File systems; Hadoop, MapReduce • Windows Azure: .Net, MS SQL, SharePoint
E-Commerce: B2C, life style & entertainment, global supply-chain, banking, telecommunications, IT hosting, business intelligence and analytics • E-Government: government data sources, services • E-Education: online education content delivery • E-Security: cybersecurity, intelligence • E-Health: healthcare big data, healthcare 2.0; genomics + EHR Emerging Applications
National Electronic Health Record Data Bank, Singapore: MOH + Accenture, August 2010; healthcare management, quality and performance management, EHR information aggregation, patient self management, decision support • E-Health, E-Health Cloud, England: Chelsea Westminster Hospital + Flexiant, July 2011, patient EHR access • CareStream Cloud, US: Carestream Health (Onex + Kodak), 2009; health imaging sharing, 1B medical images, health cloud SaaS vendor • Taiwan Smart Health Cloud, NTU & NCKU (Sources: NTU Health Cloud proposal) Selected Health Cloud Initiatives
IQ: What’s the difference between 2005 and 2012 for web computing?
Web Computing and Mining • Emerging web applications business models • Web services, APIs, mashups cloud & mobile computing • Business analytics Data, text and web mining
50 Projects, 2005-2012(“Business Web Mining Using Amazon, Google, eBay, and Google”) • E-commerce and e-Services: iRelocateRealTomatoesSmallBHHobbyCentralNewPlaceSeek College AdvisorFriendly GifterClipperGottaCouchSkiStopvTrack Barter BayLink-USSmart Gift CardTimely BidTucson Gamer CaféTV and More DeliverablesCellphone Intelligent AuctioningTucson Book ExchangeSciBubbleWish SkyGiftChannelPriceSmartWetYourWhistle • Life Style and Entertainment: BetSmartXTREME F1MLB100YardsCricWeb iBollywoodSa Ri Ga MaWOWBollywoodFunzicHinduShrines IndiapaaruNachBaliyeMovie Location QuestRemakesSugarSuite MusicBoxArtist ConnectionConcertoStar Search • Government and Education: RepCheckSmallNGreenCarsChange of BaseiDogTasty ParkiSupport
By Kumar Vakeel, Kunal Jain, Neeraj Munshi; MS MIS, Spring 2010 • One-stop portal for green cars information and resources • Unique Concept • Global customers • Youtube vehicle videos • Flickr vehicle photos • Google Maps and Local Search • Google visualization • RSS feeds of global vehicle news • Facebook recommendation from friends • Yahoo Finance for currency exchange • Google Translate for web pages • Recommendation System • Fuel Efficiency Challenge SmallNGreenCars
Sarigama.com latest news and RSS Feeds • Artist information • Transliteration • Music play and video • Shopping • Lessons and Library • Concert locator • Forums • Interactive Features • Tag Clouds • Lyrics Recommender system Sa Ri Ga Ma • Mahalakshmi Sundararajan, Pavithra Ravi, Sahana Nagaraja; Spring 2010 • Carnatic Music: One of the two main genres of Indian classical music; Mostly performed vocally • Sarigama.com: one stop information portal for carnatic music
Web Services, Cloud Computing, and Mobile Web, 2012 (Web 3.0)
25 Projects, 2012Cloud and Mobile Computing • E-commerce and e-Services: GamerzLykMeMobileAppPortalGemstonesPersonalInvestment iScreamiRace SeeMeSocialAZRegionTrendHelpMeAZ • Health & Life Style: EatRightOrganiCookRoadTripXtravelWreckDiversVoiceOfNatureHealthMiners HelpAsthmaDiabeatUSHikeAdayYogaWorldBikersParadiseYogaWorldBikersParadise
OrganiCook • By Zilong Chang, Mengwen Cheng, Yajie Wang, and Haiqing Wu, Spring 2012 • One-stop portal for healthy foods • Organic food supplier location • Different health concerned recipe catalogs • Integrate healthy content with social media • Text mining for cookware recommendation • Mark allergens among ingredients • Provide health news • Advertisement • Unique recommendation system • Amazon EC2 Cloud server • Intetergrate Mahout with Hadoop
OrganiCook User Cloud Application Server Apache Tomcat J2EE REST API Browser Internet Connection Amazon EC2 Mahout Taste Data Mining JavaScript API MySQL 5.5 API Servers Database server
EatRight • By Jim Marquardson, Justin William, Dave Wilson, and Mark Grimes, Spring, 2012 • Health & nutrition mobile site • True SoLoMo (Web 3.0) • Nutrition based meal shopping • Capturing user preferences: “Eat This” button • Directed search advertising rates • Targeted ads based on nutrition preferences and location • EatRight API • Twitter Sentiment • PCI Compliant Credit Card Processing • Amazon EC2 Cloud • Android Mobile App (iOS too!)
The Data Deluge (Big Data) • The Economists, March 2010 • LOC total book collection 15 TBs • Google processes 10 PBs per day • Internet traffic 667 Exabytes by 2013, Cisco • Total amount of world information in 2010, 1.2 Zettabyte • KB-MB-GB-TB-PB-EB-ZB-Yottabyte • E-Commerce, Government, Health, Security applications: many with TB/PB of valuable content from customers, citizens, patients, etc.
BI & Analytics: The Market • $3B BI revenue in 2009 (Gartner, 2006); $9.4B BI software M&A spending in 2010 and $14.1B by 2014 (Forrester) • IBM spent $14B in BI in five years; $9B BI revenue in 2010 (USA Today, November 2010); 24 acquisitions, 10,000 BI software developers, 8,000 BI consultants, 200 BI mathematicians Acquired i2/COPLINK in 2011
BI & Analytics: Definition and Components • BI and Analytics refers to: (1) the technologies, systems, practices and applications that (2) analyze critical business data to (3) help an enterprise better understand its business and market.” • Core technologies: data warehousing, Extraction, Transformation, and Load (ETL); Business Performance Management (BPM), visual dashboards; data and text mining, social network analysis • BI 2.0 & 3.0 research: web analytics, web 2.0; in-memory and real-time BI; web 3.0, cloud computing, Hadoop, MapReduce; mobile computing, stream data mining
Big Data Analytics Research at UA/AI Lab • Applications/problems: digital libraries, search engines, biomedical informatics, healthcare data mining, security informatics, business intelligence • Approaches: web collection/spidering, databases, data warehousing, data mining, text mining, web mining, statistical NLP, ontologies, social media analytics, interface design, information visualization, economic modeling, assessment • Structure: federal funding, director, affiliated faculty, post-docs, Ph.D./MS/BS students commercialization • Major phases: DLI COPLINK Dark Web DiabeticLink
CIA in the Global IT Landscape • Central Intelligence Agency; Culinary Institute of America • Chinese: math/science, team player, IT/hardware/web, China market (China) • Indians: math/science, entrepreneurial spirit, English • Americans: English, entrepreneurial spirit, IT/software, business development, market (US), VC access ($)
My COPLINK Experience • Taiwan/US Training: NCTU (math) SUNY Buffalo (MBA) NYU (AI) U of Arizona (top 3) • AI Lab: Digital Library COLINK Dark Web DiabeticLink • COPLINK federal funding ($4M), NSF/NIJ, 1997-2002 • COPLINK commercialization ($4.6M), angels/VCs (Taiwan, CA, AZ), 2000 & 2003 • Customer sales ($30M), 4,500 agencies, 120 FTEs, 2000-2011 • M&A Exit, Silverlake/i2/IBM acquisition, 2009 (i2), 2011 (IBM); $500M valuation
COPLINK Identity Resolution and Criminal Network Analysis (DHS) • Funding: NSF, DOJ, DHS ($4M), VCs ($4.6M); Digital Government • Publications: ACM TOIS, CACM, IEEE TKDE, IEEE IS, JASIST, DSS • Impact: 3500 agencies, 25 NATO countries, 1M users public safety
The New York Times, November 2, 2002 COPLINK assisted in DC sniper investigation ABC News April 15, 2003 Google for Cops: Coplink software helps police search for cyber clues to bust criminals Newsweek Magazine, March 3, 2003 A computerized way for police to coordinate crime databases Washington Post, March 6, 2008, COPLINK in use in 3,500 police agencies in US! COPLINK acquired by i2 (Silver Lake) in 2009; i2/COPLINK acquired by IBM in 2011 for $500M
IT Business Models: Some Thoughts • Startup Phase: business ideas (product and market), team (founders & mentors), share structure (shares, directors, options; legal/CPA), business plan (short plan, good introduction), funding (government, angels, VCs, family) Year 0, 1-3 founders, $250K funding (IT/cloud) • Early Phase: first product, product positioning, team building, initial sales Years 1-3, $500K sales • Growth Phase: products plan, strong sales team, sustainable revenues, unique IPs (SW, content), loyal customers Years 3-8, $10M sales • Exit Phase: IPO or M&A (partners), when ($20M+), next venture Taking risks!
Pain, Sorrow, and Regret • Loss of family time/life (but never money) • Managing university obligations and COI • University bureaucracy, Office of Technology Transfer (OPTT) • Lawyers, accountants are expensive • Chasing angels/VCs (40 frogs 1 prince) • Office, employees, products • Selling products (becoming a vendor) • Burning cash • Bubble burst • Raising second round funding when you are down ($2M) • Board room yelling matches • University accusations • Losing control and shares • Anti-dilution clause (losing $60M for the $2M you never used)
hchen@eller.Arizona.edu http://ai.Arizona.edu