220 likes | 526 Views
GIS and Big Data : Theory and Best Practice Case Studies. Dr. Dave Schrader Director – Strategy and Marketing, Teradata October 2012 – University of Redlands. Who is Teradata? What is Teradata’s Strategy? How do big data and geospatial fit?. Teradata. Founded 1979, first shipment 1984
E N D
GIS and Big Data: Theory and Best Practice Case Studies Dr. Dave Schrader Director – Strategy and Marketing, Teradata October 2012 – University of Redlands
Who is Teradata?What is Teradata’s Strategy?How do big data and geospatial fit?
Teradata Founded 1979, first shipment 1984 $2.4B a year in revenues, growing 22% Leading vendor of Enterprise-sized Data Warehouses (HW, SW, PS) Engineering HQ is in Rancho Bernardo We sell to the Global 3000, blue chip customer base Well-known to all database experts Moving from “back office” to “frontline” (Active), increasing # of data types
The Teradata Story – History of Big Data 1983: Teradata ships 1st system to Wells Fargo Jan 1992 Walmart passes 1TB Jan 2006 WMT loads 1B rows/day, 1 hr latency June 2012 eBay loads 1TB/minute More than 25 customers with >25,000 Terabytes at their fingertips
What Data is Driving Growth? … The W’s • More detailed data comes from` • Detailed Customer Behavioral Data • “Where” in all industries: mobile and geospatial • “What and When” granularity – e.g., browsing on web, including non-clicks and non-transactions • Telco: all the detail behind each phone call (BSS, OSS): location • Social networking data – tweets, blogs • Detailed Operations Data • “How” – Process data • Network congestion, goal planning • Transportation optimizations in real-time • Manufacturing: sensor and test data
Purpose-Built Teradata Platform Family 560 1650 2690 4600 66XX
Top Rating by Gartner - DBMS Why the TOP Rating for Data Warehousing? Happy Customers! Superior Technology! Innovative Users!
The Next Generation of Analytics: Trends Transaction vs. Interaction • Transaction: Value to the business • Interaction: EXPERIENCE with the business Business Intelligence Consumer Intelligence • Consumer is CEO of the household • Consumers making intelligent decisions based upon analytics & perfect economic information Big Data • Format: Structured & MULTI-STRUCTURED Data • Type: Web, social, location, device, channel • VOLUME and VELOCITY
Teradata and its Acquisitions Business Applications • Aprimo Applications • Strategic Partnerships • Teradata Integrated Data Warehouse • Operational BI/Intelligence • Platform Family • Interoperability & Consulting DataWarehousing Big Data Analytics • Aster Data • Extreme Data Appliance • Partnerships
Temptation: Build Analytic Silos, Geospatial Silos OLAP Cubes Data Mining BIG DATA Geospatial Data Warehouse Application Development Agile Analytics
Analytics for Everyone OLAP Cubes Data Mining BIG DATA Geospatial Data Warehouse Application Development Agile Analytics 20-40%+ wasted moving data
Teradata Integrated Analytics Application Development Temporal Agile Analytics Advanced Analytics Geospatial OLAP Optimization Big Data Integration Data Exploration Tools and techniques to accelerate development of analytics Native temporal support to manage and update time dimension In-database data labs to accelerate exploration of new data and ideas Optimized in-database data mining technology from leading vendors, open source and Teradata Native database geospatial data types and analytics Built-in multi-dimensional analytics optimization Analytic platforms and partner tools to analyze unstructured and structured data Visual data exploration to quickly understand and analyze data within the database Teradata Integrated Analytics Teradata Database Teradata Open Parallel Framework Custom Services Embedded Services Virtual Machines Teradata Purpose-Built Platform Family
Native Geospatial Data TypesSpatial Data Integrated with Non-Spatial Data • Geospatial is a feature that allows us to store, process, consume geospatial data • Teradata Geospatial based on the ST_Geometry data type • SQL/MM Standard • Like numeric or string types native to Teradata • Location is type ST_Geometry • Point (x y) • Line or curve (xy, xy, xy) • Polygon (xy, xy, xy, xy..) polygon line point Geocoded Customer Table Example:
Measurements ST_Area ST_Distance ST_SphericalDistance ST_SpheroidalDistance ST_Perimeter ST_Length Teradata Geospatial Spatial Methods – sample High Speed Big Data Analytics Attribute ST_AsBinary ST_AsText ST_CoordDim ST_Dimension ST_GeometryType ST_IsEmpty ST_IsSimple ST_IsClosed ST_NumPoints ST_SRID … Spatial Operator ST_Buffer ST_Intersection ST_Boundary ST_Difference ST_Envelope ST_ExteriorRing ST_GeometryN ST_InteriorRingN ST_Transform Spatial Relationships ST_Intersects ST_Overlaps ST_Relate ST_Touches ST_Within ST_Contains ST_Disjoint ST_Crosses ST_Equals
Distance Geospatial QueriesAnswering ‘Where’ • ST_Geometry functions… • Measurements • Distance, surface, perimeter… • Relationship between two objects • Intersect, contains, within, adjacent… • Simplified Example - find top 100 customers by value within the store area boundaries and their distance from the store: SELECT top 100 C.name, C.address, C.value, C.location.ST_Distance(S.location) AS Distance FROM cities C, stores S, store_area SA WHERE S.id=1 and S.id=SA.id and C.location.ST_WITHIN(SA.area) ORDER BY 3 Desc; Store Area Retail Outlet Customer Mail Campaign Targets Competitor outlet
Telco – RetailAccelerates Analytics with Teradata Find the 3 closest stores within 50 miles of each customer location. Over 30 million customers Over 2,200 stores Target customers changing frequently Store Store • Manual Geospatial Analytics • Calculate distance between each store and customer • Calculations based on complex trigonometric functions • Over 65 billion calculations • Filter results <= 50 miles • Retained 1 billion results • In-database Geospatial Analytics • Teradata Geospatial functions • Set a 50 mile buffer (filter) for stores • Identify customers within the buffer • Calculate spherical distance for those customers • 25 times faster
Teradata Geospatial Analytics STRUCTURED DATA SOURCES STRUCTURED BAR INTEGRATED DATA WAREHOUSE DATA LAB Geospatial Server • Integrated spatial and non-spatial data • High speed processing of big data • Innovation simplified via Data Labs • Proven by industry leaders BUSINESS USER
Big Data - provides enormous insight… …keyword use… …personal profiles… Customer behavior, calling/browsing habits, their social network… …sensor data and metrics… … location, travel destinations… …Opportunity to move beyond traditional analytics !
A major Telco uses real-time analytics to find remedies for dropped mobile phone calls