240 likes | 246 Views
Learn how to unlock the full potential of enterprise data by combining quantitative and qualitative data, and using geographic information to add structure to unstructured data. Discover the challenges of information discovery and the benefits of validated enterprise data. Experience the power of MetaCarta's Geographic Text Search to find relevant information in a geographically relevant framework.
E N D
Unlocking Enterprise Data: Looking Beyond the Numbers Claudine Bianchi, VP Marketing www.metacarta.com
Buying a car… • What if all you knew about a car was its tires?
Unlocking Enterprise Data • Quantitative data outlines the past • Qualitative data fills-in the picture Together they indicate the future
Quantitative data • Banking Data • Location, Deposits, Historical Growth Rates and 3 year deposit statistics • Chain Store Data • Name, address, selling square feet, annual sales per square foot, parent company name, and number of parent stores • Consumer Data • Buying behaviors, product ownership, shopping activities • Crime Data • indexes for assault, burglary, larceny, motor vehicle theft, murder, personal crime, property crime, rape and robbery. • Household data • Traffic data
Another data type: qualitative • Textual information (articles and reports) that provide context • The experience of competitors • Local events that may impact economic health • Consumer trends • Retail innovations
The Problem? Accessing the right information in a relevant framework
Challenges of Information Discovery • 35%-50% information is NOT found by typical search engines • 70% of us are visual learners but text based displays dominate • 30% of their time knowledge workers spend: • Searching for non-existent documents • Failing to find existing information • Recreating information that can’t be found
Validated Enterprise Data • 85% of all data stored is unstructured • validated by ITC at 2003 workshop • 80% of business is conducted on unstructured data • 60% annual growth rate for unstructured data • confirmed by ITC at 2003 DIM workshop • 80% of unstructured data has some geographic reference • validated by actual Fortune 500 customer MetaCarta deployment Number ofDocuments 2005 2006 2007 2008
Percentage of Georelevent documents, per document collection, in Internet test Summary # of Documents 1,914,443 % Georelevant 74.05 MetaCarta Internet Testing Results #docs %geo 50672 95.69 12245 89.43 17709 90.01 105944 59.1 91766 65.18 97829 43.94 54693 59.17 26447 51.85 20964 69.19 29693 81.43 42480 64.32 34498 64.72 37020 50.47 30558 60.21 19611 58.35 61570 81.54 33129 86.18 47936 64.78 #docs %geo 34476 62.10 73771 52.65 59755 64.03 47917 74.94 48532 73.08 40696 71.43 29011 68.09 65856 87.87 77641 83.53 85610 89.41 75934 87.28 65848 86.38 81774 76.99 10320 74.88 81938 93.74 77719 92.01 65276 88.15 67563 87.19 10042 74.66
Geographic Text Search Keyword Search Failing… Generic Text Search Technology Is Overwhelmed • Thousands of “best hits” is still too many to read • Filtering on words alone is insufficient • Time consuming to review • Never get a negative answer Value to the User Keywords Alone number of documents 2000 2005 2010
What to do? • Give structure to unstructured information using geography as the constant.
MetaCarta Geographic Text Search All Documents Search Results = Subset of document collection Map as a filter (map agnostic) Keyword search +
38° 18' N 77° 28' W 5 miles north of Tyson Corner. 42° 22' N 71° 2' W (302) 555-7809, 39° 53' N 75° 14' W the intersection of Jordan Terrace and Atlantic Ave. 39° 53' N 75° 15' W 94 Manhattan Street in Media, PA. MetaCarta Geoparsing Bob London was spotted 5 miles north of Tyson Corner. (302) 555-7809, He stopped to make a phone call to ate at a Chicago-style pizza place, and parked at the intersection of Jordan Terrace and Atlantic Ave. 94 Manhattan Street in Media, PA. He proceeded on foot to
Disambiguation The macho energy of Harrison Ford
Disambiguation The macho energy of Harrison Ford ‘The’ Ridge in New Caledonia ‘Macho’ Town in Honduras ‘Energy’ Town in Williamson Cty, Illinois ‘Of’ Town in Turkey ‘Harrison Ford’ Crawford Cty, Missouri
Aggregated Search • Aggregated Search
Group B Appliance Aggregated Seearch • Aggregated Search Group A Appliance
Aggregated Search • Aggregated Search Group B Appliance Group A Appliance Hosted Premium Content
Summary • 85% of the information out there is unstructured • Information can be anywhere – Internet, shared drives, document repositories – at your desktop, across the country or around the world • Geography can “unify” information – add structure to unstructured data – 80% of unstructured content has geographic relevancy • Users have lots of sources for information but want a single view to find answers • Maps can be used as a visual filter in geographically relevant analysis
Thank You!Contact: cbianchi@metacarta.com