1 / 25

Effective Big Data Governance Strategy

Learn about the impact of data growth on your company and what you can do to manage it effectively. Discover the importance of data governance and its components, and understand the need for a proactive approach. Act now to address this real and urgent problem.

ganguiano
Download Presentation

Effective Big Data Governance Strategy

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Governing Big DataWhat Strategy Works Best Nick Reizen, Vice President of eDiscovery Scott Polus, Director of Forensic Services with Bill Millican, Director of Sales & Operations XACT Data Discovery 1.877.545.XACT www.xactdatadiscovery.com

  2. previous Why We’re Here • What is happening with data growth • How does this impact your company • How does this impact YOU • Will the growth stop • What can we do • What should we do • What must we do • Why 877.545.XACT www.xactdatadiscovery.com

  3. previous What We Hope to Learn • The problem will not fix itself • The problem will persist and increase in velocity as the mass increases • There is a solution • Time is of the essence • What is Governance (different from management) • The Strategy and Its Components 877.545.XACT www.xactdatadiscovery.com

  4. previous Statistics: How About Some • By the year 2020 data will be gaining on us at a rate 4,300 % greater than it is today • Stored Customer Data • 2012: 1.2 ZB (zettabytes) Zettabyte = 1 Trillion GB • 2015: 7.9 ZB • 2020: 35 ZB 877.545.XACT www.xactdatadiscovery.com

  5. previous Statistics: How About Some • From DataVersity: With numbers like these, when Big Data and Unstructured Data walk in the room, everyone else has less oxygen to breathe. • From the Association of Certified Financial Crime Specialists: “However, in June of 2013 it surfaced that copies of the documents had already been moved from China to Las Vegas, and had been reviewed by the same attorneys who feigned ignorance to the judge. 877.545.XACT www.xactdatadiscovery.com

  6. previous Statistics: How About Some • Twitter: 200 million tweets per day = 46MB per second • FB: 640 million users – 50% login daily • FB: collects an average of 15TB of data every day • LinkedIn: 100 million users • Yahoo Hadoop cluster is 82 PB = 40,000 servers • 107 trillion e-mails were sent in 2010 (2014 – ?) • Google has more than 50 billion pages in its index • YouTube has 3 billion visitors per day with 48 hours of video uploaded per minute (May 2010 // 2014 = ?) 877.545.XACT www.xactdatadiscovery.com

  7. previous Statistics: How About Some • Global Mobile Data Traffic Forecast by Region (Cisco) What we want to notice is not just the growth, but the compound nature of the growth 877.545.XACT www.xactdatadiscovery.com

  8. previous More Statistics • Amazon’s S3 Cloud Service had some 262 billion objects at the end of 2010. (2012 = 417 B) with about 315,000 requests per second. What about 2014? • 571 new websites are created every minute of the day • The amount of data in the world today equates to: • Every US person tweeting 3 tweets per minute for 26,976 years • Every person in the world having more than 215 million high-resolution MRI scans A DAY • More than 200 billion HD movies which would take a person 47 million years to watch 877.545.XACT www.xactdatadiscovery.com

  9. previous Statistics: How About Some Multiples of bytesSI decimal prefixesBinaryusageIEC binary prefixes(Symbol) Value Name (Symbol) Value Name (KiB)210 kibibyte (kB)103210 kilobyte (MiB)220mebibyte (MB)10622 megabyte (GiB)230 gibibyte (GB)109230 gigabyte (TiB)240 tebibyte (TB)1012240 terabyte (PiBH) 1012pebibyte (PB)1015250 petabyte (EiB)260 exbibyte(EB)1018260 exabyte (ZiB)270 zebibyte(ZB)1021270 zettabyte (YiB)280 yobibyte(YB)1024280yottabyte See also: Multiples of bits·Orders of magnitude of dataThe yottabyte 877.545.XACT www.xactdatadiscovery.com

  10. previous 877.545.XACT www.xactdatadiscovery.com

  11. previous Electronic Discovery Reference Modela Viable Big Data Strategy in Disguise • Structured vs. Unstructured Data (the IDC) Again, what we want to notice is the exponential growth of what is Unstructured compared to that which is Structured This should sound an Alarm A very loud ALARM This problem is very real: action is required NOW 877.545.XACT www.xactdatadiscovery.com

  12. previous Electronic Discovery Reference Modela Viable Big Data Strategy in Disguise Information Management – Refers to the pre-discovery ESI filing and retention ==== and is the new Information Governance Identification Preservation Collection Processing Review Analysis Production 877.545.XACT www.xactdatadiscovery.com

  13. previous Electronic Discovery Reference Model 877.545.XACT www.xactdatadiscovery.com

  14. previous Information Management/Records Management • Internal e-Discovery Preparedness & Response Team • Identify & retain those who know: collaborate • Perform inventory: find it, name it, categorize it • Prepare location map: knowing where stuff is • Advanced Inventory: making sure • Procedures for processes • Audit and Edit Policies and Supporting Manual (documentation) • Validate retrieval and accuracy of data retrieved • Making Sure ---- absolutely sure 877.545.XACT www.xactdatadiscovery.com

  15. previous Collection: an E-Discovery Term – Is it just E-Discovery? • Collect what is most likely relevant or responsive: ESI is narrowed for many reasons, all of which must be documented and considered for reasonability • Note: there may be ESI preserved but not further searched or collected because of inaccessibility (unreasonable or disproportionate cost and effort) • Keyword search preparation, date ranges, file types • In Rule 26(f) conference reach agreement on the what • Deploy and collect data from all known repositories • (tapes, drives, portable storage devices, networks, etc.) • Ensure use of chain of custody and authenticity protocols: • preserve metadata; MD5 or SHA-1 Hash authentication, labeling, and identification • Meticulous record keeping and documentation of procedures • Sources are required throughout the collection process. 877.545.XACT www.xactdatadiscovery.com

  16. previous Processing • Remove system files: stuff not needed (?) • De-Duplication: how do you know (?) • Date filter (dates ---- could be a moving target) • File type (do you know many there can be) • Upwards of 1700 to 2000: nah --- Yes • Keyword Searching: which “key” words • Conceptual Searching: huh – what’s that (?) • Process to Native or Tiff for Legal Team review • What does this mean (?) 877.545.XACT www.xactdatadiscovery.com

  17. previous Processing • Predictive Coding • How many of you know what this is • How many of you are in some fashion using this approach • How many of you know that some courts are requiring litigants to use predictive coding in the e-discovery process? 877.545.XACT www.xactdatadiscovery.com

  18. previous Review • Review for relevance: • confidentiality and privilege, and related activity such as redaction • 50% + E-Discovery expenses are in this phase • Review in house • Summation, Concordance) • Review hosted • (Relativity, iCONECTnxt, FYI) 877.545.XACT www.xactdatadiscovery.com

  19. previous Inertia Wins the Battle: Bad Stuff Happens – to Those Who Refuse to Prepare • Records Have Become Evidence • The intricate attributes of data, information, content, records are now placed fully under the most powerful of legally litigated microscopes and it’s all DISCOVERABLE • Privileged vs. Non-privileged: it is a slippery slope • In the Discovery Process it’s all available • Let’s read some more……. 877.545.XACT www.xactdatadiscovery.com

  20. previous Inertia Wins the Battle: Bad Stuff Happens – to Those Who Refuse to Prepare • Scentsy Inc. v. B.R. Chase LLC: • …..the Court ordered that a defendant would be allowed to depose “appropriate individuals” to determine whether relevant information was destroyed, with Plaintiff to bear the costs. • …..moreover, while there was no way to know for sure if documents were deleted, the Court noted that the uncertainty was “caused by Scentsy’s inadequate retention policy couple with its late and imprecise litigation hold. • …..in the face of warnings given in this decision, constitutes the kind of willfulness or recklessness which may result in serious repercussions. 877.545.XACT www.xactdatadiscovery.com

  21. previous Inertia – Defeating It • Ten Steps to Take: Beginning Today • Step 1: Realize this is not a joke and it can, and probably will happen to you • Step 2: Get good on what is really going on in the market and with the world of BIG DATA. • Step 3: Get up to speed on the world of Electronic Discovery and its relevance and dependency on Records Management and Information Governance 877.545.XACT www.xactdatadiscovery.com

  22. previous Inertia – Defeating It • Ten Steps to Take: Beginning Today • Step 4: Translate the Electronic Discovery conversation into the conversation about Data (Records / Information) Discovery. • Step 5: Start, today, transforming your paradigm into knowing what your organization has, where it is stored, what it is called, who uses it and has access to it. • Step 6: Begin to formulate questions about discovering the truth about your organization’s data – its records – its information – its content – its metadata: ALL OF IT 877.545.XACT www.xactdatadiscovery.com

  23. previous Inertia – Defeating It • Ten Steps to Take: Beginning Today • Step 7: Begin asking questions. I can almost guarantee you that no one in your organization (outside of the legal department working on a piece of litigation) is asking • Step 8: What questions: • a: where are the repositories • b: what are the file types • c: how much is there • d: who are the custodians • e: how does creation, receipt, destruction happen • f: and so on, and so on……… 877.545.XACT www.xactdatadiscovery.com

  24. previous Inertia – Defeating It • Ten Steps to Take: Beginning Today • Step 9: Record Keeping – don’t forget who you are and what you are capable of accomplishing • Step 10: Know what you’re after • Great thought: what are you after? What are we after? • How can we who implicitly understand the nuances, and all of the good, the bad and the ugly of Records Management and Information Governance……………… 877.545.XACT www.xactdatadiscovery.com

  25. Governing Big DataWhat Strategy Works Best Nick Reizen, Vice President of eDiscovery Scott Polus, Director of Forensic Services with Bill Millican, Director of Sales & Operations XACT Data Discovery 1.877.545.XACT www.xactdatadiscovery.com

More Related