1 / 22

Arkansas Research Center

Arkansas Research Center. arc.arkansas.gov. Greg Holland, Ph.D. greg.holland@arkansas.gov Neal Gibson, Ph.D. neal.gibson@arkansas.gov. UAMS Study. Privacy. Center on Law and Information Policy Highly Critical of Federal Plan to Relax Restrictions on Education Records.

Download Presentation

Arkansas Research Center

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Arkansas Research Center arc.arkansas.gov Greg Holland, Ph.D. greg.holland@arkansas.gov Neal Gibson, Ph.D. neal.gibson@arkansas.gov

  2. UAMS Study

  3. Privacy Center on Law and Information Policy Highly Critical of Federal Plan to Relax Restrictions on Education Records This notice of proposed rulemaking (NPRM) represents a significant new privacy invasion. We do not oppose general collection of information about students and its use in a nonidentifiable form. In fact we believe that the collection of aggregate information on students is a critical tool for civil rights enforcement and in assuring that every student receives equal access to a high quality education. CLIP’s Academic Director, Professor Joel R. Reidenberg,stated that “the promiscuous sharing of children’s educational records that would be permitted by these proposals undermines privacy and is outside the Department’s authority.”

  4. TrustEd: Knowledgebase Identity Management (KIM) TrustEd Identifier Management (TIM) Non-PII Databases for Agencies/Researchers TIM Agency 2 Agency 1 Identifier Management PII DB Non-PII DBs De-identified Research Databases (No link between) Knowledgebase Identity Management

  5. TrustEd: KIM & TIM Agency data record PII and non-PII Research data Research Data RecID PII SourceID RecID TIM Identifier Management Non-PII DB KIM De-identified Research Databases Knowledgebase Identity Management

  6. TrustEd: KIM & TIM Research Data RecID PII KB_ID KIM_ID TIM Identifier Management Non-PII DB KIM KIM_ID RecID De-identified Research Databases Knowledgebase Identity Management

  7. TrustEd: KIM & TIM KIM_ID SourceID RecID TIM_ID Research Data Agency_TIM_ID TIM Identifier Management Non-PII DB KIM De-identified Research Databases Knowledgebase Identity Management

  8. TrustEd: KIM & TIM RecID, SourceID, various TIM_IDs Management TEMPORARY ID Crosswalks Research Data PII TIM Agency 1 Agency 2 KIM Agency 3 Identifier Management Non-PII DBs De-identified Research Databases Knowledgebase Identity Management

  9. You are the identity manager… MARIA WILSON HIGH SCHOOL CASTILLO-DELGADO MARIA JONES COMMUNITY COLLEGE CASTILLO-DELG

  10. You are the identity manager… MARIA D WILSON HS CASTILLO-DELGADO MARIA C JONES CC CASTILLO-DELG

  11. You are the identity manager… MARIA D WILSON HS CASTILLO-DELGADO DOB: 11/05/1995 MARIA C JONES CC CASTILLO-DELG DOB: 9/24/1994

  12. Exact v. Fuzzy Matching(Deterministic v. Probabilistic) Exact matching drives the majority of identity resolution (Pareto Rule—80% is easy) Probabilistic algorithms – Soundex, QTR, Edit Distance, Neural Networks (Pareto Rule—20% require 80% of effort) You want a system that does what YOU, a human, would do

  13. Identity Resolution (K12) • There are ~55,000 unique first names among students in Arkansas and ~40,000 last names. • Approximately 20% of Arkansas students share both the same first and last name with another student. (1 out of 5!)

  14. Identity Resolution (K12) • There are 4,026 students in Arkansas that share an SSN with at least one other student in the state. • Between August and January, 874 student transfers to other schools resulted in an SSN change. • Between August and January, an additional 1,018 students changed their SSN—we have records for only 300 of these changes. • There are ~17,000 students in Arkansas with a “900” SSN

  15. Identity Resolution (Workforce) • ~60,000,000 records for 11 years, 2,938,718 unique SSNs, few DOBs, inconsistent name formats. • 7,865 SSNs used by two or more people, for a total of 18,278 different individuals. Those would be combined incomes and treated as the same person if SSN was the primary key. • The same person has two or more SSNs (because of a typo/transposition) 13,373 times. There would be 13,373 additional (non-existent) people with separate incomes if SSN was the primary key.

  16. Record Linking: Merge/Purge File A File B Your knowledge is limited to what’s in these two files ONLY

  17. Knowledgebase Approach All known representations are stored to facilitate matching in the future and possibly resolve past matching errors. PII Bob Smith, 1/31/1980,SSN1 Robert Smith, 1/31/1981, SSN1 KIM Knowledgebase Bobby Smith, 1/31/1980,SSN2

  18. Knowledgebase Rules

  19. KIM Consolidation Process

  20. KIM Stats

  21. KIM/TIM Protecting Privacy TIM KT2342 Grade 11 Lit Score 245 TrustEd Identifier Management WF8839 2011 Salary $31,500 HE9051 College GPA 3.09

  22. Questions? Arkansas Research Center KIM/TIM Information Neal.Gibson@arkansas.gov Greg.Holland@arkansas.gov http://arc.arkansas.gov

More Related