1 / 26

Name? Yes, allowing for spelling variations Age? Yes, allowing for some variance

What search criteria would I use to search for James in 1881?. Name? Yes, allowing for spelling variations Age? Yes, allowing for some variance Occupation? No. Often changed over the decade Place of Birth? Yes and No. Information in 1881 censuses varies

Download Presentation

Name? Yes, allowing for spelling variations Age? Yes, allowing for some variance

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. What search criteria would I use to search for James in 1881? Name? Yes, allowing for spelling variations Age?Yes, allowing for some variance Occupation? No. Often changed over the decade Place of Birth? Yes and No. Information in 1881 censuses varies Marital status? No. Often changed over the decade Gender? Sure. It could be mis-coded, but is likely correct.

  2. Forget diamonds or dogs… • MySQL: a true best friend http://www.mysql.com/

  3. 1871 Census extract:

  4. 1NF: What are our entities? Family units? Works for family of 4, but what about the Eunson family with 8 in the family? We would have to add new columns, so no, Family units is no good as an entity.

  5. 1NF: What are our entities? Individuals? Yes, individuals works. So, let’s give each individual a Primary Key

  6. 1NF • Primary Key (PK)

  7. 1NF: 3 Goals Goal 1: no duplicated rows. Each row should have a PK. Goal 2: Each cell contains only one value. No groups or comma-separated lists Goal 3: Any given column contains the same kind of data Goal 1: no duplicated rows. Each row should have a PK. Goal 2: Each cell contains only one value. No groups or comma-separated lists Goal 3: Any given column contains the same kind of data

  8. 1NF: New Entities, New Tables • Primary Key (PK)

  9. 1NF: 3 Goals for the new tables Goal 1: no duplicated rows. Each row should have a PK. Goal 2: Each cell contains only one value. No groups or comma-separated lists Goal 3: Any given column contains the same kind of data Goal 1: no duplicated rows. Each row should have a PK. Goal 2: Each cell contains only one value. No groups or comma-separated lists Goal 3: Any given column contains the same kind of data Goal 1: no duplicated rows. Each row should have a PK. Goal 2: Each cell contains only one value. No groups or comma-separated lists Goal 3: Any given column contains the same kind of data Birthplace fails at Goal 2 and Goal 3 holding parish, county and country data in a comma-separated list

  10. Primary Key (PK) • Junction Table • Foreign Key (FK)

  11. 2NF: “Sub-entities” • Repeated bits of information should now be moved into their own tables. Information repeated in Relate to HH, M. Stat and Gender columns = new tables.

  12. 3NF • If we delete a row, will we lose any data that other records might need? • If we deleted George Struther’s row, will we lose the information that the code for New Zealand is 5? • No, because it is entered into the Country table – the code in the Individual table is a Foreign Key

  13. 3NF • 2. If we add a row, could we accidentally make any data entry mistakes that would compromise our database’s integrity? • If we accidentally entered incorrect information into a row, would that impact any other part of the database? • No, because even if we code Dr Ross as a girl, the gender table still holds the accurate code for male.

  14. 3NF • 3. Have we stored any values that we should calculate instead? • In many circumstances we would calculate age because it is dynamic. • In the case of historic data like the census the recorded age is all we have, and it could have been mis-recorded, so we do not calculate the age in this case.

  15. One other thing • Separate names into separate columns • If, for example, you want to search for the density of a given surname across all of Scotland, you will have trouble doing so unless surnames are separated out from first names.

  16. Soldiers from CrikeyVillage, Wellington County, England who were wounded while in service in the Western Desert Campaign with the 7th Armoured Division

More Related