840 likes | 890 Views
This lecture explores BCNF and 3NF in database systems, discussing examples of bad relations and the necessity of these normalization forms in eliminating redundancies. Multivalued dependencies (MVDs) are also explained to capture redundancies that functional dependencies cannot address.
E N D
CPSC-310 Database Systems Professor Jianer Chen Room 315C HRBB Lecture #8
BCNF and 3NF 3NF BCNF BCNF 3NF
BCNF and 3NF 3NF BCNF BCNF 3NF
BCNF and 3NF 3NF BCNF BCNF 3NF
BCNF and 3NF 3NF BCNF BCNF 3NF
BCNF and 3NF 3NF BCNF BCNF 3NF
BCNF and 3NF 3NF BCNF BCNF 3NF Would BCNF be sufficient to eliminate other kind of redundancies?
Example of a bad relation in BCNF • Suppose that we have a relation that lists the teacher- student relationship for each person. That is, for a person p, the relation gives all teachers who ever taught p and all students who were ever taught by p. For example, suppose that David and Jeff taught Kevin, while Kevin taught Michael, Jason, and Tom. Then the relation should contain tuples:
Example of a bad relation in BCNF • Suppose that we have a relation that lists the teacher- student relationship for each person. That is, for a person p, the relation gives all teachers who ever taught p and all students who were ever taught by p. For example, suppose that David and Jeff taught Kevin, while Kevin taught Michael, Jason, and Tom. Then the relation should contain tuples:
Example of a bad relation in BCNF • Suppose that we have a relation that lists the teacher- student relationship for each person. That is, for a person p, the relation gives all teachers who ever taught p and all students who were ever taught by p. For example, suppose that David and Jeff taught Kevin, while Kevin taught Michael, Jason, and Tom. Then the relation should contain tuples: • There are some obvious redundancies: suppose Kevin has n teachers and m students, then there will be n*m tuples for Kevin! On the other hand, it indeed needs no more than n+m (smaller) tuples to record all information about Kevin’s teachers and students.
Example of a bad relation in BCNF • Suppose that we have a relation that lists the teacher- student relationship for each person. That is, for a person p, the relation gives all teachers who ever taught p and all students who were ever taught by p. For example, suppose that David and Jeff taught Kevin, while Kevin taught Michael, Jason, and Tom. Then the relation should contain tuples: • There are some obvious redundancies: suppose Kevin has n teachers and m students, then there will be n*m tuples for Kevin! On the other hand, it indeed needs no more than n+m (smaller) tuples to record all information about Kevin’s teachers and students. • On the other hand, the relation is in BCNF: the only key is • {person, teacher, student} • so there is no BCNF violators.
Definition of MVD • MVD captures redundancy that FD’s can’t
Definition of MVD • MVD captures redundancy that FD’s can’t • A multivalued dependency (MVD) X↠Y on a relation R says that if two tuples of R agree on all the attributes of X, then by swapping their components in Y, we still get tuples in R, i.e., for each value of X, the values of Y are independent of the values of R\(X∪Y).
Definition of MVD • MVD captures redundancy that FD’s can’t • A multivalued dependency (MVD) X↠Y on a relation R says that if two tuples of R agree on all the attributes of X, then by swapping their components in Y, we still get tuples in R, i.e., for each value of X, the values of Y are independent of the values of R\(X∪Y).
Definition of MVD • MVD captures redundancy that FD’s can’t • A multivalued dependency (MVD) X↠Y on a relation R says that if two tuples of R agree on all the attributes of X, then by swapping their components in Y, we still get tuples in R, i.e., for each value of X, the values of Y are independent of the values of R\(X∪Y).
Definition of MVD • MVD captures redundancy that FD’s can’t • A multivalued dependency (MVD) X↠Y on a relation R says that if two tuples of R agree on all the attributes of X, then by swapping their components in Y, we still get tuples in R, i.e., for each value of X, the values of Y are independent of the values of R\(X∪Y).
Definition of MVD • MVD captures redundancy that FD’s can’t • A multivalued dependency (MVD) X↠Y on a relation R says that if two tuples of R agree on all the attributes of X, then by swapping their components in Y, we still get tuples in R, i.e., for each value of X, the values of Y are independent of the values of R\(X∪Y).
Definition of MVD • MVD captures redundancy that FD’s can’t • A multivalued dependency (MVD) X↠Y on a relation R says that if two tuples of R agree on all the attributes of X, then by swapping their components in Y, we still get tuples in R, i.e., for each value of X, the values of Y are independent of the values of R\(X∪Y). Thus,person ↠ teacher
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW.
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. x
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. VY VW x w1 y1 ⁞ ⁞ ⁞ yp wq
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. VY VW R x w1 y1 wk w1 . . wq . . y1 . yp . . yi . x x . . . . x ⁞ ⁞ ⁞ yp wq
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. VY VW R x Look at the example we discussed.
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. VY VW R Kevin Look at the example we discussed.
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. VY VW R Kevin David Michael Jeff Jason Tom Look at the example we discussed.
Another Definition of MVDs Suppose X↠Yis a MVD in a relation R. Let W be the set of attributes of R that are not in X∪Y. Then for each value x of X, there is a set VY of values of Y and a set VW of values of W such that R contains exactly the tuples of the form (x, yi, wk), where yi and wk run over all combinations of the values in VY and VW. VY VW R Kevin David Michael Jeff Jason Tom Look at the example we discussed.
Another Example Consumers(name, addr, phones, beersLiked)
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy. • name → addr is the only FD.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy. • name → addr is the only FD.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy. • name → addr is the only FD. Then these tuples must also be in the relation.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy. • name → addr is the only FD. If X ↠ Y X Y others Then these tuples must also be in the relation.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy. • name → addr is the only FD. If X ↠ Y X Y others equal Then these tuples must also be in the relation.
Another Example Consumers(name, addr, phones, beersLiked) • A consumer’s phones are independent of the beers they like. • name ↠ phones and name ↠ beersLiked. • Thus, each of a consumer’s phones appears with each of the beer they like in all combinations. • This repetition is unlike FD redundancy. • name → addr is the only FD. If X ↠ Y X Y others equal exchange Then these tuples must also be in the relation.
MVD Rules • Every FD is an MVD (promotion).
MVD Rules • Every FD is an MVD (promotion). * If X→Y, then swapping Y’s between two tuples that agree on X doesn’t change the tuples.
MVD Rules • Every FD is an MVD (promotion). * If X→Y, then swapping Y’s between two tuples that agree on X doesn’t change the tuples. * Therefore, the “new” tuples are surely in the relation, and we know X ↠ Y.
MVD Rules • Every FD is an MVD (promotion). * If X→Y, then swapping Y’s between two tuples that agree on X doesn’t change the tuples. * Therefore, the “new” tuples are surely in the relation, and we know X ↠ Y. • Transitiveness: If X↠Y, and Y↠Z, then X↠Z.
MVD Rules • Every FD is an MVD (promotion). * If X→Y, then swapping Y’s between two tuples that agree on X doesn’t change the tuples. * Therefore, the “new” tuples are surely in the relation, and we know X ↠ Y. • Transitiveness: If X↠Y, and Y↠Z, then X↠Z. • Complementation: If X↠Y, and Z is all the other attributes, then X↠Z.
However, Splitting Doesn’t Hold • Like FD’s, we cannot generally split the left side of an MVD.
However, Splitting Doesn’t Hold • Like FD’s, we cannot generally split the left side of an MVD. • But unlike FD’s, we cannot split the right side either --- sometimes you have to leave several attributes on the right side.
However, Splitting Doesn’t Hold • Like FD’s, we cannot generally split the left side of an MVD. • But unlike FD’s, we cannot split the right side either --- sometimes you have to leave several attributes on the right side. • Example Consumers(name, areaCode, phone, beersLiked, manf)
However, Splitting Doesn’t Hold • Like FD’s, we cannot generally split the left side of an MVD. • But unlike FD’s, we cannot split the right side either --- sometimes you have to leave several attributes on the right side. • Example Consumers(name, areaCode, phone, beersLiked, manf) * A drinker can have several phones, with the number divided between areaCode and phone (last 7 digits).
However, Splitting Doesn’t Hold • Like FD’s, we cannot generally split the left side of an MVD. • But unlike FD’s, we cannot split the right side either --- sometimes you have to leave several attributes on the right side. • Example Consumers(name, areaCode, phone, beersLiked, manf) * A drinker can have several phones, with the number divided between areaCode and phone (last 7 digits). * A drinker can like several beers, each with its own manufacturer.