40 likes | 124 Views
Problem addressed. HTML 4.01 strict, XHTML 1.0 strict or superior are allowed by the Italian regulations Distinct dominions not easily comparable 43% of Italian Public institutions homepages are declared DTD strict
E N D
Problem addressed HTML 4.01 strict, XHTML 1.0 strict or superior are allowed by the Italian regulations Distinct dominions not easily comparable 43% of Italian Public institutions homepages are declared DTD strict Meaningful synthesis about code validity cannot simply exclude 57% of monitored Web contents 1
Strategy We issued an experimental approach, thereby defining a metrics to: quantify expected errors on a target DTD from a different one; evaluate 3 properties: validity, strictness, markup quality. Such a metrics can be generalized to: approach a more general instance about comparing documents with different DTDs. 2
We defined the expected number of errors for a given DTD as: We called the couple: (x'i, x''i) Errors Springing Up Rate(ESUR). They are computed, based on a wide sample of sites according to a couple of weighted averages on errors about DTDs. a sample of 1000 Web sites has been used to compute x'i and x''i as follows: Strategy (2) 3