1 / 3

Segmentation by natural partitioning

Segmentation by natural partitioning. 3-4-5 rule can be used to segment numeric data into relatively uniform, “natural” intervals. * If an interval covers 3, 6, 7 or 9 distinct values at the most significant digit, partition the range into 3 equi-width intervals

Download Presentation

Segmentation by natural partitioning

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Segmentation by natural partitioning 3-4-5 rule can be used to segment numeric data into relatively uniform, “natural” intervals. * If an interval covers 3, 6, 7 or 9 distinct values at the most significant digit, partition the range into 3 equi-width intervals * If it covers 2, 4, or 8 distinct values at the most significant digit, partition the range into 4 intervals * If it covers 1, 5, or 10 distinct values at the most significant digit, partition the range into 5 intervals

  2. count -$351 -$159 profit $1,838 $4,700 Step 1: Min Low (i.e, 5%-tile) High(i.e, 95%-0 tile) Max Step 2: msd=1,000 Low=-$1,000 High=$2,000 (-$1,000 - $2,000) Step 3: (-$1,000 - 0) ($1,000 - $2,000) (0 -$ 1,000) ($2,000 - $5, 000) ($1,000 - $2, 000) (-$4000 - 0) (0 - $1,000) (0 - $200) ($1,000 - $1,200) (-$4000 - -$3000) ($2,000 - $3,000) ($200 - $400) ($1,200 - $1,400) (-$3000 - -$2000) ($3,000 - $4,000) ($1,400 - $1,600) ($400 - $600) (-$2000 - -$1000) ($4,000 - $5,000) ($600 - $800) ($1,600 - $1,800) ($1,800 - $2,000) ($800 - $1,000) (-$1000 - 0) Example of 3-4-5 rule (-$4000 -$5,000) Step 4:

  3. Example of 3-4-5 rule (continued) • Step 1 – Min=-$351,976, Max=$4,700,896, low (5th percentile)=-$159,876, high (95th percentile)=$1,838,761 • Step 2 – For low and high, most significant digit is at $1,000,000, rounding low -$1,000,000, rounding high $2,000,000 • Step 3 – interval ranges over 3 distinct values at the most significant digit, so using 3-4-5 rule partition into 3 intervals, -$1,000,000-$0, $0-$1,000,000, and $1,000,000-$2,000,000 • Step 4 – Examine Min & Max values to see how they “fit” into first level partitions, first partition covers Min value, so adjust left boundary to make partition smaller, last partition doesn’t cover Max value, so create a new partition (round max up to next significant digit) $2,000,000-$5,000,000 • Step 5 – Recursively, each interval can be further partitioned using 3-4-5 rule to form next lower level of the hierarchy

More Related