1 / 13

Automatic extraction of BI-RADS breast tissue composition classes from mammography reports

Automatic extraction of BI-RADS breast tissue composition classes from mammography reports. Bethany Percha (Stanford) Houssam Nassif (U. Wisconsin) Jafi Lipson (Stanford) Elizabeth Burnside (U. Wisconsin) Daniel Rubin (Stanford).

deion
Download Presentation

Automatic extraction of BI-RADS breast tissue composition classes from mammography reports

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Automatic extraction of BI-RADS breast tissue composition classes from mammography reports • Bethany Percha (Stanford) • Houssam Nassif (U. Wisconsin) • Jafi Lipson (Stanford) • Elizabeth Burnside (U. Wisconsin) • Daniel Rubin (Stanford)

  2. Breast density is an important aspect of the radiological evaluation of the breast. • Dense fibroglandular tissue is a risk factor for breast cancer. • Dense tissue decreases mammographic sensitivity. • Breast tissue composition is partially genetic.

  3. The BI-RADS system divides breast composition into four categories. • Fatty • Scattered fibroglandular • Heterogeneously dense • Dense

  4. These standardized categories... • Help stratify patients at time of screening. • Enable radiologists to quality observations with discussion of how mammographic sensitivity may limit them. • Minimize ambiguity.

  5. Limitations of the BI-RADS system • Breast composition information typically reported as part of narrative text. • No one textual pattern can extract composition information with 100% accuracy. • Large research studies require information from thousands of reports.

  6. Our approach... • Borrow techniques from text-mining to extract breast tissue composition information quickly and easily. • Use pattern-matching and regular expression to identify and extract descriptions.

  7. Methods: Data • Three distinct mammography corpora from Stanford, UCSF, and the Marshfield Clinic. • Used 34,489 reports from Stanford’s radTF database and 146,972 reports from UCSF Medical Center to construct set of textual patterns indicative of each breast composition class. • Independent test set composed of 500 annotated reports from Stanford and 100 from the Marshfield Clinic.

  8. Methods: Evaluation • Reports independently annotated by radiologists. • Annotators blinded to automatically-extracted composition classes when assessing reports.

  9. Methods: Rule Construction • Using free-text mammography reports as its input, our algorithm classifies each into one of five classes: • predominantly fat (class 1) • scattered densities (class 2) • heterogeneously dense (class 3) • dense (class 4) • no descriptors present (class 5) • Annotators blinded to automatically-extracted composition classes when assessing reports.

  10. Results

More Related