60 likes | 150 Views
An introduction to the TempEx Module. George Wilson MITRE Corporation January 30, 2002. What is TempEx?. Perl module that implements the TIDES Temporal Annotation Guidelines Handles many formats - Feb. 10, Feb. 10th, February Tenth Some parts of standard not fully implemented
E N D
An introduction to the TempEx Module George Wilson MITRE Corporation January 30, 2002
What is TempEx? • Perl module that implements the TIDES Temporal • Annotation Guidelines • Handles many formats • - Feb. 10, Feb. 10th, February Tenth • Some parts of standard not fully implemented • - Embedded Expressions: Two weeks ago tomorrow • - Unknown Components: June 10 (VAL = XXXX0610) • Some very small extensions • - Easter gets an ALT_VAL
Sample OutputPOS Tags removed I got up <TIMEX2 TYPE="DATE" VAL="20010216TMO" MOD="EARLY">early this morning</TIMEX2>. I ate lunch <TIMEX2 TYPE="TIME" VAL="20010216T1207">an hour and a half ago</TIMEX2>. In <TIMEX2 TYPE="DATE" VAL="FUTURE_REF">the future</TIMEX2>, I will know better. I went to Hong Kong <TIMEX2 TYPE="DATE" VAL="2000W40">the week of October third</TIMEX2>. I went to Hong Kong <TIMEX2 TYPE="DATE" VAL="2000W42">the third week of October</TIMEX2>. Reference Date: 02/16/2001 13:37:00
TempEx Usage Preparation: POS tagging, Article Date Must build driver program that handles document-specific formatting. Driver calls subroutines TE_TagTIMEX TE_AddAttributes Distribution contains sample driver Module provides some help with driver Date2ISO
TimeTag.pl Usage Usage: TimeTag.pl [-h -FD -TH -Dn -HLn] files n is a number h = help message FD = First Date found will be used as reference date FDNW = First Date found will be used as reference date No Warnings given TH = Tag Header, not tagged by default D = Debug Level - 0,1,2 default=0 HL = Heuristic Level - 0,1,2,3 default=3
Performance Interannotator agreement TIMEX VAL MOD Human x Human 0.789 0.889 0.871 TempEx x Human 0.624 0.705 0.301 Speed - 0.5Megabyte/Minute