1 / 7

The ACL Linked Anthology A proposal for the ACL Exec

The ACL Linked Anthology A proposal for the ACL Exec. Brett Powley (Macquarie Univ.) Min-Yen Kan (Nat’l. Univ. of S’pore). Our goal for the Linked Anthology. Provide full document metadata; Interdocument links for all documents in the ACL Anthology;

brigid
Download Presentation

The ACL Linked Anthology A proposal for the ACL Exec

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. The ACL Linked AnthologyA proposal for the ACL Exec Brett Powley (Macquarie Univ.)Min-Yen Kan (Nat’l. Univ. of S’pore) Powley and Kan

  2. Our goal for the Linked Anthology • Provide full document metadata; • Interdocument links for all documents in the ACL Anthology; • open-source tools for automatically producing this data. Powley and Kan

  3. In detail For each document, provide: • A canonical extracted text stream; • Metadata as BibTeX; • References extracted from the document and segmented into fields, as BibTeX; • Sentences with citations linked to the appropriate reference; • Links from each reference to the target document, iff in the Anthology. Powley and Kan

  4. Timeline • Current: Informal collaboration to extract text with digital anthology (dAnth) mailing list • Initial (3-6 months): Consolidation of resources; • Intermediate (1-2 yrs): Consolidation of tools; • Sustainable (2-3 yrs): Automatic and web based tools. Powley and Kan

  5. Value to the community Provide 3 clear benefits: • it will provide a corpus that can be used for ongoing research • it will provide a resource which will enhance everyday use of the Anthology • it will provide a focus point for consolidation of state-of-the-art research in bibliographic data processing. Powley and Kan

  6. Novelty Why should we be doing this? We have Google Scholar, MS Libra, CiteSeer, Rexa and the ACL Anthology? Our answers: • A collaborative effort - over many participants • A standardized effort - testbeds for research, proposed tasks • Distributed results - mirrored worldwide, little support from ACL Exec Powley and Kan

  7. Thank you for listening Current work partially supported by dAnth members Cambridge Univ. DFKI Macquarie Univ. Nat’l Univ. of Singapore Penn State Univ. Univ. of Hiroshima Univ. of Melbourne Univ. of Michigan Powley and Kan

More Related