1 / 10

Computing in 571

Computing in 571. Programming. For standalone code, you can use anything you like That runs on the department cluster For some exercises, we will use a Python-based toolikt. Department Cluster. Resources on CLMA wiki http://depts.washington.edu/uwcl/twiki/bin/view.cgi/ Main

lilike
Download Presentation

Computing in 571

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Computing in 571

  2. Programming • For standalone code, you can use anything you like • That runs on the department cluster • For some exercises, we will use a Python-based toolikt

  3. Department Cluster • Resources on CLMA wiki • http://depts.washington.edu/uwcl/twiki/bin/view.cgi/Main • Installed corpora, software, etc. • patas.ling.washington.edu • dryas.ling.washington.edu

  4. Condor • Distributes software processes to cluster nodes • All homework will be tested with condor_submit • See documentation on CLMA wiki • Construction of condor scripts

  5. NLTK • Natural Language Toolkit (NLTK) • Large, integrated, fairly comprehensive • Stemmers • Taggers • Parsers • Semantic analysis • Corpus samples, etc • Extensively documented • Pedagogically oriented • Implementations strive for clarity • Sometimes at the expense of speed/efficiency

  6. NLTK Information • http://www.nltk.org • Online book • Demos of software • HOWTOs for specific components • API information, etc

  7. Python & NLTK • NLTK is installed on cluster • Use python2.6 with NLTK • NLTK data is also installed • /corpora/nltk/nltk-data • NLTK is written in Python • http://www.python.org; http://docs.python.org • Many good online intros, fairly simple

  8. Python & NLTK • Interactive mode allows experimentation, introspection • patas$ python2.6 • >>> import nltk • >>> dir(nltk) • ….. AbstractLazySequence', 'AffixTagger', 'AnnotationTask', 'Assignment', 'BigramAssocMeasures', 'BigramCollocationFinder', 'BigramTagger', 'BinaryMaxentFeatureEncoding', • >>> help(AffixTagger) • …… • Prints properties, methods, comments,…

  9. Turning in Homework • Class CollectIt • Linked from course webpage • Homeworks due Tuesday night • CollectIt time = Wednesday (00:01) • Should submit as hw#.tar • Where # = homework number • Tar file contains top-level condor scripts to run

  10. Programming with NLTK

More Related