600 likes | 769 Views
Arabic corpus annotation currently uses the Standard Arabic Morphological Analyzer (SAMA)SAMA generates various morphological and lemma choices for each token; manual annotators then pick the correct choice out of these.The problem is, there are dozens of choices for each token ? typically 20 to 8
E N D