290 likes | 468 Views
Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents. Victor BARANOV Linguistics Department Izhevsk State Technical University Laboratory of Computer-Aided Philological Research Udmurtia State University.
E N D
Information-Analytical System “Manuscript”: technologies and tools of creation of electronic collections of ancient and medieval documents • Victor BARANOV • Linguistics Department • Izhevsk State Technical University • Laboratory of Computer-Aided Philological Research • Udmurtia State University
Title page of the portalof IAS “Manuscript” Digital Historical Corpora
Model of hierarchies and subnets of manuscript and text units Digital Historical Corpora
Net of linguistic relationships Text <…> се быша дроузи мои .<…> Εnd of the “single" relationship Relationship Predicate part се быша дроузи мои . Εnd of the “multiple" relationship Средство связи Mean of relationship Syntactic group се быша дроузи мои Word-form се быша дроузи мои Word-combination Дроузи мои быша дроузи Co-ordination се быша дроузи Dependence с е б ы ш а д р оу з и м о и . Digital Historical Corpora
Model of the Manuscript system Digital Historical Corpora
Editor OldEd: main panels Digital Historical Corpora
Editor OldEd: Text input and editing Digital Historical Corpora
Editor OldEd: Fragmentation of the manuscript texts into units and relationships with the dictionary units Dictionary of fragments Properties of fragments Fragments Digital Historical Corpora
Editor OldEd: Visualization of unit relationships Symbol Geometric hierarchy: Line Page Linguistic hierarchy: word-form normalize forms Dictionary: Lemma Properties and values of the Lemma Dictionary: word-forms of texts Digital Historical Corpora
Editor OldEd: Page layout Digital Historical Corpora
Result of creation of the layout on the site Marginalia Marginalia Marginalia Digital Historical Corpora
Automated lemmatization and establishing relationships between words and lemmas Digital Historical Corpora
Electronic edition: search page Collections & Manuscripts Search criteria Search result Digital Historical Corpora
Search result: word index and concordance Digital Historical Corpora
Module of retrievals: selection of the text Digital Historical Corpora
Module of retrievals: selection of the unit Digital Historical Corpora
Module of retrievals: setting the unit properties and values Digital Historical Corpora
Module of retrievals: saving the query Digital Historical Corpora
Module of retrievals: specifying the compositionof the query result Digital Historical Corpora
Comparative index of the wordforms Digital Historical Corpora
Comparative index of the fragments Digital Historical Corpora
Grammar dictionaries Grammar dictionary of the modern Russian language Grammar dictionary of the Old Russian language Grammar dictionary of the Old Slavonic language Grammar dictionarypseudo-elements Text N Text 6 Text 5 Text 4 Text 3 Text 2 Text 1 Digital Historical Corpora
Grammar dictionaries: retrieval form Digital Historical Corpora
Grammar dictionaries: bringing the Old Russian word-forms to the lemma Digital Historical Corpora
Grammar dictionaries: оbtaining paradigm of lemma Digital Historical Corpora
Electronic editions Digital Historical Corpora
Electronic edition:reverse index of word-forms and context Digital Historical Corpora
Acknowledgment The work on the creation of IRS Manuscript is being carried out with the support from the Russian Foundation of Basic Research (Grant # 05-07-90217в). Τhe work on the creation of the automated morphologic analyzer with the support of the Russian Foundation for the Humanities (Grant # 05-04-12408в). Digital Historical Corpora
Contacts Laboratory of Computer-Aided Philological Research Udmurtia State University Linguistics Department Izhevsk State Technical University Izhevsk, Russia Victor Baranov - baranov@udm.ru http://manuscripts.ru/index_en.html Digital Historical Corpora