180 likes | 260 Views
on my own. Towards standardisation: Special characters in European dialectological projects. ‚Dictionary of Bavarian dialects in Austria‘ ‚Database of Bavarian dialects in Austria‘. eveline.wandl-vogt@oeaw.ac.at dbo @ ema. Presentation. Background towards digital dictionary, database
E N D
onmyown Towards standardisation:Special characters in European dialectological projects ‚Dictionary of Bavarian dialects in Austria‘‚Database of Bavarian dialects in Austria‘ eveline.wandl-vogt@oeaw.ac.atdbo@ema
Presentation • Background • towards digital dictionary, database • Definition of the problem • dictionary (WBÖ) • database (DBÖ) • Towards a solution • Wien initiative „Sonderzeichen“ • dbo@ema: dinamlex • Other projects
background: collect, view, document, interpretMain steps into the digital world • 1911: establishment of the Kommission zur Schaffung des Österreichisch-Bayerischen Wörterbuches • 1913-1933: mailing of 109 questionnaires • 1963: beginning of the publication • 1993: rationalisation concept 1 – beginning of the project DBO • 1995: beginning of the project textcorpus to the WBO • 1998: new rationalisation concept • 2004: beginning of the project WBOe (eBook) • 2007: beginning of the project dbo@ema • 2011: prospective completion of DBO input • ~2020: prospective completion of WBO publication
background:What is the WBO? • Dialect dictionary Austria without the province of Vorarlberg (due to linguistic criteria)nearby regions in Italy, the Czech Republic, Slowakia, Hungary and Sloweniaand some linguistic enclaves in this nations • material: ~ 5 mio. paper slips (rough estimate) • 7 scientists; 8 typists broad lexicographic description of a word(phonetics, etymology, meanings, examples of spoken and written language)from the beginning (first written sources)to actual dialect
background:mapsBavarian dialect regions dealt with in the WBO
definition of the problem: WBOThe WBO-phonetics Why are there special characters? phonetics are an important qualitiy criteria for the WBO • System developed by Lenz (~ 1900 / ZDM) and nearly without any changes published by Teuchert 1924/25 in the periodical Teuthonista and since then called Teuthonista • Latin letters with diacriticsfew basic letters, many possibilities vowels: ~ 250 different vocals (nasalisation, open/close, long/short):
example: tupfecht ‚dotty, pockmarked, freckled‘The WBO-article and WBO-phonetics • Positionphonetics • Position phrases / idioms • Quotations (literature)
definition of the problem: WBOspecial characters in the WBO-phonetics Why hold on to the system Teuthonista and not change to IPA? due to scientific traditions accepted system in german dialectology • Internal I: WBO:(full) comparability between entry A (1961) and entry Z (2020) • Internal II: (better) comparability with other projects at the I DINAMLEX • External: (better) comparability with other projects
definition of the problem: DBOThe DBO phonetics Principle of diversity • lot of coworkers and collectors (> 2.000) → many different phonetic systems • digitisation → documentation of the whole information of the original document → as few interpretation as possible in the first step
example: Grenze ‚barrier, borderline‘The DBO-entry and DBO-phonetics *A* HK 340, g340#1481.1 = g3400126.pla#109.1 *HL* Gränze:1 *NL* Gränetz:1 *QU* Kröllendf. NÖ; Steinhauser === *LT1* gr~-an~itß [f] *LT2* gr~-anÃtß [f] *LT3* gr~-an~itßn [f,pl] *LT4* gr~-anÃtßn [f,pl] *BD/LT1/2* Grenze *VRWO* feiner #gr~;entß#; #gr~;entßn# *VRWB* +2D *ETO* mhd. #grenitze#, d.i. (im 14. Jhd.) bair. #gränitze# < poln. #granice, granica# ********************
definition of the problem: WBO+DBOThe dinamlex-phonetics Basic principles for digital publication and the database • digitisation means archiving and documentation • digitisation must not mean loss of quality • digitisation should increase the quality of the dictionary itself • standardisation • interpretation • (internal / external) crosslinking
towards solutions: dbo@emacreating a font: dinamlex Database input • Unavailable special character in the font • encoded DBO • new characters in the font dinamlex.otf, dinamlex.ttf • Problem: New UNICODE-characters
towards solutions: new unicode charactersbavarian f:fversus f Phonetic: lenis [f] in contrast to fortis [f] Italic [f]in quotations, running titles, links BWB 2 functions:phonetic [f]italic [f]
towards solutions: new unicode charactersviennese a latin small letter "a" with ring between diaeresis phonetic sign used by the WBOe for [a] (rounded, open)
towards solutions: similar projectsOther transcription systems • English dialect dictionary (Wrigth; 1889-1905)(University Innsbruck / Austria)contact • DiWA (1888 / 2001-2008), ISSG (University Marburg / Germany)use dinamlex • Some Linguistic Atlases – ZRCola(Slovene Academy of Sciences / Slovenia) cooperation and coordination with I DINAMLEX
towards solutions: Slovene Academy of SciencesZRCola Problems • Private area: 3000 characters need much more space for all charactes • Standardisation of specific dialetological and historical characters Contact: Peter.Weiss@guest.arnes.si
towards solutions: standardisationWien initiative „Sonderzeichen“
onmyown towards standardisation.. … but why? eveline.wandl-vogt@oeaw.ac.atdbo@ema