170 likes | 367 Views
Special Indices for LaaLaLaa Lyric Analysis & Generation Framework. Dr. V. Madhan Karky, Tamil Computing Lab ( TaCoLa ), College of Engineering Guindy , Anna University, Chennai. Overview. Objective Introduction Background Rhyme Schemes in Tamil Meter Pattern System Architecture
E N D
Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky, Tamil Computing Lab (TaCoLa), College of Engineering Guindy, Anna University, Chennai.
Overview • Objective • Introduction • Background • Rhyme Schemes in Tamil • Meter Pattern • System Architecture • Indexing Structure • Indexing Algorithm • Results and Analysis
Objective • To build special indices for the LaaLaLaa lyric analysis and generation framework to facilitate faster retrieval based on • Meter Pattern • Rhyme
Introduction • Tamil is a vibrant language with a rich grammar, vocabulary, an inherent poetic flavor. • About 1000 lyrics are being written every year as private albums, jingles and as original soundtracks of mainstream movies.
Background • WASP (Pablo Grevas (2000)) splits a given block of text, identifies patterns and fits words from the vocabulary to get verses of similar pattern. • COLIBRI (Agudo, Grevas, Calero (2002)) follows a case based approach to poem generation.
Background • Tra-la-Lyrics (Oliviera, Cardoso, Pereira (2005)) finds out the beat pattern of the midi file and places words with similar syllabic division and stress pattern. • Automatic generation of tamil lyrics for melodies (Ramakrishnan, Kuppan, Devi (2009)) converts the midi file to KNM reprsn and fits words to it from a corpus. Phrases are meaningful as parts only. No edhugai, monai or iyaybu.
Background • LaaLaLaa (Sowmiya, Karky (2010)) talks of splitting raw text from midi file to templates and filling them with words from a wordnet according to the pattern mined from an existing corpus of lyrics with due consideration to rhyme, meaning and flow.
Rhyme Schemes in Tamil • Two words are said to rhyme in • Monai (மோனை) - first letters are the same. • Edhugai (எதுகை) - second letters are the same. • Iyaibu (இயைபு) - last letters are the same.
Rhyme Schemes in Tamil • Examples: • பறவைand பச்சை rhyme in monai. • அருவி and விருப்பு rhyme in edhugai. • யாக்கை and வாழ்க்கை rhyme in iyaibu. • அருவி and குருவி rhyme in edhugai and iyaibu. • கவிதைகள் and கவிஞர்கள் rhyme in all the three schemes.
Meter Pattern • Maathirai (மாத்திரை) - time taken to wink an eyelid. • Maathirai based classification of Tamil alphabets. • Nedil (N) (நெடில்) - Those alphabets which are pronounced for the time interval of 2 maathirai. • Kuril (K) (குறில்) - Alphabets which take 1 maathirai to be pronounced. • Mei (M) (மெய்) - Alphabets which are pronounced for 0.5 maathirai. • Meter pattern of a word refers to its Kuril Nedil Mei pattern. • For example, the Meter pattern of the word பாடல் is NKM as பா is a Nedil(N), ட is a Kuril(K) and ல் is a Mei(M).
System Architecture Lyric DB Word Object Convertor Rhyme Extractor Rhyme Pattern Extractor Index Builder Rhyme Meter Index
Result and Analysis Retrieval Time(in milliseconds) Word
Results and Analysis • Retrieval complexity of both the approaches tested using a dataset of 500 tamil words. • The average retrieval time in • Word indexed approach - 875.47millisecond • Meter Rhyme Indexed approach – 1.90millisecond
Results and Analysis • The drastic decrease in retrieval time from O(α) to O(1) [α is the number of words in the database] is due to • The use of hash-tables which are efficient for retrieval. • Having separate hash-tables for the மோனை, எதுகைand இயைபு of each POS.