Self-Organization of Speech Sound Inventories in the framework of Complex Networks

Self-Organization of Speech Sound Inventories in the framework of Complex Networks Animesh Mukherjee (Roll No.: 05CS9405) Department of Computer Science & Engg. Indian Institute of Technology, Kharagpur

Overview of the Presentation • Basics of Speech Sound Inventories • Motivation & Objective • Occurrence Network of Consonants • Co-occurrence Network of Consonants • Co-occurrence Patterns in Consonant Inventories • Network Methods applied to Vowel Inventories • Conclusions and Future Directions

Sound Inventories • A repertoire of unique sounds (aka phonemes) that the speakers of a language use for communication /p/ /b/ /s/ /z/ /r/ …………… English Consonants pit As in bit send zip rat /p/ /b/ /r/ /ɖʱ/ …………… /ɖ/ Bangla Consonants As in pAn bAn ɖAl ɖhol rAtri

Mermelstein’s Model Representation of Phonemes • Articulatory feature based representation • Place of Articulation (labial, velar, alveolar, dental etc.) • Manner of Articulation (plosive, fricative, affricate, nasal etc.) • Phonation (voiced, voiceless)

Choice of Phonemes • Given a set of phonemes how likely is it that the set corresponds to real language inventory? • Does any random subset of phonemes qualify as a real inventory? • Certainly Not! • What are the forces governing the structure of an inventory?

/a/ /a/ Speaker Listener / Learner Forces Governing the Structure A Linguistic System – How does it look? Desires “perceptual contrast” / “ease of learnability” Desires “ease of articulation” Forces shaping the structure are opposing – There has to be a non-trivial solution

Motivation – Choice of the Problem • Vowel inventories • Linguistic arguments (Wang 1971) • Numerical simulations (Liljencrants & Lindblom 1972, Lindblom 1986, Schwartz et al. 1997) • Genetic algorithms (Ke et al. 2003) • Multi-agent simulations (Boer 2000) • Organized based on the principle of maximal perceptual contrast (mainly smaller inventories) • For instance if a language has three vowels then in more than 95% of the cases they are /a/,/i/, and /u/.

Motivation – Choice of the Problem • Consonant Inventories • Linguistic arguments (Clements 2003, Clements 2008, Boersma 1998, Hockett 1974, Lindblom & Maddieson 1988) • Studies limited to certain specific properties • They are much larger in size with many more articulatory/acoustic features • No single force is sufficient to explain their organization • A complex interplay of forces collectively shape their structure.

Motivation – Modeling Methodology • We adopt a complex network approach to capture the self-organization of the consonant inventories • A versatile modeling methodology  view & solve the problem from an alternative perspective • Enormous success in explaining various dynamical properties of language (Adilson 2002, Ferrer-i-Cancho & Sole 2001, Gruenenfelder & Pisoni 2005, Kapatsinski 2006, Sigman & Cecchi 2002) • Easy applicability in modeling this particular problem pertaining to sound inventories

Objective • Representation of the Inventories • How can the structure of the consonant inventories be accurately represented within the framework of complex networks? • Analysis of the Inventory Structure • How to conduct the analysis of the network(s) constructed in order to extract meaningful results • Analysis of the Inventory Structure • Explain the emergence of the different statistical properties (obtained from the analysis) by means of generative mechanisms usually based on models of network growth

/θ/ L1 /ŋ/ L2 /m/ Languages Consonants /d/ L3 /s/ L4 /p/ Occurrence Network of Consonants • Phoneme-Language Network (PlaNet) • Bipartite • VL (set of nodes in the language partition) • VC (set of nodes in the consonant partition) • There is an edge eЄ E between vlЄ VL and vcЄ VC iff the consonant c occurs in the language l • PlaNet constructed from the UCLA Phonological Segment Inventory Database (UPSID)  317 languages with 541 unique consonants appearing across them PlaNet

.08 pk = beta(k) with α = 7.06, and β= 47.64 .06 Γ(54.7) k6.06(1-k)46.64 .04 pk pk = Γ(7.06) Γ(47.64) .02 kmin= 5, kmax= 173, kavg= 21 1 50 100 200 0 150 Degree (k) .1 Pk = k -0.71 .01 Pk Exponential Cut-off .001 10 1000 100 1 Degree (k) Degree Distribution (DD) DD of the language nodes follows a β-distribution DD of the consonant nodes follows a power-law with an exponential cut-off pk Fraction of nodes with degree = k Pk Fraction of nodes with degree >= k

Languages Phonemes Growth of PlaNet Rules of the game: • A new language is born Coling-ACL, 2006

Languages Phonemes Growth of PlaNet Rules of the game: • A new language is born • Chooses μ distinct phonemes from the set of existing phonemes preferentially based on the degree γ k +1  (γk + 1) all phonemes not already chosen Coling-ACL, 2006

Analytical Solution for the Growth Model Notations t – #nodes in VL N – #nodes in VC (fixed and finite) pk,t – pk after adding t nodes Markov Chain Formulation (μ=1) where Europhysics Letters, 2007

The Hard Part of the Analysis • Average degree of the VC partition, i.e., (μt)/N diverges as t∞ • Methods based on steady-state and continuous time assumptions fail (pk,t ≠ pk,t+1as t∞) Closed-form solution using linear algebra tricks where η=N/γ Europhysics Letters, 2007

Simulation Data Real Data Theory Fitted Degree Distribution t=317, N=541, μ=21. Best fit for γ=14 Coling-ACL, 2006

/s/ 1 1 1 /k/ /n/ 2 1 2 1 1 1 2 /t/ /d/ 1 2 1 /p/ PhoNet Co-occurrence Network of Consonants • Phoneme-Phoneme Network (PhoNet) • One-mode projection of PlaNet onto the consonant nodes (VC) • Two nodes in this n/w are connected by an edge if they co-occur in the inventory of at least one language. The number of languages they co-occur in defines the weight of the edge.

Degree of the nodes in One-mode • Easy to calculate if each node v in growing partition enters with exactly  (> 1) edges • Consider a node u in the non-growing partition having degree k • u is connected to k nodes in the growing partition and each of these k nodes are in turn connected to -1 other nodes in the non-growing partition • Hence degree q=k(-1) Submitted to Europhysics Letters

Degree Distribution • The degree distribution pu(q) of the nodes in the one-mode should be Not a good match at all!!

What if  is not fixed?? • Relax the assumption that the size of the consonant inventories is a constant () • Assume these sizes to be random variables being sampled from a distribution fd • It is easy to show that, while the one-mode degree (q) for a node u is dependent on fd, its bipartite n/w degree (k) is not (the kernel of attachment roughly remains the same)

Analysis of Degree Distribution • If fd varies as a Normal Distribution N(μ, σ2) • If fd varies as a Delta function δ(d, μ) • If fd varies as an Exponential function E(λ=1/μ) • If fd varies as Power-law function (power = –λ) Submitted to Europhysics Letters

Results of the Analysis Bipartite Networks One-Mode Networks N = 1000, t = 1000, γ= 2, μ=22

Degree Distribution of PhoNet fd = consonant inventory size distribution Real PhoNet fd = constant Submitted to Europhysics Letters

Clustering Coefficient of PhoNet • The Clustering Coefficient (CC) for a node i is the proportion of links between the nodes that are the neighbors of i divided by the number of links that could possibly exist between them. • CC of PhoNet is 0.89 • CC of the synthesized n/w obtained from our model is 0.35 • The model needs to be refined to increase the number of triangles in the emergent network to match CC

Improving CC – Triad Model L5 L4 L1 L2 L3 IF L4 L5 L6 L1 L2 L3 Then (triad step – pt) L1 L2 L3 L4 L5 L6

Results • The triad model produces CC = 0.85 (within 3.5% of the real network) [0.8<= pt <=0.9] The degree distribution also remains unaffected Journal of Quantitative Linguistics, 2009

plosive voiceless voiced bilabial /b/ /p/ /d/ /t/ dental Patterns of Co-occurrence • Consonants tend to co-occur in groups or communities • These groups tend to be organized around a few distinctive features (based on: manner of articulation, place of articulation & phonation) – Principle of feature economy If a language has in its inventory then it will also tend to have

wuv S = • √Σi ЄVc-{u,v}(wui –wvi)2 • if √Σi ЄVc-{u,v}(wui –wvi)2>0 else S = ∞ Automatic Identification of Co-occurrence Patterns • Community structure analysis of PhoNet • Employ modified Radicchi et al. algorithm • Look for triangles, where the weights on the edges are comparable. If comparable, then the group of consonants co-occur highly else it is not so. • Calculate strength S of each edge • Remove edges with S less than a threshold η International Journal of Modern Physics C, 2007

Consonant Communities η=0.35 η=0.60 η=0.72 η=1.25

pf qf pf pf pf qf qf N N N N N N N Feature Economy: The Binding Force • pf – number of consonants in a community (C) in which feature f is present • qf –number of consonants in C in which feature f is absent • The probability that a consonant chosen at random form C has f is and that is does not have f is (1- ) • If F denote the set of all features, FE= –∑fєF log2 + log2 • FE Total discriminative capacity of the features in an inventory

Comparison between PhoNet and PhoNetrand PhoNetrand PhoNet International Journal of Modern Physics C, 2007

Network Methods forthe Vowel Inventories • Construct two networks • VlaNet (Vowel-Language Network): Bipartite network with one partition of languages (VL) and the other of vowels (VV); an edge signifies the a particular vowel occurs in a particular language; 317 languages and 151 vowels • VoNet (Vowel-Vowel Network): One-mode projection of VlaNet where to vowel nodes are connected as many times as they co-occur across different languages

Degree Distribution (VlaNet) β-distribution as in the case of consonants Theory Real Data Simulation

Degree Distribution (VoNet) fd = consonant inventory size distribution Real Data fd = constant

Clustering Coefficient (VoNet) • CC for VoNet is 0.86 • Using triad model on can achieve a CC of 0.83 (within 3.5%) of the real data The degree distribution also not much affected

Community Analysis of VoNet Two forces acting together Feature Economy Perceptual Contrast

VoNethub, VoNetrest and VoNetrest' • VoNethub • All vowel nodes having frequency of occurrence < 120 removed from VoNet along with all edges  A network of hub nodes. • VoNetrest • All vowel nodes in VoNet are retained. Only edges between hub & non-hub nodes removed. • VoNetrest' • All vowel nodes in VoNet are retained. Only edges that connect a hub with a non-hub where the non-hub occurs more than 95% of times with the hub are retained Advances in Complex Systems, 2008

Vowel Communities • VoNethub • VoNetrest • VoNetrest' Advances in Complex Systems, 2008

Feature Economy VoNetrest Feature Economy VoNetrest' Comparison with Randomly Generated Inventories Perceptual Contrast VoNethub Advances in Complex Systems, 2008

Consonant Vs. Vowel Inventories • Topological properties are qualitatively similar  preferential attachment plays the key role in the emergence of the structure • Community and redundancy ratio analysis however shows differences • Consonants  Feature economy is the key driving force • Vowels  Smaller inventories are driven by perceptual contrast while the larger ones are driven by feature economy

Conclusions and Future Directions • Complex Network based modeling allowed us to excavate various interesting universal properties of sound inventories • We do not claim that all the inferences that we draw are sacrosanct; rather they are indicative • Trends are more important than exact values • Results should help propelling future research in self-organizing phonology.

Conclusions and Future Directions • Quite a few theoretical problems that might attract statistical physicists • Network methods highly instrumental in doing computational linguistics • Unsupervised NLP (Distributional Similarity N/ws for learning syntactic and semantic categories) • IR (Blog and Query-log analysis)

Publications from the Thesis [1] M. Choudhury, A. Mukherjee, A. Basu, and N. Ganguly. Analysis and synthesis of the distribution of consonants over languages: A complex network approach. In Proceedings of COLING–ACL, 128–135, 2006. [2] F. Peruani, M. Choudhury, A. Mukherjee, and N. Ganguly. Emergence of a nonscaling degree distribution in bipartite networks: A numerical and analytical study. Euro. Phys. Lett., 79(2):28001, 2007. [3] A. Mukherjee, M. Choudhury, A. Basu, and N. Ganguly. Modeling the cooccurrence principles of the consonant inventories: A complex network approach. Int. Jour. of Mod. Phy. C, 18(2):281–295, 2007. [4] A. Mukherjee, M. Choudhury, A. Basu, and N. Ganguly. Redundancy ratio: An invariant property of the consonant inventories of the world’s languages. In Proceedings of ACL, 104–111, 2007. [5] A. Mukherjee, M. Choudhury, A. Basu, and N. Ganguly. Emergence of community structures in vowel inventories: An analysis based on complex networks. In Proceedings of ACL SIGMORPHON9, 101–108, 2007. [6] A. Mukherjee, M. Choudhury, S. Roy Chowdhury, A. Basu, and N. Ganguly. Rediscovering the co-occurrence principles of the vowel inventories: A complex network approach. Advances in Complex Systems, 11(3):371–392, 2008.

Publications from the Thesis [8] M. Choudhury, A. Mukherjee, A. Garg, V. Jalan, A. Basu, and N. Ganguly. Language diversity across the consonant inventories: A study in the framework of complex networks. In EACL workshop on Cogn. Aspects of Comp. Lang. Acquisition, 51–58, 2009. [9] A. Mukherjee, M. Choudhury, A. Basu, and N. Ganguly. Self-organization of sound inventories: Analysis and synthesis of the occurrence and co-occurrence network of consonants. Journal of Quantitative Linguistics, 16(2):157–184, 2009. [10] A. Mukherjee, M. Choudhury, and N. Ganguly. Analyzing the degree distribution of the one-mode projection of alphabetic bipartite networks (α − BiNs). preprint: arXiv.org:0902.0702.

Thank You

Self-Organization of Speech Sound Inventories in the framework of Complex Networks

Self-Organization of Speech Sound Inventories in the framework of Complex Networks

Presentation Transcript

ASSESSMENT OF SPEECH SOUND DISORDERS

DYNAMICS OF COMPLEX SYSTEMS Self-similar phenomena and Networks

Features and Sound Inventories

Basic Models of Complex Networks

Cluster Tree Based Self Organization of Virtual Sensor Networks

Vulnerability of Complex Networks

Organization of a Speech

Self-Organization on the Edge of Chaos in Clayoquot Sound

SURVIVABILITY OF COMPLEX NETWORKS

Self-Organization in Networks

Self Organization in Ad Hoc Networks

The Modern Framework of Free Speech Law

The framework and organization of the evaluation

Architecture of Complex Weighted Networks

Measuring Properties of Complex Networks

Features and Sound Inventories

DYNAMICS OF COMPLEX SYSTEMS Self-similar phenomena and Networks

Speech Sound Processing in the Brain

Scaling, renormalization and self-similarity in complex networks

SURVIVABILITY OF COMPLEX NETWORKS

Statistical physics of complex networks