350 likes | 510 Views
Dal non-determinismo al determinismo ( nei linguaggi 2dim ): alcune riflessioni. Marcella Anselmo, Dora Giammarresi, Maria Madonia, Antonio Restivo. Riunione Prin. Varese, luglio 2006. 2dim Languages. finite alphabet ** all 2dim rectangular words ( pictures ) over
E N D
Dal non-determinismo al determinismo ( nei linguaggi 2dim ): alcune riflessioni Marcella Anselmo, Dora Giammarresi, Maria Madonia, Antonio Restivo Riunione Prin. Varese, luglio 2006
2dim Languages finite alphabet ** all 2dim rectangular words (pictures) over L **2dim language p L has size (m,n) Column concatenation Row concatenation Column/Row star p q = p q =
L islocalif there exists a finite set of tiles that contains all allowed subpictures of size (2,2,), i.e. p L if and only if any 22sub-picture of is in p = p Local 2dim Languages bordered picture p: tile: a square picture of size (2,2)
0 0 0 1 0 0 0 1 0 1 0 1 0 0 1 0 0 0 0 0 = 0 0 0 0 1 0 0 0 0 1 1 0 1 0 1 0 p = # # # # # 1 0 0 # 1 0 0 # 0 1 0 # 0 1 0 # p = 0 0 1 # 0 0 1 # # # # # # (Usual) Example of local language Ld = the set of square pictures with symbol “1” in all main diagonal positions and symbol “0” in the other positions
Recognizable 2dim Languages L is recognizable by tiling systemif L=(L’) where L’ is a local language and is a mapping from the alphabet of L’ to the alphabet of L (, , , ) , where L’=L(), is called tiling system REC is the family of two-dimensional languages recognizable by tiling system REC is closed almost under all operations but it is not closed under complement
a a a a a a a a a (Usual) Example LSq = all squares over = {a}. LSq is recognizable by tiling system. Set L’=Ld and (1)= (0)= a 1 0 0 0 1 0 Ld 0 0 1
0 0 0 1 0 0 0 1 0 1 0 1 0 0 1 0 0 0 0 0 = 0 0 0 0 1 0 0 0 0 1 1 0 1 0 1 0 0 1 p = 0 1 0 “Computing” by a tiling system (from a tiling system to an automaton) (Usual) Example LSq = squares over {a}. Use L’=Ld(1)= (0)= a # # # # # First, decide a scanning strategy! 1 a a 0 a # # 0 a a a # # 0 a a a # # # # # # #
“Computing” by a tiling system (from a tiling system to an automaton) Remark :Tiling system = “undirectional” transitions Definition: A 2dim finite automaton is Tiling system + scanning procedure Local picture is the run of the automaton. Remark : All 2dim finite automata “correspond” to family REC (i.e. scanning procedure does not matter!) Ex: 2OTA (2dim on-line tesselation automata)
Scanning strategies (I) Diagonal (“2OTA”) By column 31 1 1 2 4 7 7 32 8 3 5 2 8 3 33 6 9 9 4 10 34 10 5 11 35 34 6 12 36 35 36
Scanning strategies (II) Snake-like Free 4 5 6 7 9 4 1 2 3 11 10 9 8 7 12 13 14 36 6 3 8 36 35 34 5 1 2
A remark about REC Atiling system (= local language + projection) “generalizes” to 2 dim a non-deterministic finite automaton. Family REC is not closed under complement Definition of REC is intrinsically non-deterministic and it is not possible to eliminate non-determinism without getting a smaller class!
From non-determinism to determinism.. .. • Non-determinism • Possible accepting computations: “several” • Possible backtracking steps at each step of computation: linear in the size of input [if pictures: =O(mn)] • Determinism • Possible accepting computations: 1 • Possible backtracking steps at each step of computation: 0
?? ?? Remark on 1DIM case : In string languages 2 definitions possible: • Determinism from left to right - Co-determinism from right to left Correspond to same class!....choose one definition… Remark: Languages recognized by automata that are both deterministic and co-deterministic are smaller class!
a b c d There is an unique way to fill this position with a symbol of ?? Deterministic Recognizable Languages (DREC)Classical definition (only a bit extended) A tiling systemis Top-Left-deterministic if a,b,c and s unique tile such that (s)=d. (Analogously TR-,BL-,BR-deterministictiling system) L is deterministicif it has a TL- or TR- or BL- or BR- deterministic tiling system
bb bb ab bb aa aa aa aa p’= bb ab bb bb aa ba ba aa New Example Lcol-1n = {p | first col = last col }{a,b}** Lcol-1n REC Lcol-1n DREC Local alphabet: = {xy} Projection “erase” subscripts: (xy) = x Lcol-1i = {p | 1<in , first col = col i} DREC
From non-determinism to determinism: what can we define in beetween? • Non-determinism • Possible accepting computations: “several” • Possible backtracking steps at each step of computation: linear in the size of p (mn) one Unambiguity • Determinism • Possible accepting computations: 1 • Possible backtracking steps at each step of computation: 0
Def [GR92] A tiling system (, , , ) is unambiguous for L** if the projection π is injective on L() (i.e. for any pL there is a unique p’ L’ such that (p’)=p). Unambiguous Recognizable Languages (UREC) L ** is unambiguous if it admits an unambiguous tiling system. UREC: all unambiguous recognizable 2dim languages. UREC REC Generalization in 2dims of unambiguous automata for strings
UREC REC j i i,j: col i = col j Lcol-ij= Lcol-ij = ** Lcol-1n ** and REC is closed with respect to Necess. Cond. for UREC UREC and REC Lcol-ijREC Lcol-ij UREC
Properties of UREC PropositionUREC is closed under intersection and rotation operations. Proposition UREC is not closed under row/column concatenation/closure.
one From non-determinism to determinism: what can we define in beetween? (2) • Non-determinism • Possible accepting computations: “several” • Possible backtracking steps at each step of computation: linear in the size of p (mn) one dimension of p (m or n) “line”-unambiguity • Determinism • Possible accepting computations: 1 • Possible backtracking steps at each step of computation: 0
Column-Unambiguos Languages (Col-UREC) A tiling system is Left-Right Column-Unambiguous if, after having computed the local symbols in an entire column, the local symbols on the next column are univocally determined. ?? ?? ?? ?? L is Col-URECif L has a tiling system that is LR- or RL- column unambiguous. Remark: Backtracking at each step of possibly O(m) steps.
Row-Unambiguos Languages (Row-UREC) A tiling system is Top-Down Row-Unambiguous if, after having computed the local symbols in an entire row of a picture, the local symbols on the next row are univocally determined. ?? ?? ?? ?? L is Row-URECif L has a tiling system that is TD- or DT- column unambiguous. Remark: Backtracking at each step of possibly O(n) steps.
a b a a b a b b a b b a a a a a b a b b b a b b a (A new) Example = {a, b} LSq-cent-a = odd-side squares with a in center position a0 b1 a0 a2 LSq-cent-a Col-UREC,Row-UREC LSq-cent-a DREC b0 a0 b0 b1 a0 b2 b0 a0 a0 a1 a0 a0 b0 a2 b0 b1 By an “old” proof by Inoue et al. b1 a2 b0 b0 a0
1 i j n Necess. Cond. for Col-UREC Col-UREC and UREC Col-UREC UREC i,j: col 1 = col j col i = col n Lcol-1ij n= Lcol-1ij n UREC Lcol-1ijn Col-UREC Lcol-1ijn = Lcol-1j Lcol-in and UREC is closed with respect to
A necessary condition for unambiguity L(m) L is the subset of all pictures with m rows. It can be viewed as a string language over the columns alphabet. S*, regular string language. MS is the boolean matrix MS=|a| *, * where a= 1 iff L. The number of different rows , Row(MS ), is finite. Theorem Let L **. There is a k such that, for all m, If L Col-UREC then Row(ML(m)) km If L UREC then RankQ(ML(m)) km Use Matz’s Theorem and Hromkovic et al. Theorem Idea of Proof
From 2dim to 1dim Theorem [Matz 97]Let L **. If L REC, then there is a k such that, for all m, there is a finite string automaton Am with km states for L(m). Fact If L UREC, then Am is an unambiguous automaton with km states for L(m). If L Col-UREC, then Am is a deterministic automaton with km states for L(m).
Theorem of Hromkovic et al. d(S) the size of the minimal deterministic automaton accepting S uns(S) the size of a minimal unambiguous non-deterministic automaton accepting S. Theorem (Hromkovic et al.)For every regular string language S*, d(S) = Row(MS) uns(S) RankQ(MS).
The following inclusions are all strict: DREC Col-UREC UREC REC 1 i j n j i a Collecting all classes…
Theorem Whatever we choose a definition of deterministic 2dim finite automaton, the family of corresponding languages is strictly included in UREC. A separation result Det-REC is strictly included in UREC , for any definition of Det-REC we choose. Proof: By previous strict inclusions results (Col-UREC UREC )
Theorem Given a tiling system (, , , ) for L**, it is undecidable whether it is unambiguous. An undecidability result for UREC Proof: By reduction from the undecidable 2dimensional Unique Decipherability Problem.
Theorem Given a tiling system T = ( , , , ) for L **, it is decidable whether it is col-unambiguous. No pair of pictures p s p t with • p, s, t n,1 • s t • (s) = (t) • Any 22 sub-picture of p s, p t in • n M A decidability result for Col-UREC (Row-UREC) Proof: Let M=Card {(,) : , }. T col-unambiguous
REMARK: Diag-Unambiguos Languages A tiling system is Top-Left Diagonal-Unambiguous if, after having computed the local symbols in an entire diagonal of a picture, the local symbols on the next diagonal are univocally determined. ?? ?? ?? ?? L is Diag-URECif L has a tiling system that is TL-, TD-, BL- or BR- diagonal unambiguous. Remark: NO backtracking at each step
Open Problems Is UREC (Col-UREC) closed under complement? Is UREC largest subset in REC closed under complement? Conjecture: If L REC\UREC then L REC Is L(4NFA) UREC?
Conclusioni alle riflessioni… Tiling systems are a “compact” way to represent classes of finite state automata on 2 dims. Unambiguos languages are a strict intermediate class between non-deterministic and deterministic families.