Grammars with Regulated Rewriting



Similar documents
On String Languages Generated by Spiking Neural P Systems

Automata and Formal Languages

Parsing Beyond Context-Free Grammars: Introduction

Pushdown Automata. International PhD School in Formal Languages and Applications Rovira i Virgili University Tarragona, Spain

Lights and Darks of the Star-Free Star

Scanner. tokens scanner parser IR. source code. errors

Honors Class (Foundations of) Informatics. Tom Verhoeff. Department of Mathematics & Computer Science Software Engineering & Technology

Automata Theory. Şubat 2006 Tuğrul Yılmaz Ankara Üniversitesi

Mathematics for Computer Science/Software Engineering. Notes for the course MSM1F3 Dr. R. A. Wilson

Automata on Infinite Words and Trees

Learning Automata and Grammars

6.045: Automata, Computability, and Complexity Or, Great Ideas in Theoretical Computer Science Spring, Class 4 Nancy Lynch

INTRODUCTORY SET THEORY

CS5236 Advanced Automata Theory

5.1 Bipartite Matching

Overview of E0222: Automata and Computability

Math 312 Homework 1 Solutions

Reducing Clocks in Timed Automata while Preserving Bisimulation

Chapter 7 Uncomputability

Chapter 7: Products and quotients

Formal Languages and Automata Theory - Regular Expressions and Finite Automata -

On an anti-ramsey type result

Introduction to Automata Theory. Reading: Chapter 1

SECTIONS NOTES ON GRAPH THEORY NOTATION AND ITS USE IN THE STUDY OF SPARSE SYMMETRIC MATRICES

Elements of Abstract Group Theory

MATH 304 Linear Algebra Lecture 9: Subspaces of vector spaces (continued). Span. Spanning set.

A first step towards modeling semistructured data in hybrid multimodal logic

Automata and Computability. Solutions to Exercises

CHAPTER 7 GENERAL PROOF SYSTEMS

Two-dimensional Languages

The Halting Problem is Undecidable

The Optimum One-Pass Strategy for Juliet

Stationary random graphs on Z with prescribed iid degrees and finite mean connections

1 Sets and Set Notation.

Aikaterini Marazopoulou

GROUPS ACTING ON A SET

December 4, 2013 MATH 171 BASIC LINEAR ALGEBRA B. KITCHENS

Semantics of UML class diagrams

3515ICT Theory of Computation Turing Machines

2.1 Complexity Classes

Scalable Automated Symbolic Analysis of Administrative Role-Based Access Control Policies by SMT solving

ω-automata Automata that accept (or reject) words of infinite length. Languages of infinite words appear:

Continued Fractions and the Euclidean Algorithm

MATRIX ALGEBRA AND SYSTEMS OF EQUATIONS. + + x 2. x n. a 11 a 12 a 1n b 1 a 21 a 22 a 2n b 2 a 31 a 32 a 3n b 3. a m1 a m2 a mn b m

Inner Product Spaces and Orthogonality

Determination of the normalization level of database schemas through equivalence classes of attributes

Computability Theory

System BV is NP-complete

Elements of probability theory

! " # The Logic of Descriptions. Logics for Data and Knowledge Representation. Terminology. Overview. Three Basic Features. Some History on DLs

Testing LTL Formula Translation into Büchi Automata

Copy in your notebook: Add an example of each term with the symbols used in algebra 2 if there are any.

Discuss the size of the instance for the minimum spanning tree problem.

Reading 13 : Finite State Automata and Regular Expressions

About the inverse football pool problem for 9 games 1

A Static Analysis Technique for Graph Transformation Systems

Complexity Theory. IE 661: Scheduling Theory Fall 2003 Satyaki Ghosh Dastidar

Omega Automata: Minimization and Learning 1

Formal Grammars and Languages

Regular Languages and Finite State Machines

A 2-factor in which each cycle has long length in claw-free graphs

GENERATING SETS KEITH CONRAD

6.3 Conditional Probability and Independence

Dimitris Chatzidimitriou. Corelab NTUA. June 28, 2013

NP-Hardness Results Related to PPAD

Large induced subgraphs with all degrees odd

Relational Databases

Notes on Determinant

Properties of Real Numbers

Basic Concepts of Set Theory, Functions and Relations

C H A P T E R Regular Expressions regular expression

Section IV.21. The Field of Quotients of an Integral Domain

A Rough Set View on Bayes Theorem

Abstract Algebra Cheat Sheet

ENRICHED CATEGORIES AND THE FLOYD-WARSHALL CONNECTION Vaughan Pratt Computer Science Dept. Stanford University April, 1989

Verification of hybrid dynamical systems

Classification of Cartan matrices

(IALC, Chapters 8 and 9) Introduction to Turing s life, Turing machines, universal machines, unsolvable problems.

Turing Machines: An Introduction

CMSC 858T: Randomized Algorithms Spring 2003 Handout 8: The Local Lemma

A REMARK ON ALMOST MOORE DIGRAPHS OF DEGREE THREE. 1. Introduction and Preliminaries

MA651 Topology. Lecture 6. Separation Axioms.

Lecture 17 : Equivalence and Order Relations DRAFT

INCIDENCE-BETWEENNESS GEOMETRY

Approximation Algorithms

(67902) Topics in Theory and Complexity Nov 2, Lecture 7

On Omega-Languages Defined by Mean-Payoff Conditions

Foundations of mathematics. 2. Set theory (continued)

Chapter 6: Episode discovery process

OPTIMAL DESIGN OF DISTRIBUTED SENSOR NETWORKS FOR FIELD RECONSTRUCTION

Cluster Editing And Search Tree Algorithms

Transcription:

Grammars with Regulated Rewriting Jürgen Dassow Otto-von-Guericke-Universität Magdeburg Fakultät für Informatik Lecture in the 5 th PhD Program Formal Languages and Applications PhD Program Formal Languages and Applications, Tarragona, June 2006 1

References [1] [4] are summarizing books/papers, the remaining references give the papers where the regulations were introduced. References [1] J. Dassow, Grammars with regulated Rewriting. In C. Martin-Vide, V. Mitrana and Gh. Păun (eds.), Formal Languages and Applications, Studies in Fuzziness and Soft Computing 148, Springer-Verlag, Berlin 2004. [2] J. Dassow and Gh. Păun, Regulated Rewriting in Formal Language Theory. Akademie- Verlag, Berlin and Springer-Verlag, Berlin, 1989. [3] J. Dassow, Gh. Păun and G. Rozenberg, Grammars with Controlled derivations. In Handbook of Formal Languages, Vol. II, Chapter 3, 101 154, Springer-Verlag, 1997 [4] A. Salomaa, Formal Languages. Academic Press, New York, 1973. PhD Program Formal Languages and Applications, Tarragona, June 2006 2

[5] S. Abraham, Some questions of phrase-structure grammars. Comput. Linguistics 4 (1965) 61-70. [6] A.V. Aho, Indexed grammars - an extension of context-free grammars. J. ACM 15 (1968) 647-671. [7] A.B. Cremers and O. Mayer, On matrix languages. Inform. Control 23 (1973) 86-96. [8] I. Fris, Grammars with a partial ordering of the rules. Inform. Control 12 (1968) 415-425. [9] S. Ginsburg and E.H. Spanier, Control sets on grammars. Math. Syst. Th. 2 (1968) 159-177. [10] J. Kelemen, Conditional grammars: motivations, definitions and some properties. In: Proc. Conf. Automata Languages and Math. Sciences, Salgotarjan, 1984, 110-123. [11] Gh. Paun, A new generative device: valence grammars. Rev. Roum. Math. Pures Appl. 25 (1980) 911-924. [12] D.J. Rosenkrantz, Programmed grammars and classes of formal languages. J. ACM 16 (1969) 107-131. [13] A.P.J. van der Walt, Random context grammars. In: Proc. Symp. on Formal Languages, 1970. PhD Program Formal Languages and Applications, Tarragona, June 2006 3

Preliminaries I N set of all natural numbers {0, 1, 2, 3,...}, Z set of all integers Q set of all rational numbers V = {a 1, a 2,..., a n } alphabet (with letters in a fixed order) V set of all words over V V + set of all non-empty words over V w length of the word w # U (w) number of occurrences of letters of U V in w V PhD Program Formal Languages and Applications, Tarragona, June 2006 4

Preliminaries II Ψ V (w) = (# a1 (w), # a2 (w),..., # an (w)) Parikh vector of w Ψ V (L) = {Ψ V (w) w L} Parikh language of L V L V is called semilinear if and only if its Parikh language Ψ V (L) is a finite union of linear subsets of N n (i.e., a finite union of sets which can be represented {v 0 + m γ i v i γ i N, 1 i m} i=1 for some v 0, v 1,..., v m N n PhD Program Formal Languages and Applications, Tarragona, June 2006 5

Preliminaries III G = (N, T, S, P ) phrase-structure grammar N set of nonterminals (usually denoted by capitals) T set of nonterminals (usually denoted by small letters) P set of productions (or rules) α β S axiom V G = N T (sometimes only denoted by V ) G length-increasing iff α β for all α β P G context-free iff all rules of P have the form A β with A N, β V G G regular iff all rules of P have the form A β with A N, β T N T {λ} PhD Program Formal Languages and Applications, Tarragona, June 2006 6

Preliminaries IV REG CF CS RE family of all regular languages family of all context-free languages family of all length-increasing/context-sensitive languages family of all recursively enumerable languages {ww w {a, b} } / CF, {a n c m b n d m n, m 1} / CF, {a 2n n 0} / CF, {a n b n c n n 1} / CF PhD Program Formal Languages and Applications, Tarragona, June 2006 7

Preliminaries V Any L CS can be generated by grammar G = (N, T, S, P ) where all rules are of the form AB CD, A BC, A B and A a with A, B, C, D N and a T. Any L RE can be generated by grammar G = (N, T, S, P ) where all rules are of the form AB CD, A BC, A B, A a and A λ with A, B, C, D N and a T. For any language L REG, there is a finite automaton A = (X, Z, z 0, F, δ) (with input set X, state Z, initial state z 0, set F of accepting states and transition function δ) such that the set T (A) of words accepted by A coincides with L. PhD Program Formal Languages and Applications, Tarragona, June 2006 8

Motivation English Mary and John are a woman and a man, respectively. Mary, John and William are a woman, a man and a man, respectively. Mary, John, William and Jenny are a woman, a man, a man and a woman, respectively. Sentences of this type form a sublanguage L with h(l) = {ww w {a, b}} for some morphism h PhD Program Formal Languages and Applications, Tarragona, June 2006 9

Motivation Swiss German Jan säit das mer em Hans hälfed. (Jan says that we helped Hans.) Jan säit das mer em Hans es Huus hälfed aastriche. (Jan said that we helped Hans to paint the house.) Jan säit das mer d chind em Hans es Huus lönd hälfed aastriche. (Jan said that we allowed the children to help Hans to paint the house.) Sentences of this type form a sublanguage L with h(l) = {ww w {a, b}} for some morphism h PhD Program Formal Languages and Applications, Tarragona, June 2006 10

Regularly Controlled Grammar Definition Definition ([9]) : i) A regularly controlled (context-free) grammar is a 5-tuple G = (N, T, S, P, R) where N, T, P and S are specified as in a context-free grammar, R is a regular set over P. ii) The language L(G) generated by G consists of all words w T such that there is a derivation S = p1 w 1 = p2 w 2 = p3... = pn w n = w with p 1 p 2 p 3... p n R. PhD Program Formal Languages and Applications, Tarragona, June 2006 11

Regularly Controlled Grammar Example I G = ({S, A, B}, {a, b}, S, {p 0, p 1, p 2, p 3, p 4, p 5, p 6, p 7, p 8 }, R) p 0 = S AB, p 1 = A aa, p 2 = B ab, p 3 = A ba, p 4 = B bb, p 5 = A a, p 6 = B a, p 7 = A b, p 8 = B b R = p 0 {p 1 p 2, p 3 p 4 } {p 5 p 6, p 7 p 8 } L(G) = {ww w {a, b} + } PhD Program Formal Languages and Applications, Tarragona, June 2006 12

Regularly Controlled Grammar Example II G = ({S, A, B}, {a, b, c, d}, S, {p 0, p 1, p 2, p 3, p 4, p 5, p 6 }, R) p 0 = S AB, p 1 = A aa, p 2 = B bb, p 3 = A ca, p 4 = B db, p 5 = A c, p 6 = B d R = p 0 (p 1 p 2 ) + (p 3 p 4 ) p 5 p 6 L(G) = {a n c m b n d m n 1, m 1} PhD Program Formal Languages and Applications, Tarragona, June 2006 13

Appearance Checking N and T set of nonterminals and terminals, respectively P (finite) set of (context-free) productions over V = N T F subset of P We say that x V + directly derives y V in appearance checking mode by application of p = A w P (written as x = ac p y) if one of the following conditions hold: or x = x 1 Ax 2 and y = x 1 wx 2 A does not appear in x, p F and x = y. PhD Program Formal Languages and Applications, Tarragona, June 2006 14

Regularly Controlled Grammar with Appearance Checking Definition Definition ([9]) : i) A regularly controlled (context-free) grammar with appearance checking is a 6- tuple G = (N, T, S, P, R, F ) where N, T, P, S and R are specified as in a regularly controlled grammar and F is a subset of P. ii) The language L(G) generated by G with appearance checking consists of all words w T such that there is a derivation S = ac p 1 w 1 = ac p 2 w 2 = ac p 3... = ac p n w n = w with p 1 p 2 p 3... p n R. PhD Program Formal Languages and Applications, Tarragona, June 2006 15

Regularly Controlled Grammar with Appearance Checking Example G = ({S, A, X}, {a}, S, {p 1, p 2, p 3, p 4, p 5 }, R, F ), p 1 = S AA, p 2 = S X, p 3 = A S, p 4 = A X, p 5 = S a R = (p 1p 2 p 3p 4 ) p 5, F = {p 2, p 4 } L(G) = {a 2m m 1} PhD Program Formal Languages and Applications, Tarragona, June 2006 16

Regularly Controlled Grammars versus Chomsky Grammars λrc family of all languages generated by regularly controlled grammars (without appearance checking) λrc ac family of all languages generated by regularly controlled grammars with appearance checking rc family of all languages generated by regularly controlled grammars without erasing rules (and without appearance checking) rc ac family of all languages generated by regularly controlled grammars with appearance checking and without erasing rules Theorem : i) All languages of λrc over a unary alphabet are regular. ii) CF rc rc ac CS iii) CF rc λrc λrc ac = RE PhD Program Formal Languages and Applications, Tarragona, June 2006 17

Matrix Grammar Definition I Definition ([5]) : i) A matrix grammar with appearance checking is a quintuple G = (N, T, S, M, F ) where N, T and S are specified as in a context-free grammar, M = {m 1, m 2,... m n }, n 1, is a finite set of sequences m i = (p i,1, p i,2,..., p i,k(i) ), k(i) 1, 1 i n, where any p i,j, 1 i n, 1 j k(i), is a context-free production, F is a subset of all productions occuring in the elements of M, i.e., F {p i,j 1 i n, 1 j k(i)}. ii) We say that M is a matrix grammar without appearance checking if and only if F =. PhD Program Formal Languages and Applications, Tarragona, June 2006 18

Matrix Grammar Definition II iii) For m i, 1 i n, and x, y V G, we define x = m i y by x = x 0 = ac p i,1 x 1 = ac p i,2 x 2 = ac p i,3... = ac p i,k(i) x k(i) = y iv) The language L(G) generated by G (with appearance checking) is defined as the set of all words w T such that there is a derivation S = mj1 y 1 = mj2 y 2 = mj3... = mjk y k = w for some k 1, 1 j i n, 1 i k. PhD Program Formal Languages and Applications, Tarragona, June 2006 19

Matrix Grammar Example I G = ({S, A, B}, {a, b}, S, {m 0, m 1, m 2, m 3, m 4 }, ) m 0 = (S AB), m 1 = (A aa, B ab), m 2 = (A ba, B bb), m 3 = (A a, B a), m 4 = (A b, B b) L(G) = {ww w {a, b} + } G = ({S, A, B}, {a, b, c, d}, {m 0, m 1,..., m 4 }, S, ) m 0 = (S ACBD, m 1 = (A aa, B bb), m 2 = (A a, B b), m 3 = (C cc, D dd), m 4 = (C c, D d) L(G ) = {a n c m b n d m n 1, m 1} PhD Program Formal Languages and Applications, Tarragona, June 2006 20

Matrix Grammar Example II G = ({S, A, A, B, C, D, X}, {a, b}, M, S, F ) M = {m 0, m 0, m 1, m 2, m 3, m 4, m 5, m 5} m 0 = (S AB), m 1 = (A A A, B B), m 3 = (A a, D D), m 4 = (A A, C C), m 5 = (A X, C D) m 0 = (S AD) m 2 = (A X, B C), m 4 = (A X, D b), m 5 = (A X, C B), F = {A X, A X} L(G) = {a 2m b m 0} PhD Program Formal Languages and Applications, Tarragona, June 2006 21

Matrix Grammars versus Regularly Controlled Grammars λm family of all languages generated by matrix grammars (without appearance checking) λm ac family of all languages generated by matrix grammars with appearance checking M family of all languages generated by matrix grammars without erasing rules (and without appearance checking) M ac family of all languages generated by matrix grammars with appearance checking and without erasing rules Theorem : i) M = rc, ii) λm = λrc, iii) M ac = rc ac, iv) λm ac = λrc ac PhD Program Formal Languages and Applications, Tarragona, June 2006 22

Unordered Vector Grammar Definition Definition ([7]) : i) An (unordered) vector grammar is a quadruple G = (V, T, S, M) where N, T, M and S are defined as for matrix grammars. ii) The language L(G) generated by G is defined as the set of all words w T such that there is a derivation S = p1 w 1 = p2 w 2 = p3... = pn w where p 1 p 2... p n is a permutation of some element of M. PhD Program Formal Languages and Applications, Tarragona, June 2006 23

Unordered Vector Grammar Example G = ({S, A, B}, {a, b}, {m 0, m 1, m 2, m 3, m 4 }, S, ) m 0 = (S AB), m 1 = (A aa, B ab), m 2 = (A ba, B bb), m 3 = (A a, B a), m 4 = (A b, B b) {wxw x x {a, b}, w {a, b}, w P erm({w})} PhD Program Formal Languages and Applications, Tarragona, June 2006 24

Unordered Vector Grammars versus Matrix Grammars λuv family of all languages generated by unordered vector grammars uv family of all languages generated by unordered vector grammars without erasing rules Theorem : CF uv = λuv M Theorem : Each language in uv is semilinear. PhD Program Formal Languages and Applications, Tarragona, June 2006 25

Programmed Grammar Definition I Definition ([12]) : i) A programmed grammar is a quadruple G = (N, T, S, Lab, P, P G ) where N, T and S are specified as in a context-free grammar, Lab is a finite set of labels, P is a finite set of context-free productions (set of core productions) P G is a finite set of quadruples r = (q, p, σ, ϕ) where q Lab, p P, and σ and ϕ are subsets of Lab. If r = (q, p, σ, ϕ), then σ and ϕ are called the success field and failure field of r, respectively. ii) If r = (q, p, σ, ) holds for any r P G, then we say that G is a programmed grammar without appearance checking. Otherwise G is a programmed grammar with appearance checking. PhD Program Formal Languages and Applications, Tarragona, June 2006 26

Programmed Grammar Definition II iii) The language L(G) generated by G is defined as the set of all words w T such that there is a derivation S = w 0 = r1 w 1 = r2 w 2 = r3... = rk w k = w, k 1 and, for r i = (q i, A i v i, σ i, ϕ i ), one of the following conditions hold: w i 1 = w i 1 A iw i 1, w i = w i 1 v iw i 1 for some w i 1, w i 1 V G and q i+1 σ i or A i does not occur in w i 1, w i 1 = w i and q i+1 ϕ i. PhD Program Formal Languages and Applications, Tarragona, June 2006 27

Programmed Grammar Examples I G = ({S, A, B}, {a, b}, S, {q 0, q 1,,q 8 }, P, {r 0, r 1, r 2,..., r 8 }) P = {S AB, A aa, B ab, A ba, B bb, A a, B a, A b, B b} r 0 = (q 0, S AB, {q 1, q 3, q 5, q 7 }, ), r 1 = (q 1, A aa, {q 2 }, ), r 3 = (q 3, A ba, {q 4 }, ), r 5 = (q 5, A a, {q 6 }, ), r 7 = (q 7, A b, {q 8 }, ), r 2 = (q 2, B ab, {q 1, q 3, q 5, q 7 }, ), r 4 = (q 4, B bb, {q 1, q 3, q 5, q 7 }, ), r 6 = (q 6, B a,, ), r 8 = (q 8, B b,, ), L(G) = {ww w {a, b} + } PhD Program Formal Languages and Applications, Tarragona, June 2006 28

Programmed Grammar Examples II G = ({S, A}, {a}, {q 1, q 2, q 3 }, P, {r 1, r 2, r 3 }, S) P = {S AA, A S, S a} r 1 = (q 1, S AA, {q 1 }, {q 2 }), r 2 = (q 2, A S, {q 2 }, {q 1, q 3 }), r 3 = (q 3, S a, {q 3 }, ) L(G ) = {a 2m m 0} PhD Program Formal Languages and Applications, Tarragona, June 2006 29

Programmed Grammars versus Matrix Grammars λp family of all languages generated by programmed grammars (without appearance checking) λp ac family of all languages generated by programmed grammars with appearance checking P family of all languages generated by programmed grammars without erasing rules (and without appearance checking) P ac family of all languages generated by programmed grammars with appearance checking and without erasing rules Theorem : i) P = M, ii) λp = λm, iii) P ac = M ac, iv) λp ac = λm ac PhD Program Formal Languages and Applications, Tarragona, June 2006 30

Regularly Controlled Grammar Another Interpretation I R X regular set R = T (A) for some finite (nondeterministic) automaton A = (X, Z, z 0, Q, δ) A description as a graph with labelled edges z = δ(z, x) corresponds to z x z x 1 x 2... x n T (A) = R if and only if x 1 x 2... x n is a sequence of edge labels given by a path from z 0 to some state in Q Control by a regular set corresponds to control by sequences of edge labels of paths from a source node to a target node (in a graph with edge labelling, one source node and a set of target nodes) PhD Program Formal Languages and Applications, Tarragona, June 2006 31

Regularly Controlled Grammar Another Interpretation II R = p 0 {p 1 p 2, p 3 p 4 } {p 5 p 6, p 7 p 8 } z 4 p 2 z 4 p 6 z 5 p 1 z 0 p 0 z 1 p 3 p 5 p 7 z 4 p 4 z 6 p8 z 7 source node z 0 target nodes z 5, z 7 PhD Program Formal Languages and Applications, Tarragona, June 2006 32

Programmed Grammars Another Interpretation I G = (N, T, S, Lab, P, P G ) programmed grammar without appearance checking r = (q(r), p(r), σ(r), ϕ(r)) H = (Lab, E) graph (q(r), q ) E if and only if q σ(r) S = r1 w 1 = r2 w 2... = rk w k = w derivation in G if and only if q 1 q 2... q k is a sequence of nodes along a path in H Control in programmed grammars corresponds to control by node sequences along paths in graphs PhD Program Formal Languages and Applications, Tarragona, June 2006 33

Programmed Grammars Another Interpretation II G = ({S, A, B}, {a, b}, S, {q 0, q 1,,q 8 }, P, {r 0, r 1, r 2,..., r 8 }) r 0 = (q 0, S AB, {q 1, q 3, q 5, q 7 }, ), r 1 = (q 1, A aa, {q 2 }, ), r 2 = (q 2, B ab, {q 1, q 3, q 5, q 7 }, ), r 3 = (q 3, A ba, {q 4 }, ), r 4 = (q 4, B bb, {q 1, q 3, q 5, q 7 }, ), r 5 = (q 5, A a, {q 6 }, ), r 6 = (q 6, B a,, ), r 7 = (q 7, A b, {q 8 }, ), r 8 = (q 8, B b,, ), r 0 r 1 r 2 r 3 r 4 r 4 r 6 r 7 r 8 PhD Program Formal Languages and Applications, Tarragona, June 2006 34

Valence Grammar Definition Definition ([11]) : i) A valence grammar over a monoid is a quintuple G = (N, T, S, P, (M, )) where N, T and S are specified as in a context-free grammar, (M, ) is a monoid with neutral element e, P is a finite set of pairs r = (p, m) with a context-free rule p and m M. ii) For x, y VG, k, l M, we say that (x, k) directly derives (y, l), written as (x, k) = (y, l), iff there is a pair r = (A w, m) P such that x = x Ax and y = x wx for some x, x VG and l = k m. iii) The language L(G) generated by G is defined as L(G) = {w w T, (S, e) = (w, e)}. PhD Program Formal Languages and Applications, Tarragona, June 2006 35

Valence Grammar Examples G = ({S, A, B}, {a, b}, S, {r 0, r 1, r 2, r 3, r 4, r 5, r 6, r 7, r 8 }, (Q, )) r 0 = (S AB, 1), r 1 = (A aa, 2), r 2 = (B ab, 1/2), r 3 = (A ba, 3), r 4 = (B bb, 1/3), r 5 = (A a, 2), r 6 = (B a, 1/2), r 7 = (A b, 3), r 8 = (B b, 1/3) L(G) = {w 1 w 2 w 1 {a, b}, w 2 P erm(w 1 )} G = ({S, A, B}, {a, b}, S, {r 0, r 1, r 2, r 3, r 4, r 5, r 6, r 7, r 8}, (Z, +)) r 0 = (S AB, 0), r 1 = (A aa, 2), r 2 = (B ab, 2), r 3 = (A ba, 3), r 4 = (B bb, 3), r 5 = (A a, 2), r 6 = (B a, 2), r 7 = (A b, 3), r 8 = (B b, 3) L(G ) = {w 1 w 2 w 1, w 2 {a, b} +, 2# a (w 1 ) + 3# b (w 1 ) = 2# a (w 2 ) + 3# b (w 2 )} PhD Program Formal Languages and Applications, Tarragona, June 2006 36

Valence Grammars versus other Grammars Additive valence grammar (M, ) = (Z, +) Multiplicative valence grammar (M, ) = (Q, ) λav family of all languages generated by additive valence grammars av family of all languages generated by additive valence grammars without erasing rules λmv family of all languages generated by multiplicative valence grammars mv family of all languages generated by multiplicative valence grammars without erasing rules Theorem : av = λav mv = λmv = uv PhD Program Formal Languages and Applications, Tarragona, June 2006 37

Conditional Grammars Definition Definition ([8]) : i) A conditional grammar is a quadruple G = (N, T, S, P ) where N, T and S are specified as in a context-free grammar, and P is a finite set of pairs r = (p, R) where p is a context-free production and R is a regular set over V G. ii) For x, y VG, we say that x directly derives y, written as x = y, iff there is a pair r = (A w, R) P such that x = x Ax and y = x wx for some x, x VG and x R. iii) The language L(G) generated by G is defined as L(G) = {w w T, S = w}. PhD Program Formal Languages and Applications, Tarragona, June 2006 38

Conditional Grammars Examples G = ({S, A, B, A, B }, {a, b}, S, {r 0, r 1,... r 8 }) r 0 = (S AB, S), r 1 = (A aa, V BV ), r 2 = (A ba, V BV ), r 3 = (B ab, V aa V, r 4 = (B bb, V ba V ), r 5 = (A A, V B V ), r 6 = (A λ, V B V ), r 7 = (B B, V AV ), r 8 = (B λ, T B T ), L(G) = {ww w {a, b} + } G = ({S, S, A, B}, {a}, S, {r 1, r 2, r 3, r 4, r 5, r 6 }) r 1 = (S S B, SA ), r 2 = (A BB, S B + A + ), r 3 = (S S, S B +, r 4 = (B A, SA B + ), r 5 = (S a, SA ), r 6 = (A a, a + A + ) L(G ) = {a 2n n 0} PhD Program Formal Languages and Applications, Tarragona, June 2006 39

Semi-Conditional Grammar Definition Definition ([10]) : i) A semi-conditional grammar is a quadruple G = (N, T, S, P ) where N, T and S are specified as in a context-free grammar, and P is a finite set of triples r = (p, R, Q) where p is a context-free production and R and Q are disjoint finite sets of words over V G. R (Q) are called the permitted (forbidden) context of r or p, respectively. ii) For x, y VG, we say that x directly derives y, written as x = y, iff there is a triple r = (A w, R, Q) P such that x = x Ax and y = x wx for some x, x VG, any word of R is a subword of x, and no word of Q is a subword of x. iii) The language L(G) generated by G is defined as L(G) = {w w T, S = w} PhD Program Formal Languages and Applications, Tarragona, June 2006 40

Semi-Conditional Grammar Examples G = ({S, S, S }, {a}, S, {r 1, r 2, r 3, r 4 }) r 1 = (S S S,, {SS, S, a}), r 2 = (S S,, {S S, S, a}), r 3 = (S S,, {S S, S, a}, r 4 = (S a,, {Sa, S, S }) L(G) = {a 2n n 0} G = ({S, S, S, A}, {a}, S, {r 1, r 2, r 3, r 4, r 5 }) r 1 = (S S S,, {S, A}), r 2 = (S S,, {S}), r 3 = (S S,, {S }), r 4 = (S A,, {S, S }), r 5 = (A a,, {S}) L(G ) = {a 2n n 0} PhD Program Formal Languages and Applications, Tarragona, June 2006 41

Random Context Grammar Definition ([13]) : A random context grammar is a semi-conditional grammar where the permitting and forbidden contexts of all productions are subsets of the set of nonterminals. A permitting (forbidden, respectively) random context grammar is a random context grammar where all forbidden (permitting, respectively) contexts are empty. Example : G = ({S, A, A, A a, A b, B, B }, {a, b}, S, {r 0, r 1,... r 10 }) r 0 = (S AB,, ), r 1 = (A aa a, {B}, ), r 2 = (A ba b, {B}, ), r 3 = (B ab, {A a }, ), r 4 = (B bb, {A b }, ), r 5 = (A a A, {B }, ), r 6 = (A b A, {B }, ), r 7 = (B B, {A}, ) r 8 = (A A, {B}, ), r 9 = (B λ, {A }, ), r 10 = (A λ,, ) L(G) = {ww w {a, b} } PhD Program Formal Languages and Applications, Tarragona, June 2006 42

Notations λc family of all languages generated by conditional grammars C family of all languages generated by conditional grammars without erasing rules λsc family of all languages generated by semi-conditional grammars sc family of all languages generated by semi-conditional grammars without erasing rules λrc family of all languages generated by random context grammars RC family of all languages generated by random context grammars without erasing rules λprc family of all languages generated by permitting random context grammars prc family of all languages generated by permitting random context grammars without erasing rules PhD Program Formal Languages and Applications, Tarragona, June 2006 43

Generative Capacity Theorem : i) λc = λsc = RE ii) C = sc = CS Theorem : i) CF prc RC = M ac λrc. ii) prc λprc λrc = λm ac. iii) iv) prc M λprc λm PhD Program Formal Languages and Applications, Tarragona, June 2006 44

Ordered Grammar Definition Definition ([8]) : i) An ordered grammar is a quadruple G = (N, T, S, P ) where N, T and S are specified as in a context-free grammar and P is a finite (partially) ordered set of context-free production. ii) For x, y V G, we say that x directly derives y, written as x = y, if and only if there is a production p = A w P such that x = x Ax, y = x wx and there is no production q = B v P such that p q and B occurs in x. iii) The language L(G) generated by G is defined as L(G) = {w w T, S = w} PhD Program Formal Languages and Applications, Tarragona, June 2006 45

Ordered Grammar Example G = ({S, S, S, A, Z}, {a}, S, P ) S Z A Z S Z S Z S S S S S S A S S A a L(G) = {a 2n n 0} PhD Program Formal Languages and Applications, Tarragona, June 2006 46

Ordered Grammars versus Other Grammars λf RC family of all languages generated by forbidden random context grammars f RC family of all languages generated by forbidden random context grammars without erasing rules λo family of all languages generated by ordered grammars O family of all languages generated by ordered grammars without erasing rules Theorem : i) O = frc λo = λfrc RE. ii) CF O rc ac PhD Program Formal Languages and Applications, Tarragona, June 2006 47

Indexed Grammar Definition I Definition ([6]) : i) An indexed grammar is a quintuple G = (N, T, I, S, P ) where N, T and S are specified as in a context-free grammar, I is a finite set of finite sets of productions of the form A w with A N and w V G, and P is a finite set of productions of the form A α with A N and α (NI T ). The elements of I are called indexes. PhD Program Formal Languages and Applications, Tarragona, June 2006 48

Indexed Grammar Definition II ii) For x, y (NI T ), we say that x directly derives y, written as x = y, if either or x = x 1 Aβx 2 for some x 1, x 2 (NI T ), A N, β I, A X 1 β 1 X 2 β 2... X k β k P, y = x 1 X 1 γ 1 X 2 γ 2... X k γ k x 2 with γ i = β i β for X i N and γ i = λ for X i T, 1 i k, x = x 1 Aiβx 2 for some x 1, x 2 (NI T ), A N, i I, β I, A X 1 X 2... X k i, y = x 1 X 1 γ 1 X 2 γ 2... X k γ k x 2 with γ i = β for X i N and γ i = λ for X i T, 1 i k. PhD Program Formal Languages and Applications, Tarragona, June 2006 49

Indexed Grammar Definition III and a Result = denotes the reflexive and transitive closure of =. iii) The language L(G) generated by G is defined as L(G) = {w w T, S = w} λi family of all languages generated by indexed grammars I family of all languages generated by indexed grammars without erasing rules Theorem : CF I = λi CS. PhD Program Formal Languages and Applications, Tarragona, June 2006 50

Indexed Grammar Examples G = ({S, A, B}, {a, b, c, d}, {f}, S, P ) f = {B bb, B b}, P = {S asf, S A, A cad, A B} L(G) = {a n c m b n d m n 1, m 1} G = ({S, A}, {a, b}, {f a, f b, h}, S, P ) f a = {B Ba}, f b = {B Bb}, h = {B λ}, P = {S Ah, A aaf a, A baf b, A B} L(G ) = {ww w {a, b} } PhD Program Formal Languages and Applications, Tarragona, June 2006 51

Hierarchy of Languages Obtained by Regulated Rewriting Theorem : The following equalities are valid: RE = λm ac = λrc ac = λp ac = λrc = λc = λsc, CS = C = sc, λm = λrc = λp, M ac = rc ac = P ac = RC, M = rc = P, uv = λuv = mv = λmv, av = λav, λo = λfrc, O = frc, I = λi PhD Program Formal Languages and Applications, Tarragona, June 2006 52

RE Theorem : The opposite diagram holds. If two families are connected by a line (an arrow), then the upper family includes (includes properly) the lower family; if two families are not connected then they are not necessarily incomparable. CS λo I M ac λm M λprc) mv O prc av CF PhD Program Formal Languages and Applications, Tarragona, June 2006 53

Closure Properties Theorem : The following table holds. operation M ac λm M uv av I λo O λprc prc union + + + + + + + + + + intersection? complement? intersection + + + + + + + + + + by reg. sets concatenation + + + + + + + + + Kleene-closure +?? + + + + + PhD Program Formal Languages and Applications, Tarragona, June 2006 54

operation M ac λm M uv av I λo O λprc prc λ-free + + + + + + + + + + morphisms (arbitrary) + + + + +? +? morphisms inverse + + + + + + + + + + morphisms λ-free + + + + + + + + + + gsm-mappings gsm-mappings + + + + +? +? derivative + + + + + + + + + + quotient + + + + + + +? by reg. sets PhD Program Formal Languages and Applications, Tarragona, June 2006 55

Decision Results I Theorem : Let X be a family of grammars generating one of the families {M ac, M, RC, O, λm, λrc, λo, I, uv, av }. Then the equivalence problem Instance: grammars G 1 X and G 2 X, Answer: Yes if and only if L(G 1 ) = L(G 2 ) and the problem Instance: grammar G X Answer: Yes if and only if G generates a context-free language. are undecidable. PhD Program Formal Languages and Applications, Tarragona, June 2006 56

Decision Results II Theorem : The following table holds. grammar membership emptiness finiteness family problem problem problem I NP-complete + + λo??? O +, NP-hard?? M ac +, NP-hard - - λm + +, NP-hard +, NP-hard M + +, NP-hard +, NP-hard uv LOGCFL +, NP-hard +, NP-hard RC + +, NP-hard +, NP-hard av DTIME(n 4 ) + + PhD Program Formal Languages and Applications, Tarragona, June 2006 57

Reachability Problem for Vector Addition System An n-dimensional vector addition system is a couple (x 0, K) where x 0 N n and K is a finite subset of Z n. A vector y N n is called reachable within (x 0, K) if and only if there are vectors v 1, v 2,..., v t K, t 1, such that x 0 + j v i N n for 1 j t and x 0 + i=1 t v i = y. i=1 The reachability problem Instance: n-dimensional vector addition system (x 0, K), vector y N n Answer: Yes if and only if y is reachable within (x 0, K) is decidable (in exponential space). PhD Program Formal Languages and Applications, Tarragona, June 2006 58

3 Partition Problem The 3-partition problem Instance: multiset {t 1, t 2,..., t 3m } of integers and integer t Answer: Yes, if there is partition {Q 1, Q 2,..., Q m } of {t 1, t 2,..., t 3m } such that #(Q i ) = 3 and s Q i s = t for 1 i m. is NP-complete. PhD Program Formal Languages and Applications, Tarragona, June 2006 59

Syntactic Complexity I Definition : i) For a grammar G, V ar(g) denotes the cardinality of its set of nonterminals. ii) Let X be a family of languages and G(X) the corresponding set of grammars. For a language L X, we set V ar X (L) = min{v ar(g) G G(X), L(G) = L}. Theorem : There is a sequence of context-free languages L n, n 1, such that V ar CF (L n ) = n, V ar M (L n ) 3, V ar P (L n ) = 1, V ar rc (L n ) = 1, V ar prc (L n ) 8. PhD Program Formal Languages and Applications, Tarragona, June 2006 60

Syntactic Complexity II Theorem : i) For any recursively enumerable language L, V ar λmac (L) 3 and V ar λpac (L) 3. ii) V ar λmac ({a n b n c m d m e p f p n, m, p 1}) = 3 iii) There is a sequence of recursively enumerable languages L n, n 1, such that f(n) V ar λrc (L n ) [log 2 n] + 3 for n 1 where f is an unbounded function from N into N. PhD Program Formal Languages and Applications, Tarragona, June 2006 61

Finite Index Definitions G grammar D = S = w 0 = w 1 = w 2 =... = w n = w derivation of w in G Ind(G, w, D) = max{# N (w i ) 0 1 n} Ind(G, w) = min{ind(g, w, D) D is a derivation of w in G} Ind(G) = sup{ind(g, w) w L(G)} Ind X (L) = min{ind(g) G G(X), L = L(G)} X fin = {L L X, Ind X (L) < } PhD Program Formal Languages and Applications, Tarragona, June 2006 62

Families of Languages of Finite Index Theorem : i) All the following language families are equal to M fin P fin, (P ac ) fin, λp fin, (λp ac ) fin, rc fin, (rc ac ) fin, λrc fin, (λrc ac ) fin, λm fin, (M ac ) fin, (λm ac ) fin, RC fin, λrc fin, ii) O fin M fin C fin iii) prc fin M fin M iv) av fin uv fin M fin Theorem : Each language in L fin (M) is semilinear. PhD Program Formal Languages and Applications, Tarragona, June 2006 63