PlanetMath (more info)
 Math for the people, by the people. Sponsor PlanetMath
Encyclopedia | Requests | Forums | Docs | Wiki | Random | RSS  
Login
create new user
name:
pass:
forget your password?
Main Menu
Owner confidence rating: Very high Entry average rating: No information on entry rating
[parent] generalized regular expression (Definition)

Recall that a regular expression over an alphabet $\Sigma$ is a finite strings of symbols that are elements of $\Sigma$ , together with symbols $\varnothing$ , $\cup$ , $\cdot$ , $^*$ , as well as $($ and $)$ , which is put together based on a set of expression formation rules. In a generalized regular expression, additional symbols $\cap$ and $\neg$ are tossed in also. Formally,

Definition. Given an alphabet $\Sigma$ , let $S$ be the set $\lbrace \varnothing, \cup, \cdot, ^*, \cap, \neg, (, ) \rbrace$ , considered to be disjoint from $\Sigma$ . Let $X$ be the smallest subset of $(\Sigma\cup S)^*$ containing the following:

  • any regular expression is in $X$ ,
  • if $u,v\in Y$ , then $(u\cap v), (\neg u)\in X$ .
An element of $X$ is called a generalized regular expression over $\Sigma$ .

Like regular expressions, every generalized regular expressions are designed to represent languages (it is clear that $\cap$ and $\neg$ are intended to mean set-theoretic intersection and complementation). If $u$ is a generalized regular expression:

  • if $u$ is regular expression, then the language represented by $u$ as a generalized regular expression is $L(u)$ , the language represented by $u$ as a regular expression;
  • if $A$ is represented by $u$ and $B$ is represented by $v$ , then $A\cap B$ is represented by $(u\cap v)$
  • if $A$ is represented by $u$ , then $\Sigma^* - A$ is represented by $(\neg u)$ .
By induction, it is easy to see that, given a generalized regular expression $u$ , there is exactly one language represented by $u$ . We denote $L(u)$ the language represented by $u$ , and $ \mathscr{GR}$ the family of languages represented by generalized regular expressions.

Since regular languages are closed under intersection and complementation, generalized regular expressions in this regard are no powerful than regular expressions. The symbols $\neg$ and $\cap$ are therefore extraneous. In other words,

$\displaystyle \mathscr{R}=\mathscr{GR},$
where $ \mathscr{R}$ is the family of regular languages.

Bibliography

1
A. Salomaa, Formal Languages, Academic Press, New York (1973).




"generalized regular expression" is owned by CWoo.
(view preamble | get metadata)

View style:


This object's parent.
Log in to rate this entry.
(view current ratings)

Cross-references: words, closed under, regular languages, induction, intersection, mean, clear, languages, represent, subset, disjoint, expression, elements, strings, finite, alphabet, regular expression
There are 2 references to this entry.

This is version 2 of generalized regular expression, born on 2009-09-03, modified 2009-09-07.
Object id is 11898, canonical name is GeneralizedRegularExpression.
Accessed 239 times total.

Classification:
AMS MSC68Q70 (Computer science :: Theory of computing :: Algebraic theory of languages and automata)
 20M35 (Group theory and generalizations :: Semigroups :: Semigroups in automata theory, linguistics, etc.)

Pending Errata and Addenda
None.
Discussion
Style: Expand: Order:
forum policy

No messages.

Interact
post | correct | update request | add derivation | add example | add (any)