EFTA01145242
EFTA01145243 DataSet-9
EFTA01145285

EFTA01145243.pdf

DataSet-9 42 pages 22,663 words document
P17 P22 V11 P24 P19
Open PDF directly ↗ View extracted text
👁 1 💬 0
📄 Extracted Text (22,663 words)
Introduction to Membrane Computing Gheorghe Pfiun Institute of Mathematics of the Romanian Academy PO Box 1-764, 014700 Bucuresti, Romania E-mail: george .pauneimar .ro Research Group on Natural Computing Department of Computer Science and Artificial Intelligence University of Sevilla Avda. Reins Mercedes s/n, 41012 Sevilla, Spain E-mail: gpaunflus . es Abstract. This is a comprehensive (and — supposed — friendly) introduction to mem- brane computing, meant to offer both to computer scientists and to non-computer scientists an up-dated overview of the domain. That is why the panoply of no- tions which are introduced here is rather large, but the presentation is informal, without any proof and with rigorous definitions given only for the basic types of P systems — symbol-object P systems with multiset rewriting rules, systems with sym- port/antiport rules, systems with string-objects, tissue-like P systems, and neural- like P systems. Besides a list of (biologically inspired or mathematically motivated) ingredients/features which can be used in systems of these types, we also mention a series of results — as well as a series of research trends and topics. Then, both some applications are briefly mentioned and a discussion is made about the attractiveness of this framework for (possible) applications, especially in biology. 1 (The Impossibility of) A Definition of Membrane Computing Membrane computing is an area of computer science aiming to abstract computing ideas and models from the structure and the functioning of living cells, as well as from the way the cells are organized in tissues or higher order structures. In short, it deals with distributed and parallel computing models, processing mu ltisets of symbol-objects in a localized manner (evolution rules and evolving objects are encapsulated into compartments delimited by membranes), with an essential role played by the communication among compartment s (with the environment as well). Of course, this is just a rough description of a membrane system — hereafter called P system — of the very basic type, as many different classes of such devices exist. The essential ingredient of a P system is its membrane structure, which ca n be a hierarchical arrangement of membranes, like in a cell (hence described by a tree), or a net of membranes (placed in the nodes of a graph), like in a tissue, or in a neural net. The intuition behind the 1 EFTA01145243 notion of a membr ane is that from biology, of a three—dimensional vesicle, but the concept itself is generalized/idealized to interpreting a membrane as a separator of two regions (of the Euclidean space), a finite "inside" an d an infinite "outside", also providing the possibility of a selective communication among the two regions. The variety of suggestions from biology and the range of possibilities to define the architecture and the functioning of a membrane-based-multiset-processing device are practically endless — and already the literature of membrane computing contains a very large number of models. Thus, membrane computing is not a theory related to a specific model, it is a framework for devising compart mentalized models. Both because the domain is rather young (the trigger paper is [77], circulated first on web, but related ideas were considered before, in various contexts), but also as a genuine feature, bas ed both on the biological background and the mathematical formalism used, not only there are already proposed many types of P systems, but the flexibility and the versatility of P systems seem to be, in p rinciple, unlimited. This last observation, as well as the rapid development and enlargement of the research in this area, make impossible a short and faithful presentation of membrane computing, with any good level of completeness. However, there are a series of notions, notation, models which are already "standard", which have stabilized and can be considered as basic elements of membrane computing. This paper is devoted to presenting mainly such notions and models, together with the associated notation. The presentation will be both historically and didactically organized, introducing first either notions which were investigated from the beginning in this area, or simpler notions, able to quickly offer an image of membrane computing to the reader who is not familiar with the domain. The reader has surely noticed that all the previous discussion refers mainly to computer science (goals), and much less to biology. Membrane computing was not initiated as an area aiming to provide models to biology, in particular, models of the cell. Still in this moment, after a considerable development at the theoretical level, the domain is not fully prepared to offer such models to biology — but this is a strong tendency of the recent research and considerable advances towards such achievements were reported (the topic will be discussed later in more details). 2 Membrane Computing as Part of Natural Computing Before entering into more specific elements of membrane computing, let us spend some time with the relationship of this area with, let us say, using the "local" terminology, the "outside". We have said above that membrane computing is part of computer science. However, the genus proximus is natural computing, the general attempt to learn computer science useful ideas, models, paradigms from the way nature - life especially — "computes", in various circumstances where substance and information processing can be interpreted as computations. Classic bio- inspired branches of natural computing are genetic algorithms (more generally, evolutionary computing, with well individualized sub-branches such as evolutionary programming) and neural computing. Both of them have a long history, which can be traced until unpublish ed works of Turing (see, e.g., [97]), many applications, and a huge bibliography. Both of them are a proof that "it is worth learning fr om biology", supporting the optimistic observation that many billions of year nature/life has adjusted certa in tools and processes which, correctly (luckily?) 2 EFTA01145244 abstracted and implemented in computer science terms, can p rove to be surprisingly useful in many applications. A more recent branch of natural computing, with an enthusiastic beginning and unconfirmed yet computational applicability (we do not discuss here the by-products, such as the nanotech- nology related developments), is DNA computing, whose birth certificate is related to Adleman experiment [1] of solving a (small) instance of the Hamiltonian path problem by handling DNA molecules in a laboratory. According to Hartmanis [50], (51], it was a demo that we can compute with bio-molecules, a big event for computability. However, after one decade of research, the domain is still preparing its tools for a possible future practical application and looking for a new breakthrough idea, similar to Adleman's one front 1994. However, at the theoretical level, DNA computing is beautifully developed (see, e.g., [85] and the proceedings of the yearly DNA Based Computers series of conferences). This is both due to the fact that DNA structure and processing suggest a series of new data structures (e.g., the double stranded sequence, with the pairs of corresponding symbols from the two strands being related through a complementarity relation) and operations (e.g., recombination, denaturation, hybridization), but also to the fact that the massive parallelism made possible by the efficiency of DNA as a support of information promises to be useful in solving computationally hard problems in a feasible time. Actually, at the theoretical level one can say that DNA computing started already in 1987, when T. Head [49] has proposed a language theoretic model of what he called the splicing operation (the re- combination of DNA molecules, cut by restriction enzymes in fragments pasted together when the sticky ends match). Both evolutionary computing and DNA computing are inspired from and related to handling DNA molecules. Neural computing considers the neurons are simple finite automata linked in networks of specific types. Thu s, these "neurons" are not interpreted as cells, with an internal structure and life, but as "dots on a grid", with a simple input-output function. (The same observation holds true for cellular automata, where again the "cells" are "dots on a grid", only interacting among them, in a rigid structure.) None of these domains considers the cell itself as its main object of research, in particular, none of these domains pays any attention to membranes and compartmentalization — and this is the poin t where membrane computing enters the stage. Thus, membrane computing can be seen as an extension of DNA On ore generally, molecular) computing, from the "one-processor" level to a distributed computing model. 3 Laudation to the Cell (and Its Membranes) Life (as we know it on Earth, in the traditional meaning of the term, that investigated by biology) is directly related to cells, everything alive consists of cells or has to do in a direct way with cells. The cell is the smallest "thing" unanimously considered as dive. It is very small and very intricate in its structure and functioning, has an elaborate internal activity and an exquisite interaction with the neighboring cells, and with the environment in general. It is fragile and robust at the same time, with a way to organize (control) the bio-chemical (and informational) processes which was polished during billions of years of evolution. Any cell means membranes. The cell itself is defined — separated from its environment — by a membrane, the external one. Inside the cell, several membranes enclose "protected reactors", compartments where specific biochemical processes take place. In particular, a membrane en- closes the nucleus (of eukaryotic cells), where the genetic material is placed. Through vesicles 3 EFTA01145245 enclosed by membranes one can transport packages of molecules from a part of the cell (e.g., from the Colgi apparatus) to other parts of the cell - in such a way that the transported molecules are not "aggressed" during their journey by neighboring chemicals. Then, the membranes allow a selective passage of substances among the compartmen is delimited by them. This can be a simple selection by size, in the case of small molecules, or a much mor e intricate selection, through protein channels, which not only select, but can also move molecules from a low concentration to a higher concentration, perhaps coupling molecules, through so-called symport and antiport processes. Much more: the membranes of a cell do not only delimit compartments where specif is reactions take place in solution, hence inside the compartments, but many reactions in a cell develop on the membranes, catalyzed by the many proteins bound on them. It is said that when a compartment is too large for the local biochemistry to be efficient, life creates membranes, both in order to create smaller "reactors " (small enough that, through the Brownian motion, any two of the enclosed molecules can collide frequently en ough), and in order to create further "reaction surfaces". Anyway, biology contains many fascinating facts fr om a computer science point of view, and the reader is encouraged to check the validity of this assertion browsing, e.g., through [2], [64], [8]. Life means surfaces inside surfaces, as can be learned already from the ti the of [52], while S. Marcus puts it in an equational form [69]: Life = DNA software + membrane hardware. Then, there are cells living alone (unicellular organisms, such as ciliates, bac feria, etc.), but in general the cells are organized in tissues, organs, organisms, communities of organisms. All these suppose a specific organization, starting with the direct communication/cooperation among neighboring cells, and ending with the interaction with the environment, at various levels. Together with the internal structure and org anization of the cell, all these suggest a lot of ideas, exciting from a mathematical point of view, and potentia Ily useful from a computability point of view. Part of them were already explored in membrane computing, much mo re still wait for research efforts. 4 Some General Features of Membrane Computing Models Still remaining at this general level, it is worth mentioning from the beginning some of the basic features of models investigated in this area, besides the essential use of mem- branes/compartmentalization. We have mentioned above the notion of a multiset. The compartments of a cell contains substances (ions, small molecules, macromolecules) swimming in an aqueous solution; there is no ordering there, everything is close to everything, the concentration matters, the population, the number of copies of each molecule (of course, we are abstracting/idealizing here, departing from the biological reality). Thus, the suggestion is immediate: to work with sets of objects whose multiplicities matter, hence with multisets. This is a data structure with peculiar characteristics, not new but not systematically investigated in computer science. A multiset can be represented in many ways, but the most compact one is in the f orm of a string. For instance, if the objects a, b, c are present in, respectively, 5, 2, 6 copies each, we can represent this multiset by the string a5b2c6; of course, all permutations of this string represent the same multiset. Both from the string representation of multisets and because of the bio-chemical background, 4 EFTA01145246 where standard chemical reactions are common, the suggestion comes to process the multisets fro in the compartments of our computing device by means of rewriting-like rules. This means rules of the form ti v, where w and v are multisets of objects (represented by strings). Continuing the previous example, we can consider a rule aab abcc. It indicates the fact that two copies of object a together with a copy of object b react, and, as a result of this reaction, we get back a copy of a as well as the copy of b (hence b behaves here as a catalyst), and we produce two new copies of c. If this rule is applied to the multiset a5b2c6, then, because aab are "consumed" and then abcc are "produced", we pass to the multiset a4b2cs. Similarly, by using the rule bb aac, we will get the multiset a7c7, which contains no occurrence of object b. Two important problems arise here. The first one is related to the non-determinism. Which rules should be app lied and to which objects? The copies of an object are considered identical, so we do not distinguish among them; whether to use the first rule or the second one is a significant issue, especially because they cannot be used both at the same time (for the mentioned multiset), as they compete for the "reactant" b. The standard solut ion to this problem in membrane computing is that the rules and the objects are chosen in a non-determinist ic manner (at random, with no preference; more rigorously, we can say that any possible evolution is allowed). This is also related to the idea of parallelism. Biochemistry is not only (in a certain degree) non-deterministic, but it is also (in a certain degree) parallel. If two chemicals can react, then the reaction does not take place for only two molecules of the two chemicals, but, in principle, for all molecules. This is the suggestion supporting the maximal parallelism used in many classes of P systems: in each step, all rules which can be applied have to be applied to all possible objects. We will come back to this important notion later, but now we only illustrate it with the previous multiset and pair of rules. Using these rules in the maximally parallel manner means to either use the first rule twice (thus involving four copies of a and both copies of b) or once the second rule (it consumes both copies of b, hence the first rule cannot be used at the same time). In the first case, one copy of a remains unused (and the same with all copies of c), and the resulting multiset is a3b2c10; in the second case, all copies of a and c remain unused, and the resulting multiset is a7c7. It deserves to be noted that in the latter case the maximally parallel application of rules corresponds to the sequential (one in a time) application of the second rule. There also are other types of rules used in membrane computing (e.g., symport an d antiport rules), but we will discuss them later. Here we close with the observation that membrane computing d eals with models which are intrinsically discrete (basically, working with multisets of objects, with the multiplicities being natural numbers) and evolve through rewriting-like (we can also say reacti on-like) rules. 5 Computer Science Related Areas Rewriting rules are standard rules for handling strings in formal language theory (although other types of rules are also used, both in formal lan guage theory and in P systems, such as insertion, deletion, context-adjoining, etc.). Also working with strings modulo the ordering of symbols is "old stuff': commutative languages (investigated, e.g., in (35]) are nothing else than the permutation closure of languages. In turn, the multiplicity of symbol occurrences in a string corresponds to the Parikh image of the string, which directly leads to vector addition systems, Petri nets, register machines, formal power series. 5 EFTA01145247 Then, also the parallelism is considered in many areas of formal languages, and it is the main feature of Lindenmayer systems. These systems deserve a special discussion here, as they are a well developed branch of formal language theory inspired from biology, specifically, from the development of multi-cellular organisms (which can be described by strings of symbols). However, for L systems the cells are considered as symbols, not their structure is investigated but their organization in (mainly linear) patterns. P systems can be seen as dual to L systems, as they zoom in the cell, distinguishing the internal structure and the objects evolving inside it - maybe also distinguishing (when "zooming enough") the str ucture of the objects themselves, which leads to the category of P systems with string-objects. However, a difference exists between the kind of parallelism in L systems and th at in P systems: there the parallelism is total — all symbols of a string should be processed at the same time, here we work with a maximal parallelism — we process as many objects as possible, but not necessari ly all of them. Still closer to membrane computing are the multiset processing languages, the most known of them being Gamma [9, 10]. The standard rules of Gamma are of the form u v(r), where u and v ar e multisets and x is a predicate which should be satisfied by the multiset to which the rule u v is applied. The generality of the form of rules ensures a great expressivity and, in a direct manner, computational universality . What alumna does not have (at least in the initial versions) is distributivity. Then, membrane computing restricts the form of rules, on the one hand, as imposed by the biological roots, on the other hand , in search of mathematically simple and elegant models. Also membranes appear, even in Gamma related models, and this is the case with CHAM, the Chemical Abstract Machine of Berry and Boudol, [16], the direct ancestor of membrane systems — wit h the mentioning that the membranes of CHAM ... are not membranes as in the cell biology, but they correspond to the contents of membranes, multisets and lower level membranes together, while the goals and the approach are completely different, directed to the algebraic treatment of the processes these membranes can undergo. From this point of view, of goals and tools, CHAM has a recent counterpart in the so-called brane calculus (of course, "brane" co mes front "membrane") from [22] (see also [89] for a related approach), where process alge bra is used for investigating the processes taking place on membranes and with membranes of a cell . The idea of devising a computing device based on compartmentalization through membranes was also suggested in [66]. Many related areas and many roots, with many common ideas and many differences. In some extent, membrane computing is a synthesis of part of these ideas, integrated in a framework directly inspired from the cell biology, paying the deserved attention to membranes (hence to distribution, hierarchization, communication, localization and to other related concepts), aiming — in the basic types of devices — to find computing models, as elegant (minimalistic) as possible, as powerful as possible (in comparison with Turing machines and their subclasses), and as efficient as possible (able to solve computationally hard problems in a feasible time). 6 The Cell-Like Membrane Structure We move now towards presenting in a more precise manner the computing models investigated in our area, and we start by introducing one the fundamental ingredients of a P system, namely, 6 EFTA01145248 the membrane structure. The meaning of this notion is illustrated in Figure 1, and this is wha t we can see when looking (through mathematical glasses, hence abstracting as much as necessary in order t o obtain a formal model) to a standard cell. skin elementary membrane membranes regions environment environment Figure 1: A membrane structure Thus, as suggested by Figure 1, a membrane structure is a hierarchical ly arranged set of membranes, contained in a distinguished external membrane (corresponding to the plasma membrane and usually called the skin membrane). Several membranes can be p laced inside the skin membrane (they correspond to the membranes present in a cell, around the nucleus, in Golg i apparatus, vesicles, mitochondria, etc.); a membrane without any other membrane inside it is said to be elementary. Each membrane determine s a compartment, also called region, the space delimited from above by it and from below by the membranes placed directly inside, if any exists. Clearly, the corre spondence membrane-region is one-to-one, that is why we sometimes use interchangeably the se terms. Usually, the membranes are identified by labels from a given set of labels. In Figure 1, we use numbers, starting with number 1 assigned to the skin membrane (this is the stand and labelling, but the labels can be more informative "names" associated with the membranes). Also, in the fi gure the labels are assigned in a one-to-one manner to membranes, but this is possible only in the case of membr ane structures which cannot grow (indefinitely), otherwise several membranes should have the same label (we will see later such cases). Due to the membrane— region correspondence, we identify by the same label a memb rare and its associated region. Clearly, a hierarchical structure of membranes can be represented by a rooted tree; Figure 2 gives the tree which describes the membrane structure from Figure 1. The root of the tree is associated with the skin membrane and the leaves with the elementary membranes. In this way, graph-theoretic notions are brought into the stage, such as the distance in the tree, the level of a membrane, the height/depth of the membrane structure, as well as terminology, such as parent/child membrane, ancestor, etc. 7 EFTA01145249 Figure 2: The tree describing the membrane structure from Figure 1 Directly suggested by the tree representation is the symbolic representation of a membrane structure, by strings of labelled matching parentheses. For instance, a string corresponding to the st ructure from Figure 1 is the following one: [1 [2 12 [3 13 [4 [5 15 [6 IS 13191916 [7 4 LI ]1' An important aspect should now be noted: the membranes from the same level can f bat around, that is, the tree representing the membrane structure is not oriented; in terms of paren- theses exp ressions, two sub-expressions placed at the same level represent the same membrane structure. For instance, in the previous case, the expression [313 [4 [6 615 19 19 16 [717 [5 15 14 [21211 is an equivalent representation of the same membrane structure. 7 Evolution Rules and the Way of Using Them In the basic variant of P systems, each region contains a multiset of symbol-obj ects, which correspond to the chemicals swimming in a solution in a cell compartment; these chemicals are considered here as unstructured, that is why we describe them by s ymbols from a given alphabet. The objects evolve by means of evolution rules, which are also localized, associated with the regions of the membrane structure. Actually, there are three main types of r (1) multiset- rewriting rules (one uses to call them, simply, evolution rules), (2) conununication rules, and (3) rules for handling membranes. In this section we present the first type of rules. They correspond to the chemi cal reactions possible in the compartments of a cell, hence they are of the form v, whe re u and v are multisets of objects. However, in order to make the compartments cooperate, we h ave to move objects across membranes, and to this aim we add target indications to the objects produced by a rule as above (to the objects from multiset v). These indications are: here, i n, out, with the meaning that an object having associated the indication here remains in the same r egion, one having associated the indication in goes immediately into a directly lower 8 EFTA01145250 membra ne, non-deterministically chosen, and out indicates that the object has to exit the membrane, thus b ecoming an element of the region surrounding it. An example of evolution rule is aab (a, here)(b, out)(c, here)(c, in) (this is the first of the rules considered in Section 4, with target indications associated with the objects produced by rule application ). After using this rule in a given region of a membrane structure, two copies of a and one b are con sumed (removed from the multiset of that region), and one copy of a, one of b, and two of c are pr oduced; the resulting copy of a remains in the same region, and the same happens with one copy of c (indications here), while the new copy of b exits the membrane, goin g to the surrounding region (indication out), and one of the new copies of c enters one of the child membranes, non- deterministically chosen. If no such child membrane exists, that is, the membrane with which the r ule is associated is elementary, then the indication in cannot be followed, and the rule cannot be applied. In t urn, if the rule is applied in the skin region, then b will exit into the environment of the system (and it is " lost" there, as it can never come back). In general, the indication here is not specified (an object wi thout an explicit target indication is supposed to remain in the same region where the rule is applied). It is important to note that in this initial type of systems we are going to describe we do not provide similar rules for the environment, as we do not care about the objects present there; later we will consider types of P systems where also the environment takes part in the system evolution. A rule as above, with at least two objects in its left hand side, is said to be cooperative; a particular case is that of catalytic rules, of the form ca cv, where c is an object (called catalyst) which assists the object a to evolve into the multiset v; rules of the form a —) v, where a is an object, are called non-cooperative. The rules can also have the form u vo, where (S denotes the acti on of membrane dissolving: if the rule is applied, then the corresponding membrane disappears and its contents , object and membranes alike, are left free in the surrounding membrane; the rules of the dissolved membrane d isappear at the same time with the membrane. The skin membrane is never dissolved. The communication of objects through membranes reminds the fact that the biological mem- branes contain various (protein) channels through which the molecules can pass (in a passive way, due to concentration difference, or in an active way, with a consumption of energy), in a rather selective manner. However, the fact that the communication of objects front a com- partment to a neighboring compartment is controlled by the "reaction rules" is mathematically attractive, but not quite realistic from a biological point of view, that is why there were also considered variants where the two processes are separated: the evolution is controlled by rules as above, without target indications, and the communication is controlled by specific rules (by symport/antiport rules). It is also worth noting that evolution rules are stated in terms of names o f objects, while their application/execution is done using copies of objects — remem ber the example from Section 4, where the multiset asb2c6 was processed by a rule of the form aab —) a(b, out)c(c, in), which, in the maximally parallel manner, is used twice, for the two possible sub-multisets aab. We have arrived in this way at the important feature of P systems, concerning the way of using the rules. The key phrase in this respect is: in the maximally parallel manner, non- deterministically choosing the rules and the objects. More specifically, this means that we assign objects to rules, non-deterministically choosing the objects and the rules, until no further assignment is possible. More mathematically stated, we look to the set of rules, and try to find a multiset of rules, by assigning multiplicities to rules, 9 EFTA01145251 with two properties: (i) the multiset of rules is applicable to the multiset of objects available in the respective region, that is, there are enough objects in order to apply the rules a number of times as indicated by their multiplicities, and (ii) the multiset is maximal, no further rule can be added to it (because of the lack of available objects). Thus, an evolution step in a given region consists in finding a maximal applicab le multiset of rules, removing from the region all objects specified in the left hand of the chosen rules (with the multiplicities as indicated by the rules and by the number of times each rule is used), producing the objects from the right hand sides of rules, and then distributing these objects as indicated by the targets associated with them. If at least one of the rules introduces the dissolving action 8, then the membrane is dissolved, and its contents become part of the immediately upper membrane — provided that this membrane was not dissolved at t he same time, a case where we stop in the first upper membrane which was not dissolved (at least the skin r emains intact). 8 A Formal Definition of a Transition P System Systems based on multiset-rewriting rules as above are usually called trans ition P systems, and we preserve here this terminology (although "transitions" are present in all types of systems). Of course, when presenting a P system we have to specify: the alphabet of objects (a usual finit e non-empty alphabet of abstract symbols identifying the objects), the membrane structure (it can be represented in many ways, but the most used one is by a string of labelled matching parentheses), the multisets of objects prese nt in each region of the system (represented in the most compact way by strings of symbol-o bjects), the sets of evolution rules associated with each region, as well as the indication about the way the o utput is defined — see below. Formally, a transition P system (of degree rn) is a construct of the form H= (O,C,pon -i,w2,• • • I Wrn, R15 R2, • • • fi rn,job where: 1. O is the (finite and non-empty) alphabet of objects, 2. C C O is the set of catalysts, 3. µ is a membrane structure, consisting of m membranes, labelled with 1,2, ..., m; one says that the membrane structure, and hence the system, is of degree in, 4. un, w2,..., tvo, are strings over O representing the multisets o f objects present in the regions 1,2, ... In of the membrane structure, 5. R25 . .. ,Ro, are finite sets of evolution rules associated wi th the regions 1, 2, ... , m of the membrane structure, 6. io is either one of the labels 1, 2, ... ,m, and then the respective r egion is the output region of the system, or it is 0, and then the result of a computation is collected in the environment of the system. The rules are of the form u v or u v8, with u E O+ and v E (O x Tar)', where" Tar = (here, in, out}. The rules can be cooperative (with it arbitrary), non-cooperative (with 'By V' we denote the set of a0 strings over an alphabet V, the empty string lambda included, and by V+ we denote the set V' — {A}, of all non-empty strings over V. 10 EFTA01145252 E 0 - C), or catalytic (of the form ea cv or ca coo, with a E 0 - C, c E C, v E ((0 - C) x Tar)*) ; note that the catalysts never evolve and never change the region, they only help the other objects to evolve. A possible restriction about the region it, in the case when it is an internal one is to consider only regions enclosed by elementary membranes for output (that is, it, should be th e label of an elementary membrane of µ). In general, the membrane structure and the multisets of objects from its compart ments identify a configuration of a P system. The initial configuration is given by specifying the membra ne structure and the multisets of objects available in its compartments at the beginning of a computation, hence (p, wi, During the evolution of the system, by means of applying the rules, both the multisets of o bjects and the membrane structure can change. We will see how this is done in the next section; here we conclude with an example of a P system, represented in a pictorial way in Figure 3. It is important to note that adding to the initial configuration the set of rules, placed in the corresponding regions, we have a complete and co ncise presentation of the system (the indication of the output region can also be added in a suitable manner, for instance, writing "output" inside it). 1 Figure 3: The initial configuration of a P system, rules included 9 Defining Computations and Results of Computations In their basic variant, membrane systems are synchronous devices, in the sense t hat a global clock is assumed, which marks the time for all regions of the system. In each time unit a transformation of a configuration of the system — we call it transition — takes place by applying the rules in ea ch region, in a non-deterministic and maximally parallel manner. As explained in th e previous sections, this means that the objects to evolve and the rules governing this evolution are chosen in a non-deterministic way, and this choice is "exhaustive" in the s ense 11 EFTA01145253 that, after the choice was made, no rule can be applied in the same evolution st ep to the remaining objects. A sequence of transitions constitutes a computation. A computation is successful if it halts, it reaches a configuration where no rule can be appli ed to the existing objects, and the output region it, still exists in the halting configuration ( in the case when it, is the label of a membrane, it can be dissolved during the computation). With a success ful computation we can associate a result in various ways. If we have an output region specified, and this is an internal region, then we have an internal output: we count the objects present in the output region in the halting configuration and this number is the result of the computation. When we have it, = 0, we count the objects which leave the system d wring the computation, and this is called external output. In both cases the result is a number. If we distinguish among different objects, then we can have as the result a vector of natural numb ers. The objects which leave the system can also be arranged in a sequence according to the moments when they exit the skin membrane, and in this case the result is a string (if several objects exit at the same tim e, then all their permutations are accepted as a substring of the result). Note that non-halting computations provi de no output (we cannot know when a number is "completely computed" before halting), and if the output membrane is dissolved during the computation, then the computation aborts, no result is obtained (of course, this makes sense only in the case of the internal output). A possible extension of the definition is to consider a terminal set of ob jects, T C 0, and to count only the copies of objects from T, discarding the objects from 0—T pre sent in the output region. This allows some additional leeway in constructing and "programming" a P system , because we can ignore some auxiliary objects (e.g., the catalysts). Because of the non-determinism of the application of rules, starting from an initial config- uration, we can get several successful computations, hence several results. Thus, a P sys tem computes (one also uses to say generates) a set of numbers (or a set of vectors of numbers, or a language, depending on the way the output is defined). The case when we get a la nguage is important in view of the qualitative difference between the "loose" data structure we use inside the system (vectors of numbers) and th e data structure of the result, strings, where we also have a "syntax", a positional information. For a given system H we denote by N(II) the set of numbers computed by Pi in the above way. When we consider the vector of multiplicities of objects from the output region, we w rite Ps(II). In turn, in the case when we take as (external) output the strings of objects leaving the system , then we denote the language of these strings by L(II). Let us illustrate the previous definitions by examining the computations of the system from Figure 3 — with the output region being the environment. We have objects only in the central membrane, that with label 3, hence only here we can apply rules. Specifically, we can repeatedly apply the rule a ab in parallel with f f f, and in this way the number of copies of b grows each step by o ne, while the number of copies of f is doubled in each step. If we do not apply the rule a a (again in parallel with f ff), which dissolves the membrane, then we can continue in this way forever. Thus, in order to ever h alt, we have to dissolve membrane 3. Assume that this happens after n > 0 steps of using the rules a raab and f f f. When membrane 3 is dissolved, its contents (n+ 1 copies of b, 29i}1 copi es of f, and one copy of the catalyst c) are left free in membrane 2, which now can start using its rules. In the next step, all objects b become d. Let us examine the rule s f f f and cf ca. The second rule dissolves membrane 2, hence passes the contents of this membrane to membrane 1. If among the objects which arrive in membrane 1 there is at least one copy of 12 EFTA01145254 f, then the r ule f -) f from region 1 can be used forever and the computation never stops; moreover, if the rule f f f is used at least once in parallel with the rule cf —) cdd, then at least one copy of f is present. Therefore, the rule cf cdd s hould be used only if region 2 contains only one copy of f (note that, because of the catalyst, the rule cf aid can be used only for one copy of f). This means that the rule ff f was used always fo r all pairs of f available, that is, in each step the number of copies of f is divided by 2. This is alrea dy done once in the step when all copies of b become d, and will be done from now on as long as at le ast two copies of f are present. Simultaneously, in each step each d produces one copy of e. This pr ocess can continue until we get a configuration with only one copy of f present; at that step we have t o use the rule cf cdd (the rule ff —) f is no longer applicable), hence also membrane 2 is dissolved . Because we have applied the rule d —) de, in parallel for all copies of d (there are n + 1 such copi es), during n + 1 steps, we have (n + 1)(n + 1) copies of e, n + 2 copies of d (one of them was produced by the rule cf —) aid), and one copy of c present in the skin membrane of the system (the unique membrane still present). The objects e are sent out, and the computation halts. Thereto re, we compute in this way the number (n + 1)2, for some n > 0, that is, N(II) = {n2 n > 11. 10 Using Symport and Antiport Rules The multiset rewriting rules correspond to reactions taking place in the cell, inside the compart- ments. However, an important part of the cell activity is related to the passage of sub stances through membranes, and one of the most interesting ways to handle this trans-membrane com- munication is by coup ling molecules. The process by which two molecules pass together across a membrane (through a specific prote in channel) is called symport when the two molecules pass simultaneously through a protein channel, but in opp osite directions, the process is called antiport. We can formalize these operations in an obvious way: (ab, in) or (ab, out) are symport rules, stating that a and b pass together through a membrane, entering in the former case and exiting in t he latter; similarly, (a, out; b, in) is an antiport rule, stating that a exits and at the same time b enters the membrane. Separately, neither a nor b can cross a membrane — unless we have a rule of the form (a, in) or (a, out), called, for uniformity, uniport rule. Of course, we can generalize these types of rules, by considering symport rules of the form (x, (x, out), and antiport rules of the form (z, out; w, in), where x, z, w are multisets of arbitr ary size; one uses to say that is the weight of a symport rule as above, and maxazi, Iwl) is the weight of the antiport rule2. Now, such rules can be used in a P system instead of the target indications here, in, out we consider multiset rewriting rules of the form u —. v (or 11 —) v6) without target indications associated with the objects from v, as well as symport/antiport rules for communicating the ob jects among compartments. Such systems, called evolution-communication P systems, were considered in [23] (for various restricted types of rules of the two forms). Here we do not go into this direction, but we stay closer both to the chronologi cal evolution of the domain, and to the mathematical minimalism, and we check whether we can compute using only communic ation, that is, only symport and antiport rules. This leads to considering one of the most interesting classes of P systems, which we formally introduce here. 2 By lul we denote the length of the string u E V', for any al phabet V. 13 EFTA01145255 A P system with symporl/antiport rules is a construct of the form H = (0, A, to', ...,w„„ E, Ri, R„„i„), where: 1. 0 is the alphabet of objects, 2. µ is the membrane structure (of degree m > 1, with the membranes la belled in a one-to- one manner with 1, 2, ...,m), 3. till, ..., w„, are strings over 0 representing the multisets of objects present in the m com- partments of A in the initial configuration of the system, 4. E C 0 is the set of objects supposed to appear in the environment in arbitrarily many copies, 5. R,„ are the (finite) sets of rules associated with the m mem braves of µ, 6. io E H is the label of a membrane of µ, which indicates the output region of the system. The rules from R can be of two types, symport rules and antiport rules, of the forms as specified above. The rules are used in the non-deterministic maximally parallel manner. In the usual way, we define transitions, computations, and halting computations. The number (or the vector of multiplicities) of objects present in region io in the halting configuration is said to be computed by the system along that computation; the set of all numbers (resp., vectors or num bers) computed in this way by H is denoted by N(II) (resp., by Ps(II)). We remark here a new component of the system, the set E of objects which are present in the environment in arbitrarily many copies; because we only move objects across membranes and because we start with finite multisets of objects present in the system, we cannot increase the number of objects necessary for the computation if we do not provide a supply of objects, and this can be done by considering the set E. Because the environment is supposed inexhaustible, the objects from E are supposed inexhaustible, irrespective how many of them are brought into the system, arbitrarily many remain outside. Another new feature is that this time the rules are associated with the membrane s, not with the regions, and this is related to the fact that each rule governs the communication through a specific membrane. The P systems with symport/antiport rules have a series of attractive characteri sties: they are fully based on biological types of multiset processing rules; the environment takes a direct pa rt into the evolution of the system; the computation is done only by communication, no object is changed, the objects only move across membranes; no object is created or destroyed, hence the conservation low is obse rved. (as given in the previous sections, this is not valid for multiset rewriting rules, because, for instance, rules of the forma as or f f -.fare allowed, but by using some dummy objects d available in arbitrarily many copies, we can take care of the conservation low by writing, e.g., da as and f f f d). 14 EFTA01145256 11 An Example (Like a Proof... ) Because P systems with symport/antiport rules constitute an important class of P systems, it is worth considering an example; however, instead of a simple example (actually, it is not very easy to find simple symport/antiport systems computing non-trivial sets of numbers), we give directly a general construction, for simulating a register machine. In this way, we also introduce one of the widely used proof techniques for the universality r esults in this area. (Of course, the biologist reader can safely skip this section.) Informally speaking, a register machine consists of a specified number of counters (also called registers) which can hold any natural number, and which ar e handled according to a program consisting of labelled instructions; the counters can be increased or decreased by 1 — the decreasing being possible only if a counter holds a number greater than or equal to 1 (we say that it is non-empty) —, and checked whether they are non-empty. Formally, a (non-deterministic) register machine is a device M
ℹ️ Document Details
SHA-256
24c957f00810e258a3202b77272e61442e6989a8fa1e69f744bd955d626f4460
Bates Number
EFTA01145243
Dataset
DataSet-9
Document Type
document
Pages
42

Comments 0

Loading comments…
Link copied!