Back to CASCI Research

The Agent-Based T-Cell Cross-regulation Model for Document Classification

The Agent-Based T-Cell Cross-regulation Model for Document Classification.

We have developed a bio-inspired solution for binary classification of textual documents inspired by T-cell cross-regulation in the vertebrate adaptive immune system, which is a complex adaptive system of millions of cells interacting to distinguish between self and nonself substances. In analogy, automatic document classification assumes that the interaction and co-occurrence of thousands of words in text can be used to identify conceptually-related classes of documents—at a minimum, two classes with relevant and irrelevant documents for a given concept (e.g. articles with protein-protein interaction information). Our agent-based method for document classification expands the analytical model of Carneiro et al, by allowing us to deal simultaneously with many distinct populations of antigen-specific T-Cells and their collective dynamics. We have extended this model to produce a spam-detection system. We have also developed our agent-based model further to apply it to biomedical article classification, testing it on a dataset of biomedical articles provided by the BioCreative 2.5 challenge. Our results are useful for biomedical text mining, but they also help us understand T-cell cross-regulation as a potential general principle of classification available to collectives of molecules without a central controller. While there is still much to know about the specifics of T-cell cross-regulation in adaptive immunity, Artificial Life allows us to explore alternative emergent classification principles while producing useful bio-inspired tools. Recently, we started expanding this algorithm to other forms of classification such as sensor data from human-robot interactions under an IUCRG project.


Project Members

Luis Rocha

Luis Rocha

Al Abi-Haidar

Al Abi-Haidar

Ian Wood

Ian Wood



Funding

Project partially funded by:


Selected Project Publications

Back to CASCI Research


Pathway modules in the Canalizing Dynamics of the Drosophila Segment Polarity Network

Pathway modules in the Canalizing Dynamics of the Drosophila Segment Polarity Network

The paradigmatic example of a complex system is the web of biochemical interactions that make up life. We still know very little about the organization of life as a dynamical, interacting network of genes, proteins and biochemical reactions. How do biochemical networks—containing many regulatory, signaling, and metabolic processes—achieve reliability and robustness? Cells function reliably despite noisy dynamic environments, which is all the more impressive given that control strategies implemented by intra and inter-cellular processes cannot rely on a centralized, global view of the relevant networks. Are the resulting complex dynamics made up of relatively autonomous modules? If so, what is their functional role and how can they be identified? How robust is the collective computation performed by intra-cellular networks to mutations, delays and stochastic noise? To address these questions, we are focused on developing both novel methodologies and informatics tools to study control and collective computation in automata networks used to model gene regulation and biochemical signaling.

Modeling of biochemical signaling, regulation, modularity, robutsness and emergent computation in the dynamics of complex networks. Our methodology identifies canalizing control patterns in discrete automata models of biochemical networks. Currently working with models of genetic regulation in yeast , flowering of Arabidopsis thaliana; body segmentation in Drosophila, intracellular signal transduction in fibroblasts, biochemical pathways in granular leukemic lymphocytes, an integrated genome-scale transcriptional and metabolic network for E-Coli, and others. Our approach allows us to model biochemical signaling, regulation, modularity [Kolchinsky and Rocha, 2011], robutsness and emergent computation in the dynamics of complex networks modeled as automata networks.

Two-symbol Schemata Redescription of the Transition Look-Up tables of Automata

Two-symbol Schemata Redescription of the Transition Look-Up tables of Automata

Schema Redescription in Cellular Automata

Schema redescription with two symbols is a method to eliminate redundancy in the transition tables of Boolean automata. One symbol is used to capture redundancy of individual input variables, and another to capture permutability in sets of input variables: fully characterizing the canalization present in Boolean functions. Two-symbol schemata explain aspects of the behavior of automata networks that the characterization of their emergent patterns does not capture. We have used our method to show that despite having very different collective behavior, CA rules can be very similar at the local interaction level [Marques-Pita and Rocha, 2011]—leading us to question the tendency in complexity research to pay much more attention to emergent patterns than to local interactions. We have also used schema redescription to obtain more amenable search spaces of CA rules for the Density Classification Task—obtaining some of the best known rules for this task. [Marques-Pita and Rocha, 2008, Marques-Pita, Mitchell, and Rocha, 2008].


Emergent Computation in the AND Rule

Emergent Computation in the AND Rule

Origin of Representations in Evolving Cellular Automata

We have been interested on the problem of how information, symbols, representations and the like can arise from a purely dynamical system of many components. This is a topic of particular interest in Cognitive Science, where the notions of representation and symbol often divide the field into opposing camps. Often, in the area of Embodied Cognition the idea of self-organization in dynamical systems leads many researchers to reject representational or semiotic elements in their models of cognition. This attitude seems not only excessive, but indeed absurd as it ignores the informational processes so important for biological organisms. Therefore, we have been working both on a re-formulation of the concept of representation for embodied cognition, as well as on simulations of dynamical systems (using Celular Automata) where one can study the origin of representations.

The Evolving Cellular Automata experiments of Crutchfield, Mitchell et al, in the late 1990′s were very exciting, as the ability of evolved cellular automata to solve non-trivial computation tasks seemed to provide clues about the origin of representations and information from dynamical systems [Mitchell, 1998] [Rocha ,1998b]. We conducted additional experiments which extended the density classification task with more difficult logical tasks [Rocha ,2000; Rocha, 2004]. Later, we proposed a re-formulation of the concept of representation in cognitive science and artificial life which is based on this work, but argues that the type of emergent computations observed in these experiments do not produce representations quite as rich as those as observed in biology and cognition [Rocha and Hordijk ,2005]. These experiments allow us to think about how to evolve symbols from artificial matter in computational environments. The figure above, depicts a space-time diagram and particle model of a CA rule evolved to solve the AND task . Some additional Figures and experiment details of CA rules for logical tasks in our experiments are also available.



Project Members and Collaborators

Luis Rocha

Luis Rocha

Wim

Wim Hordijk

 Melanie Mitchell

Melanie Mitchell

Santiago Schnell

Santiago Schnell

  Manuel Marques-Pita

Manuel Marques-Pita

Artemy Kolchinsky

Artemy Kolchinsky

Santosh Manicka

Santosh Manicka

Marcio Mourao

Marcio Mourao


Funding

Project partially funded by .

  • Fundacao para a Ciencia e Tecnologia, Portugal. PTDC/EIA-CCO/114108/2009. Project title: “Collective Computation and Control in Complex Biochemical Systems
  • Fundação Luso-Americana para o Desenvolvimento (Portugal) and National Science Foundation (USA), 2012-2014. Project title: “Network Mining For Gene Regulation And Biochemical Signaling.” (171/11)

Selected Project Publications

Back to CASCI Research

Selected Self-organization in Genotype-Phenotype Maps

Agent-based simulation of self-organizing, evolving agents with and without genotype-phenotype mappings

Agent-based simulation of self-organizing, evolving agents with and without genotype-phenotype mappings


We are interested in the linguistic/symbolic aspects of the living organization (the gene as a carrier of information, and DNA as memory) which play a large role in the seemingly open-ended evolution defined by natural selection. This symbolic vision of biology (bio-semiotics), at first glance, seems to be at odds with notions of self-organization so dear to complex systems scientists and a more developmental approach to biology. Therefore, we have been studying the interplay between self-organization and natural selection (in embodied agents), introducing the concept of selected self-organization[Rocha ,1996a; Rocha ,1998a].

We are particularly interested in the problem of how information, symbols, representations and the like can arise from a purely dynamical system of many components. In addition to our work on collective computation and origin of representations, we have worked on simulations of evolving agents with different kinds of reproduction strategies (self-inspection and via a symbolic genotype-phenotype mapping). For these simulations we developed a genetic algorithm with an indirect encoding implemented with Fuzzy Development Programs, which model self-organizing development processes. More information on these simulations is available in the Fuzzy Development Programs’ Resource page, which contains publications and software for understanding and using these. You can also check a paper where these simulations are detailed. The figure depicts a run of our agent-based model where agents which reproduce via a genotype-phenotype mapping completely overtake a population, in a few generations, also containing agents which reproduce by self-inspection without such mappings.


Project Members

Luis Rocha

Luis Rocha

Wim

Wim Hordijk

Artemy Kolchinsky

Artemy Kolchinsky


Selected Project Publications

Complex Networks Collaboratory

lanet-viCx-Nets  is a virtual collaboratory of three research groups that despite their far apart geographical locations pursue the same research agenda in close collaboration. Active research areas include:

  • Network theory, structure and models
  • Information Networks
  • Epidemic modeling
  • Social systems
  • Infrastructures
  • Biological networks

The Cx-Nets website is also intended as an information exchange point with links to conferences, tools and references useful for the network science community.

Alex Vespignani (PI)

Alex Vespignani (PI)

Sandro Flammini

Sandro Flammini

Fil Menczer

Fil Menczer

Research | People | Academics | News and Meetings | Publications-online | Media Mentions | Relevant Conferences

Complex Adaptive Systems and Computational Intelligence

We are a research group at Indiana University and the Instituto Gulbenkian de Ciencia working on complex systems. We are particularly interested in the informational properties of natural and artificial systems which enable them to adapt and evolve. This means both understanding how information is fundamental for the evolutionary capabilities of natural systems, as well as abstracting principles from natural systems to produce adaptive information technology.

Our research projects (see below) are on computational and systems biology, complex networks, text and literature mining, evolutionary systems, adaptive search and recommendation, cognitive science, artificial life, and biosemiotics. Additional information available on Luis Rocha’s Website and our group page at the Instituto Gulbenkian de Ciencia.

For information on joining our group see our Academics page. As a group, we are seriously interconnected with other research groups and networks: The Center for Complex Networks and Systems (CNets), Alife@IU, Biocomplexity Institute, Cognitive Science Program, Complex Systems & Networks, FLAD Computational Biology Collaboratorium, InfoVis Lab, Instituto Gulbenkian de Ciencia, Networks an Agents (NAN).

You are welcome to join our mailing list CASCI-L by either:

  • sending an e-mail to listserv@indiana.edu with subscribe CASCI-L in the body (with no subject), or
  • via the LISTSERV web interface: https://listserv.indiana.edu/cgi-bin/wa-iub.exe?HOME ; Click Subscriber’s Corner at the top of the page. Search for “CASCI-L” select it and click Submit.

CASCI projects

Literature Mining

Biomedical Literature Mining

Collective Dynamics in Complex Biochemical Networks

Collective Dynamics in Complex Biochemical Networks

Models of RNA Editing

Models of RNA Editing

Artificial Immune Systems

 Semi-metric Network Analysis

Network Analysis of Weighted and Fuzzy Graphs

 The Adaptive Web and Bio-inspired designs for Recommendation Systems

The Adaptive Web

Microarray Analysis

Genomic Multivariate Analysis

 Biosemiotics: interplay between self-organization and selection

Biosemiotics

Agent-based modeling

Agent-based modeling

Uncertainty and Generalized Information Theory

Uncertainty and Generalized Information Theory