170 likes | 313 Vues
Artificial Immune Systems: a new classifier. Andrew Watkins & Lois Boggess Department of Computer Science Mississippi State University. Metaphors from nature. Neural networks Genetic algorithms. Biological immune systems. Pathogen Leucocytes T cells (thymus) B cells (bone marrow).
E N D
Artificial Immune Systems: a new classifier Andrew Watkins & Lois Boggess Department of Computer Science Mississippi State University
Metaphors from nature • Neural networks • Genetic algorithms
Biological immune systems • Pathogen • Leucocytes • T cells (thymus) • B cells (bone marrow)
Immune response • Pathogen • Antigen presenting cells • T cell (helper cell) and B cell • B cell begins cloning and mutating • Memory cells
T cells • B cells can’t “match” an antigen unless a T cell does also • Protection against self-destruction: • As T cells mature, those that match against self are destroyed. • Surviving T cells do not match self
Artificial Immune Systems • Some based on T cell analogy • (e.g., Dasgupta, University of Memphis,Forrest at U.New Mexico) • Information Security • Example: data security • Data chopped into small segments • Lots of small random sequences from the alphabet of the data • The ones that match are eliminated. • Rest recognize patterns not originally in the data • Can calculate how many needed on basis of alphabet and acceptable risk of overlooking changes to data
Models based on B cells • Measure affinity of B cells to a presenting antigen • The stronger the affinity, the more likely the B cell is to change state and produce clones, some of which are mutated • B cells that are not excited by antigens eventually die • B cells “network” with similar B cells • Clusters visible in the results (e.g. Timmis et al. and the Iris data)
Early work at MSU (Andrew Watkins) • Used the B cell models • Tried to modify to create a classifier • Problem: B cells proliferated until a system is swamped • Problem: method for training the classifier elusive
Resource limited AIS • Timmis’ group also had B cells swamping the available computing resources • Replace large numbers of identical B cells with single representative (ARB) having a resource number indicating “how many” • Limit the total, and introduce competition
MSU contribution • Add class consciousness • Reward B cells which have strong affinity to a presenting antigen of the same class • At some point, also reward B cells which have low affinity to antigens of different class • Take resources away from B cells which respond inappropriately - eventually they die
Effects of number of seed cells • Top two are accuracy on training • Bottom two are accuracy on test data • Slight trend toward better accuracy with more seed cells • Very different from radial basis classifiers
Training algorithm 1. A training antigen is presented to all of the memory cells of the same class as itself to find the memory cell MCmatch 2. MCmatch is added to the ARB pool along with its mutated offspring; the number of offspring it is allowed to create depends on the strength of the match 3. The training antigen is presented to the entire ARB pool. It is at this point that we go through the stages of competing for resources, culling the weakest ARBs, and producing mutated offspring. This stage continues until the stimulation threshold is met. 4. The strongest ARB of the same class as the training antigen is chosen as the candidate memory cell. The strength of the reaction of MCcandidate is compared with the strength of the reaction of MCmatch to the training antigen.
Algorithm (cont.) • If MCcandidate’s reaction is stronger than MCmatch’s reaction then add MCcandidate to the Memory Cell Pool. If, in addition to having this stronger reaction, MCcandidate is closer to MCmatch than the product of the Affinity Threshold and the Affinity Threshold Scalar, then replace MCmatch with MCcandidate in the Memory Cell Pool.
We have only just begun • Lots of ideas for exploring the paradigm • Andrew Watkins original ideas for modifications • Don Goodman - exploring what happens when there are lots of classes • Gaurov Marwah - uniform probability assumed for cloning and mutation - what if we introduce prob. distributions? • L. Boggess - nature doesn’t use pure mutation during cloning process - what if we use some ideas from sequence alignment to determine good mutation sites?