* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project
Download Mining the evolutionary search
Survey
Document related concepts
Transcript
國立雲林科技大學 National Yunlin University of Science and Technology Intelligent Exploration for Genetic Algorithms Using Self-Organizing Maps in Evolutionary Computation Advisor : Dr. Hsu Presenter : Zih-Hui Lin Author :Heni Ben Amor, Achim Rettinger 1 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Outline Motivation Objective Introduction The GASOM approach Mining the evolutionary search Enhancing the evolutionary search Evaluation and results Conclusions Future work 2 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Motivation Exploration vs. exploitation is a well known issue in Evolutionary Algorithms. Accordingly, an unbalanced search can lead to premature convergence. 3 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Objective GASOM, a novel Genetic Algorithm, addresses this problem by intelligent exploration techniques. The approach uses Self-Organizing Maps to mine data from the evolution process. 4 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction Encoding schemas Fitness evaluation Testing the end of the algorithm YES Halt NO Parent selection Crossover operators Mutation operators 5 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction (cont.) Genetic drift ─ 因選擇樣本的誤差,而導致在演化過程後,無法正 確呈現完整的基因庫。 6 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction (cont.) Various enhanced strategies for diversity maintenance have been proposed which target the different stages of the evolution process. ─ Selection prevent selection from being too much biased towards high fitness individuals. This is done by pre-processing the fitness values prior to the selection procedure. 7 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction (cont.) ─ Mating ─ In incest is prevented by prohibiting crossover of genetically similar individuals. Hamming distance below a given threshold were not authorized for crossover. the mating decision is based upon an attraction value computed between the partners. Replacement elements will lose their place in the population when new chromosomes are inserted. A well designed replacement condition can keep promising genetic material from getting lost. 8 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction (cont.) Although the extensions described above have been shown to partially improve the efficiency of the search, they make simplifying assumptions by attacking rather the symptoms of premature convergence than the real cause 9 Intelligent Database Systems Lab N.Y.U.S.T. I. M. The GASOM approach An explicit representation of the search history A fitness evaluation promoting novelty A reseeding operator preserving exploratory power A control mechanism balancing exploration and exploitation 10 Intelligent Database Systems Lab Mining the evolutionary search Training the SOM 1. Initialization 2. Stimulus and Response ─ In this step it is crucial that the random selection of chromosomes (x) obeys a uniform distribution. ─ 3. Adaptation ─ ─ * Step 2 and 3 are repeated until the maximum number of iterations is reached. 11 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining the evolutionary search Training the SOM (cont.) In this work a simple test procedure is employed to test the quality of a SOM. ─ The projection is done by finding the appropriate BMU for each of these chromosomes, as defined by equation 1. * ─ ─ We keep track how many times each neuron was activated as the BMU. Let E(i) be a function, that returns the number of times a neuron i was activated. Ideally, the activation frequency of all neurons is equal: ─ 12 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining the evolutionary search Mapping the population To store the resulting information, we will use two frequency tables. ─ population distribution table ─ holds the activation frequencies with respect to only those individuals that are currently in the population. the activation frequency of a neuron i in the population distribution table, will be denoted by Ep(i). search history table 13 stores the activation frequencies with respect to all individuals evaluated during evolution. To access the information from this table we will use the function Eh(i). Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining the evolutionary search Representing the Search History 14 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining the evolutionary search Representing the Search History 15 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Enhancing the evolutionary search This new fitness evaluation will encourage the exploration of previously neglected solutions. ─ Computed by the use of two ranks The first rank results from ordering the population with respect to the objective value (fitness). The second rank orders individuals by their novelty. ─ The sum of both ranks is then used as the individual’s fitness score. 16 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Enhancing the evolutionary search (cont.) To regain lost diversity some GA variants employ reseeding operations. GASOM performs reseeding by introducing individuals with a high novelty factor into the population. Such individuals correspond to the neurons with low activation count in the search history table. buckets 17 buckets filled by randomly created new chromosomes. Reseeding-pool Intelligent Database Systems Lab N.Y.U.S.T. I. M. Enhancing the evolutionary search (cont.) Balancing exploration and exploitation ─ ─ In the beginning of the search premature convergence should be avoided before having covered as much of the search space as possible. The degree of exploitation should be monotonically decreasing and the degree of exploration should be monotonically increasing during the search run, respectively. 18 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Enhancing the evolutionary search (cont.) A balance function is used to determine the number of neurons which ideally should be activated in the current state of the search process. This optimal number of activated units r is calculated by ─ ─ ─ n is the number of fitness evaluations already performed nmax is the maximal number of fitness evaluations ||U|| is the total number of neurons of the SOM. 19 Intelligent Database Systems Lab N.Y.U.S.T. I. M. N.Y.U.S.T. I. M. Evaluation and results 20 Intelligent Database Systems Lab Evaluation and results (cont.) 21 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Evaluation and results (cont.) 22 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Evaluation and results (cont.) In order to measure the population diversity we used the following function ─ ─ ─ N equals the population size L is the chromosome length in bits. Pi(j) returns the allele at the jth gene of chromosome 23 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Evaluation and results (cont.) 24 Intelligent Database Systems Lab N.Y.U.S.T. I. M. N.Y.U.S.T. I. M. Conclusions The approach uses SOMs to analyize data of the evolution process and utilize it to enhance the search strategy. This lead to the development of a GA with four new core components: ─ ─ ─ ─ an explicit representation of the search history a fitness evaluation promoting novelty a reseeding operator preserving exploratory power a control mechanism balancing exploration and exploitation. 25 Intelligent Database Systems Lab N.Y.U.S.T. I. M. Future work A more fundamental issue is the size of the SOM. The effect of varying the number of neurons on the performance has not been investigated so far. An optimal trade-off between accuracy of the search history table and computational demands is desirable. 26 Intelligent Database Systems Lab N.Y.U.S.T. I. M. My opinion Advantage: Genetic drift will not happen. Disadvantage: Apply:考試題目 27 Intelligent Database Systems Lab