استفاده از شبکه عصبی مولد در الگوریتم تکاملی

STUDENT

DEGREE

YEAR

One of the challenges in sciences, is the optimization problem. By increasing the complexity of the real world problem finding the optimum solution by using the exact and heuristic methods, is not possible. Because these methods are trapped in local optimum. In order to find satisfactory solutions for these problems, metaheuristic methods such as evolutionary algorithms can be used. Estimation of distribution algorithm(EDA) is a type of the evolutionary algorithms that explore the space of potential solutions by building and sampling explicit probabilistic models of promising candidate solutions. Using flexible probability model that can be efficiently learned and sampled, has influence in the optimization process. Given the capabilities of generative neural networks in various areas of machine learning, especially learning distribution of data and produce the same data, these networks can be used as probabilistic model in EDA. We investigate the suitability of invertible generative neural network as a model to capture the dependencies between variables and objectives in estimation of distribution algorithm and the model is enriched in terms of information. In EDA we need to train the neural network in each generation, which increases the total time to solve the problem. We propose an approach based on active learning to selecting samples for each generation so that it is possible to use the trained network to produce new samples in several generations.One of the defects of evolutionary algorithm is the high number of function evaluations leading to the desired solution. To solve this problem, surrogate models are used to predict the value of the objective function, and this increases the cost. We propose a strategy for selecting the informative data based on active learning to train surrogate model, in order to reduce training costs. Experimental results on the benchmark functions, demonstrate that the addition of objective function information to the probabilistic model of distribution estimation algorithm and the use of active learning, accelerates the optimization process compared to the previous related methods.

سائل بهینه‌سازی اهمیت زیادی در حوزه‌های مختلف علوم دارد و از جایگاه ویژه‌ای برخوردار است. با افزایش پیچیدگی مسائل جهان واقعی دست یافتن به پاسخ بهینه در این مسائل با روش‌های مرسوم بهینه‌سازی نظیر روش‌های دقیق و روش‌های ابتکاری امکان‌پذیر نیست، زیرا این روش‌ها به طور معمول به پاسخ بهینه محلی بسنده می‌کنند. این مسئله ضرورت استفاده از روش‌های فراابتکاری همچون الگوریتم‌های تکاملی را تبیین می‌کند. امروزه الگوریتم‌های تکاملی به دلیل توانایی‌شان در حل بسیاری از مسائل پیچیده در زمینه‌های مختلف، مورد اقبال واقع شده‌اند. الگوریتم تخمین توزیع گونه‌ای از الگوریتم‌های تکاملی است که با ساختن یک مدل احتمالاتی صریح از ناحیه‌های امیدبخش جستجو و نمونه‌گیری از آن، فضای مسئله را کاوش می‌کند. استفاده از یک مدل احتمالاتی کارامد که بتواند بازنمایی مطلوبی از توزیع داده‌ها داشته باشد و همچنین نمونه‌های مطلوب ایجاد کند، نقش به سزایی در روند بهینه‌سازی دارد. با توجه به مزایا و قابلیت‌های اثبات شده شبکه‌های عصبی مولد در حوزه‌های مختلف یادگیری ماشین بالاخص یادگیری توزیع داده و تولید داده، می‌توان از این شبکه‌ها به عنوان مدل احتمالاتی در الگوریتم تخمین توزیع استفاده نمود. در این پژوهش با استفاده از شبکه‌های مولد معکوس‌پذیر مقادیر تابع هدف نیز به ساختار مدل احتمالاتی افزوده شده و مدل به لحاظ اطلاعاتی قوی‌تر شده است. در الگوریتم تخمین توزیع نیازمند به آموزش شبکه عصبی در هر نسل هستیم که موجب افزایش کل مدت زمان حل مسئله می‌گردد. ما با کمک یادگیری فعال رهیافتی برای انتخاب نمونه‌ها در هر نسل پیشنهاد می‌کنیم تا این امکان فراهم شود از شبکه آموزش داده شده در چند نسل برای تولید نمونه استفاده کنیم. یکی از چالش‌های اصلی در الگوریتم‌های تکاملی نیاز به فراخوانی تابع هدف به تعداد دفعات بسیاری در نسل‌های متوالی الگوریتم است. به جهت استیلا بر این مشکل مدل‌های جایگزین به جهت پیش‌بینی مقدار تابع هدف استفاده می‌شوند، و این کار خود هزینه‌هایی را به همراه دارد. ما برای کاهش این هزینه‌ها از رویکرد یادگیری فعال بهره گرفته‌ایم و روشی ارائه می‌کنیم که آموزش مدل با تعداد محدودی از داده‌های آموزنده و باارزش انجام پذیرد و از این طریق فرایند یادگیری مدل کنترل شود. نتایج به دست آمده بر روی توابع آزمایشی، نشان دهنده آن است که افزودن اطلاعات تابع هدف به مدل احتمالاتی الگوریتم تخمین توزیع و به کارگیری یادگیری فعال موجب تسریع فرایند بهینه‌سازی نسبت به روش‌های پیشین مرتبط می‌گردد.