بخش‌بندی تصاویر رنگی با استفاده از روش‌های مبتنی بر گراف

STUDENT

DEGREE

YEAR

Segmentation is the process of partitioning an image into disjoint and homogeneous regions. It is a fundamental step toward low-level vision, which is significant for object recognition and tracking, image retrieval, face detection, and other computer-vision-related applications. There has been significant interest in graph-based approaches to image segmentation in the past few years. These methods possesses some desirable properties. In particular, both spatial information and feature space characteristics influence the procedure appropriately. In these approaches the image is considered as a whole and all pixels play role in partitioning. Moreover, in some graph-based methods minimization of the inter-region similarity and maximization of the intra-regions similarity are simultaneously accomplished. The common theme underlying these approaches is the formation of a weighted graph where each vertex corresponds to an image pixel and each edge is weighted with some measure of the desire for the pixels connected by that edge to be in the same segment. The weights are usually related to the color and/or texture features, a well a the spatial characteristic of the corresponding ixels. This graph is partitioned into components in a way that minimizes some specified cost function of the vertices in the components and/or the boundary between those components. Till now, several cost functions are proposed for image segmentation. The most popular one is the Normalized cut (Ncut). However, Ncut-based techniques suffer high computation complexity and are not suitable for real-time processing. An efficient solution to this problem is to apply the graph representation strategy on the regions (instead of pixels) that are derived by some region segmentation method. In 2007, Tao, Jin and zhang, proposed to construct a graph in which each vertex is associated with a region derived by Mean-Shift algorithm. They applied Ncut method for partitioning this region-based graph. It is well known that Ncut algorithms try to find a “balanced partition” of a weighted graph, and the cut does not have a bias in favor of cutting small sets of isolated nodes in the graph. But this property is desirable for pixel-based graphs. In region-based graphs, some vertices may correspond to very large regions which must be separated in the first steps. In order to overcome this problem, Tao, Jin and zhang proposed to replace each vertex of the region-based graph by a complete graph and then apply Ncut. Here in this thesis we analytically prove the u Key words Image segmentation, Graph, Spectral clustering, Ncut method, Generalized eigenvalue problem, Rayleigh-Ritz theorem.

بخش‌بندی یکی از عملیات سطح پایین پردازش تصویر با هدف افراز تصویر به نواحی مجزا و همگن، یا بطور معادل یافتن مرزهای این نواحی است. تاکنون روش‌های بخش‌بندی بسیاری در مراجع پیشنهاد شده ‌است. از جمله‌ی این روش‌ها می‌توان به روش‌های مبتنی بر گراف اشاره کرد که در سالهای اخیر مورد توجه زیادی قرار گرفته‌اند. روش‌های مذکور، اطلاعات مربوط به فضای ویژگی‌ها و اطلاعات مکانی پیکسل‌ها را به خوبی با یکدیگر ترکیب می‌کنند. همچنین پاسخ بهینه‌ی آنها سراسری بوده و از اطلاعات کل تصویر در بخش‌بندی استفاده می‌شود. خاصیت مطلوب دیگر برخی از این روش‌ها این است که بر پایه‌ی دو مفهوم شهودی دسته‌بندی، یعنی شباهت داخلی زیاد دسته‌ها و شباهت متقابل کم در بین دسته‌ها، تعریف شده و همزمان هر دوی این خواسته‌ها را برآورده می‌کنند. چارچوب کلی این روش‌ها تخصیص یک گراف بدون جهت وزن‌دار به تصویر و بخش‌بندی این گراف براساس یک معیار مشخص است. هر رأس از این گراف متناظر با یک پیکسل از تصویر در نظر گرفته شده و وزن یال بین دو رأس، نشانگر شباهت دو پیکسل متناظر در تصویر است. در این پایان‌نامه تمرکز اصلی بر روی یکی از پرطرفدارترین معیارهای بخش‌بندی گراف یعنی Ncut بوده‌است. مهمترین علت تعریف این معیار بدست‌آوردن نواحی با شباهت داخلی زیاد است که باعث می‌شود پیکسل‌های تنها و نواحی بسیار کوچک به نواحی بزرگ تصویر ترجیح داده نشوند. با وجود مزایای این روش، حجم محاسبات آن بسیار زیاد است. برای رفع این مشکل می‌توان از یک بیش‌بخش‌بندی اولیه استفاده کرده و سپس گراف تصویر را مبتنی بر این نواحی (به جای پیکسل‌ها) بناکرد. با این کار گراف مورد نظر کوچک شده و حجم محاسبات کاهش چشمگیری می‌یابد. ولی در صورت استفاده از گراف مبتنی بر نواحی، یک رأس‌ ممکن است متناظر با یک ناحیه‌ی بزرگ از تصویر باشد و این معیار تمایلی به جدا کردن نواحی کوچک و رأس منفرد در گراف ندارد. برای رفع این مشکل در سال 2007 تآو، جین و ژانگ پیشنهاد کردند که هر رأس گراف مبتنی بر نواحی با یک گراف کامل جایگزین شود. در این پایان‌نامه به صورت تحلیلی ناکارآمدی این پیشنهاد را اثبات می‌کنیم. همچنین برای رفع مشکل Ncut اندازه‌ی نواحی حاصل از بیش‌بخش‌بندی اولیه را هم در محاسبه‌ی شباهت داخلی نواحی در نظر می‌گیریم. بدین منظور با تعمیم برهان موجود برای یافتن جواب بهینه‌ی Ncut ، دسته‌ای جدید از معیارهای بخش‌بندی گراف را معرفی می‌کنیم که معیار Ncut را به عنوان حالت خاص دربرمی‌گیرد. سپس با هدف وارد کردن اندازه‌ی نواحی به مسئله، معیاری از دسته‌ی جدید پیشنهاد می‌کنیم. در نهایت با ساخت یک مجموعه تصویر مرجع با استفاده از پایگاه داده‌ی برکلی و در نظر گرفتن یک سنجه، به مقایسه‌ی کمّی نتایج بخش‌بندی معیار Ncut با معیار پیشنهادی می‌پردازیم. نتایج حاصل در بیشتر موارد حاکی از برتری معیار پیشنهادی نسبت به معیار Ncut است. کلمات کلیدی: 1- بخش‌بندی تصویر 2- گراف 3- دسته‌بندی طیفی 4- معیار Ncut 5-مسئله‌ی مقدارویژه‌ی تعمیم یافته 6- قضیه‌ی رایلی- ریتز.