دسته بندی نیمه نظارتی (2)

دسته بندی نیمه نظارتی (2) زهره کریمی Introduction to semi-supervised Learning, Xiaojin Zhu and Andrew B. Goldberg, University of Wisconsin, Madison, 2009.

روش های یادگیری نیمه نظارتی • مدل های Mixture و روش EM • روش Co-Training • روش های مبتنی بر گراف • روش های مبتنی بر SVM • یادگیری نیمه نظارتی انسان • تئوری

Co-Training • Named entity Classification Location

Co-Training • Named entity Classification Location Location

Co-Training • دو دسته بندی کننده را یاد می گیرد: هر دسته بندی کننده روی یک دید • نمونه هایی را که در یک مدل دسته بندی با اطمینان بالا دسته بندی شده اند به داده های آموزش مدل دسته بندی دیگر اضافه می کند.

Co-Training • فرضیات • هر view به تنهایی برای دسته بندی کافی باشد • دو view به صورت شرطی با توجه به برچسب داده شده مستقل از هم باشند

کاربردها • Web-page classification • متن صفحه: کلمات رخ داده در صفحه • متن hyperlink: کلمات رخ داده در hyperlink ها به صفحه مورد نظر • Classify Speech phonemes • سیگنال Audio • سیگنال video نمایش دهنده حرکت لب ها

Multiview learning (1) • The squared loss • c(x, y, f (x)) = (y − f (x))2 • 0/1 loss • c(x, y, f (x)) = 0 if y = f (x), and 1 otherwise • c(x, y = healthy, f (x) = diseased) = 1 and • c(x, y = diseased, f (x) = healthy) = 100

Multiview learning (2)

Multiview Learning (3) • MULTIVIEW LEARNING هدف تولید k مدل بر اساس k دید است • The semi-supervised regularizer: میزان عدم توافق k مدل را روی داده های بدون برچسب اندازه گیری می کند Individual Regularized Risk Semi-Supervised regularizer

Multiview learning(4) فرض: مجموعه فرضیه ها با یکدیگر موافق باشند و علاوه بر آن emprical risk آن ها کوچک باشد

دسته بندی نیمه نظارتی مبتنی بر گراف (1) • نمونه های برچسب دار و بدون برچسب متناظر با راس های گراف • شباهت بین هر دو نمونه متناظر با وزن یال بین دو راس • گراف متصل کامل • گراف kNN • گراف NN

دسته بندی نیمه نظارتی مبتنی بر گراف (2)

چارچوب Regularization • تابع برچسب f روی گراف • پیشگویی برچسب f نزدیک به برچسب داده های برچسب دار باشد • loss function • f روی کل گراف هموار باشد (با توجه به regularization framework) • special graph-based regularization

Mincut (1) • نمونه های با برچسب مثبت معادل راس های source • نمونه های با برچسب مثبت معادل راس های sink • هدف، یافتن مجموعه کمینه ای از یال ها است که source را از sink جدا می کند

3 4 1 5 2 Mincut (2)

Mincut (3) • Cost Function • Regularizer • Mincut Regularized Risk problem

Harmonic Function (1)

Harmonic Function (2)

Harmonic Function (3) • راه حل تکراری • راه حل بسته • unnormalized graph Laplacian matrix L W is an (l + u) × (l + u) weight matrix, whose i, j -th element is the edge weightwij

Harmonic Function (4) • unnormalized graph Laplacian matrix

Manifold Regularization (1) • مسائل روش های موجود • Transductive بودن • فقط امکان برچسب گذاری داده های بدون برچسب موجود • حساس بودن به نویز • فرض f (x) = y برای داده های برچسب دار

Manifold Regularization (2) • Inductive بودن • پایدار بودن در محیط های نویزی

Manifold Regularization (3) • normalized graph Laplacian matrix L • توان هایی از ماتریس Laplacian نرمال و غیرنرمال

فرض روش های مبتنی بر گراف (1)

فرض روش های مبتنی بر گراف (2) Spectral graph theory

فرض روش های مبتنی بر گراف (3) • a smaller eigenvalue corresponds to a smoother eigenvector over the graph The graph has k connected components if and only if λ1 = . . . = λk = 0. Thecorresponding eigenvectors are constant on individual connected components, and zero elsewhere.

Graph Spectrum

فرض روش های مبتنی بر گراف (4) • Regularization term اگر مقدار ai یا λi نزدیک به صفر باشد Regularization term کمینه خواهد بود. به عبارت دیگر، f ترجیح می دهد که از پایه های هموار (با λi کوچک ) استفاده کند.

فرض روش های مبتنی بر گراف (5) • در گراف k-connected component، کمینه Regularization term

فرض روش های مبتنی بر گراف (6) • کارایی حساس به ساختار گراف و وزن ها

شهود فاصله از مرز تصمیم تا margin: geometric margin.

Support Vector Machines

Support Vector Machines • The signed geometric margin: The distance from the decision boundary to the closest labeled instance • decision boundary • Maximum margin hyperplane must be unique

Non-Separable Case (1)

Non-Separable Case (2) • lie inside the margin, • but on the correct side of the decision boundary • lie on the wrong side of the • decision boundary and are misclassified • are correctly classified

S3VM (1)

S3VM (2) • the majority (or even all) of the unlabeled instances are predicted in only one of the classes

S3VM (3) • Convex function • The S3VM objective function is non-convex • The research in S3VMs has focused on how to efficiently find a near-optimum solution

Logistic regression • SVM and S3VM are non-probabilistic models • probabilistic model • conditional log likelihood • Gaussian distribution as the prior on w:

Logistic regression • Logistic loss • regularizer

Logistic regression

Entropy Regularizer • Logistic Regression+EntropyRegulizer For SemiSupervised Learning • Intuition • if the two classes are well-separated, then the classification on any unlabeled instance should be confident: it either clearly belongs to the positive class, or to the negative class. Equivalently, the posterior probability p(y|x) should be either close to 1, or close to 0. • Entropy

Semi-supervised Logistic Regression • entropy regularizer for logistic regression

Entropy Regularizer

دسته بندی نیمه نظارتی (2)

دسته بندی نیمه نظارتی (2)

Presentation Transcript

2 2

2-2-2 OFFENSIVE Plays

2 2 2

Chapter 2 Lesson 2-2

2--1- 2- 2- -2-

2-2

2-2-2 OFFENSIVE Plays

2-2

2+2 = 4 2x2 = 2+2 1+2 1/3

Chapter 2, Sec. 2-2

2 .2

2 + 2 4 + 2 6 + 2 8

2+2

2- 2

2-2

Lesson 2-2 Example 2

2 2 2 2 2 10

2-2

2-2

МО 2 = 2 2 –

2 2

2-2