NN – cont.

NN – cont. Alexandra I. Cristea USI intensive course “Adaptive Systems”April-May 2003

We have seen how the neuron computes, let’s see • What it can compute? • How it can learn?

What does the neuron compute?

Perceptron, discrete neuron • First, simple case: • no hidden layers • Only one neuron • Get rid of threshold – b becomes w0 • Y – Boolean function : > 0 fires  0 doesn’t fire

ｆ１ｔ＝１ Threshold function f (w0 = - t = -1)  f 

ｆ X2 １ 0 １ X1 Y X1 X2 ｔ＝１ 0 0 1 １ 1 1 Ｗ１＝１Ｗ２＝1 ｆ Y = X1 or X2

ｆ X2 １ 0 １ X1 Y X1 X2 ｔ＝１ 0 0 0 １ 0 1 Ｗ１＝0,5 Ｗ２＝0,5 ｆ Y = X1 and X2

ｆ１ｔ＝１ Y = or(x1,…,xn) w1=w2=…=wn=1

ｆ１ｔ＝１ Y = and(x1,…,xn) w1=w2=…=wn=1/n

X2 X2 X2 X2 0 １ 0 １ 0 １ X1 X1 X1 Y Y Y 0 0 0 １ 0 1 0 0 1 １ 11 0 -1 1 １ 1 1 X1 What are we actually doing? Ｗ0=-1; Ｗ１＝7; Ｗ２＝9 Ｗ0=1; Ｗ１＝7; Ｗ２＝9 w0+w1*X1+w2*X2 Ｗ0=-1; Ｗ１＝0,7; Ｗ２＝0,9

Linearly Separable Set w0= - 1 w1= - 0,67 w2= 1 x1 w0+w1*x1+w2*x2 x2

Linearly Separable Set w0= - 1 w1= 0,25 w2= - 0,1 x1 w0+w1*x1+w2*x2 x2

Linearly Separable Set w0= - 1 w1= 0,25 w2= 0,04 x1 w0+w1*x1+w2*x2 x2

Linearly Separable Set w0= - 1 w1= 0,167 w2= 0,1 x1 w0+w1*x1+w2*x2 x2

Non-linearly separable Set

Non Linearly Separable Set w0+w1*x1+w2*x2 w0= w1= w2= x1 x2

Perceptron Classification Theorem A finite set X can be classified correctly by aone-layer perceptronif and only if it is linearly separable.

Typical non-linearly separable set: Y=XOR(x1,x2) x1 Y=1 Y=0 1,1 0,1 x2 0,0 1,0 w0+w1*x1+w2*x2

How does the neuron learn?

W1*X1＋W2*X2 X1 X2 Learning: weight computation • W1*（X1＝１）＋W２*（X2=１）＞＝（ｔ＝１） • W1*（X1＝０）＋W２*（X2=１）＜（ｔ＝１） • W1*（X1＝１）＋W２*（X2=０）＜（ｔ＝１） • W1*（X1＝０）＋W２*（X2=０）＜（ｔ＝１）

Perceptron Learning Ruleincremental version ROSENBLATT (1962) FOR i:= 0 TO n DO wi:=random initial valueENDFOR; REPEAT select a pair (x,t) in X; (* each pair must have a positive probability of being selected *) IF wT * x' > 0 THEN y:=1 ELSE y:=0 ENDIF; IF y  t THEN FOR i:= 0 TO n DO wi:= wi +  (t-y) xi' ENDFORENDIF; UNTIL X is correctly classified

wnew x’ x’ w wniew x’ + - x’ w changes in the direction of the input w Idea Perceptron Learning Rule wi:= wi +  (t-y) xi' t=1 y=0 (wTx’0) wnew=w + x’ t=0 y=1 (wTx’>0) wnew=w - x’

For multi-layered perceptrons w. continuous neurons,a simple and successful learning algorithm exists.

e1=d1－y1 ｙ２、d２ Hidden layer e2=d2－y2 ｙ３、d３ e3=d3－y3 Input Output ｙ４、d４ e4=d4－y4 BKP:Error ｙ１、d１ Hidden layer error？

Forward propagation value value y1 y2＝w*y1 Synapse W： weight neuron1 neuron2 Weight serves as amplifier! Value (y1,y2)= Internal activation

Backward propagation value value e1=?? e2 Inverse Synapse W： weight neuron1 neuron2 Weight serves as amplifier! Value(e1,e2)= Error

Backward propagation value value e1=w＊e2 e2 Inverse Synapse W： weight neuron1 neuron2 Weight serves as amplifier! Value(e1,e2)= Error

e1=d1－y1 ｙ２、d２ Hidden layer e2=d2－y2 ｙ３、d３ e3=d3－y3 Input Output ｙ４、d４ e4=d4－y4 BKP:Error O1 I1 ｙ１、d１ O2 O2, I2 Hidden layer error？

Output O1 Input I1 e１ｗ１ｗ２ e２ｗ３ e３ Hidden layer ee［j］＝ ie［i］ｗ［j,i］ Backpropagation： Backpropagation to hidden layer O2, I2

Update rule for 2 weight types • ① I2（hidden layer）, O1（ system output） • ② I1（system input）, O2（ hidden layer ） ① Δｗ =α(d[i]-y[i]) f’(S[i])f(S[i]) = =αe[i] f(S[i])(simplification f’=1 for repeater, e.g.) S[i] = jw[j,ｉ](t)h[j] ② Δｗ =α（ ie[i]ｗ[j,i]）f’(S[j])f(S[j]) =α ee[j]f(S[j]) S[j] = kw[k,j](t)x[k]

Backpropagation algorithm FOR s:= 1 TO r DO Ws := initial matrix(often random); REPEAT select a pair (x,t) in X;y0:=x; # forward phase: compute the actual output ys of the network with input x FOR s:=1 TO r DO ys:= F(Ws ys-1) END; # yr is the output vector of the network # backpropagation phase: propagate the errors back through the network # and adapt the weights of all layers dr:= Fr’ (t -yr) ; FOR s:=r TO 2 DO ds-1 := Fs-1' WsT ds; Ws:=Ws+  ds ys-1T; END; W1:=W1+  d1 y0T UNTIL stopcriterion

Conclusion • We have seen binary function representation with single layer perceptron • We have seen a learning algorithm for SLP • We have seen a learning algorithm for MLP (BP) • So, neurons can represent knowledge AND learn!

NN – cont.

NN – cont.

Presentation Transcript

Accounts Receivable

Appearance (cont).

Military Theory and Strategy (cont)

DNA Science

A Child Called It

CIS 454 Local Area Networks

Histology of the periodontium (2) (cont.)

The Basic

Facility Design-Week 10 ( cont ) Computerized Layout Planning

H-1B Visas

Los meses del año y los días de la semana.

Introduction to Kalman Filter and SLAM

Course Notes

ASBOA All State

Lesson 4: Fleet Transactions

Business Process Management (BPM) – cont.

Industrial Revolution

APES 8/11/11

INTRODUCTION TO DATABASE (cont.)