Comp 3503 / 5013 Dynamic Neural Networks

Comp 3503 / 5013Dynamic Neural Networks Daniel L. Silver March, 2014

Outline • Hopfield Networks • Boltzman Machines • Mean Field Theory • Restricted BoltzmanMachines (RBM)

Dynamic Neural Networks • See handout for image of spider, beer and dog • The search for a model or hypothesis can be considered the relaxation of a dynamic system into a state of equilibrium • This is the nature of most physical systems • Pool of water • Air in a room • Mathematics is that of thermal-dynamics • Quote from John Von Neumann

Hopfield Networks • See hand out

Hopfield Networks • Hopfield Network video intro • http://www.youtube.com/watch?v=gfPUWwBkXZY • http://faculty.etsu.edu/knisleyj/neural/ • Try these Applets: • http://lcn.epfl.ch/tutorial/english/hopfield/html/index.html • http://www.cbu.edu/~pong/ai/hopfield/hopfieldapplet.html

Hopfield Networks Basics with Geoff Hinton: • Introduction to Hopfield Nets • http://www.youtube.com/watch?v=YB3-Hn-inHI • Storage capacity of Hopfield Nets • http://www.youtube.com/watch?v=O1rPQlKQBLQ

Hopfield Networks Advanced concepts with Geoff Hinton: • Hopfield nets with hidden units • http://www.youtube.com/watch?v=bOpddsa4BPI • Necker Cube • http://www.cs.cf.ac.uk/Dave/JAVA/boltzman/Necker.html • Adding noise to improve search • http://www.youtube.com/watch?v=kVgT2Eaa6KA

Boltzman Machine • See Handout • http://www.scholarpedia.org/article/Boltzmann_machine Basics with Geoff Hinton • Modeling binary data • http://www.youtube.com/watch?v=MKdvJst8a6k • BM Learning Algorithm • http://www.youtube.com/watch?v=QgrFsnHFeig

Limitations of BMs • BM Learning does not scale well • This is due to several factors, the most important being: • The time the machine must be run in order to collect equilibrium statistics grows exponentially with the machine's size = number of nodes • For each example – sample nodes, sample states • Connection strengths are more plastic when the units have activation probabilities intermediate between zero and one. Noise causes the weights to follow a random walk until the activities saturate (variance trap).

Potential Solutions • Use a momentum term as in BP: • Add a penalty term to create sparse coding (encourage shorter encodings for different inputs) • Use implementation tricks to do more in memory – batches of examples • Restrict number of iterations in + and – phases • Restrict connectivity of network wij(t+1)=wij(t) +ηΔwij+αΔwij(t-1)

Restricted Boltzman Machine hjpj=1/(1-e-Σj) Recall = Relaxation Σj=wijvi j wij voorho Σi=wijhj i vipi=1/(1-e-Σi) Oscar Winner SF/Fantasy • Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

Restricted Boltzman Machine hjpj=1/(1-e-Σj) Recall = Relaxation SF/Fantasy Oscar Winner Σj=wijvi j wij voorho Σi=wijhj i vipi=1/(1-e-Σi) Oscar Winner SF/Fantasy • Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

Restricted Boltzman Machine hjpj=1/(1-e-Σj) Learning = ~ Gradient Descent = Constrastive Divergence Update hidden units Σj=wijvi j P=P+vihj voorho i Σi=wijhj vipi=1/(1-e-Σi) Oscar Winner SF/Fantasy • Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

Restricted Boltzman Machine hjpj=1/(1-e-Σj) Learning = ~ Gradient Descent = Constrastive Divergence Reconstruct visible units Σj=wijvi j voorho i Σi=wijhj vipi=1/(1-e-Σi) Oscar Winner SF/Fantasy • Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

Restricted Boltzman Machine hjpj=1/(1-e-Σj) Learning = ~ Gradient Descent = Constrastive Divergence Reupdatehidden units Σj=wijvi j N=N+vihj voorho i Σi=wijhj vipi=1/(1-e-Σi) Oscar Winner SF/Fantasy • Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

Restricted Boltzman Machine hjpj=1/(1-e-Σj) Learning = ~ Gradient Descent = Constrastive Divergence Update weights Σj=wijvi j wij=wij+ηΔwij voorho Δwij=<P>-<N> Σi=wijhj i vipi=1/(1-e-Σi) Oscar Winner SF/Fantasy • Source: http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/

Restricted Boltzman Machine • RBM Overview: • http://blog.echen.me/2011/07/18/introduction-to-restricted-boltzmann-machines/ • Wikipedia on DLA and RBM: • http://en.wikipedia.org/wiki/Deep_learning • RBM Details and Code: • http://www.deeplearning.net/tutorial/rbm.html

Restricted Boltzman Machine Geoff Hinton on RBMs: • RBMs and Constrastive Divergence Algorithm • http://www.youtube.com/watch?v=fJjkHAuW0Yk • An example of RBM Learning • http://www.youtube.com/watch?v=Ivj7jymShN0 • RBMs applied to Collaborative Filtering • http://www.youtube.com/watch?v=laVC6WFIXjg

Additional References • Courseracourse – Neural Networks fro Machine Learning: • https://class.coursera.org/neuralnets-2012-001/lecture • ML: Hottest Tech Trend in next 3-5 Years • http://www.youtube.com/watch?v=b4zr9Zx5WiE

Comp 3503 / 5013 Dynamic Neural Networks

Comp 3503 / 5013 Dynamic Neural Networks

Presentation Transcript

biotechnology

Neural Tube Defects

Data pre-processing for neural networks

Chapter 39 Neural Signaling and Chapter 40 Neural Regulation

Hierarchical Neural Networks for Object Recognition and Scene “Understanding”

The neural basis of self-knowledge

Dynamic Reference Frames

Vehicular Ad hoc Networks (VANET)

NETWORK MODELS

Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Feed-Forward Neural Networks

Neural networks for data mining

NEURAL NETWORKS

COMP 104: Intro to Unix

Fall 2012 COMP 4605/5605 Computer Networks

Dynamic Programming

Financial Informatics –XIII: Neural Computing Systems

Mini-course on Artificial Neural Networks and Bayesian Networks

Neural Networks

Neural Signaling