1 / 25

Decision making

Explore how the brain learns values to make decisions and maximize rewards in a complex world. Learn about dopamine, TD learning algorithms, and impacts of dopamine-related disorders. Discover the science behind reward processing and dopamine release.

zenaidaj
Download Presentation

Decision making

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Decision making

  2. How does the brain learn the values?

  3. The computational problem The goal is to maximize the sum of rewards

  4. The computational problem The value of the stateS1 depends on the policy If the animal chooses ‘right’ at S1,

  5. How to find the optimal policy in a complicated world?

  6. How to find the optimal policy in a complicated world? • If values of the different states are known then this task is easy

  7. How to find the optimal policy in a complicated world? • If values of the different states are known then this task is easy How can the values of the different states be learned?

  8. V(St) = the value of the state at time t rt = the (average) reward delivered at time t V(St+1) = the value of the state at time t+1

  9. The TD (temporal difference) learning algorithm where is the TD error.

  10. Dopamine

  11. Dopamine is good • Dopamine is released by rewarding experiences, e.g., sex, food • Cocaine, nicotine and amphetamine directly or indirectly lead to an increase of dopamine release • Neutral stimuli that are associated with rewarding experiences result in a release of dopamine • Drugs that reduce dopamine activity reduce motivation, cause anhedonia (inability to experience pleasure) • Long-term use may result in dyskinesia (diminished voluntary movements and the presence of involuntary movements)

  12. No dopamine is bad

  13. No dopamine is bad (Parkinson’s disease) • Bradykinesia – slowness in voluntary movement such as standing up, walking, and sitting down. This may lead to difficulty initiating walking, but when more severe can cause “freezing episodes” once walking has begun. • Tremors – often occur in the hands, fingers, forearms, foot, mouth, or chin. Typically, tremors take place when the limbs are at rest as opposed to when there is movement. • Rigidity – otherwise known as stiff muscles, often produce muscle pain that is increased during movement. • Poor balance – happens because of the loss of reflexes that help posture. This causes unsteady balance, which oftentimes leads to falls.

  14. Schultz, Dayan and Montague, Science, 1997

  15. 2 3 4 9 1 6 7 8 5 CS Reward Before trial 1: • In trial 1: • no reward in states 1-7 • reward of size 1 in states 8

  16. 2 3 4 9 1 6 7 8 5 CS Reward Before trial 2: In trial 2, for states 1-6 For state 7,

  17. 2 3 4 9 1 6 7 8 5 CS Reward Before trial 2: For state 8,

  18. 2 3 4 9 1 6 7 8 5 CS Reward Before trial 3: In trial 2, for states 1-5 For state 6,

  19. 2 3 4 9 1 6 7 8 5 CS Reward Before trial 3: For state 7, For state 8,

  20. 2 3 4 9 1 6 7 8 5 CS Reward After many trials Except for the CS whose time is unknown

  21. Schultz, 1998

  22. “We found that these neurons encoded the difference between the current reward and a weighted average of previous rewards, a reward prediction error, but only for outcomes that were better than expected”. Bayer and Glimcher, 1998

  23. Bayer and Glimcher, 1998

More Related