1 / 64

A temporally abstracted Viterbi algorithm (TAV)

A temporally abstracted Viterbi algorithm (TAV). Shaunak Chatterjee and Stuart Russell University of California, Berkeley July 17, 2011. Earth’s history – A timescale view. Widely varying timescales are pervasive in data Planning, simulation & state estimation

darrin
Download Presentation

A temporally abstracted Viterbi algorithm (TAV)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A temporally abstracted Viterbi algorithm (TAV) ShaunakChatterjeeand Stuart Russell University of California, Berkeley July 17, 2011

  2. Earth’s history – A timescale view • Widely varying timescales are pervasive in data • Planning, simulation & state estimation • More efficient if timescale information is cleverly exploited 4.5Ga 1Ma 10000 yrs 600 yrs 1 yr 2 days 1 min

  3. Images: berkeley.edu, wikipedia, food.com Where is Shaunak? Monday Tuesday Wednesday Thursday Friday Saturday Sunday Berkeley Berkeley Philadelphia Barcelona Barcelona Barcelona Barcelona Burger Cheese steak Paella Gazpacho Tapas Gazpacho Burger

  4. State time trellis Berkeley Philadelphia Montreal Toronto Barcelona Madrid Paris Marseille t=1 t=2 t=3 t=4 t=5 t=6

  5. The Viterbi algorithm – Viterbi, 1967 1 Berkeley 2 Philadelphia 3 Montreal 3 Toronto 4 Barcelona 4 Madrid 4 Paris 4 Marseille t=1 t=2 t=3 t=4 t=5 t=6

  6. The Viterbi algorithm – Viterbi, 1967 1 3 Berkeley 2 4 Philadelphia 3 4 Montreal 3 4 Toronto 4 6 Barcelona 4 6 Madrid 4 8 Paris 4 8 Marseille t=1 t=2 t=3 t=4 t=5 t=6

  7. The Viterbi algorithm – Viterbi, 1967 1 3 10 Berkeley 2 4 7 Philadelphia 3 4 15 Montreal 3 4 15 Toronto 4 6 17 Barcelona 4 6 18 Madrid 4 8 15 Paris 4 8 18 Marseille t=1 t=2 t=3 t=4 t=5 t=6

  8. The Viterbi algorithm – Viterbi, 1967 1 3 10 13 Berkeley 2 4 7 13 Philadelphia 3 4 15 15 Montreal 3 4 15 15 Toronto 4 6 17 10 Barcelona 4 6 18 11 Madrid 4 8 15 13 Paris 4 8 18 15 Marseille t=1 t=2 t=3 t=4 t=5 t=6

  9. The Viterbi algorithm – Viterbi, 1967 1 3 10 13 14 15 Berkeley 2 4 7 13 15 16 Philadelphia 3 4 15 15 16 17 Montreal 3 4 15 15 17 18 Toronto 4 6 17 10 11 12 Barcelona 4 6 18 11 12 13 Madrid 4 8 15 13 14 15 Paris 4 8 18 15 15 16 Marseille t=1 t=2 t=3 t=4 t=5 t=6

  10. The Viterbi algorithm – Viterbi, 1967 1 3 10 13 14 15 Berkeley 2 4 7 13 15 16 Philadelphia 3 4 15 15 16 17 Montreal 3 4 15 15 17 18 Toronto 4 6 17 10 10 11 12 Barcelona 4 6 18 11 12 13 Madrid 4 8 15 13 14 15 Paris 4 8 18 15 15 16 Marseille t=1 t=2 t=3 t=4 t=5 t=6

  11. The Viterbi algorithm – Viterbi, 1967 • O(N2T) by using dynamic programming • NT possible state sequences • Used in signal decoding, speech recognition, parsing and many other applications • For large N and T, this cost could be quite prohibitive • Every possible transition is considered • In some cases, many of these transitions are very unlikely to feature in the optimal path

  12. Abstraction Berkeley U.S.A. Philly North America Montreal Canada Toronto Barcelona Spain Madrid Europe Paris France Marseille t=1 t=2 t=3 t=4 t=5 t=6

  13. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  14. CFDP • Step 1: Find the most likely sequence in the current state-time trellis

  15. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  16. CFDP • Step 1: Find the most likely sequence in the current state-time trellis • Step 2: Refine along the most likely sequence

  17. CFDP Refinement • Node-based refinement Spain Europe France Node Refinement N.America N.America

  18. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  19. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  20. CFDP • Step 1: Find the most likely sequence in the current state-time trellis • Step 2: Refine along the most likely sequence • Step 3: Go to step 1 if step 2 performed any refinement; else terminate

  21. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  22. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  23. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  24. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  25. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  26. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  27. Cost bounds for abstract links • Cost of an abstract link should be a lower bound of the link refinements it encapsulates • Standard heuristic admissibility argument  Correctness

  28. Coarse-to-fine dynamic programming (CFDP) – Raphael, 2001 Berkeley Philly Montreal Toronto Barcelona Madrid Paris Marseille t = 1 t = 2 t = 3 t = 4 t = 5 t = 6

  29. Analyzing CFDP • Great when large portions of the state-time trellis are very unlikely • Leading to fewer refinements • An appropriate abstraction hierarchy is required

  30. An actual state trajectory Europe trip Sardinia Venice Milan Interlaken India trip Los Angeles road trip Yosemite road trip San Francisco Berkeley Stanford Jan May Sep Dec

  31. Persistence a.k.a. Timescales Europe trip Sardinia Venice Milan Interlaken India trip Los Angeles road trip Yosemite road trip San Francisco Berkeley Stanford Jan May Sep Dec

  32. A set of really good paths • Set 1: All paths within California for the entire month of April • Set 2: All paths that visit California and at least one other state in April • Cost(PathsApril-in-California) < Cost(PathsApril-in-1+-states) • |PathsApril-in-California | << | PathsApril-in-1+-states| • An abstraction scheme which can distinguish between these two sets of paths!

  33. Temporally abstract link • Each link encapsulates a set of paths at the specified abstraction level over a temporal interval [T1,T2] • Just specifying start and end points is pointless! Europe Europe N.America N.America T2 T1

  34. Links Europe Europe Direct links Paths that stay within N. America for the entire interval [T1,T2] N.America N.America T2 T1

  35. Links Europe Europe Cross links Paths that start in Europe at T1 and end in N. America at T2 N.America N.America T2 T1

  36. Links Europe Europe Re-entry links Paths that start and end in N. America at T1 and T2 respectively, but leave N.Americaat least once in that interval N.America N.America T2 T1

  37. Link Refinement • No longer refining nodes! • Two types of refinement • Direct links undergo spatial refinement • Cross and re-entry links undergo temporal refinement

  38. Europe Europe Europe Europe Spatial Refinement N.America N.America N.America N.America T1 T2 T1 T2 U.S.A. U.S.A. Canada Canada

  39. Europe Europe Europe Europe Europe Temporal Refinement N.America N.America N.America N.America N.America T2 T1 T2 T1 T’

  40. TAV algorithm • Identical to CFDP in structure • Step 1: Find the most likely sequence in the current state-time trellis • Step 2: Refine along the most likely sequence • Link refinement instead of node refinement • Step 3: Go to step 1 if step 2 performed any refinement; else terminate

  41. TAV example Europe Europe N.America N.America 0 T

  42. TAV algorithm - example

  43. TAV algorithm - example

  44. TAV algorithm - example

  45. TAV algorithm - example

  46. TAV algorithm - example

  47. TAV algorithm - example

  48. TAV algorithm - example

  49. TAV algorithm - example

  50. TAV algorithm - example

More Related