1 / 21

Algorithmic information transfer and it’s practical use

Algorithmic information transfer and it’s practical use. B. Bauwens, L. Boullart, P. Santens. Introduction dia 3-8 Current approach dia 9-17 Test and results dia 18-21. Introduction. Problem setting. Given: 2 measured signals: x,y Asked: Are underlying sources cooperating?

roanna-cox
Download Presentation

Algorithmic information transfer and it’s practical use

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Algorithmic information transfer and it’s practical use B. Bauwens, L. Boullart, P. Santens

  2. Introduction dia 3-8 Current approach dia 9-17 Test and results dia 18-21

  3. Introduction

  4. Problem setting Given: 2 measured signals: x,y Asked: • Are underlying sources cooperating? • x influences y ? • y influences x ?

  5. Exemple: how do different brain areas relate ?

  6. Goals : The algorithm must infer causalities: • Using little data • For complex interactions • Having a notion of confidence • Without setting parameters by the final user • As objective as possible

  7. Literature • Granger causality:  Predict xt from x1…t-1 and Mx.  Predict xt from x1…t-1 ; y1…t-1 and Mxy If the second prediction is better:  B influences A.

  8. (Shannon) information transfer Fit probablity distribution to X x X+d x Y Calculate IT(x ← y) = I(xt+d; yt | xt) shuffle y to test significance

  9. Current approach

  10. Dependency test • Let d: B x B → N such that (cfr. sum-m test) ∑x,y m(x)m(y) 2d(x,y) <1 m  universal prior • I(x;y) is a sum-m dependency test • A dependency test d’ dominates d iff for some c, for all x,y : d’(x,y)+c > d(x,y) c not dependant on x and y

  11. Optimality of I(x;y) a test d in a set A of tests is universal for A if it dominates every test in A. • There exists no universal recursive dependency test. • Exists upper-enumerable dependency test? • Exists lower-enumerable dependency test? Assume the class of all functions which are enumerable using the halting sequence. Universal element: K(x)+K(y)-K(x,y | ξ) = I(x;y) + I(x,y ; ξ)

  12. Consequence: Assume M: B x B → B a recursive function If I(x;y|M(x,y)) is a dependancy sum-m test Then I(x;y) – I(x;y|M(x,y)) > -I(x,y ; ξ) (1)

  13. Monotone-Conditional Algorithmic Complexity: K(x | y ↑) =min {|p| : k=0..n; U(p,y1…k)=xk+1 & U(p,y1…n)=ε} K(x|y) <c K(x | y ↑) <c’ K(x) Algorithmic information transfer: IT(x ← y) = K(x | y ↑) - K(x)

  14. Decomposition of I(x;y) One has: I(x;y) >c IT(x ← y) + IT(y ← x) I(x;y) = IT(x ← y) + IT(y ← x) + IT(x = y) IT(x=y) = K(x | y ↑) + K(y | x ↑) – K(x,y) = I(p; q) – dI(x,y) 0 < dI(x,y) < K(p|x,y)+K(q|x,y) < log(|x|)

  15. IT(x ← y) is as hard to calulate ? If x ~ m and y~m independently, then Prob{(x, y)|dI(x, y) > k} < ck32−k. If x ~ m and y~m independently; then IT(x ← y) and IT(y ← x) is computable from ξ1…k with probability ck32−k ξ is the halting sequence (domain U)

  16. Comparision with litterature Granger causality can be stated as IT(x ← y | M(x,y) ) = K(x | y ↑, M(x,y)) – K(x|M(x,y)) Here: IT(x ← y ) = K(x | y ↑) – K(x) In case of autoregressive coefficients, M(x,y) are the coefficients. • No sum-m dependency test (condition: K(M|x)+K(M|y) – K(M) > c)

  17. Differences: • Predictibility improvement = better data compression • Complexity of model is incorporated Advantages: • Arbitrary complex models can be evaluated in the same framework. • If the estimate of K is sufficiently good: we have a sum-m test. • The IT is more objective (one can use different models at the same time and compare them naturally)

  18. Results and Testing

  19. Testing Artificial data coming from: • Coupled oscillators dx/dt=ω1 + e1 sin(y-x) + n1 dy/dt=ω2 + e2 sin(x-y) + n2 • Rössler system Amount of samples 1000, 10 000, 100 000

  20. Methods Compressors where built using: • Naive bayesian prediction • Linear regression • Recurrent neural networks • Support vector machines • Lempel Ziv algorithm • Combinations of above

  21. Results Results compared to Shannon information transfer: • For 10 000 and 100 000 there was perfect agrement. • For 1000 there is a bias for inferring causality from the simplest signal for most compressors reason: not fully understood

More Related