Nonlinear visual coding from an intrinsic-geometry perspective

Nonlinear visual coding from an intrinsic-geometry perspective E. Barth* & A. B. Watson NASA Ames Research Center http://vision.arc.nasa.gov Supported by DFG grant Ba 1176/4-1 to EB and NASA grant 199-06-12-39 to ABW

Intrinsic dimensionality in 2D • i0D: constant in all directions • i1D: constant in one direction • i2D: no constant direction

i0D FT

i1D FT • e.g. straight lines and edges, gratings

i2D FT • e.g. corners, line ends, curved edges and lines

i0D i1D i2D

i0D i2D i1D

Intrinsic dimensionality in 3D • i0D: constant in all (space-time) directions • i1D: constant in 2 directions • i2D: constant in one direction • i3D: no constant direction

i0D FT

i1D FT • e.g. drifting spatial grating

i2D FT e.g. drifting corner, flashed grating

i3D FT e.g. flow discontinuities, flashed corners

Intrinsic dimensionality and motion • FT of (rigid) motion signal is in a plane

The visual input as a hypersurface luminance hypersurface Visualization of surfaces is easier:

Geometric view on intrinsic dimensionality

Curvature and motion (“plane” = “more than line but no volume”)

The Riemann tensor R • most important property of (hyper)surfaces • measures the curvature of the (hyper)surface • has 6 independent components in 3D • vanishes in 1D.

The Riemann tensor components are nonlinear combinations of derivatives, i.e., of linear filters with various spatio-temporal orientations.

Multiple representation of speed. R components and speed v

R and direction of motion q Multiple, distributed representation of direction.

Sectional curvatures

Directiontuningsof R components vertical motion horizontal motion

Barber pole Wallach, 1935

“abolished illusion” Kooi, 1993

Orthogonal orientation and direction tunings Analytical predictions based on R components Typical Type II MT neuron, macaque monkey Direction tuning Rodman & Albright, 1989 Orientation tuning

Multiple motions Analytical predictions based on R components Typical MT neuron, macaque monkey Recanzone, Wurtz, & Schwarz, 1997

Conclusion Hypothesis that a basic (geometric) signal property (the intrinsic dimensionality) is encoded in early- and mid-level vision explains • orientation selectivity (derivatives, and R2, R3) • endstopping (all R components are endstopped for translations) • velocity selectivity • direction selectivity • some global-motion percepts (by integration) • some properties reported for MT neurons. (Reference to 3D world of moving objects is not needed.)

Nonlinear visual coding from an intrinsic-geometry perspective

Nonlinear visual coding from an intrinsic-geometry perspective

Presentation Transcript

Cortical visual impairment: An educational perspective

Derivatives From an Accounting Perspective

From FPU to Intrinsic Localized Modes: An Odyssey in Nonlinear Science

Scrum from an Organization Perspective

Networking (from an OS perspective)

Customization From an ISD Perspective

Tax from an Auxiliary’s Perspective

SPM from an Avian Perspective

ParFUM from an Application Perspective

TIC from an Industry Perspective

Spam from an ISP perspective

Excel from an Audit Perspective

Intrinsic Localized Modes in Nonlinear Photonic Crystals

Biofuels: From An Information Perspective

An analysis from global perspective

Formulation from an Economic Perspective

CWE from an SME perspective

Visual Basic - Intrinsic Functions

Embodied Coding through Perspective

HTA from an Industry Perspective

Telö09 from an Internet Perspective

Scrum from an Organization Perspective