Control

Environment Actuator Motions Raw Sensor Input Robot states Actuators Sensing State Info Conditioned Sensor Input Observation Learning Trigger Control Learning Commands Behavior Controller Sensor/action pairs

A-D on sound card microphone Note: This process occurs for each microphone at a rate of 4 times a second. In order to smooth out the amplitudes associated with intermittent chirps, the average power is kept as a running average over 1 second. ¼ second of data Zero pad /* Sum up power in chirp. */ Sum = 0 For j = start freq, end freq Sum = FFTpower[j] + Sum End j FFT Average Power = Sum/chirp freq range

Left Sensor Right Sensor Sensor Values Sensor/Action FIFO Are changes in Sensor values extreme? Sensor/Action pairs Yes No Reactive Sub-system Non-Reactive Sub-system Single Commands Actuators

Left Sensor Right Sensor Sensor Values Sensor/Action FIFO Do sensor values indicate reactive action needed? Sensor/Action pairs Yes No Reactive Sub-system Inhibit Non-Reactive Sub-system Single Commands Actuators

Left Sensor Right Sensor Sensor Values Sensor/Action FIFO Are changes in Sensor values extreme? Sensor/Action pairs Yes No Action Sequence Library Reactive Sub-system Inhibit Non-Reactive Sub-system Single Commands Actuators

Left analog Maxnet Left indicator Left microphone value T(0) Center analog Center indicator Action T(0) Right microphone value T(0) Right indicator Right analog

Left micropone trend from times T(0) – T(-n) Left analog Maxnet Left indicator Center analog Center indicator Action T(0) Right indicator Right analog Right microphone trend from times T(0) – T(-n)

Left microphone trend from times T(0) – T(-n) Heading Adjustment spread over times T(0) – T(n) Right microphone trend from times T(0) – T(-n)

Left microphone value T(0) Heading Adjustment Right microphone value T(0)

Left microphone values T(0) – T(-n) Heading Adjustment spread over times T(0) – T(n) Right microphone values T(0) – T(-n)

Left microphone values T(0) – T(-n) Heading Adjustment T(0) Heading Adjustment T(1) Heading Adjustment T(2) Right microphone values T(0) – T(-n)

Left microphone values T(0) – T(-n) Heading Adjustment for times T(0) – T(n) Actions T(-1) – T(-n) Right microphone values T(0) – T(-n)

Left microphone values T(0) – T(-n) Left analog Maxnet Left indicator Actions T(-1) – T(-n) Center analog Center indicator Action T(0) Right indicator Right analog Right microphone values T(0) – T(-n)

L L L C C C R R R Left microphone values T(0) – T(-n) Left indicator Maxnet Center indicator Action T(0) Right indicator Maxnet Actions T(-1) – T(-n) Action T(1) Maxnet Action T(2) Maxnet Action T(3) Right microphone values T(0) – T(-n)

Robot states Is the core routine running and is there enough memory to learn from? Are goal states being met? Is reactive system being called often? yes no yes Trigger Reactive and non-reactive learning systems Trigger Reactive and non-reactive learning systems

Sensor/Action FIFO Reactive Memory Preparation Non-Reactive Memory Preparation Reactive Memory Non - reactive Memory Reactive Behavior Generation Non-reactive Behavior Generation Output of process is a trained reactive NN Output of process is a trained non-reactive NN

Reactive Memory Intensity Filter Negative and Positive Example Set Creation Correct Action Marking GA trains feed-forward NN to select the correct action given the sensor input. Output of process is a trained NN

Non - reactive Memory Kohonen SOFM Codebook Codebook mirror process GA trains feed-forward NN using codebook and fitness function that maximizes sensor energy and minimizes changes in direction. Output of process is a trained NN

Mirror Process Sensor/Action Memory Reactive Memory

Sensor/Action Memory pairs Correlation Cluster Set Creation Reactive Memory Cluster Set Recent past Memory from T0 to T-n Is recent memory a member of the Cluster Set? Sensor FIFO Yes No Update hit count Add recent memory to Non-reactive Memory, mirror, and update miss count Recent memory Non - reactive Memory

0)LIRA 0)LIRA 0)LIRA 0)LIRA 0)LIRA 1)LIRA 1)LIRA 1)LIRA 1)LIRA 1)LIRA 2)LIRA 2)LIRA 2)LIRA 2)LIRA 2)LIRA 1)LIRA 1)LIR 2)LIRA 2)LIRA Recent Memory Kohonen SOFM Codebook SOFM Codebook Entry Close match found in SOFM codebook Intensity value from LIRA(0) used in fitness function of GA Feed forward NN Action New action replaces old action. New situation made in scratch area.

0)LIRA 0)LIRA 0) A 1)LIRA 1)LIRA 1) A 2)LIRA 2) A 2)LIRA Recent Memory RBF Network RBF Codebook Target for RBF entry (j) is RBF entry (m) RBF entry (j) Input to NN is past values from RBF codebook entries. GA uses actions from target entry to train NN. Feed forward NN

Control

Control

Presentation Transcript

CONTROL MEASUR2ES Vector control

SELF-CONTROL -- IMPULSE CONTROL

Control

Control Structures: Iterative Control

Control:

Control St Control St Control St

control

Control

Control

Control

Control

Control

Topology Control – power control

Control

control

CONTROL

Control

Control

Version Control, Revision Control