1 / 29

Marine Institute of Ireland: Streams architecture and deployment deep dive

Marine Institute of Ireland: Streams architecture and deployment deep dive. Krishna Mamidipaka, krishnag@us.ibm.com Roger Rea, rrea@us.ibm.com. Housekeeping.

xerxes
Download Presentation

Marine Institute of Ireland: Streams architecture and deployment deep dive

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Marine Institute of Ireland: Streams architecture and deployment deep dive Krishna Mamidipaka, krishnag@us.ibm.com Roger Rea, rrea@us.ibm.com

  2. Housekeeping We value your feedback - don't forget to complete your evaluation for each session you attend and hand it tothe room monitors at the end of each session Overall Conference Evaluation will be providedat the General Session on Friday Visit the Expo Solutions Centre Please remember this is a 'non-smoking' venue! Please switch off your mobile phones Please remember to wear your badge at all times

  3. Disclaimer The Information regarding potential future products is intended to outline our general product direction and it should not be relied on in making a purchasing decision. The information mentioned regarding potential future products is not a commitment, promise, or legal obligation to deliver any material, code or functionality. Information about potential future products may not be incorporated into any contract. The development, release, and timing of any future features or functionality described for our products remains at our sole discretion.

  4. Agenda Solution overview Why InfoSphere Streams? Solution details

  5. + + = Real Time Marine Mammal Position • Filter wind & wave noise • Model Marine Mammal environment • Correlate to Galway Bay ecosystem Analytics & Sensors InfoSphere Streams Advanced Acoustical Analytics

  6. IBM InfoSphere Streams v1.2 Front Office 3.0 Development Environment Runtime Environment Toolkits & Adapters Eclipse IDE StreamSight Stream Debugger RHEL v5.3 or v5.4 x86 multicore hardware InfiniBand support Up to 125 servers Connectors to data sources Operator Library Financial Toolkit Mining Toolkit

  7. Streams Programming Model Process Output Input Streams Processing Language Platform optimized compilation

  8. Streams Runtime Illustrated Streams Data Fabric Transport X86 Processor X86 Processor X86 Processor X86 Processor X86 Processor X86 Box X86 Blade FPGA Blade X86 Blade Cell Blade Optimizing scheduler assigns operators to processing nodes, and continually manages resource allocation Runs on commodity hardware – from single node to blade centers to high performance multi-rack clusters Processing Element Container Processing Element Container Processing Element Container Processing Element Container Processing Element Container

  9. Streams Runtime Illustrated Streams Data Fabric Transport X86 Processor X86 Processor X86 Processor X86 Processor X86 Processor Can adapt to changes in resources, workload, data rates Processing Element Container Processing Element Container Processing Element Container Processing Element Container Processing Element Container

  10. SmartBay Overview Multi year joint development between the Marine Institute, IBM and others Project 1:Next generation integrated cyberphysical environment for sensors in environmental monitoring and management Project 2 : Integrated data and information environment with innovative human interface and advanced visualization capabilities supporting multidisciplinary users in environmental monitoring management and sustainable energy Project 3 : Advanced device monitoring and management for remote sensors and data collection/aggregation platforms Project 4: Real-time distributed stream analytical fabric for environmental monitoring and management

  11. Project 4 – Cetacean Streaming Analytics Goals Real-time acoustic analysis of hydrophone data Initial goals to use echo location clicks to identify Cetacean Species (and sub-species) Count Distance Extended Goal Individual animal detection & recognition

  12. Hydrophone Audio High frequency Requirement to sample up to 300 KHz Bottlenose dolphins produce directional, broadband clicks in sequence. Each click lasts about 50 to 128 microseconds. Peak frequencies of echolocation clicks are about 40 to 130 kHz. Medium resolution (16bit mono) but can be higher Contains environmental (natural and artificial) noise sound of weather on ocean surface, sea bed activity artificial noise from marine traffic (propellers) Seismic surveys Not normalised (significant drift)

  13. Just over .5 second of data Click periodicity varies based on activity e.g. navigation, hunting

  14. Species Identification In simpler terms “Click Detection” and “Click Profiling” Three primary stages in operation Pre-click detection establishing a click “hint” Clean Up / Dynamic Filtering isolating key frequencies Frequency and Time Based Click Profiling / Detection Arriving at a decision over species based on matching to known characteristics

  15. Dynamic Pre-Click Detection – a click “hint” An algorithm to detect “potential” clicks in the acoustic data By calculating a rolling average of the incoming sound pressure / intensity level we can then dynamically change the max / min thresholds used to for checking for potential clicks, holding above a given low threshold for a certain duration Max Min

  16. Pre-Click Detection Red – unfiltered, Green – Filtered, Blue – Click “hints”

  17. Clean Up / Dynamic Filtering Number of factors effecting the intensity / sound pressure of the received acoustic signal Water temperature can change significantly above / below any thermocline monitored via the deployment buoy Salinity maps are available can also be monitored but does not shift significantly Species Most significantly the Distance of source from the hydrophone Necessary to have understanding of the sound pressure level and “species hint” (based on mean frequency) to apply appropriate filter Specific filter applied based on industry recognised sound pressure lookup adjusted for distance

  18. Click Detection & Profiling Moving from a species “hint” to a more robust approach requires a series of checks to confirm a click and try to identify the species* Comparison of the Relative energy in two different frequency bands Peak spectral frequency in the click The width of the main frequency peak (based on 80% of the energy) The duration of the click * Credit Marjolaine Caillat, SMRU, 2005

  19. FFT Most of these techniques require that we provide both time domain and frequency domain for the input signal FFT normally computed across a “window” of samples returning the frequency distribution of the time based signal Window size is governed by the number of samples in a click rounded up to the nearest factor of 2 There is a computational overhead in running FFT on each click so minimising the window size is of benefit

  20. Peak spectral frequency in the click

  21. Dynamic Filtering Click Profiling and Detection – 4 Criteria WAV Decoder SPL in dB is calculated using the inverse square Law High = 1 meter Medium= 5 meter Low = 20 meters FFT (512 Window) Mean Frequency 175 dB Low order BP Filter Band Energy Pre-click Detector High Pass Filter (remove < 10 kHz) Low Pass Filter (remove > 10 kHz) Peak Position and Width 161 dB Medium order BP Filter Dynamically Change -Click Detector. Parameters Depending on average SPL then Assign Click # *To Do Low Frequency Species Click Detection Click Length 151 dB High order BP Filter Porpoise Fo 131 kHz Calc Sound Pressure Level (inv. sq.) Click Detection 230 dB Low order BP Filter Click Counter FFT (4096 Window) Common Dolphin Fo = 80 kHz Calc Sound Pressure Level (inv. sq.) Calc. mean frequency “Species Hint” and split 216 dB Low order BP Filter 210 dB Low order BP Filter

  22. Determining Counts Animal counts are based on building click trains through correlation Correlate from click to click based on relative energy and peak frequency Additional rules need applying to account for “re-visits” Credit - Josefin Starkhammara, Johan Nilssona, Mats Amundinb, Tomas Janssona, Monica Almqvista, Hans W. Perssona

  23. Visualisation - Spectrogram There are a number of metrics planned for the acoustic work through the SmartBay portal Species breakdown Species count Distance from hydrophone A “basic” spectrogram (sound intensity by frequency and time) is shown below

  24. Performance Our initial approach was to treat each audio sample (int) as a single tuple (300k per second) Allowed us to implement all of the Streams operators quickly to validate the end to end processing Recently moved to integer list of 128 samples per tuple up to the pre-click detector Followed by integer list of the total click per tuple And Integer list of the frequency spectrum for a complete click per tuple Moving to multiple samples per tuple has resulted in approx. 20x performance improvement (faster than real time) Currently testing on cluster of 4 x xSeries 3950 (8GB Ram, 2xQuad Core)

  25. Next Steps Implement development GUI to allow tuning of the detection values (by species) Reference dB, salinity, temperature in addition to intensity to calculate distance Additional challenges down to frequency of the echo location click as produced by each species – high frequencies don't tend to travel very far Variations in the echo location click spread Porpoise click is very narrow and directional from the melon and out in front of the animal Add in the ability to check for low frequency species such as Fin Whales, Hump Back Whales etc.

  26. Futures – Large Scale Ocean Energy Impact Analysis

  27. IBM InfoSphere Streams directions Front Office WebSphereBusinessEvents Tools Streams Studio enhancements Video/audio analytics Text/unstructured analytics Streams Processing Language v2 Native XML support Runtime High Availability Security enhancements Unicode support Installation enhancements Adapters Cognos Now WebSphere MQ RSS feeds Mashup Hub WebSphere Business Events Oracle SQL Server Cognos InfoSphere Warehouse 8BI IBM MashupHub Millions of events per second Millisecond Latency Existing business information Data in motion All statements regarding IBM's plans, directions, and intent are subject to change or withdrawal without notice. Any reliance on these statements are at the relying party's sole risk and will not create any liability or obligation for IBM.

  28. InfoSphere Streams sessions Questions?

More Related