170 likes | 295 Views
Audio Taken Seriously; The present and future of audio at Microsoft Ken Greenebaum kgreene@microsoft.com Internet Platforms and tools Division Microsoft Corporation. Slides, other materials online: http://www.research.microsoft.com/research/graphics/kgreene/icad. Overview. Today
E N D
Audio Taken Seriously;The present and future of audio at MicrosoftKen Greenebaum kgreene@microsoft.comInternet Platforms and tools DivisionMicrosoft Corporation ICAD Industry Panel
Slides, other materials online: http://www.research.microsoft.com/research/graphics/kgreene/icad ICAD Industry Panel
Overview • Today • Solid media foundations (DirectX, ActiveMovie) • Soon • Advanced media (ActiveAnimation, Whisper/Whistler) • Tomorrow • Conversational interfaces ICAD Industry Panel
Today: DirectSoundhttp://www.microsoft.com/mediadev/audio/iaud.htm • Streaming audio • Reasonable latency • Input (soon) • Device independence • Multiple app’s audio mix • DSound3D ICAD Industry Panel
Today: Active Movie • Graph based media architecture • Movie playback • Movie record (soon!) • Open filter API • Audio plugin technology ICAD Industry Panel
Today: Netshowhttp://www.microsoft.com/netshow/ • Streaming network audio/video • Multicast audio using RTP (real-time protocol) • ASF file format, conversion, editing tools • NT server ICAD Industry Panel
Today: Interactive Music(Formerly BlueRibbon’s AudioActive) • Intelligent interactive music • Composes/Delivers music • Based on expert system • Human composer ‘authors’ templates • Music always sounds fresh and original • Look for it: PowerPoint ‘97, MSN Riff ICAD Industry Panel
Soon: DirectMusicContact: craighs@microsoft.com • Consistent Playback of MIDI Music • Internet support for Music • DLS downloadable sample sets • Optional software MIDI synth • Internet MIDI jamming? ICAD Industry Panel
Soon: “Appelles” • Expect an announcement soon! • Animation Description Language • Functional Paradigm • Media Integration • Implicit Time • Language Integration (Java) • Enable sophisticated Web animation ICAD Industry Panel
Appelles Audio Capabilities: • All audio types orthogonal • Parametric Synthesis • MIDI • Audio Active Music Synthesis • Streaming audio • PCM Audio • 3D Spatialized sound embedded in geometry ICAD Industry Panel
Soon: “Talisman” Audiohttp://www.microsoft.com/hwdev/devdes/talisman.htm/ • Hardware acceleration of: • DSound/DSound3D • Echo Cancellation • Active Movie filter accelerator • 32bit mixer • DLS compatible synthesizer • MODEM/Telephony ICAD Industry Panel
Soon: “Whisper”http://www.research.microsoft.com/research/srg/ • Windows Highly Intelligent Speech Recognizer • Based on SphinxII • Continuous speech recognition • Speaker independent • Context-free grammar decoding ICAD Industry Panel
Soon: “Whistler”http://www.research.microsoft.com/research/srg/ • Trainable Text to Speech Synthesizer • Training from human speech; maintains: • Natural prosody • Characteristics of original human • Emotional control • Uses NLP technology to parse text ICAD Industry Panel
Tomorrow: Conversational Interfaces • Motivation: • Given choice people communicate with speech • People prefer natural language over ‘command languages’ • anthropomorphism unavoidable w/spoken interaction ICAD Industry Panel
Persona Projecthttp://www.research.microsoft.com/ui/persona/home.htm/ • Conversational Assistant as UI • Spoken conversation (voice recognition/synth) • Natural Language (in limited domains) • Assistant w/Rich visual presence • Simulates verbal and non-verbal cues ICAD Industry Panel
Here’s Peedy and Gene: ICAD Industry Panel
Conclusion: • Microsoft is: • Taking media very seriously • Offering a solid foundation today • Designing the future ICAD Industry Panel