270 likes | 436 Views
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE). David C. Gibbs Department of Mathematics and Computing University of Wisconsin-Stevens Point Stevens Point, WI 54481 David.Gibbs@uwsp.edu. Introduction.
E N D
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE) David C. Gibbs Department of Mathematics and Computing University of Wisconsin-Stevens Point Stevens Point, WI 54481 David.Gibbs@uwsp.edu
Introduction • University of Wisconsin - Stevens Point (USA) • Department of Mathematics and Computing • Web Speech Research Group • Undergraduate Computing students WBE 2007 March 16, 2007 Chamonix, France
This Presentation • What motivated development of the TTS-IDE? • What? Browsers speak? • Which browsers? • How do browsers speak? • How do I create a speaking presentation? WBE 2007 March 16, 2007 Chamonix, France
The Motivation • Online Course in JavaScript programming : Spring 2004 • Multiple learning styles, as applied to online instruction • Text presentation • Online discussion • Graphics • Audio WBE 2007 March 16, 2007 Chamonix, France
The Motivation, cont’d. • First three are “easy” (text, discussion, graphics) • Audio delivery (then) was problematic • PowerPoint, saved as HTML • large files – inaccessible to dial-up users • Clumsy to edit, maintain WBE 2007 March 16, 2007 Chamonix, France
The Motivation, cont’d. • Difficulties with audio • Recording file size 0.5 MB/minute • Cannot easily edit the text • Updating presentations essential in technology disciplines WBE 2007 March 16, 2007 Chamonix, France
A timely coincidence? • Opera introduced “speaking browser” – March 2004 • Investigated Text-To-Speech (TTS) WBE 2007 March 16, 2007 Chamonix, France
A Possible Solution • Speech Synthesis • “The process of automatic generation of speech output from data input which may include plain text, marked up text or binary objects.” (w3.org, 2004) • For this paper, speech synthesis within a web browser WBE 2007 March 16, 2007 Chamonix, France
Competing Technologies • Two HTML-like markup languages • Microsoft SALT • Speech Application Language Tags • VoiceXML from W3C WBE 2007 March 16, 2007 Chamonix, France
Microsoft SALT:Speech Application Language Tags • Uses O.S. “default voice” • XP ships with Microsoft Mary, Mike, Sam, LH Michael and Michelle • Purchase voices: NeoSpeech Kate and Paul ($30 US street price) • DEMONSTRATION – speech properties dialog (control panel) • Browser use requires download • voice add-in for I.E. (65MB) WBE 2007 March 16, 2007 Chamonix, France
VoiceXML • Opera Browser (v8 and beyond) • Partnered with IBM ViaVoice • Download Opera – follow Voice instructions • DEMONSTRATION • Speak the text on any page – select text and right-click | “Speak” WBE 2007 March 16, 2007 Chamonix, France
Code SamplesHello World: html doc • <html> • <head> • <title>Hello World</title> • </head> • <body> • Hello World! <!-- displayed on the web page --> • </body> • </html> WBE 2007 March 16, 2007 Chamonix, France
Code SamplesHello World: SALT doc (p. 1 of 2) • <html xmlns:salt="http://www.saltforum.org/2002/SALT"> • <head> • <title>HelloWorld</title> • <!-- SALT Add-in to Internet Explorer object --> • <object id="SpeechTags" CLASSID="clsid:33cbfc53-a7de-491a-90f3-0e782a7e347a" VIEWASTEXT> • </object> • <!-- salt: Importing the namespace for implementation --> • <?import namespace="salt“ implementation="#SpeechTags" /> • </head> WBE 2007 March 16, 2007 Chamonix, France
Code SamplesHello World: SALT doc (p. 2 of 2) • <body onload="hello.start()"> • <salt:prompt id="hello"> • Hello World <!-- this text is spoken --> • </salt:prompt> • Hello World! <!-- text displayed on the web page --> • </body> • </html> SALT-HelloWorld.htm WBE 2007 March 16, 2007 Chamonix, France
Code SamplesHello World: VoiceXML doc (p. 1 of 2) • <!DOCTYPE html PUBLIC "-//VoiceXML Forum//DTD XHTML+Voice 1.2//EN“ "http://www.voicexml.org/specs/multimodal/x+v/12/dtd/xhtml+voice12.dtd"> • <html xmlns=http://www.w3.org/1999/xhtml xmlns:ev = "http://www.w3.org/2001/xml-events"> • <head> • <title>Hello World</title> • <form xmlns="http://www.w3.org/2001/vxml" id="sayHello"> • <block> • Hello World! <!-- this text is spoken --> • </block> • </form> • </head> WBE 2007 March 16, 2007 Chamonix, France
Code SamplesHello World: VoiceXML doc (p. 2 of 2) • <body ev:event="load" ev:handler="#sayHello"> • Hello World! <!-- text displayed on the web page --> • </body> • </html> OPERA-HelloWorld.xml (open the Opera browser) WBE 2007 March 16, 2007 Chamonix, France
Text-to-Speech IDE WBE 2007 March 16, 2007 Chamonix, France
What does the IDE do? (SALT) • Modifies existing page by adding the necessary tags • Creates the salt namespace <html xmlns:salt="http://www.saltforum.org/2002/SALT"> • Adds SALT Add-in <object id="SpeechTags" CLASSID="clsid:33cbfc53-a7de-491a-90f3-0e782a7e347a" VIEWASTEXT> </object> • Adds namespace <?import namespace="salt“ implementation="#SpeechTags" /> • Adds SALT prompt tags around “notes” <salt:prompt id=“lecture"> WBE 2007 March 16, 2007 Chamonix, France
Preparatory tasks (I.E.) • Download and install the Internet Explorer Speech Add-in • Download the TTS-IDE WBE 2007 March 16, 2007 Chamonix, France
Creating an “Online Lecture” • Run the TTS-IDE • Add the text to be spoken • Convert the page to use TTS • Example: my home page WBE 2007 March 16, 2007 Chamonix, France
Creating an “Online Lecture”:from PowerPoint slides • Open the PPT presentation in PowerPoint • Then save the PowerPoint file as a Web page (File | Save as Web Page). • From the TTS-IDE, open the <filename>.htm file. WBE 2007 March 16, 2007 Chamonix, France
Creating an “Online Lecture” • Add the text to be spoken to each page. • Convert the page. • Test the pages. • Copy the pages to the course website or CD. WBE 2007 March 16, 2007 Chamonix, France
Demonstration • Example file: E:\WBE2007-SpeakingBrowserPPT.ppt WBE 2007 March 16, 2007 Chamonix, France
Further Work on TTS-IDE • Speech Recognition • For basic navigation • “Next Page” “Previous Page” “Links” • Custom Voices • FestVox, Carnegie Mellon • Additional Language Support • Client browser, speech engine WBE 2007 March 16, 2007 Chamonix, France
Further Work • Speech Recognition • Speaking and Listening Browser • Hands-free browsing (via speech recognition) • RSS news feeds, by topic WBE 2007 March 16, 2007 Chamonix, France
Conclusion – TTS-IDE • Add text to create speaking online “lectures.” • Can use HTML or PPT – saved as web • Can create for Internet Explorer (SALT) or Opera (VoiceXML) WBE 2007 March 16, 2007 Chamonix, France
Contact Information This presentation and download of IDE (available 2007/03/26) http://www.uwsp.edu/cis/dgibbs/WBE2007/ e-Mail dgibbs@uwsp.edu WBE 2007 March 16, 2007 Chamonix, France