660 likes | 828 Views
Verifying Interactive Web Programs. Daniel R. Licata Shriram Krishnamurthi Brown University. Popular Press. Quote:
E N D
Verifying Interactive Web Programs Daniel R. Licata Shriram Krishnamurthi Brown University
Popular Press Quote: But when I clicked on the National [car rental] price […], the site responded with this message: “You have back-buttoned too far”. This was my first experience with “back-button” as a verb. […] Since that was patently untrue, I decoded its true meaning: “We ran out [of cars]”. –M. Slatalla, New York Times, 2003-07-17
A Headache for Companies • Minor problem: Users might get booked into the wrong hotels, onto the wrong flights, etc. • Major problem: People might embarass you in newspapers and in public talks
The Orbitz Property • Orbitz Property: the user should receive a reservation at the hotel that was displayed on the page he submitted • In other words, the result does not depend on the page on which you did not click “Reserve” Should all sites have this property?
Question What would Amazon want?
The Amazon Property Amazon property: at the end, every book the user added to his shopping cart is actually in his shopping cart These properties are • not fixed in number • temporal in nature model checking
Model Checking • From the source code of a program, generate a model that captures the behaviors of interest • Consume properties written by the developer • Automatically check whether or not the model satisfies the properties
Model Checking • From the source code of a program, generate a model that captures the behaviors of interest • Consume properties written by the developer • Automatically check whether or not the model satisfies the properties
Modelling Web Programs Model = control-flow graph (CFG) What would a model of Orbitz look like?
Modelling Orbitz display hotel list set chosen use chosen to compute displayed use chosen to compute reserved display reservation display details for displayed
User Operations add Control Flow • The browser's back-button introduced control flow not present in the original CFG • Other browser operations do the same How many operations do today's browsers provide?
One Browser How can we model all of these operations? Alt+Tab
User Operation Calculus • Express all browser operations in terms of primitive user operations: • submit form to server • switch to previously-visited page [Graunke et al., 2003] • Only need to account for these two operations' control flow
Our Model: the WebCFG • submit corresponds to program's control flow Already in the CFG • switch permits returning to any previously-visited Web-interaction point Add edges from each Web-interaction node to the successors of all the others (WebCFG)
The Orbitz CFG display hotel list set chosen use chosen to compute displayed use chosen to compute reserved display reservation display details for displayed
The Orbitz WebCFG display hotel list set chosen use chosen to compute displayed use chosen to compute reserved display reservation display details for displayed
Model Checking • From the source code of a program, generate a model that captures the behaviors of interest • Consume properties written by the developer • Automatically check whether or not the model satisfies the properties
Properties We want to state properties about Web pages
<html> <body bgcolor=yellow> <table> <td> <p> Residence Inn by Marriot Charleston Downtown ... </html> Properties Web pages are written as HTML source
Properties We want to reason about Web page texts <html> <body bgcolor=yellow> <table> <td> <p> Residence Inn by Marriot Charleston Downtown ... </html>
Properties How can we associate these texts with the corresponding HTML source? <html> <body bgcolor=yellow> <table> <td> <p> Residence Inn by Marriot Charleston Downtown ... </html>
Relating Web Page Content to Source • Parse the text? Too hard • Static-distance coordinates? Too brittle What else can we do?
<html> <body bgcolor=yellow> <table> <td> <p id=”reserved”> Residence Inn by Marriot Charleston Downtown ... </html> Relating Web Page Content to Source Capitalize on Cascading Style Sheet (CSS) ID tags!
Relating Web Page Content to Source • If the tag is in the HTML, it must be present in the source of the program that generates the page • This relates Web page text to the Web program source expression that generates it
Annotating the WebCFG Annotate each WebCFG state with the propositions true in that state <html> <body bgcolor=yellow> <table> <td> <p id=”reserved”> Residence Inn by Marriot Charleston Downtown ... </html> generate reservation page generate reservation text tag=reserved
Defining our Property Language • The annotated WebCFG describes the set of traces that potentially occur • The developer writes an automaton accepting the set of traces that should occur • Verification is containment of the former in the latter [Vardi and Wolper, 1986]
Example Property Password-Page Property: Before reaching an access-controlled page, the user must go through a password page tag=password-entry 2 1 violation tag=access-controlled Note: In properties, tags label transitions