Persisting Information Across Application Executions

Persisting Information Across Application Executions Derek BrueningDetermina, Inc.

Dynamic vs. Static Program Analysis • Dynamic analysis has advantages: • Access to entire app: libraries, generated code • Visibility on exact use cases (data used and control flow followed) • Applicable to any binary • But…costly to do • Analysis cost is part of the runtime of the app • Analysis is applied every time the app is run

Analysis Challenges • Monitoring values of application variables is performance-intensive • Transparent instrumentation expensive on x86 • May need to monitor more than just function parameters • Difficult to obtain enough data from short-lived applications

Solution: Persistent Data • Store results from prior application executions on disk • Instrument different part of application in each run to keep costs down • Combine results of each new run with stored data • Once have enough data for analysis, send to central server

Common Persistence Infrastructure • Underlying execution engine can also benefit from persistence • Can share the same validation and consistency models to ensure persisted state corresponds to current application code and behavior

Managed Program Execution Engine • Manipulate, at runtime, every instruction an arbitrary application executes, with: • Minimal performance penalty • Full transparency • Exports interface for building custom tools • No modifications to the hardware, operating system, or application • Foundation of Determina Memory Firewall

Native Execution APPLICATION executable library A library B hardware platform

Execute From Code Cache APPLICATION executable library A library B EXECUTION ENGINE code cache hardware platform

Managed Program Execution Engine trace selector basic block builder START dispatch context switch BASIC BLOCK CACHE TRACE CACHE indirect branch lookup non-control-flow instructions non-control-flow instructions indirect branch stays on trace?

Non-Amortized Overhead • In steady-state, overhead is amortized • Overhead does show up in non-steady-state: • Application start-up • Short-lived applications • Extensive new code execution • Not an issue on servers (long-running apps) • A major issue on desktops • Startup time is directly visible and perceived as a slowdown

Traces • Combine frequently executed sequences of basic blocks into traces • Superior code layout • Eliminate indirect branches • Traces provide ~10% performance boost on longer-running applications • Will they help startup performance, with its once-executed code?

Solution: Persistent Code Cache • Goals: • Improve start-up performance • Support process-shared code caches • Challenges: • Re-design data structures to support persistence • Efficiently validate persisted files • Safely allow persistent file updates

Prototype Implementation • Supports both no-traces mode and trace building mode • Persists at end of run to application-specific files, loaded in for the next run • Does not support relocation • Validates only the module base

Per-Module Caches APPLICATION executable library A library B EXECUTION ENGINE cache exe cache libA cache libB hardware platform

Persistent Caches APPLICATION executable library A library B PERSISTED CACHES cache exe EXECUTION ENGINE cache exe cache libA cache libB cache libA cache libB hardware platform

Summary • Persisting information across application executions has many advantages • Reduce overhead • Enable inter-process sharing • Enable accumulation of data from multiple runs for broader view of application with less instrumentation or in presence of varying workloads • Many challenges remain

Persisting Information Across Application Executions

Persisting Information Across Application Executions

Presentation Transcript

Integrating Information Across Operations

Information Literacy across the Curriculum

UCAS Application Information Session

Many people enjoyed watching executions...

Information Flow across the Hospital

Sharing information across the Supply Chain

Dietetics Application Information Meeting

Information Flow Across the Internet

Student Field Application Information

Persisting Rich Domain Models Using NHibernate

Persisting

Information Systems (Android Application)

Persisting data in isolated storage

Vendor Application and Information

Persisting Rich Domain Models Using NHibernate

Caching Across Enterprise Application Tiers

Persisting Information Across Application Executions

Compactly Representing Parallel Program Executions

High Impact Creative Executions

Recording application executions enriched with domain semantics of computations and data

UCAS Application Information Session

Verifying Serializability Considering Only Sequential Executions