xCalls: Safe I/O in Memory Transactions

xCalls: Safe I/O in Memory Transactions Haris Volos, Andres Jaan Tack, NeelamGoyal+, Michael Swift, Adam Welc§ University of Wisconsin - Madison § +

Transactional Memory (TM) • CMP leads to more concurrency within programs • Synchronizing access to shared data via locks is hard • atomicconstruct simplifies synchronizing access to shared data Thread 1 Thread 2 Isolation: OBSERVE both red transaction’s updates to A and B or none atomic { B = B – 10; A = A + 10; } atomic { A = A – 20; B = B + 20; } store(A) store(B) store(A) atomic { x = x - c; y = y + c; } LOCK(L) x = x - c; y = y + c; UNLOCK(L) Atomicity: PERFORM both updates to A and B or none Conflict Abort blue ABORT conflicting transactions

A challenging world… • Real world programs frequently take actions outside of their own memory • Firefox: ~1% critical sections do system call [Baugh TRANSACT ’07] • Most TM systems apply only to user-level memory Thread 1 memory file Thread 2 Memory updates dropped Interleaved writes 1a atomic { item = procItem(queue); write (file, principal); write (file, item->header); } atomic { item = procItem(queue); write (file, principal); write (file, item->header); } 2b 2b 1a 2a 1a File writes not dropped 1b 2b Abort

State of the art • Defer • Undo • Global Lock Ignore failures NAS Stop the world atomic { item = procItem(queue); write (file, principal); write (file, item->header); send (socket, item->body); } LAN Internet Defer COMMIT Perform send

Contribution • xCall programming interface • Exposes transactional semantics to programmer • Enables I/O within transactions w/o stopping the world • Exposes all failures to the program Transactional Program xCalls Legacy calls xCall Library Runs in user mode System call interface Transaction-unaware kernel

Outline • Motivation • xCall Design & Implementation • Evaluation • Conclusion

Design overview • Principles • As early as possible but not earlier • Expose all failures • Components • Atomic execution • Isolation • Error handling

Atomic execution • Provide abort semantics for kernel data and I/O • Expose to programmer when action is performed file buffers atomic { item = procItem(queue); x_write (file, principal); x_write (file, item->header); } Abort

Isolation • Prevent conflicting changes to kernel data made within a transaction • Sentinels • Revocable user-level locks • Lock logical kernel state visible through system calls Thread 1 memory file Thread 2 atomic { item = procItem(queue); x_write (file, principal); x_write (file, item->header); } atomic { item = procItem(queue); x_write (file, principal); x_write (file, item->header); } Conflict

Error handling • Some errors might not happen until transaction commits or aborts • Inform programmer when failures happen • Handle errors after transaction completes Deferred send: FAILED err3 = error atomic { item = procItem(queue); x_write (file, principal, &err1); x_write (file, item->header, &err2); x_send (socket, item->body, &err3); } if (err1 || err2 || err3) { /* CLEANUP */ } LAN Internet Defer Handle error here COMMIT Perform send

Example: file write Error handling ssize_tx_write (intfd, void *buf, ssize_tnbytes ) { void *localbuf; ssize_t bytes; get_sentinel(fd); localbuf = alloc_local_buf(nbytes); read(fd, localbuf, nbytes); bytes = write(fd, buf, nbytes); if (bytes != -1) compensate(x_undo_write, fd, localbuf, bytes, result); } intx_undo_write (intfd, void *buf, ssize_tnbytes, int *result){ off_t ret1, ret2; lseek(fd, -nbytes, SEEK_CUR); if (ret1) pwrite(fd, buf, nbytes, ret1); if (ret1 == -1 || ret2 == -1) *result = errno; return (ret1 == -1 || ret2 == -1); } , int *result Isolation Atomic execution Atomic execution Atomic execution ret1 = ret2 = Error handling

Summary • xCall API exposes transactional semantics • Atomicity • Isolation • Error handling • Prototype implementation • Executes as user-mode library • Relies on Intel STM for transactional memory • Provides 27 xCalls including file handling, communication, threading

Outline • Motivation • xCall Design & Implementation • Evaluation • Benefit of xCalls over global lock • Benefit of TM over locks • Conclusion

Evaluation platform • Transactified three large multithreaded apps • Berkeley DB • BIND • XMMS • Configurations • Native : locks + system calls • STM : transactions + system calls + global lock • xCalls : transactions + xCalls • Run on 4 quad-core 2 GHz AMD Barcelona

Performance: Berkeley DB (1/2) • xCalls scales better than STM with global lock • TM worse than locks due to STM overhead • Workload: TPC-C

Performance: Berkeley DB (2/2) • xCalls improve concurrency over coarse grain lock • Global lock kills optimistic concurrency • Workload: Lockscale

Performance: BIND • Transactions scale better than coarse grain locks • xCalls enable additional concurrency • Workload: QueryPerf

Performance summary • xCalls benefit programs with I/O concurrency • TM benefits programs with contended but not conflicting critical sections

Conclusion • xCall programming interface • Brings common OS services to TM programs • Requires no kernel modifications • Improves scalability over state of the art Questions?

xCalls: Safe I/O in Memory Transactions

xCalls: Safe I/O in Memory Transactions

Presentation Transcript

The Fail-Safe C to Java translator

Memory

MEMORY

Safe and Secure www.gemc.org

Memory

Agenda

Board Memory

Manchester University Transactions for Scala

Memory Coherence in Shared Virtual Memory System

O ur memory

Hazard Pointers: Safe Memory Reclamation of Lock-Free Objects

Memory

Country

Transactional Memory

ITCS 4/5010 CUDA Programming, UNC-Charlotte, B. Wilkinson, Jan 22, 2013 GPUMemories

Hazard Pointers: Safe Memory Reclamation for Lock-Free Objects

Effectiveness of an Automatic Insertion of Safe Memory Reuses into ML-like Programs

Phase Change Memory (PCM) ‏

MODELS OF MEMORY

Memory

Transactions