1 / 19

From A to E: Analyzing TPC’s OLTP Benchmarks

From A to E: Analyzing TPC’s OLTP Benchmarks. The obsolete, the ubiquitous, the unknown. Pınar Tözün Ippokratis Pandis* Cansu Kaynak Djordje Jevdjic Anastasia Ailamaki. École Polytechnique Fédérale de Lausanne *IBM Almaden Research Center. OLTP Benchmarks of TPC. 2005. 2015.

mostyn
Download Presentation

From A to E: Analyzing TPC’s OLTP Benchmarks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. From A to E:Analyzing TPC’s OLTP Benchmarks The obsolete, the ubiquitous, the unknown Pınar Tözün Ippokratis Pandis*CansuKaynakDjordjeJevdjicAnastasia Ailamaki École Polytechnique Fédérale de Lausanne *IBM Almaden Research Center

  2. OLTP Benchmarks of TPC 2005 2015 1985 1995 2007 1989 1992 1990 Brokerage house Wholesale supplier TPC-E TPC-C TPC-B Banking TPC-A TPC-A, TPC-B: Obsolete TPC-C: Ubiquitous – Most common Allow fair product comparisons Drive innovations for better performance TPC-E: Unknown – Results from one DBMS vendor

  3. How is TPC-E different? Micro-architectural behavior Under-utilization due to instruction stalls Fewer cache misses and higher IPC Hardware Where does time go? Harder to partition requests Logical lock contention Storage Manager Characteristics/Statistics More page re-use Complex schema & transactions Longer held locks Workload

  4. Outline Preview Setup & Methodology Micro-architectural behavior Within the storage manager Conclusions

  5. Experimental Setup

  6. Methodology * * • Shore-MT • Scalable open-source storage manager • Shore-Kits • Application layer for Shore-MT • Workloads: TPC-B, TPC-C, TPC-E, ++ • Micro-architectural • Xeon X5660: Vtune, Niagara T2: cputrack • Measured at peak throughput • Storage manager profiling • Niagara T2: dtrace *https://sites.google.com/site/shoremt

  7. Outline Preview Setup & Methodology Micro-architectural behavior Within the storage manager Conclusions

  8. IPC on Fat & Lean Cores Intel Xeon X5660 Sun Niagara T2 Maximum Maximum OLTP utilizes lean cores better TPC-E has higher IPC

  9. Execution Cycles and Stalls Intel Xeon X5660 More than half of execution time goes to stalls Instruction stalls are the main problem

  10. Cache Misses Intel Xeon X566032KB L1-I & 32 KB L1-D Sun Niagara T2 16KB L1-I & 8KB L1-D L1-I misses dominate TPC-E has lower data miss ratio (MPKI)

  11. Why TPC-E has lower miss ratio? Average per transaction More scans of TPC-E  Increased page reuse

  12. Outline Preview Setup & Methodology Micro-architectural behavior Within the storage manager Conclusions

  13. From A to E: Schema TPC-E TPC-B TPC-C Fixed Scaling warehouse customer branch Growing Increasing schema complexity

  14. From A to E: Transactions More complexity & variety in transaction mix Harder to partition

  15. Within the Storage Manager Sun Niagara T264 HW Contexts SF 64 – 0.6GB Spread SF 64 – 8.2GB Spread SF 1 – 20GB No-Spread

  16. Within the Storage Manager Sun Niagara T264 HW Contexts SF 64 – 0.6GB Spread SF 64 – 8.2GB Spread SF 1 – 20GB No-Spread Lock manager is the main bottleneck for TPC-E

  17. Inside the Lock Manager SF 64 – 0.6GB Spread SF 64 – 8.2GB Spread SF 1 – 20GB No-Spread Logical contention even for a large DB

  18. Conclusions • Modern hardware is still highly under-utilized • TPC-E: fewer misses, less stall time, higher IPC • OLTP utilizes less aggressive cores better • Instruction footprint is too large to fit in L1-I • Spread instructions, (software guided) prefetching • Code/Compiler optimizations • Logical lock contention due to hotspots • Increased complexity in schema and transactions • TPC-E: harder to physically partition • Logical partitioning, OCC

  19. The obsolete TPC-B TPC-C The ubiquitous TPC-E The unexplored Also starring: Shore-MT, Xeon X5660, Niagara T2 Directed by Produced by

More Related