LEAP: lightweight deterministic multi-processor replay of concurrent java programs Conference Paper uri icon

abstract

  • The technique of deterministic record and replay aims at faithfully reenacting an earlier program execution. For concurrent programs, it is one of the most important techniques for program understanding and debugging. The state of the art deterministic replay techniques face challenging efficiency problems in supporting multi-processor executions due to the unoptimized treatment of shared memory accesses. We propose LEAP: a deterministic record and replay technique that uses a new type of local order w.r.t. the shared memory locations and concurrent threads. Compared to the related work, our technique records much less information without losing the replay determinism. The correctness of our technique is underpinned by formal models and a replay theorem that we have developed in this work. Through our evaluation using both benchmarks and real world applications, we show that LEAP is more than 10x faster than conventional global-order based approaches and, in most cases, 2x to 10x faster than other local-order based approaches. Our recording overhead on the two large open source multi-threaded applications Tomcat and Derby is less than 10%. Moreover, as the evidence of the deterministic replay, LEAP is able to deterministically reproduce 7 out of 8 real bugs in Tomcat and Derby, 13 out of 16 benchmark bugs in IBM ConTest benchmark suite, and 100% of the randomly injected concurrency bugs. 2010 ACM.

name of conference

  • Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering

published proceedings

  • Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering

author list (cited authors)

  • Huang, J., Liu, P., & Zhang, C.

citation count

  • 68

complete list of authors

  • Huang, Jeff||Liu, Peng||Zhang, Charles

publication date

  • January 2010