The Need for Precise and Efficient Memory Capacity Budgeting

Shaleen Garg, Sudarsun Kannan, Manish Parashar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Modern high performance computing (HPC) systems pack hundreds of CPU cores to enable extreme parallelism. However, with increasing core counts, the effective per-core memory capacity is reducing. Reducing performance bottlenecks require precise monitoring and budgeting of application memory capacity requirements for attaining high performance, maximum resource efficiency, and low performance variability. Unfortunately, current operating systems (OS) and their toolsets are inaccurate, lack the capability to precisely measure the memory requirements of applications, forcing system administrators to either underestimate or over-provision memory, consequently compromising performance or resource efficiency, respectively. In this paper, we decipher the memory budgeting limitations in current OSes and their impact on both homogeneous and heterogeneous memory systems (e.g., nonvolatile memory). The limitations mainly stem from the mismatch between application-level and global memory accounting in the OS memory manager, fixing which can be prohibitively expensive at runtime. Our analysis of popular HPC workloads using widely-used memory budgeting strategies and deep instrumentation of the memory management layer reveals that imprecise budgeting can reduce performance by more than 1.65x and 2.05x in homogeneous and heterogeneous memory systems respectively. The program's memory requirement increases by up to 25x without significant performance gains. We also briefly describe our ongoing research approach to redesign the budgeting mechanisms in the OS.

Original languageEnglish (US)
Title of host publicationMEMSYS 2020 - Proceedings of the International Symposium on Memory Systems
PublisherAssociation for Computing Machinery
Pages169-177
Number of pages9
ISBN (Electronic)9781450388993
DOIs
StatePublished - Sep 28 2020
Event2020 International Symposium on Memory Systems, MEMSYS 2020 - Washington, United States
Duration: Sep 28 2020Oct 1 2020

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2020 International Symposium on Memory Systems, MEMSYS 2020
Country/TerritoryUnited States
CityWashington
Period9/28/2010/1/20

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications

Keywords

  • HPC
  • Linux
  • Memory
  • Performance
  • Resource Efficiency

Fingerprint

Dive into the research topics of 'The Need for Precise and Efficient Memory Capacity Budgeting'. Together they form a unique fingerprint.

Cite this