Using active NVRAM for I/O staging

Sudarsun Kannan, Ada Gavrilovska, Karsten Schwan, Dejan Milojicic, Vanish Talwar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

25 Scopus citations

Abstract

With HPC machines moving to the exascale, scaling the I/O performance of applications is a well known problem. Also, a closely related problem is, how to efficiently analyze and extract useful information from I/O data, viz. data post processing. With advent of nonvolatile memory technologies (NVMs) like SSD, PCM and Memristor, research has been focusing on how to improve the file systems performance and optimizations to overcome disk latencies. In the other end, there has been extensive focus on 'DataStaging' or 'in situ' I/O processing where I/O data are moved from computational cores to memory buffers of dedicated data processing or staging nodes using high performance I/O channels. The I/O data gets processed in these nodes before writing them to persistent storage like disks. However, issues with such approaches include (1) the limitation that they cannot easily analyze temporal data relationships or characteristics embedded in multiple simulation output steps, due to the limited aggregate memory capacity of staging nodes, and (2) the need to 'right size' such staging memory, sometimes even for single output/checkpoint steps when data volumes are large. Failing to properly allocate staging memory buffers (2) can cause applications to block and severely degrade the performance improvements sought by the extensive parallelization efforts undertaken by application developers. The limitation posed by (1) can degrade the utility of the Staging approach seen by end users. This paper explores an alternative solution for 'right memory sizing' issue for staging I/O. In this solution, memory scaling avoids the cost and power constraints imposed on machine designers by the use of DRAM (memory), by instead, using active NVRAM (nonvolatile memory) to enhance the memory capacities of compute and staging nodes. Active NVRAMs are node-local NVRAMs that are embedded with a low power system-on-chip compute element. We propose a mechanism, in which each physical node has an ad-ditional active NVRAM component to stage I/O and apply simple data analytics operations over the I/O data. While such node local data storage provides an obvious I/O acceleration, our experimental results show the effectiveness of our approach in addressing 'right memory sizing issue' by efficient I/O data processing. We also discuss the overheads in using Active NVRAM based approach for I/O staging.

Original languageEnglish (US)
Title of host publicationPDAC'11 - Proceedings of the 2011 International Workshop on Petascal Data Analytics
Subtitle of host publicationChallenges and Opportunities, Co-located with SC'11
Pages15-21
Number of pages7
DOIs
StatePublished - 2011
Externally publishedYes
Event2011 2nd International Workshop on Petascal Data Analytics: Challenges and Opportunities, Held in Cooperation with the IEEE/ACM International Conference for High Performance Computing, Networking, Storage, and Analysis, SC'11 - Seattle, WA, United States
Duration: Nov 14 2011Nov 14 2011

Publication series

NamePDAC'11 - Proceedings of the 2011 International Workshop on Petascal Data Analytics: Challenges and Opportunities, Co-located with SC'11

Other

Other2011 2nd International Workshop on Petascal Data Analytics: Challenges and Opportunities, Held in Cooperation with the IEEE/ACM International Conference for High Performance Computing, Networking, Storage, and Analysis, SC'11
Country/TerritoryUnited States
CitySeattle, WA
Period11/14/1111/14/11

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Information Systems

Keywords

  • DataStaging
  • Non-volatile memory
  • Post processing

Fingerprint

Dive into the research topics of 'Using active NVRAM for I/O staging'. Together they form a unique fingerprint.

Cite this