TY - GEN
T1 - Using active NVRAM for I/O staging
AU - Kannan, Sudarsun
AU - Gavrilovska, Ada
AU - Schwan, Karsten
AU - Milojicic, Dejan
AU - Talwar, Vanish
PY - 2011
Y1 - 2011
N2 - With HPC machines moving to the exascale, scaling the I/O performance of applications is a well known problem. Also, a closely related problem is, how to efficiently analyze and extract useful information from I/O data, viz. data post processing. With advent of nonvolatile memory technologies (NVMs) like SSD, PCM and Memristor, research has been focusing on how to improve the file systems performance and optimizations to overcome disk latencies. In the other end, there has been extensive focus on 'DataStaging' or 'in situ' I/O processing where I/O data are moved from computational cores to memory buffers of dedicated data processing or staging nodes using high performance I/O channels. The I/O data gets processed in these nodes before writing them to persistent storage like disks. However, issues with such approaches include (1) the limitation that they cannot easily analyze temporal data relationships or characteristics embedded in multiple simulation output steps, due to the limited aggregate memory capacity of staging nodes, and (2) the need to 'right size' such staging memory, sometimes even for single output/checkpoint steps when data volumes are large. Failing to properly allocate staging memory buffers (2) can cause applications to block and severely degrade the performance improvements sought by the extensive parallelization efforts undertaken by application developers. The limitation posed by (1) can degrade the utility of the Staging approach seen by end users. This paper explores an alternative solution for 'right memory sizing' issue for staging I/O. In this solution, memory scaling avoids the cost and power constraints imposed on machine designers by the use of DRAM (memory), by instead, using active NVRAM (nonvolatile memory) to enhance the memory capacities of compute and staging nodes. Active NVRAMs are node-local NVRAMs that are embedded with a low power system-on-chip compute element. We propose a mechanism, in which each physical node has an ad-ditional active NVRAM component to stage I/O and apply simple data analytics operations over the I/O data. While such node local data storage provides an obvious I/O acceleration, our experimental results show the effectiveness of our approach in addressing 'right memory sizing issue' by efficient I/O data processing. We also discuss the overheads in using Active NVRAM based approach for I/O staging.
AB - With HPC machines moving to the exascale, scaling the I/O performance of applications is a well known problem. Also, a closely related problem is, how to efficiently analyze and extract useful information from I/O data, viz. data post processing. With advent of nonvolatile memory technologies (NVMs) like SSD, PCM and Memristor, research has been focusing on how to improve the file systems performance and optimizations to overcome disk latencies. In the other end, there has been extensive focus on 'DataStaging' or 'in situ' I/O processing where I/O data are moved from computational cores to memory buffers of dedicated data processing or staging nodes using high performance I/O channels. The I/O data gets processed in these nodes before writing them to persistent storage like disks. However, issues with such approaches include (1) the limitation that they cannot easily analyze temporal data relationships or characteristics embedded in multiple simulation output steps, due to the limited aggregate memory capacity of staging nodes, and (2) the need to 'right size' such staging memory, sometimes even for single output/checkpoint steps when data volumes are large. Failing to properly allocate staging memory buffers (2) can cause applications to block and severely degrade the performance improvements sought by the extensive parallelization efforts undertaken by application developers. The limitation posed by (1) can degrade the utility of the Staging approach seen by end users. This paper explores an alternative solution for 'right memory sizing' issue for staging I/O. In this solution, memory scaling avoids the cost and power constraints imposed on machine designers by the use of DRAM (memory), by instead, using active NVRAM (nonvolatile memory) to enhance the memory capacities of compute and staging nodes. Active NVRAMs are node-local NVRAMs that are embedded with a low power system-on-chip compute element. We propose a mechanism, in which each physical node has an ad-ditional active NVRAM component to stage I/O and apply simple data analytics operations over the I/O data. While such node local data storage provides an obvious I/O acceleration, our experimental results show the effectiveness of our approach in addressing 'right memory sizing issue' by efficient I/O data processing. We also discuss the overheads in using Active NVRAM based approach for I/O staging.
KW - DataStaging
KW - Non-volatile memory
KW - Post processing
UR - http://www.scopus.com/inward/record.url?scp=84857948109&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84857948109&partnerID=8YFLogxK
U2 - 10.1145/2110205.2110209
DO - 10.1145/2110205.2110209
M3 - Conference contribution
AN - SCOPUS:84857948109
SN - 9781450311304
T3 - PDAC'11 - Proceedings of the 2011 International Workshop on Petascal Data Analytics: Challenges and Opportunities, Co-located with SC'11
SP - 15
EP - 21
BT - PDAC'11 - Proceedings of the 2011 International Workshop on Petascal Data Analytics
T2 - 2011 2nd International Workshop on Petascal Data Analytics: Challenges and Opportunities, Held in Cooperation with the IEEE/ACM International Conference for High Performance Computing, Networking, Storage, and Analysis, SC'11
Y2 - 14 November 2011 through 14 November 2011
ER -