SCTuner: An Autotuner Addressing Dynamic I/O Needs on Supercomputer I/O Subsystems

Houjun Tang, Bing Xie, Suren Byna, Philip Carns, Quincey Koziol, Sudarsun Kannan, Jay Lofstead, Sarp Oral

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

In high-performance computing (HPC), scientific applications often manage a massive amount of data using I/O libraries. These libraries provide convenient data model abstractions, help ensure data portability, and, most important, empower end users to improve I/O performance by tuning configurations across multiple layers of the HPC I/O stack. We propose SCTuner, an autotuner integrated within the I/O library itself to dynamically tune both the I/O library and the underlying I/O stack at application runtime. To this end, we introduce a statistical benchmarking method to profile the behaviors of individual supercomputer I/O subsystems with varied configurations across I/O layers. We use the benchmarking results as the built-in knowledge in SCTuner, implement an I/O pattern extractor, and plan to implement an online performance tuner as the SCTuner runtime. We conducted a benchmarking analysis on the Summit supercomputer and its GPFS file system Alpine. The preliminary results show that our method can effectively extract the consistent I/O behaviors of the target system under production load, building the base for I/O autotuning at application runtime.

Original languageEnglish (US)
Title of host publicationProceedings of PDSW 2021
Subtitle of host publicationIEEE/ACM 6th International Parallel Data Systems Workshop, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages29-34
Number of pages6
ISBN (Electronic)9781665418379
DOIs
StatePublished - 2021
Externally publishedYes
Event6th IEEE/ACM International Parallel Data Systems Workshop, PDSW 2021 - St. Louis, United States
Duration: Nov 15 2021 → …

Publication series

NameProceedings of PDSW 2021: IEEE/ACM 6th International Parallel Data Systems Workshop, Held in conjunction with SC 2021: The International Conference for High Performance Computing, Networking, Storage and Analysis

Conference

Conference6th IEEE/ACM International Parallel Data Systems Workshop, PDSW 2021
Country/TerritoryUnited States
CitySt. Louis
Period11/15/21 → …

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture
  • Information Systems
  • Information Systems and Management
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'SCTuner: An Autotuner Addressing Dynamic I/O Needs on Supercomputer I/O Subsystems'. Together they form a unique fingerprint.

Cite this