Projects per year
Fingerprint
- 1 Similar Profiles
Collaborations and top research areas from the last five years
-
Collaborative Research: Frameworks: hpcGPT: Enhancing Computing Center User Support with HPC-enriched Generative AI
Yuan, B. (CoPI), Zhang, Z. (PI) & Liu, H. (CoPI)
8/1/24 → 7/31/27
Project: Research project
-
CAREER: Efficient and Scalable Large Foundational Model Training on Supercomputers for Science
Zhang, Z. (PI)
7/1/24 → 6/30/29
Project: Research project
-
Collaborative Research: CSR: Medium: Fortuna: Characterizing and Harnessing Performance Variability in Accelerator-rich Clusters
Zhang, Z. (PI)
10/1/23 → 9/30/26
Project: Research project
-
Collaborative Research: Frameworks: Diamond: Democratizing Large Neural Network Model Training for Science
Hauton, C. C. (CoI), Campbell, A. A. (CoI) & Zhang, Z. (PI)
10/1/20 → 9/30/26
Project: Research project
-
Collaborative Research: OAC Core: ScaDL: New Approaches to Scaling Deep Learning for Science Applications on Supercomputers
Cumming, J. J. (CoI), Kolitsida, M. M. (CoI) & Zhang, Z. (PI)
9/28/20 → 10/31/25
Project: Research project
-
Fine-grained Policy-driven I/O Sharing for Burst Buffers
Karrels, E., Huang, L., Kan, Y., Arora, I., Wang, Y., Katz, D. S., Gropp, W. & Zhang, Z., Nov 12 2023, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2023. Association for Computing Machinery, Inc, 95. (Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2023).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Open Access1 Scopus citations -
Mirage: Towards Low-Interruption Services on Batch GPU Clusters with Reinforcement Learning
Ding, Q., Zheng, P., Kudari, S., Venkataraman, S. & Zhang, Z., 2023, SC 2023 - International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, (International Conference for High Performance Computing, Networking, Storage and Analysis, SC).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
Open Access1 Scopus citations -
MKOR: Momentum-Enabled Kronecker-Factor-Based Optimizer Using Rank-1 Updates
Mozaffari, M., Li, S., Zhang, Z. & Dehnavi, M. M., 2023, In: Advances in Neural Information Processing Systems. 36Research output: Contribution to journal › Conference article › peer-review
-
Optimizing Data Movement for GPU-Based In-Situ Workflow Using GPUDirect RDMA
Zhang, B., Davis, P. E., Morales, N., Zhang, Z., Teranishi, K. & Parashar, M., 2023, Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Proceedings. Cano, J., Dikaiakos, M. D., Papadopoulos, G. A., Pericàs, M. & Sakellariou, R. (eds.). Springer Science and Business Media Deutschland GmbH, p. 323-338 16 p. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); vol. 14100 LNCS).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
-
Deep Neural Network Training With Distributed K-FAC
Gregory Pauloski, J., Huang, L., Xu, W., Chard, K., Foster, I. T. & Zhang, Z., Dec 1 2022, In: IEEE Transactions on Parallel and Distributed Systems. 33, 12, p. 3616-3627 12 p.Research output: Contribution to journal › Article › peer-review
3 Scopus citations