Collaborative Research: Frameworks: hpcGPT: Enhancing Computing Center User Support with HPC-enriched Generative AI

Project Details

Description

hpcGPT is a question answering service for academic computing centers such as the National Center for Supercomputing Applications, Ohio Supercomputer Center, San Diego Supercomputer Center, and Texas Advanced Computing Center. These Centers provide high-performance computing (HPC) platforms to tens of thousands of users for science and engineering research. In collaboration with Princeton University and Rutgers University, hpcGPT uses generative artificial intelligence (AI) and integrates heterogeneous data sources with different update frequencies to enhance the user support service quality and efficiency, decrease the response time, and improve precision of the support. With hpcGPT, user support teams can leverage the historical knowledge, real-time system status, and external technical expertise to better support the HPC users. With the high-quality and timely answers from hpcGPT, HPC users can resolve many technical issues, thus reducing the workload of the user support teams. This will allow the support teams to focus more on new and novel support issues. hpcGPT will significantly enhance the user support service quality, capacity, and efficiency without increasing the human effort.hpcGPT combines the fine-tuning and Retrieval Augmented Generation (RAG) techniques to incorporate recent knowledge, past experience, domain expertise, documentations, and real-time system status of versatile computing. By building upon existing and recognized capabilities in large language model fine-tuning and hosting, retrieval augmentation generation, and external data source integration, hpcGPT reduces the complexity and effort required to align information and identify dependencies between questions, answers, and the supporting information. This is particularly beneficial for research groups and computing centers with diverse application requirements and limited staff. hpcGPT extends and translates a suite of Cyberinfrastructure building blocks and technologies such as large language model training and inference service hosting.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.
StatusActive
Effective start/end date8/1/247/31/27

Funding

  • National Science Foundation: $1,199,126.00

Fingerprint

Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.