TY - JOUR
T1 - Comparing workflow application designs for high resolution satellite image analysis
AU - Al-Saadi, Aymen
AU - Paraskevakos, Ioannis
AU - Gonçalves, Bento Collares
AU - Lynch, Heather J.
AU - Jha, Shantenu
AU - Turilli, Matteo
N1 - Publisher Copyright:
© 2021
PY - 2021/11
Y1 - 2021/11
N2 - Very High Resolution satellite and aerial imagery are used to monitor and conduct large scale surveys of ecological systems. Convolutional Neural Networks have successfully been employed to analyze such imagery to detect large animals and salient features. As the datasets increase in volume and number of images, utilizing High Performance Computing resources becomes necessary. In this paper, we investigate three task-parallel, data-driven workflow designs to support imagery analysis pipelines with heterogeneous tasks on high performance computing platforms. We analyze the capabilities of each design when processing 3097 and 1575 images for two distinct use cases, for a total of 4,672 satellite and aerial images and 8.35 TB of data. We experimentally model the execution time of the tasks of the image processing pipelines. We perform experiments to characterize resource utilization, total time to completion and overheads of each design. Our analysis shows which design is best suited to scientific pipelines with similar characteristics.
AB - Very High Resolution satellite and aerial imagery are used to monitor and conduct large scale surveys of ecological systems. Convolutional Neural Networks have successfully been employed to analyze such imagery to detect large animals and salient features. As the datasets increase in volume and number of images, utilizing High Performance Computing resources becomes necessary. In this paper, we investigate three task-parallel, data-driven workflow designs to support imagery analysis pipelines with heterogeneous tasks on high performance computing platforms. We analyze the capabilities of each design when processing 3097 and 1575 images for two distinct use cases, for a total of 4,672 satellite and aerial images and 8.35 TB of data. We experimentally model the execution time of the tasks of the image processing pipelines. We perform experiments to characterize resource utilization, total time to completion and overheads of each design. Our analysis shows which design is best suited to scientific pipelines with similar characteristics.
KW - Computational modeling
KW - Image analysis
KW - Runtime
KW - Scientific workflows
KW - Task-parallel
UR - http://www.scopus.com/inward/record.url?scp=85108114271&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85108114271&partnerID=8YFLogxK
U2 - 10.1016/j.future.2021.04.023
DO - 10.1016/j.future.2021.04.023
M3 - Article
AN - SCOPUS:85108114271
SN - 0167-739X
VL - 124
SP - 315
EP - 329
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -