TY - GEN
T1 - Developing a convolutional neural network to classify phytoplankton images collected with an Imaging FlowCytobot along the West Antarctic Peninsula
AU - Nardelli, Schuyler C.
AU - Gray, Patrick C.
AU - Schofield, Oscar
N1 - Funding Information:
ACKNOWLEDGMENTS This work was supported by the National Science Foundation Antarctic Organisms and Ecosystems Program (PLR-1440435) as part of the PAL-LTER program. Thank you to Alison Chase and Sasha Kramer for help with taxonomic identifications, and to Emmett Culhane for helpful discussions regarding the challenges of building a CNN for IFCB data. This work would not be possible without the PAL-LTER field teams who aided in data collection and the Palmer Station and Laurence M. Gould personnel who provided logistical support.
Publisher Copyright:
© 2021 MTS.
PY - 2021
Y1 - 2021
N2 - High-resolution optical imaging systems are quickly becoming universal tools to characterize and quantify microbial diversity in marine ecosystems. Automated detection systems such as convolutional neural networks (CNN) are often developed to identify the immense number of images collected. The goal of our study was to develop a CNN to classify phytoplankton images collected with an Imaging FlowCytobot for the Palmer Antarctica Long-Term Ecological Research project. A medium complexity CNN was developed using a subset of manually-identified images, resulting in an overall accuracy, recall, and f1-score of 93.8%, 93.7%, and 93.7%, respectively. The f1-score dropped to 46.5% when tested on a new random subset of 10, 269 images, likely due to highly imbalanced class distributions, high intraclass variance, and interclass morphological similarities of cells in naturally occurring phytoplankton assemblages. Our model was then used to predict taxonomic classifications of phytoplankton at Palmer Station, Antarctica over 2017-2018 and 2018-2019 summer field seasons. The CNN was generally able to capture important seasonal dynamics such as the shift from large centric diatoms to small pennate diatoms in both seasons, which is thought to be driven by increases in glacial meltwater from January to March. Moving forward, we hope to further increase the accuracy of our model to better characterize coastal phytoplankton communities threatened by rapidly changing environmental conditions.
AB - High-resolution optical imaging systems are quickly becoming universal tools to characterize and quantify microbial diversity in marine ecosystems. Automated detection systems such as convolutional neural networks (CNN) are often developed to identify the immense number of images collected. The goal of our study was to develop a CNN to classify phytoplankton images collected with an Imaging FlowCytobot for the Palmer Antarctica Long-Term Ecological Research project. A medium complexity CNN was developed using a subset of manually-identified images, resulting in an overall accuracy, recall, and f1-score of 93.8%, 93.7%, and 93.7%, respectively. The f1-score dropped to 46.5% when tested on a new random subset of 10, 269 images, likely due to highly imbalanced class distributions, high intraclass variance, and interclass morphological similarities of cells in naturally occurring phytoplankton assemblages. Our model was then used to predict taxonomic classifications of phytoplankton at Palmer Station, Antarctica over 2017-2018 and 2018-2019 summer field seasons. The CNN was generally able to capture important seasonal dynamics such as the shift from large centric diatoms to small pennate diatoms in both seasons, which is thought to be driven by increases in glacial meltwater from January to March. Moving forward, we hope to further increase the accuracy of our model to better characterize coastal phytoplankton communities threatened by rapidly changing environmental conditions.
KW - Machine learning
KW - Neural network
KW - Phytoplankton
KW - Polar science
UR - http://www.scopus.com/inward/record.url?scp=85125957988&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125957988&partnerID=8YFLogxK
U2 - 10.23919/OCEANS44145.2021.9706072
DO - 10.23919/OCEANS44145.2021.9706072
M3 - Conference contribution
AN - SCOPUS:85125957988
T3 - Oceans Conference Record (IEEE)
BT - OCEANS 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - OCEANS 2021: San Diego - Porto
Y2 - 20 September 2021 through 23 September 2021
ER -