TY - GEN
T1 - The Temple University Hospital Digital Pathology Corpus
AU - Houser, D.
AU - Shadhin, G.
AU - Anstotz, R.
AU - Campbell, C.
AU - Obeid, I.
AU - Picone, J.
AU - Farkas, T.
AU - Persidsky, Y.
AU - Jhala, N.
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - Digital pathology is a relatively new field that stands to gain from modern big data and machine learning techniques. In the United States alone, millions of pathology slides are created and interpreted by a human expert each year, suggesting that there is ample data available to support machine learning research. However, the relevant corpora that currently exist contain only hundreds of images, not enough to develop sophisticated deep learning models. This lack of publicly accessible data also hinders the advancement of clinical science. Our digital pathology corpus is an effort to place a large amount of clinical pathology images collected at Temple University Hospital into the public domain to support the development of automatic interpretation technology. The goal of this ambitious project is to create a corpus of 1M images. We have already released 10,000 images from 600 clinical cases. In this paper, we describe the corpus under development and discuss some of the underlying technology that was developed to support this project.
AB - Digital pathology is a relatively new field that stands to gain from modern big data and machine learning techniques. In the United States alone, millions of pathology slides are created and interpreted by a human expert each year, suggesting that there is ample data available to support machine learning research. However, the relevant corpora that currently exist contain only hundreds of images, not enough to develop sophisticated deep learning models. This lack of publicly accessible data also hinders the advancement of clinical science. Our digital pathology corpus is an effort to place a large amount of clinical pathology images collected at Temple University Hospital into the public domain to support the development of automatic interpretation technology. The goal of this ambitious project is to create a corpus of 1M images. We have already released 10,000 images from 600 clinical cases. In this paper, we describe the corpus under development and discuss some of the underlying technology that was developed to support this project.
UR - http://www.scopus.com/inward/record.url?scp=85062096498&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85062096498&partnerID=8YFLogxK
U2 - 10.1109/SPMB.2018.8615619
DO - 10.1109/SPMB.2018.8615619
M3 - Conference contribution
AN - SCOPUS:85062096498
T3 - 2018 IEEE Signal Processing in Medicine and Biology Symposium, SPMB 2018 - Proceedings
BT - 2018 IEEE Signal Processing in Medicine and Biology Symposium, SPMB 2018 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE Signal Processing in Medicine and Biology Symposium, SPMB 2018
Y2 - 1 December 2018
ER -