TY - GEN
T1 - Towards the design of an end-to-end automated system for image and video-based recognition
AU - Chellappa, Rama
AU - Chen, Jun Cheng
AU - Ranjan, Rajeev
AU - Sankaranarayanan, Swami
AU - Kumar, Amit
AU - Patel, Vishal M.
AU - Castillo, Carlos D.
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2017/3/27
Y1 - 2017/3/27
N2 - Over many decades, researchers working in object recognition have longed for an end-to-end automated system that will simply accept 2D or 3D image or videos as inputs and output the labels of objects in the input data. Computer vision methods that use representations derived based on geometric, radiometric and neural considerations and statistical and structural matchers and artificial neural network-based methods where a multi-layer network learns the mapping from inputs to class labels have provided competing approaches for image recognition problems. Over the last four years, methods based on Deep Convolutional Neural Networks (DCNNs) have shown impressive performance improvements on object detection/recognition challenge problems. This has been made possible due to the availability of large annotated data, a better understanding of the non-linear mapping between image and class labels as well as the affordability of GPUs. In this paper, we present a brief history of developments in computer vision and artificial neural networks over the last forty years for the problem of image-based recognition. We then present the design details of a deep learning system for end-to-end unconstrained face verification/recognition. Some open issues regarding DCNNs for object recognition problems are then discussed.
AB - Over many decades, researchers working in object recognition have longed for an end-to-end automated system that will simply accept 2D or 3D image or videos as inputs and output the labels of objects in the input data. Computer vision methods that use representations derived based on geometric, radiometric and neural considerations and statistical and structural matchers and artificial neural network-based methods where a multi-layer network learns the mapping from inputs to class labels have provided competing approaches for image recognition problems. Over the last four years, methods based on Deep Convolutional Neural Networks (DCNNs) have shown impressive performance improvements on object detection/recognition challenge problems. This has been made possible due to the availability of large annotated data, a better understanding of the non-linear mapping between image and class labels as well as the affordability of GPUs. In this paper, we present a brief history of developments in computer vision and artificial neural networks over the last forty years for the problem of image-based recognition. We then present the design details of a deep learning system for end-to-end unconstrained face verification/recognition. Some open issues regarding DCNNs for object recognition problems are then discussed.
UR - http://www.scopus.com/inward/record.url?scp=85018243621&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85018243621&partnerID=8YFLogxK
U2 - 10.1109/ITA.2016.7888183
DO - 10.1109/ITA.2016.7888183
M3 - Conference contribution
AN - SCOPUS:85018243621
T3 - 2016 Information Theory and Applications Workshop, ITA 2016
BT - 2016 Information Theory and Applications Workshop, ITA 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 Information Theory and Applications Workshop, ITA 2016
Y2 - 31 January 2016 through 5 February 2016
ER -