PixISegNet:Pixel Level Iris Segmentation Network using Convolutional Encoder-Decoder with Stacked Hourglass Bottleneck

Abstract

Biometric segmentation to obtain the region of interest under non-cooperative conditions is a fundamental and essential problem in biometric research. In the past few decades, researchers have widely studied this problem. For instance, non-ideal iris images cause poor segmentation in case of non-regular reflections, blurred boundaries, eyelids occlusion, and off-angle rotations. In this paper, we present a new iris ROI segmentation algorithm using a deep convolutional neural network to achieve the state-of-the-art segmentation performance on well-known iris image datasets. Our model surpasses the performance of state- of-the-art Iris-DenseNet framework by applying several strategies, including multi-scale/ multi-orientation training, model training from scratch, and proper hyper-parameterization of crucial parameters. The proposed PixISegNet consists of an autoencoder which primarily uses long and short skip connections and a stacked hourglass network between encoder and decoder. There is a continuous scale up-down in stacked hourglass networks, which helps in extracting features at multiple scales and robustly segments the iris even in an occluded environment. Furthermore, cross entropy loss and content loss optimizes the proposed model. The content loss considers the high-level features, thus operating at a different scale of abstraction, which compliments the cross-entropy loss, which considers pixel-to-pixel classification loss. Additionally, we have checked the robustness of the proposed network by rotating images to certain degrees with a change in the aspect ratio along with blurring and change in contrast. Experimental results on the various iris characteristics demonstrate the superiority of the proposed method over state-of-the-art iris segmentation methods considered in this paper. In order to demonstrate the network generalization, we deploy a very stringent T OT A (i.e train once test all) strategy. Our proposed method achieves E1 scores of 0.00672, 0.00916 and 0.00117 on UBIRIS-V2, IIT-D and CASIA V3.0 Interval datasets respectively. Moreover, such a deep convolutional neural network for segmentation when included in an end-to-end iris recognition system with a siamese based matching network will augment the performance of the siamese network. It facilitates the siamese matching network by selectively removing all of the unwanted (non-iris) pixels, to learn the salient iris features and provide a better recognition performance.

Publication
The Institution of Engineering and Technology - Biometrics