SILVA, A. G. A.; http://lattes.cnpq.br/9570138520912194; SILVA, Arnaldo Gualberto de Andrade e.
Abstract:
The face is considered the primary biometric trait for machine-readable travel documents such as passports. In this context, the ISO/IEC 19794-5 standard denes a set of photographic requirements to ensure the image quality and simplify the face recognition process. However, because the number of requirements dened by such a standard is high (almost 30), the compliance verication of a single face image is still a challenge. Usually, problems with multiple tasks, such as the ISO/IEC 19794-5 requirements, are broken into independent subproblems that are solved separately and then recombined. Nevertheless, it ignores the common information between related tasks and increases the risk of overtting. Multitasking Learning (MTL) has proven to be an important technique for solving multiple related tasks simultaneously. It explores the common and distinct aspects of tasks from the same domain to improve the generalization among all tasks. In addition, MTL focuses on learning a useful representation that can yield benets, particularly in scenarios where a labeled dataset for a task is limited. Finally, in the case of Deep Neural Networks, MTL can help reduce the number of parameters and inference speed. This research proposes the rst deep Multitasking Learning method designed for automatic evaluation of both photographic and geometric requirements of the ISO/IEC 19794-5 standard, called ICAONet. Undercomplete Autoencoders are extended to employ a multi-and-collaborative learning approach, in which both supervised and unsupervised learning are performed concurrently and in a collaborative manner. The method is trained using an ad hoc image dataset and evaluated using an ocial benchmark system that is also used by other approaches presented in the literature. The experiments show the method proposed achieves the best results in terms of Equal Error Rate for 9 out of the 23 photographic requirements of ISO/IEC 19794-5, which was not achieved by any other individual method according to the consulted bibliography. Therefore, the proposed method can be considered the best overall solution among evaluated academic works published in the literature and private SDKs. Overall, the median Equal Error Rate (3.3%) is also competitive. In terms of running time, the proposed method stands out among the fastest methods to evaluate all 23 requirements according to the ocial benchmark. On the other hand, there are space for improvements on results of eye’s landmark location and some specic requirements that may require additional investigation. Finally, through Neural Network visualization techniques, relevant patterns related to the requirements of the ISO/IEC 19794-5 standard could be observed.