استفاده از معماری شبکه عصبی پیچشی عمیق برای تشخیص چهره
ترجمه نشده

استفاده از معماری شبکه عصبی پیچشی عمیق برای تشخیص چهره

عنوان فارسی مقاله: تشخیص چهره با وضوح پایین با استفاده از یک معماری شبکه عصبی پیچشی عمیق دو شاخه
عنوان انگلیسی مقاله: Low resolution face recognition using a two-branch deep convolutional neural network architecture
مجله/کنفرانس: سیستم های خبره با برنامه های کاربردی - Expert Systems With Applications
رشته های تحصیلی مرتبط: کامپیوتر
گرایش های تحصیلی مرتبط: هوش مصنوعی، معماری سیستم های کامپیوتری، مهندسی نرم افزار
کلمات کلیدی فارسی: تشخیص چهره با وضوح پایین، روشهای فراتفکیک پذیری، روشهای نگاشت تزویج شده، شبکه های عصبی پیچشی عمیق
کلمات کلیدی انگلیسی: Low resolution face recognition، Super-resolution methods، Coupled mappings methods، Deep convolutional neural networks
نوع نگارش مقاله: مقاله پژوهشی (Research Article)
نمایه: Scopus - Master Journals List - JCR
شناسه دیجیتال (DOI): https://doi.org/10.1016/j.eswa.2019.112854
دانشگاه: Department of Computer Engineering and Information Technology at Amirkabir University of Technology, Tehran, Iran
صفحات مقاله انگلیسی: 11
ناشر: الزویر - Elsevier
نوع ارائه مقاله: ژورنال
نوع مقاله: ISI
سال انتشار مقاله: 2020
ایمپکت فاکتور: 5/891 در سال 2019
شاخص H_index: 162 در سال 2020
شاخص SJR: 1/190 در سال 2019
شناسه ISSN: 0957-4174
شاخص Quartile (چارک): Q1 در سال 2019
فرمت مقاله انگلیسی: PDF
وضعیت ترجمه: ترجمه نشده است
قیمت مقاله انگلیسی: رایگان
آیا این مقاله بیس است: خیر
آیا این مقاله مدل مفهومی دارد: ندارد
آیا این مقاله پرسشنامه دارد: ندارد
آیا این مقاله متغیر دارد: ندارد
کد محصول: E14826
رفرنس: دارای رفرنس در داخل متن و انتهای مقاله
فهرست مطالب (انگلیسی)

Abstract

1- Introduction

2- Previous works

3-  Proposed method

4- Experimental evaluation

5- Discussion and conclusion

References

بخشی از مقاله (انگلیسی)

Abstract

We propose a novel coupled mappings method for low resolution face recognition using deep convolutional neural networks (DCNNs). The proposed architecture consists of two branches of DCNNs to map the high and low resolution face images into a common space with nonlinear transformations. The branch corresponding to transformation of high resolution images consists of 14 layers and the other branch which maps the low resolution face images to the common space includes a 5-layer super-resolution network connected to a 14-layer network. The distance between the features of corresponding high and low resolution images are backpropagated to train the networks. Our proposed method is evaluated on FERET, LFW, and MBGC datasets and compared with state-of-the-art competing methods. Our extensive experimental evaluations show that the proposed method significantly improves the recognition performance especially for very low resolution probe face images (5% improvement in recognition accuracy). Furthermore, it can reconstruct a high resolution image from its corresponding low resolution probe image which is comparable with the state-of-the-art super-resolution methods in terms of visual quality.

Introduction

In the past few decades, face recognition has shown promising performance in numerous applications and under challenging conditions such as occlusion (Jia & Martinez, 2009), variation in pose, illumination, and expression (Martínez, 2002). While many face recognition systems have been developed for recognizing high quality face images in controlled conditions (Zhao, Chellappa, Phillips, & Rosenfeld, 2003), there are a few studies focused on face recognition in real world applications such as surveillance systems with low resolution faces (Pnevmatikakis & Polymenakos, 2007). One important challenge in these applications is that high resolution (HR) probe images may not be available due to the large distance of the camera from the subject. Here, we focus on addressing the problem of recognizing low resolution probe face images when a gallery of high quality images is available. There are three standard approaches to address this problem. (1) down sampling the gallery images to the resolution of the probe images and then performing the recognition. However, this approach is suboptimal because the additional discriminating information available in the high resolution gallery images is lost.