Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models

Deepfake is one of the applications that is deemed harmful. Deepfakes are a sort of image or video manipulation in which a person’s image is changed or swapped with that of another person’s face using artificial neural networks. Deepfake manipulations may be done with a variety of techniques and app...

Full description

Bibliographic Details
Main Authors:	Semih Yavuzkilic, Abdulkadir Sengur, Zahid Akhtar, Kamran Siddique
Format:	Article
Language:	English
Published:	MDPI AG 2021-07-01
Series:	Symmetry
Subjects:	deepfake fake face detection face manipulations multi-stream CNNs
Online Access:	https://www.mdpi.com/2073-8994/13/8/1352

id	doaj-c21bcd278e3f4ceebafd99e02513ab5a
record_format	Article
spelling	doaj-c21bcd278e3f4ceebafd99e02513ab5a2021-08-26T14:23:45ZengMDPI AGSymmetry2073-89942021-07-01131352135210.3390/sym13081352Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs ModelsSemih Yavuzkilic0Abdulkadir Sengur1Zahid Akhtar2Kamran Siddique3Department of Electrical and Electronics Engineering, Fırat University, Elazig 23000, TurkeyDepartment of Electrical and Electronics Engineering, Fırat University, Elazig 23000, TurkeyDepartment of Network and Computer Security, State University of New York Polytechnic Institute, Utica, NY 13502, USADepartment of Information and Communication Technology, Xiamen University Malaysia, Sepang 43900, MalaysiaDeepfake is one of the applications that is deemed harmful. Deepfakes are a sort of image or video manipulation in which a person’s image is changed or swapped with that of another person’s face using artificial neural networks. Deepfake manipulations may be done with a variety of techniques and applications. A quintessential countermeasure against deepfake or face manipulation is deepfake detection method. Most of the existing detection methods perform well under symmetric data distributions, but are still not robust to asymmetric datasets variations and novel deepfake/manipulation types. In this paper, for the identification of fake faces in videos, a new multistream deep learning algorithm is developed, where three streams are merged at the feature level using the fusion layer. After the fusion layer, the fully connected, Softmax, and classification layers are used to classify the data. The pre-trained VGG16 model is adopted for transferred CNN1stream. In transfer learning, the weights of the pre-trained CNN model are further used for training the new classification problem. In the second stream (transferred CNN2), the pre-trained VGG19 model is used. Whereas, in the third stream, the pre-trained ResNet18 model is considered. In this paper, a new large-scale dataset (i.e., World Politicians Deepfake Dataset (WPDD)) is introduced to improve deepfake detection systems. The dataset was created by downloading videos of 20 different politicians from YouTube. Over 320,000 frames were retrieved after dividing the downloaded movie into little sections and extracting the frames. Finally, various manipulations were performed to these frames, resulting in seven separate manipulation classes for men and women. In the experiments, three fake face detection scenarios are investigated. First, fake and real face discrimination is studied. Second, seven face manipulations are performed, including age, beard, face swap, glasses, hair color, hairstyle, smiling, and genuine face discrimination. Third, performance of deepfake detection system under novel type of face manipulation is analyzed. The proposed strategy outperforms the prior existing methods. The calculated performance metrics are over 99%.https://www.mdpi.com/2073-8994/13/8/1352deepfakefake face detectionface manipulationsmulti-stream CNNs
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Semih Yavuzkilic Abdulkadir Sengur Zahid Akhtar Kamran Siddique
spellingShingle	Semih Yavuzkilic Abdulkadir Sengur Zahid Akhtar Kamran Siddique Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models Symmetry deepfake fake face detection face manipulations multi-stream CNNs
author_facet	Semih Yavuzkilic Abdulkadir Sengur Zahid Akhtar Kamran Siddique
author_sort	Semih Yavuzkilic
title	Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models
title_short	Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models
title_full	Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models
title_fullStr	Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models
title_full_unstemmed	Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models
title_sort	spotting deepfakes and face manipulations by fusing features from multi-stream cnns models
publisher	MDPI AG
series	Symmetry
issn	2073-8994
publishDate	2021-07-01
description	Deepfake is one of the applications that is deemed harmful. Deepfakes are a sort of image or video manipulation in which a person’s image is changed or swapped with that of another person’s face using artificial neural networks. Deepfake manipulations may be done with a variety of techniques and applications. A quintessential countermeasure against deepfake or face manipulation is deepfake detection method. Most of the existing detection methods perform well under symmetric data distributions, but are still not robust to asymmetric datasets variations and novel deepfake/manipulation types. In this paper, for the identification of fake faces in videos, a new multistream deep learning algorithm is developed, where three streams are merged at the feature level using the fusion layer. After the fusion layer, the fully connected, Softmax, and classification layers are used to classify the data. The pre-trained VGG16 model is adopted for transferred CNN1stream. In transfer learning, the weights of the pre-trained CNN model are further used for training the new classification problem. In the second stream (transferred CNN2), the pre-trained VGG19 model is used. Whereas, in the third stream, the pre-trained ResNet18 model is considered. In this paper, a new large-scale dataset (i.e., World Politicians Deepfake Dataset (WPDD)) is introduced to improve deepfake detection systems. The dataset was created by downloading videos of 20 different politicians from YouTube. Over 320,000 frames were retrieved after dividing the downloaded movie into little sections and extracting the frames. Finally, various manipulations were performed to these frames, resulting in seven separate manipulation classes for men and women. In the experiments, three fake face detection scenarios are investigated. First, fake and real face discrimination is studied. Second, seven face manipulations are performed, including age, beard, face swap, glasses, hair color, hairstyle, smiling, and genuine face discrimination. Third, performance of deepfake detection system under novel type of face manipulation is analyzed. The proposed strategy outperforms the prior existing methods. The calculated performance metrics are over 99%.
topic	deepfake fake face detection face manipulations multi-stream CNNs
url	https://www.mdpi.com/2073-8994/13/8/1352
work_keys_str_mv	AT semihyavuzkilic spottingdeepfakesandfacemanipulationsbyfusingfeaturesfrommultistreamcnnsmodels AT abdulkadirsengur spottingdeepfakesandfacemanipulationsbyfusingfeaturesfrommultistreamcnnsmodels AT zahidakhtar spottingdeepfakesandfacemanipulationsbyfusingfeaturesfrommultistreamcnnsmodels AT kamransiddique spottingdeepfakesandfacemanipulationsbyfusingfeaturesfrommultistreamcnnsmodels
_version_	1721189749222277120

Spotting Deepfakes and Face Manipulations by Fusing Features from Multi-Stream CNNs Models

Similar Items