You're reading from Developing Kaggle Notebooks

Product typeBook

Published inDec 2023

Reading LevelIntermediate

PublisherPackt

ISBN-139781805128519

Edition1st Edition

Languages

Python

Concepts

Data Analysis

Author (1)

Gabriel Preda

Can You Find Out Which Movie Is a Deepfake?

In the previous chapters, we explored various data formats: tabular, geospatial, text, image, and acoustic, while working with Kaggle datasets, learning about shapefile visualization, building models for image or text classification, and acoustic signal analysis.

In this chapter, we will introduce video data analysis. We will start by describing a Kaggle competition, Deepfake Detection Challenge. This competition challenged the participants to classify which videos were generated artificially to create realistic fake content convincingly. We will continue by quickly exploring the most used video formats, followed by introducing two utility scripts used for our data analysis. First, a utility script with functions for manipulating video content, i.e., reading, visualizing images from videos, and playing video files. Second, a utility script with functions for body, face, and face element detection. We will continue with metadata exploration...

Introducing the competition

In this chapter, we examine data from the well-known Kaggle competition the Deepfake Detection Challenge (DFDC). The competition, detailed in Reference 1, commenced on December 11, 2019, and concluded on March 31, 2020. It attracted 2,265 teams comprising 2,904 participants, who collectively made 8,951 submissions. Competitors vied for a total prize pool of $1,000,000, with the first prize being $500,000.

The event was a collaborative effort involving AWS, Facebook, Microsoft, the Partnership on AI’s Media Integrity Steering Committee, and various academic entities. At the time, there was a widespread agreement among tech industry leaders and academics on the technical complexity and rapidly changing nature of media content manipulation. The competition’s aim was to encourage global researchers to devise innovative and effective technologies to detect deepfakes and media manipulation. Unlike later competitions that focused on code, this...

Introducing competition utility scripts

Let’s begin by grouping the Python modules with reusable functions for video manipulation in two Kaggle utility scripts. The first utility script groups functions to load and display images from videos or play video files. The second one is geared toward object detection in videos – more specifically, to detect human faces and bodies – using a few alternative methods.

Video data utils

We developed a utility script to assist us in the manipulation of video data. Let’s introduce one utility script that we will use in the notebook associated with the current chapter to read video data, as well as visualize frames from a video file.

The video_utils utility script includes functions to load, transform, and display images from videos. Additionally, it also contains a function to play video content. For video manipulation, we will use the OpenCV library. OpenCV is an open-source computer vision library widely...

Metadata exploration

We start by importing the utility functions and classes from the utility scripts for data quality, plot utils, video utils, and face object detection. The following code block shows what we import from the utility scripts:

from data_quality_stats import missing_data, unique_values, most_frequent_values
from plot_style_utils import set_color_map, plot_count
from video_utils import display_image_from_video, display_images_from_video_list, play_video
from face_object_detection import CascadeObjectDetector, FaceObjectDetector
from face_detection_mtcnn import MTCNNFaceDetector

After we load the data files (the train and test samples), we are ready to start our analysis. The following code block checks the types of files in TRAIN_SAMPLE_FOLDER:

train_list = list(os.listdir(os.path.join(DATA_FOLDER, TRAIN_SAMPLE_FOLDER)))
ext_dict = []
for file in train_list:
    file_ext = file.split('.')[1]
    if (file_ext not in ext_dict):
        ext_dict...

Video data exploration

In this section, we will visualize a few samples of files, and then we will begin performing object detection to try to capture the features from the images that might have some anomalies when processed to create deepfakes. These are mostly the eyes, mouths, and figures.

We will start by visualizing sample files, both genuine images and deepfakes. We will then apply the first algorithm introduced previously for face, eye, and mouth detection, the one based on Haar cascade. We then follow with the alternative algorithm, based on MTCNN.

Visualizing sample files

The following code block selects a few video files from the set of fake videos and then visualizes an image capture from them, using the display_image_from_video function from the utility script video_utils:

fake_train_sample_video = list(meta_train_df.loc[meta_train_df.label=='FAKE'].sample(3).index)
for video_file in fake_train_sample_video:
    display_image_from_video(os.path...

Summary

In this chapter, we began by introducing a series of utility scripts, which are reusable Python modules on Kaggle designed for video data manipulation. One such script, video_utils, is used to visualize images from videos and play them. Another script, face_object_detection, utilizes Haar cascade models for face detection.

The third script, face_detection_mtcnn, employs MTCNN models to identify faces and key points such as the eyes, nose, and mouth. We then examined the metadata and video data from the DFDC competition dataset. In this dataset, we applied the aforementioned face detection methods to images from training and test videos, finding the MTCNN model approach to be more robust and accurate, with fewer false positives.

As we near the conclusion of our exploration of data, we will reflect on our journey through various data formats, including tabular, text, image, sound, and now video. We’ve delved into numerous Kaggle datasets and competition datasets...

References

Deepfake Detection Challenge, Kaggle competition, Identify videos with facial or voice manipulations: https://www.kaggle.com/competitions/deepfake-detection-challenge
Haar Cascades for Face Detection, Kaggle dataset: https://www.kaggle.com/datasets/gpreda/haar-cascades-for-face-detection
Serkan Peldek – Face Detection with OpenCV, Kaggle notebook: https://www.kaggle.com/code/serkanpeldek/face-detection-with-opencv/
Kaipeng Zhang, Zhanpeng Zhang, Zhifeng Li, and Yu Qiao – Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks: https://arxiv.org/abs/1604.02878
Justin Güse – Face Detection using MTCNN — a guide for face extraction with a focus on speed: https://towardsdatascience.com/face-detection-using-mtcnn-a-guide-for-face-extraction-with-a-focus-on-speed-c6d59f82d49

Join our book’s Discord space

Join our Discord community to meet like-minded people and learn...

The rest of the chapter is locked

You have been reading a chapter from

Developing Kaggle Notebooks

Published in: Dec 2023Publisher: PacktISBN-13: 9781805128519

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Gabriel Preda

Dr. Gabriel Preda is a Principal Data Scientist for Endava, a major software services company. He has worked on projects in various industries, including financial services, banking, portfolio management, telecom, and healthcare, developing machine learning solutions for various business problems, including risk prediction, churn analysis, anomaly detection, task recommendations, and document information extraction. In addition, he is very active in competitive machine learning, currently holding the title of a three-time Kaggle Grandmaster and is well-known for his Kaggle Notebooks.
Read more about Gabriel Preda

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages