You're reading from Codeless Deep Learning with KNIME

Product typeBook

Published inNov 2020

Reading LevelIntermediate

PublisherPackt

ISBN-139781800566613

Edition1st Edition

Languages

Python

Tools

Knime

Concepts

Deep Learning

Authors (3):

Kathrin Melcher

KNIME AG

Rosaria Silipo

View More author details

Chapter 9: Convolutional Neural Networks for Image Classification

In the previous chapters, we talked about Recurrent Neural Networks (RNNs) and how they can be applied to different types of sequential data and use cases. In this chapter, we want to talk about another family of neural networks, called Convolutional Neural Networks (CNNs). CNNs are especially powerful when used on data with grid-like topology and spatial dependencies, such as images or videos.

We will start with a general introduction to CNNs, explaining the basic idea behind a convolution layer and introducing some related terminology such as padding, pooling, filters, and stride.

Afterward, we will build and train a CNN for image classification from scratch. We will cover all required steps: from reading and preprocessing of the images to defining, training, and applying the CNN.

To train a neural network from scratch, a huge amount of labeled data is usually required. For some specific domains, such as...

Introduction to CNNs

CNNs are commonly used in image processing and have been the winning models in several image-processing competitions. They are often used, for example, for image classification, object detection, and semantic segmentation.

Sometimes, CNNs are also used for non-image-related tasks, such as recommendation systems, videos, or time-series analysis. Indeed, CNNs are not only applied to two-dimensional data with a grid structure but can also work when applied to one- or three-dimensional data. In this chapter, however, we focus on the most common CNN application area: image processing.

A CNN is a neural network with at least one convolution layer. As the name states, convolution layers perform a convolution mathematical transformation on the input data. Through such a mathematical transformation, convolution layers acquire the ability to detect and extract a number of features from an image, such as edges, corners, and shapes. Combinations of such extracted features...

Classifying Images with CNNs

In this section, we will see how to build and train from scratch a CNN for image classification.

The goal is to classify handwritten digits between 0 and 9 with the data from the MNIST database, a large database of handwritten digits commonly used for training various image-processing applications. The MNIST database contains 60,000 training images and 10,000 testing images of handwritten digits and can be downloaded from this website: http://yann.lecun.com/exdb/mnist/.

To read and preprocess images, KNIME Analytics Platform offers a set of dedicated nodes and components, available after installing the KNIME Image Processing Extension.

Tip

The KNIME Image Processing Extension (https://www.knime.com/community/image-processing) allows you to read in more than 140 different format types of images (thanks to the Bio-Formats Application Processing Interface (API)). In addition, it can be used to apply well-known image-processing techniques such as...

Introduction to transfer learning

The general idea of transfer learning is to reuse the knowledge gained by a network trained for task A on another related task B. For example, if we train a network to recognize sailing boats (task A), we can use this network as a starting point to train a new model to recognize motorboats (task B). In this case, task A is called the source task and task B the target task.

Reusing a trained network as the starting point to train a new network is different from the traditional way of training networks, whereby neural networks are trained on their own for specific tasks on specific datasets. Figure 9.19 here visualizes the traditional way of network training, whereby different systems are trained for different tasks and domains:

Figure 9.19 – Traditional way of training machine learning models and neural networks

But why should we use transfer learning instead of training models in the traditional, isolated way...

Applying Transfer Learning for Cancer Type Prediction

We will introduce here a new (and final) case study. We will start from the state-of-the-art VGG16 network as a source network to train a new target network on a dataset of images describing three different subtypes of lymphoma, which are chronic lymphocytic leukemia (CLL), follicular lymphoma (FL), and mantle cell lymphoma (MCL).

A typical task for a pathologist in a hospital is to look at histopathology slide images and make a decision about the type of lymphoma. Even for experienced pathologists this is a difficult task and, in many cases, follow-up tests are required to confirm the diagnosis. An assistive technology that can guide pathologists and speed up their job would be of great value.

VGG16 is one of the winner models on the ImageNet Challenge from 2014. It is a stacked CNN network, using kernels of size with an increasing depth—that is, with an increasing number of filters. The original network was trained...

Summary

In this chapter, we explored CNNs, focusing on image data.

We started with an introduction to convolution layers, which motivates the name of this new family of neural networks. In this introduction, we explained why CNNs are so commonly used for image data, how convolutional networks work, and the impact of the many setting options. Next, we discussed pooling layers, commonly used in CNNs to efficiently downsample the data.

Finally, we put all this knowledge to work by building and training from scratch a CNN to classify images of digits between 0 and 9 from the MNIST dataset. Afterward, we discussed the concept of transfer learning, introduced four scenarios in which transfer learning can be applied, and showed how we can use transfer learning in the field of neural networks.

In the last section, we applied transfer learning to train a CNN to classify histopathology slide images. Instead of training it from scratch, this time we reused the convolutional layers of...

Questions and Exercises

What is the kernel size in a convolutional layer?
a) The area summarized by a statistical value
b) The size of the matrix moving across an image
c) The number of pixels to shift the matrix
d) The size of the area used by a layer
What is a pooling layer?
a) A pooling layer is a commonly used layer in RNNs
b) A pooling layer summarizes an area with a statistical value
c) A pooling layer is a commonly used layer in feedforward networks
d) A pooling layer can be used to upsample images
When is transfer learning helpful?
a) To transfer data to another system
b) If no model is available
c) If not enough labeled data is available
d) To compare different models

The rest of the chapter is locked

You have been reading a chapter from

Codeless Deep Learning with KNIME

Published in: Nov 2020Publisher: PacktISBN-13: 9781800566613

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (3)

Kathrin Melcher

Kathrin Melcher is a data scientist at KNIME. She holds a master's degree in mathematics from the University of Konstanz, Germany. She joined the evangelism team at KNIME in 2017 and has a strong interest in data science and machine learning algorithms. She enjoys teaching and sharing her data science knowledge with the community, for example, in the book From Excel to KNIME, as well as on various blog posts and at training courses, workshops, and conference presentations.
Read more about Kathrin Melcher

KNIME AG

Other recommended products

Related to this chapter

Data Analytics Made Easy

This book takes away the fear of working with, analyzing, and visualizing data. Understand the key concepts involved with data analytics while working with real-world business examples. You are introduced to two fantastic tools to cleanse and analyze data (KNIME) and visualize your insights (Microsoft Power BI), but the principles from this book can apply to any platform you decide to use.

BookAug 2021406 pages4

Intelligent Projects Using Python

This book includes 9 projects on building smart and practical AI-based systems. These projects cover solutions to different domain-specific problems in healthcare, e-commerce and more. With this book, you will apply different machine learning and deep learning techniques and learn how to build your own intelligent applications for smart predictions and other insight-driven tasks.

BookJan 2019342 pages

Hands-On Natural Language Processing with PyTorch 1.x

Developers working with NLP will be able to put their knowledge to work with this practical guide to PyTorch. You will learn to use PyTorch offerings and how to understand and analyze text using Python. You will learn to extract the underlying meaning in the text using deep neural networks and modern deep learning algorithms.

BookJul 2020276 pages

Hands-On Java Deep Learning for Computer Vision

This book will take you through the process of efficiently training deep neural networks in Java for Computer Vision-related tasks. You will build real-world applications ranging from simple Java handwritten digit recognition models to real-time autonomous car driving systems and face recognition models using the popular Java-based libraries.

BookFeb 2019260 pages

Deep Learning for Natural Language Processing

Starting with the basics, this book teaches you how to choose from the various text pre-processing techniques and select the best model from the several neural network architectures for NLP issues.

BookJun 2019372 pages

Deep Learning with Microsoft Cognitive Toolkit Quick Start Guide

Cognitive Toolkit is one of the most popular and recently open sourced deep learning toolkit by Microsoft. Cognitive Toolkit is used to train fast and effective deep learning models. This book will be a quick introduction to using Cognitive Toolkit and will teach you how to train and validate different types of neural networks.

BookMar 2019208 pages

Neural Network Projects with Python

This book contains practical implementations of several deep learning projects in multiple domains, including in regression-based tasks such as taxi fare prediction in New York City, image classification of cats and dogs using a convolutional neural network, implementing a facial recognition security system using Siamese Neural Networks, and more.

BookFeb 2019308 pages

Natural Language Processing with TensorFlow

TensorFlow is the leading framework for deep learning algorithms critical to artificial intelligence, and natural language processing (NLP) makes much of the data used by deep learning applications accessible to them. This book brings the two together and teaches deep learning developers how to work with today’s vast amount of unstructured data.

BookMay 2018472 pages

Deep Learning Quick Reference

This book is a practical guide to applying deep neural networks including MLPs, CNNs, LSTMs, and more in Keras and TensorFlow. Packed with useful hacks to solve real-world challenges along with the supported math and theory around each topic, this book will be a quick reference for training and optimize your deep neural networks.

BookMar 2018272 pages

Hands-On Neural Networks

This book will be a journey for beginners who want to step into the world of deep learning and artificial intelligence. It will thoughtfully take you through the training and implementation of various neural network architectures using the Python ecosystem. You will master each neural network architecture while understanding its working mechanism.

BookMay 2019280 pages

Hands-On Python Natural Language Processing

This book provides a blend of both the theoretical and practical aspects of Natural Language Processing (NLP). It covers the concepts essential to develop a thorough understanding of NLP and also delves into a detailed discussion on NLP based use-cases such as language translation, sentiment analysis, etc. Every module covers real-world examples

BookJun 2020316 pages4

Hands-On Deep Learning Architectures with Python

This book explains the essential learning algorithms used for deep and shallow architectures. Packed with practical implementations to help you understand the concepts and ideas required to build efficient artificial intelligence systems, this book will help you construct deep models using popular frameworks and datasets.

BookApr 2019316 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages