You're reading from R Deep Learning Essentials. - Second Edition

Product typeBook

Published inAug 2018

Reading LevelIntermediate

PublisherPackt

ISBN-139781788992893

Edition2nd Edition

Languages

Tools

H2O

Concepts

Deep Learning

Authors (2):

Mark Hodnett

Joshua F. Wiley

View More author details

Tuning and Optimizing Models

In the last two chapters, we trained deep learning models for classification, regression, and image recognition tasks. In this chapter, we will discuss some important issues in regard to managing deep learning projects. While this chapter may seem somewhat theoretical, if any of the issues discussed are not correctly managed, it can derail your deep learning project. We will look at how to choose evaluation metrics and how to create an estimate of how well a deep learning model will perform before you begin modeling. Next, we will move onto data distribution and the mistakes often made in splitting data into correct partitions for training. Many machine learning projects fail in production use because the data distribution is different to what the model was trained with. We will look at data augmentation, a valuable method to enhance your model&apos...

Evaluation metrics and evaluating performance

This section will discuss how to set up a deep learning project and what evaluation metrics to select. We will look at how to select evaluation criteria and how to decide when the model is approaching optimal performance. We will also discuss how all deep learning models tend to overfit and how to manage the bias/variance tradeoff. This will give guidelines on what to do when models have low accuracy.

Types of evaluation metric

Different evaluation metrics are used for categorization and regression tasks. For categorization, accuracy is the most commonly used evaluation metric. However, accuracy is only valid if the cost of errors is the same for all classes, which is not always...

Data preparation

Machine learning is about training a model to generalize on the cases it sees so that it can make predictions on unseen data. Therefore, the data used to train the deep learning model should be similar to the data that the model sees in production. However, at an early product stage, you may have little or no data to train a model, so what can you do? For example, a mobile app could include a machine learning model that predicts the subject of image taken by the mobile camera. When the app is being written, there may not be enough data to train the model using a deep learning network. One approach would be to augment the dataset with images from other sources to train the deep learning network. However, you need to know how to manage this and how to deal with the uncertainty it introduces. Another approach is transfer learning, which we will cover in Chapter 11...

Data augmentation

One approach to increasing the accuracy in a model regardless of the amount of data you have is to create artificial examples based on existing data. This is called data augmentation. Data augmentation can also be used at test time to improve prediction accuracy.

Using data augmentation to increase the training data

We are going to apply data augmentation to the MNIST dataset that we used in previous chapters. The code for this section is in Chapter6/explore.Rmd if you want to follow along. In Chapter 5, Image Classification Using Convolutional Neural Networks, we plotted some examples from the MNIST data, so we won't repeat the code again. It is included in the code file, and you can also refer back...

Tuning hyperparameters

All machine learning algorithms have hyper-parameters or settings that can change how they operate. These hyper-parameters can improve the accuracy of a model or reduce the training time. We have seen some of these hyper-parameters in previous chapters, particularly Chapter 3, Deep Learning Fundamentals, where we looked at the hyper-parameters that can be set in the mx.model.FeedForward.create function. The techniques in this section can help us find better values for the hyper-parameters.

Selecting hyper-parameters is not a magic bullet; if the raw data quality is poor or if there is not enough data to support training, then tuning hyper-parameters will only get you so far. In these cases, either acquiring additional variables/features that can be used as predictors and/or additional cases may be required.

...

Use case—using LIME for interpretability

Deep learning models are known to be difficult to interpret. Some approaches to model interpretability, including LIME, allow us to gain some insights into how the model came to its conclusions. Before we demonstrate LIME, I will show how different data distributions and / or data leakage can cause problems when building deep learning models. We will reuse the deep learning churn model from Chapter 4, Training Deep Prediction Models, but we are going to make one change to the data. We are going to introduce a bad variable that is highly correlated to the y value. We will only include this variable in the data used to train and evaluate the model. A separate test set from the original data will be kept to represent the data the model will see in production, this will not have the bad variable in it. The creation of this bad variable...

Summary

This chapter covered topics that are critical to success in deep learning projects. These included the different types of evaluation metric that can be used to evaluate the model. We looked at some issues that can come up in data preparation, including if you only have a small amount of data to train on and how to create different splits in the data, that is, how to create proper train, test, and validation datasets. We looked at two important issues that can cause the model to perform poorly in production, different data distributions, and data leakage. We saw how data augmentation can be used to improve an existing model by creating artificial data and looked at tuning hyperparameters in order to improve the performance of a deep learning model. We closed the chapter by examining a use case where we simulated a problem with different data distributions/data leakage and...

The rest of the chapter is locked

You have been reading a chapter from

R Deep Learning Essentials. - Second Edition

Published in: Aug 2018Publisher: PacktISBN-13: 9781788992893

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Mark Hodnett

Mark Hodnett is a data scientist with over 20 years of industry experience in software development, business intelligence systems, and data science. He has worked in a variety of industries, including CRM systems, retail loyalty, IoT systems, and accountancy. He holds a master's in data science and an MBA. He works in Cork, Ireland, as a senior data scientist with AltViz.
Read more about Mark Hodnett

Joshua F. Wiley

Joshua F. Wiley is a lecturer at Monash University, conducting quantitative research on sleep, stress, and health. He earned his Ph.D. from the University of California, Los Angeles and completed postdoctoral training in primary care and prevention. In statistics and data science, Joshua focuses on biostatistics and is interested in reproducible research and graphical displays of data and statistical models. He develops or co-develops a number of R packages including Varian, a package to conduct Bayesian scale-location structural equation models, and MplusAutomation, a popular package that links R to the commercial Mplus software.
Read more about Joshua F. Wiley

Other recommended products

Related to this chapter

Advanced Deep Learning with R

This book will help readers to apply deep learning algorithms in R using advanced examples. You will cover variants of neural network models such as ANN, CNN, RNN, LSTM, and more using expert techniques. Readers will make use of popular deep learning libraries such as Keras-R, Tensorflow-R, and more to implement AI models.

BookDec 2019352 pages

Deep Learning with R Cookbook

This book will help you get through the problems that you face during the execution of different tasks and understand hacks in deep learning. With unique recipes, you will implement various deep learning architectures using R 3.5.x. You will cover complex algorithms to perform tasks such as reinforcement learning, GANs, advanced neural networks and more.

BookFeb 2020328 pages

R Deep Learning Projects

R is a popular programming language used by statisticians and mathematicians for statistical analysis, and is popularly used for deep learning. This book demonstrates end-to-end implementations of five real-world projects on popular topics in deep learning such as handwritten digit recognition, traffic light detection, fraud detection, text generation, and sentiment analysis. You'll see how to train effective neural networks in R—including convolutional neural networks, recurrent neural networks and LSTMs—and also see how neural networks can be trained using GPU capabilities. You will use popular R libraries and packages—such as MXNetR, H2O, deepnet, and more—to implement the projects. By the end of this book, you will have a better understanding of deep learning concepts and techniques and how to use them in a practical setting.

BookFeb 2018258 pages

R Deep Learning Cookbook

Deep Learning is the next big thing. It is a part of machine learning. Its favorable results in application with huge and complex data is remarkable. This book will help you to get through the problems that you face during the execution of different tasks and understand hacks in deep learning, neural networks, and advanced machine learning techniques

BookAug 2017288 pages

Hands-On Deep Learning with R

Deep learning enables efficient and accurate learning from data. Developers working with R will be able to put their knowledge to work with this practical guide to deep learning. The book provides a hands-on approach to implementation and associated methodologies that will have you up-and-running, and productive in no time.

BookApr 2020330 pages

Deep Learning with PyTorch

This book provides the intuition behind the state of the art Deep Learning architectures such as ResNet, DenseNet, Inception, and encoder-decoder without diving deep into the math of it. It shows how you can implement and use various architectures to solve problems in the area of image classification, language translation and NLP using PyTorch.

BookFeb 2018262 pages

R Machine Learning Projects

The purpose of the book is to help a machine learning practitioner gets hands-on experience in working with real-world data and apply modern machine learning algorithms. You will learn to implement each algorithm to a specific industry problem. It covers projects involving both supervised as well as unsupervised learning approaches.

BookJan 2019334 pages

TensorFlow 2.0 Quick Start Guide

TensorFlow is one of the most popular machine learning frameworks in Python. With this book, you will improve your knowledge of some of the latest TensorFlow features and will be able to perform supervised and unsupervised machine learning and also train neural networks.

BookMar 2019196 pages

Python Deep Learning Cookbook

Deep Learning is a rapidly evolving field of Machine Learning science which gives machines the ability to learn from information. This book contains detailed recipes to tackle with the common and not so common problems while dealing with deep learning algorithms and models in Python. You will benefit from this book by finding technical solutions to the issues presented, along with a detailed explanation of the solutions, and a discussion on corresponding pros and cons of implementing the proposed solution using Theano, Tensorflow, MXNet, and Keras. You'll come across recipes on data pre-processing, network models and topologies, supervised and unsupervised learning presented in a “solution to problem” fashion.

BookOct 2017330 pages

The Applied TensorFlow and Keras Workshop

The Applied TensorFlow and Keras Workshop provides you with a blueprint to build an application that generates predictions using a deep learning model. You’ll learn to apply techniques to improve the model: add more data and features, change its architecture, or create a new model by changing the core components to meet your own requirements.

BookJul 2020174 pages

Beginning Application Development with TensorFlow and Keras

With this book, you’ll learn how to train, evaluate and deploy Tensorflow and Keras models as real-world web applications. After a hands-on introduction, you’ll use a sample model to explore the details of deep learning, selecting the right layers that can solve a given problem. By the end of the book, you’ll build a Bitcoin application that predicts the future price, based on historic, and freely available information.

BookMay 2018148 pages

Hands-On Computer Vision with TensorFlow 2

Computer vision is achieving a new frontier of capabilities in fields like health, automobile or robotics. This book explores TensorFlow 2, Google's open-source AI framework, and teaches how to leverage deep neural networks for visual tasks. It will help you acquire the insight and skills to be a part of the exciting advances in computer vision.

BookMay 2019372 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages