You're reading from Hands-On Data Visualization with Bokeh

Product typeBook

Published inJun 2018

Reading LevelIntermediate

PublisherPackt

ISBN-139781789135404

Edition1st Edition

Languages

Python

Tools

Bokeh

Concepts

Data Visualization

Author (1)

Kevin Jolly

The Bokeh Workflow – A Case Study

When it comes to building your very own Bokeh visualization from scratch, a good practice to develop is to never start with Bokeh. Instead, the ideal approach is to perform a little exploratory analysis on your data first, in order to visualize the application you can create using Bokeh that can deliver the most value to your users.

Such an approach, of first exploring your dataset, helps you formulate the ideal visualization that you might want to present to your audience.

In this chapter, you will learn the exact workflow that you need to follow, from when you get the data to the final visualization that you want to present.

Bokeh, like most data visualization tools, is best used in a workflow that follows a logical sequence of steps, which will allow you to deliver impactful insights to your audience. This workflow can be summarized...

Technical requirements

You will be required to have Python installed on a system. Finally, to use the Git repository of this book, the user needs to install Git.

The code files of this chapter can be found on GitHub:
https://github.com/PacktPublishing/Hands-on-Data-Visualization-with-Bokeh.

Check out the following video to see the code in action:

http://bit.ly/2sLSoX1.

Asking the right question

Asking the right question is by far the most important step when it comes to data visualization. What is the answer that you are seeking?

Some of the most common questions that you need to ask yourself before deciding to visualize data are:

Do I want to observe how well two features are correlated?
Do I suspect potential outliers in my data that I cannot see unless I visualize my data?
Do I want to see whether my data shows a particular trend over a period of time?
Do I want to observe the distribution of individual features/columns in my data?
Do I want to see whether there are clusters/groups within my data that I can potentially extract value from?
Do I believe that a visualization can tell my audience a story about the data?

If the answer to any one of these questions is a yes, then you know that you need to visualize your data. The second question...

The exploratory data analysis

Since we have worked extensively with the S&P 500 stock data from Kaggle, we are going to be using that dataset in order to create our application. The dataset can be found here: https://www.kaggle.com/camnugent/sandp500/data.

The first step is to read the data into Jupyter Notebook and understand what the data looks like. This can be done using the code shown here:

#Import packages

import pandas as pd

#Read the data into the notebook

df = pd.read_csv('all_stocks_5yr.csv')

#Extract information about the data

df.info()

This renders the output shown in this screenshot:

This sheds information on the number of rows the dataset has, the data types of each column, the number of variables, and any missing values.

The next step is to understand the kind of information contained in all the columns of your dataset. We can do this by using the...

Creating an insightful visualization

Now that we have a fundamental idea of what our data contains, we can proceed to making the visualization. The first step is to ensure we have the foundation of the visualization ready.

Creating the base plot

The foundation consists of the base plot that you want to visualize. In our case, we want to see how the volume of stocks traded over a period of time correlates with the high prices. In order to build this application, we use the code shown here:

#Import the required packages

from bokeh.io import curdoc
from bokeh.models import ColumnDataSource
from bokeh.plotting import figure
import pandas as pd

#Read the data into the notebook

df = pd.read_csv('all_stocks_5yr.csv')

#Convert...

Presenting your results

The right visualization is not just limited to picking the right type of plot, such as scatter plots or bar charts. It extends to picking the right colors, shapes, markers, and features.

Some of the questions that you will want to ask yourself when choosing the right visualization are as follows:

Do I want to transmit a positive message to my readers? If yes, the colors green and blue are a great choice
Do I want to transmit an alarming/negative message, indicating some form of danger/decline to my readers? If yes, the color red works best
Do I want to show how two different segments/categories differ from each other? If yes, using contrasting colors such as red and blue works well

The tone of the insight and message that you want to convey is critical when it comes to creating the ideal visualization.

Summary

In this chapter, you learned how to build a real-time Bokeh visualization that can be used to analyze the performance of stocks from scratch. You learned how to perform initial exploratory data analysis in order to determine the kind of visualization that you wanted to create. You then created the visualization and improved its performance using WebGL.

Finally, you learned the four steps that form an integral part of the Bokeh workflow. You learned how asking the right kinds of questions is pivotal in any data visualization project, followed by the exploratory data analysis. You also learned how presenting your results is not limited to the type of plot you use, but also the tone of the message that you want to convey to the audience.

This concludes the book! I hope the book has given you an informative, hands-on introduction to the world of Bokeh! I hope you will continue...

The rest of the chapter is locked

You have been reading a chapter from

Hands-On Data Visualization with Bokeh

Published in: Jun 2018Publisher: PacktISBN-13: 9781789135404

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Kevin Jolly

Kevin Jolly is a formally educated data scientist with a master's degree in data science from the prestigious King's College London. Kevin works as a statistical analyst with a digital healthcare start-up, Connido Limited, in London, where he is primarily involved in leading the data science projects that the company undertakes. He has built machine learning pipelines for small and big data, with a focus on scaling such pipelines into production for the products that the company has built. Kevin is also the author of a book titled Hands-On Data Visualization with Bokeh, published by Packt. He is the editor-in-chief of Linear, a weekly online publication on data science software and products.
Read more about Kevin Jolly

Other recommended products

Related to this chapter

Applied Data Visualization with R and ggplot2

When data is presented to you in a graphical or pictorial format, you can analyze it more effectively. This book begins by introducing you to basic concepts, such as grammar of graphics and geometric objects. It then goes on to explain these concepts in detail with examples. Once you are comfortable with basics, you can learn all about the advanced plotting techniques, such as box plots and density plots. With this book, you can transform data into useful material and make data analysis interesting and fun.

BookSep 2018140 pages

Data Visualization with Python for Beginners

Utilizing tools and operations from several major libraries, this book will teach you to visualize data with Python comfortably and confidently in no time at all.

BookMar 2021280 pages

Interactive Data Visualization with Python

Interactive Data Visualization with Python sharpens your data exploration skills, tells you everything there is to know about interactive data visualization in Python, and most importantly, helps you make your storytelling more intuitive and persuasive.

BookApr 2020362 pages

Interactive Data Visualization with Python

Interactive Data Visualization with Python sharpens your data exploration skills, tells you everything there is to know about interactive data visualization in Python, and most importantly, helps you make your storytelling more intuitive and persuasive.

BookOct 2019362 pages

Python Data Analysis

This book takes a practical approach to Python data analysis, showing you how to use Python libraries such as pandas, NumPy, SciPy, and scikit-learn to analyze a variety of data. You’ll also get up to speed with everything from data manipulation to visualization systematically.

BookFeb 2021478 pages5

Data Visualization with Python

With so much data being continuously generated, developers with a knowledge of data analytics and data visualization are always in demand. With Data Visualization with Python, you'll learn how to use Python with NumPy, Pandas, Matplotlib, and Seaborn to create impactful data visualizations with a real world, public data.

BookFeb 2019368 pages

The Data Visualization Workshop

Cut through the noise and get real results with a step-by-step approach to learning data visualization with Python

BookFeb 2020480 pages

The Data Visualization Workshop

The Data Visualization Workshop will help you get started with data visualization, giving you the confidence to choose the best visualization technique to suit your needs. Fun activities and exercises featured throughout the book will keep you engaged as you build interactive visualizations with real data.

BookJul 2020536 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages