Packt+ | Advance your knowledge in tech

You're reading from IBM SPSS Modeler Essentials

Product typeBook

Published inDec 2017

PublisherPackt

ISBN-139781788291118

Edition1st Edition

Tools

IBM SPSS

Concepts

Predictive Analytics

Authors (2):

Jesus Salcedo

Keith McCormick

View More author details

Chapter 2. The Basics of Using IBM SPSS Modeler

The previous chapter introduced the notion of data mining and the CRISP-DM process model. You learned what data mining is, why you would want to use it, and some of the types of questions you could answer with data mining. The rest of this book is going to focus on how you actually do some of the aspects of data mining—reading data, exploring variables, deriving new fields, developing models, and so on. However, before we can get started with these different data mining projects, we first need to become familiar with the software that we will use to work on the data. In this chapter, you will learn the following:

Get an overview of the Modeler interface
Learn how to build streams
Get an introduction to various help options

Introducing the Modeler graphic user interface

IBM SPSS Modeler can be thought of as a data mining workbench that combines multiple tools and technologies to support the data mining process. Modeler allows users to mine data visually on the stream canvas.

The following figure shows the different areas of the Modeler interface:

As you can see, the Modeler interface is comprised of several components, and these are described in the next few pages.

Stream canvas

The stream canvas is the main work area in Modeler. It is located in the center of the Modeler user interface. The stream canvas can be thought of as a surface on which to place icons or nodes. These nodes represent operations to be carried out on the data. Once nodes have been placed on the stream canvas, they can be linked together to form a stream.

Palettes

Nodes (operations on the data) are contained in palettes. The palettes are located at the bottom of the Modeler user interface. Each palette contains a group of related nodes that are...

Building streams

As was mentioned previously, Modeler allows users to mine data visually on the stream canvas. This means that you will not be writing code for your data mining projects; instead you will be placing nodes on the stream canvas. Remember that nodes represent operations to be carried out on the data. So once nodes have been placed on the stream canvas, they need to be linked together to form a stream. A stream represents the flow of data going through a number of operations (nodes). The following diagram is an example of nodes on the canvas, as well as a stream:

Given that you will spend a lot of time building streams, in this section you will learn the most efficient ways of manipulating nodes to create a stream.

Mouse buttons

When building streams, mouse buttons are used extensively so that nodes can be brought onto the canvas, connected, edited, and so on. When building streams within Modeler, mouse buttons are used in the following ways:

The left button is used for selecting...

Modeler stream rules

You may have noticed that in the previous example, we connected the Var. File node to the Table node and this worked fine. However, what if instead we tried to connect the Table node to the Var. File node? Let's try it:

Right-click the Table node.
Select Connect from the Context menu (notice that the Connect option does not exist).

Let's try something different:

Bring a Statistics File node onto the canvas.
Right-click on the Var. File node.
Select Connect from the Context menu.
Click the Statistics File node (notice that you get an error message when you try to connect these two nodes).

The reason we are experiencing these issues is that there are rules for creating Modeler streams.

Modeler streams are typically comprised of three types of nodes: Source, Process, and Terminal nodes. Connecting nodes in certain ways makes sense in the context of Modeler, and other connections are not allowed.

In terms of general rules, streams always start with a Source node (a node from the Sources...

Help options

When using Modeler, at some point we are going to need help. Modeler provides various help options.

Help menu

The most intuitive way to get help is to use the Help menu. As seen in the following figure, the Help menu provides several options:

Help Topics takes you to the Help System, where you can search for various topics
CRISP-DM Help provides an introduction to the CRISP-DM methodology
Application Examples offers a variety of real-life examples of using common data mining techniques for data preparation and modeling
Accessibility Help informs users about keyboard alternatives to using the mouse
What's This changes the cursor into a question mark and provides information about any Modeler item you select

Dialog help

Perhaps the most useful help option is to use context sensitive help, which is available in whatever dialog box you are currently working on. For example, let's say that you are using the Var. File node and you either did not know how to use this node or you were unfamiliar...

Summary

In this chapter, you learned about the different components of the Modeler graphic user interface. You also learned how to build streams. Finally, you were introduced to various help options.

In the next chapter, we will take a detailed look at how to bring data into Modeler. We will also discuss how to properly set up the metadata for your fields.

The rest of the chapter is locked

You have been reading a chapter from

IBM SPSS Modeler Essentials

Published in: Dec 2017Publisher: PacktISBN-13: 9781788291118

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Jesus Salcedo

Jesus Salcedo has a PhD in psychometrics from Fordham University. He is an independent statistical consultant and has been using SPSS products for over 20 years. He is a former SPSS Curriculum Team Lead and Senior Education Specialist who has written numerous SPSS training courses and trained thousands of users.
Read more about Jesus Salcedo

Keith McCormick

Keith McCormick is a career long practitioner of predictive analytics and data science. He has engaged in statistical modeling, data mining, and mentoring others in the area for more than 20 years. He has a particular expertise in helping organizations perform their first predictive analytics project or build their first predictive analytics practice, and has done so in a variety of industries including healthcare, banking, telecommunications, non-profit, direct mail, pharmaceuticals, and retail. Keith is also an established author and speaker with four books in print, or under contract. Although his consulting work is not restricted to any one tool, his writing and speaking has made him particularly well known in the IBM SPSS Statistics and IBM SPSS Modeler communities.
Read more about Keith McCormick

Other recommended products

Related to this chapter

Machine Learning for Data Mining

Most data mining opportunities involve machine learning and often come with greater financial rewards. This book will help you bring the power of machine learning techniques into your data mining work. By the end of the book, you will be able to create accurate predictive models for data mining.

BookApr 2019252 pages

Learning Alteryx

Alteryx, as a leading data blending and advanced data analytics platform, has taken self-service data analytics to the next level. This book will set you on a self-service data analytics journey that will help you create efficient workflows using Alteryx, without any coding involved. It will empower you and your organization to take well-informed decisions with the help of deeper business insights from the data. You will see how to use the unique features of Alteryx to perform common tasks such as data preparation and blending, and also delve into the more advanced concepts such as performing predictive analytics, before sharing the insights gained with the relevant decision makers. Whether you are a novice with Alteryx or an experienced data analyst keen to explore Alteryx’s self-service analytics features, this guide will be the perfect companion for you.

BookDec 2017228 pages

Data Analysis with IBM SPSS Statistics

SPSS Statistics is a software package used for logical batched and non-batched statistical analysis. Analytical tools such as SPSS can readily provide even a novice user with an overwhelming amount of information and a broad range of options for analyzing patterns in the data. This book will have a comprehensive coverage of IBM’s premier statistics and data analysis tool – IBM SPSS Statistics. It is designed for business professionals who wish to analyze their data. By the end of this book, you will have a firm understanding of the various statistical analysis techniques offered by SPSS Statistics, and be able to master its use for data analysis with ease.

BookSep 2017446 pages

Advanced Analytics with R and Tableau

R is the go-to tool for statistics and data mining while Tableau offers an interface to filter data, plug and play with rich visualizations to describe insights from your data. When combined these two tools makes it easier to harness interesting patterns and communicate stories. This book covers various analytical techniques like prediction, classification, clustering and best practices to visualize it using interactive dashboard with drop-downs, sliders, and other visual cues of Tableau. Get to know how R can be used in conjunction with Tableau and implement powerful machine learning techniques making big data analytics accessible and presentable through Tableau workbooks.

BookAug 2017178 pages

Hands-On Machine Learning with IBM Watson

A practical guide on Machine learning with IBM cloud to act as a solid yet concise reference for the readers. You will learn about the role of data representation and feature extraction in machine learning. This book will help you learn how to use the IBM Cloud and Watson Machine learning service to develop real-world machine learning solutions.

BookMar 2019288 pages

R Data Mining

This book will empower you to produce and present impressive analyses from data, by selecting and implementing the appropriate data mining techniques in R. Explore a data mining crime case, where you will be requested to help resolving a real fraud case affecting a commercial company, by the mean of both basic and advanced data mining techniques.

BookNov 2017442 pages

Big Data Visualization

Uncover new approaches to big data visualization to make your analysis more effective and efficient with Big Data Visualization. Featuring in-depth coverage of big data analysis concepts together with industry-proven techniques, you?ll learn how to approach the challenge of big data visualization with confidence, ease and precision.

BookFeb 2017304 pages

Practical Predictive Analytics

This book teaches six specific steps needed to implement predictive analytics using R. It also teaches how team collaboration is critical and how it increases the chances of implementing a successful model. The book uses cases from healthcare, marketing, and government to build practical skills. Big Data is also covered, in this book, which will extend your skill sets by learning Databricks and RSpark.

BookJun 2017576 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages