Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
R Data Mining

You're reading from  R Data Mining

Product type Book
Published in Nov 2017
Publisher Packt
ISBN-13 9781787124462
Pages 442 pages
Edition 1st Edition
Languages
Concepts

Table of Contents (22) Chapters

Title Page
Credits
About the Author
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface
Why to Choose R for Your Data Mining and Where to Start A First Primer on Data Mining Analysing Your Bank Account Data The Data Mining Process - CRISP-DM Methodology Keeping the House Clean – The Data Mining Architecture How to Address a Data Mining Problem – Data Cleaning and Validation Looking into Your Data Eyes – Exploratory Data Analysis Our First Guess – a Linear Regression A Gentle Introduction to Model Performance Evaluation Don't Give up – Power up Your Regression Including Multiple Variables A Different Outlook to Problems with Classification Models The Final Clash – Random Forests and Ensemble Learning Looking for the Culprit – Text Data Mining with R Sharing Your Stories with Your Stakeholders through R Markdown Epilogue
Dealing with Dates, Relative Paths and Functions

Deployment


We are now reaching the final phase, and we are going to implement our models into the production system. But, what if one of the previous phases doesn't go well? This is when we understand that the CRISP-DM model is an iterative one. If the previous evaluation phase terminates showing that an unsatisfactory level of performance was reached, it would be pointless to develop a deployment plan, since the deployed solution would not meet the business expectations, and this would later produce undesired costs required to fix the problem.

In these circumstances, it would definitely be more appropriate to invest some more time to understand what went wrong, to define which phase of the CRISP-DM process needs to be resorted to. A model performance analysis could, for instance, reveals a poor level of accuracy of the model due to bad data quality of the training dataset employed to estimate model parameters. This would then involve a step back to the data preparation phase, or even to...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}