Unsupervised Learning with R

More Information
Learn
  • Load, manipulate, and explore your data in R using techniques for exploratory data analysis such as summarization, manipulation, correlation, and data visualization
  • Transform your data by using approaches such as scaling, re-centering, scale [0-1], median/MAD, natural log, and imputation data
  • Build and interpret clustering models using K-Means algorithms in R
  • Build and interpret clustering models by Hierarchical Clustering Algorithm’s in R
  • Understand and apply dimensionality reduction techniques
  • Create and use learning association rules models, such as recommendation algorithms
  • Use and learn about the techniques of feature selection
  • Install and use end-user tools as an alternative to programming directly in the R console
About

The R Project for Statistical Computing provides an excellent platform to tackle data processing, data manipulation, modeling, and presentation. The capabilities of this language, its freedom of use, and a very active community of users makes R one of the best tools to learn and implement unsupervised learning.

If you are new to R or want to learn about unsupervised learning, this book is for you. Packed with critical information, this book will guide you through a conceptual explanation and practical examples programmed directly into the R console.

Starting from the beginning, this book introduces you to unsupervised learning and provides a high-level introduction to the topic. We quickly move on to discuss the application of key concepts and techniques for exploratory data analysis. The book then teaches you to identify groups with the help of clustering methods or building association rules. Finally, it provides alternatives for the treatment of high-dimensional datasets, as well as using dimensionality reduction techniques and feature selection techniques.

By the end of this book, you will be able to implement unsupervised learning and various approaches associated with it in real-world projects.

Features
  • Unlock and discover how to tackle clusters of raw data through practical examples in R
  • Explore your data and create your own models from scratch
  • Analyze the main aspects of unsupervised learning with this comprehensive, practical step-by-step guide
Page Count 192
Course Length 5 hours 45 minutes
ISBN 9781785887093
Date Of Publication 3 Dec 2015

Authors

Erik Rodríguez Pacheco

Erik Rodríguez Pacheco works as a manager in the business intelligence unit at Banco Improsa in San José, Costa Rica, where he holds 11 years of experience in the financial industry. He is currently a professor of the business intelligence specialization program at the Instituto Tecnológico de Costa Rica's continuing education programs. Erik is an enthusiast of new technologies, particularly those related to business intelligence, data mining, and data science. He holds a bachelor's degree in business administration from Universidad de Costa Rica, a specialization in business intelligence from the Instituto Tecnológico de Costa Rica, a specialization in data mining from Promidat (Programa Iberoamericano de Formación en Minería de Datos), and a specialization in business intelligence and data mining from Universidad del Bosque, Colombia. He is currently enrolled in an online specialization program in data science from Johns Hopkins University.

_x000D_

He has served as the technical reviewer of R Data Visualization Cookbook and Data Manipulation with R - Second Edition, both from Packt Publishing.

_x000D_

He can be reached at https://www.linkedin.com/in/erikrodriguezp.