You're reading from Hands-On Machine Learning with C++

Product typeBook

Published inMay 2020

Reading LevelIntermediate

PublisherPackt

ISBN-139781789955330

Edition1st Edition

Languages

C++

Tools

Caffe

Concepts

Machine Learning

Author (1)

Kirill Kolodiazhnyi

Recommender Systems

Recommender systems are algorithms, programs, and services whose main task is to use data to predict which objects (goods or services) are of interest to a user. There are two main types of recommender systems: content-based and collaborative filtering. Content-based recommender systems are based on data collected from specific products. They recommend objects to a user that are similar to ones the user has previously acquired or shown interest in. Collaborative filtering recommender systems filter out objects that a user might like based on the reaction history of other similar users of these systems. They usually consider the user's previous reactions, too.

In this chapter, we'll look at the implementation of recommender system algorithms based on both content and collaborative filtering. We are going to discuss different approaches for implementing...

Technical requirements

The required technologies and installations for the chapter are as follows:

Eigen library
Armadillo library
mlpack library
Modern C++ compiler with C++17 support
CMake build system version >= 3.8

The code files for this chapter can be found at the following GitHub repo: https://github.com/PacktPublishing/Hands-On-Machine-Learning-with-CPP/tree/master/Chapter08

An overview of recommender system algorithms

A recommender system's task is to inform a user about an object that could be the most interesting to them at a given time. Most often, such an object is a product or service, but it may be information—for example, in the form of a recommended news article.

Despite the many existing algorithms, we can divide recommender systems into several basic approaches. The most common are as follows:

Summary-based: Non-personal models based on the average product rating
Content-based: Models based on the intersection of product descriptions and user interests
Collaborative filtering: Models based on interests of similar user groups
Matrix factorization: Methods based on the preferences matrix decomposition

The basis of any recommender system is the preferences matrix. The preferences matrix has all users of the service laid on one...

Understanding collaborative filtering method details

In this section, let's formalize the recommender system problem. We have a set of users, , a set of items, (movies, tracks, products, and so on), and a set of estimates, . Each estimate is given by user , object , its result , and, possibly, some other characteristics.

We are required to predict preference as follows:

We are required to predict personal recommendations as follows:

We are required to predict similar objects as follows:

Remember—the main idea behind collaborative filtering is that similar users usually like similar objects. Let's start with the simplest method, as follows:

Select some conditional measures of similarity of users according to their history of ratings.
Unite users into groups (clusters) so that similar users will end up in the same cluster: _.
Predict the item's user rating...

Examples of item-based collaborative filtering with C++

Let's look at how we can implement a collaborative filtering recommender system. As a sample dataset for this example, we use the MovieLens dataset provided by GroupLens from the research lab in the Department of Computer Science and Engineering at the University of Minnesota: https://grouplens.org/datasets/movielens/. They provide a full dataset with 20 million movie ratings and a smaller one for education, with 100,000 ratings. We recommend starting with the smaller one because it allows us to see results earlier and detect implementation errors faster.

This dataset consists of several files, but we are only interested in two of them: ratings.csv and movies.csv. The rating file contains lines with the following format: the user ID, the movie ID, the rating, and the timestamp. In this dataset, users made ratings on...

Summary

In this chapter, we discussed what recommender systems are and the types of these that exist today. We studied two main approaches to building recommender systems: content-based recommendations and collaborative filtering. We identified two types of collaborative filtering: user-based and item-based. We looked at the implementation of these approaches, and their pros and cons. We found out that an important issue in the implementation of recommender systems is the amount of data and the associated large computational complexity of algorithms. We considered approaches to overcome computational complexity problems, such as partial data updates and approximate iterative algorithms, such as ALS. We found out how matrix factorization can help to solve the problem with incomplete data, improve the generalizability of the model, and speed up the calculations. Also, we implemented...

Collaborative Filtering for Implicit Feedback Datasets: http://yifanhu.net/PUB/cf.pdf
Collaborative Filtering using Alternating Least Squares: http://danielnee.com/2016/09/collaborative-filtering-using-alternating-least-squares/
ALS Implicit Collaborative Filtering: https://medium.com/radon-dev/als-implicit-collaborative-filtering-5ed653ba39fe
Collaborative Filtering: https://datasciencemadesimpler.wordpress.com/tag/alternating-least-squares/
The mlpack library official site: https://www.mlpack.org/
The Armadillo library official site: http://arma.sourceforge.net/
Variational Autoencoders for Collaborative Filtering, by Dawen Liang, Rahul G. Krishnan, Matthew D. Hoffman, and Tony Jebara: https://arxiv.org/abs/1802.05814
Deep Learning-Based Recommender System: A Survey and New Perspectives, by Shuai Zhang, Lina Yao, Aixin Sun, and Yi Tay: https://arxiv.org/abs...

The rest of the chapter is locked

You have been reading a chapter from

Hands-On Machine Learning with C++

Published in: May 2020Publisher: PacktISBN-13: 9781789955330

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Kirill Kolodiazhnyi

Kirill Kolodiazhnyi is a seasoned software engineer with expertise in custom software development. He has several years of experience building machine learning models and data products using C++. He holds a bachelor degree in Computer Science from the Kharkiv National University of Radio-Electronics. He currently works in Kharkiv, Ukraine where he lives with his wife and daughter.
Read more about Kirill Kolodiazhnyi

Other recommended products

Related to this chapter

Caffe2 Quick Start Guide

Caffe2 by Facebook is a popular and relatively lightweight deep learning framework. Caffe2 is known for speed, accuracy and high efficiency in training neural networks. Caffe2 is widely used in mobile apps. This book is a fast paced guide that will teach you how to train and deploy deep learning models with Caffe2 on resource constrained platforms.

BookMay 2019136 pages

Mastering Java for Data Science

Java is the most wide-spread programming language nowadays, and you fill find it everywhere, from small startup companies to large enterprises. It is also a common choice for developing Data Science applications thanks to Java's prevalence and rich data processing toolbox. This book will explain how to use Java for Data Science, overview the available Machine Learning libraries, and cover many topics including supervised and unsupervised learning, natural language processing, deep learning, and big data

BookApr 2017364 pages

Machine Learning with Scala Quick Start Guide

Scala as a programming language is a highly scalable integration of object-oriented and functional programming, which makes it easy to build scalable and complex big data applications. This book is a handy guide for machine learning developers and data scientists who want to train effective machine learning models using this popular language.

BookApr 2019220 pages

Hands-On Machine Learning with scikit-learn and Scientific Python Toolkits

This book covers the theory and practice of building data-driven solutions. Includes the end-to-end process, using supervised and unsupervised algorithms. With each algorithm, you will learn the data acquisition and data engineering methods, the apt metrics, and the available hyper-parameters. You will learn how to deploy the models in production.

BookJul 2020384 pages

Mastering Machine Learning with scikit-learn

This book examines machine learning models including k-nearest neighbors, logistic regression, naive Bayes, random forests, and support vector machines. You will work through document classification, image recognition, and other example problems.

BookJul 2017254 pages

Hands-On Automated Machine Learning

This book helps machine learning professionals in developing AutoML systems that can be utilized to build ML solutions. This book covers the necessary foundations and shows the most practical ways possible to get to speed with regards to creating AutoML modules.

BookApr 2018282 pages

Machine Learning with Swift

Machine learning has become a hot topic for developers who want to impart intelligent functionality to their applications. In this book, we'll show you how to incorporate various machine learning libraries available for iOS developers. You’ll quickly get acquainted with the machine learning fundamentals and implement various algorithms with Swift.

BookFeb 2018378 pages

Python Machine Learning

This second edition of Python Machine Learning by Sebastian Raschka is for developers and data scientists looking for a practical approach to machine learning and deep learning. In this updated edition, you’ll explore the machine learning process using Python and the latest open source technologies, including scikit-learn and TensorFlow 1.x.

BookSep 2017622 pages

Deep Learning with PyTorch Quick Start Guide

PyTorch is extremely powerful and yet easy to learn. It provides advanced features such as supporting multiprocessor, distributed and parallel computation. This book is an excellent entry point for those wanting to explore deep learning with PyTorch to harness its power.

BookDec 2018158 pages

Hands-On Unsupervised Learning with Python

Unsupervised learning is a key required block in both machine learning and deep learning domains. You will explore how to make your models learn, grow, change, and develop by themselves whenever they are exposed to a new set of data. With this book, you will learn the art of unsupervised learning for different real-world challenges.

BookFeb 2019386 pages

Hands-On Neural Networks

This book will be a journey for beginners who want to step into the world of deep learning and artificial intelligence. It will thoughtfully take you through the training and implementation of various neural network architectures using the Python ecosystem. You will master each neural network architecture while understanding its working mechanism.

BookMay 2019280 pages

Machine Learning Algorithms

Machine learning explores the study and construction of algorithms that can learn from, and make predictions on, data. This book will act as an entry point for anyone who wants to make a career in the field of Machine Learning. A few famous algorithms that are covered in this book are Linear regression, Logistic Regression, SVM, Naïve Bayes, K-Means, Random Forest, TensorFlow, and Feature engineering.

BookJul 2017360 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages