Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Big Data Analytics with Java

You're reading from  Big Data Analytics with Java

Product type Book
Published in Jul 2017
Publisher Packt
ISBN-13 9781787288980
Pages 418 pages
Edition 1st Edition
Languages
Concepts
Author (1):
RAJAT MEHTA RAJAT MEHTA
Profile icon RAJAT MEHTA

Table of Contents (21) Chapters

Big Data Analytics with Java
Credits
About the Author
About the Reviewers
www.PacktPub.com
Customer Feedback
Preface
Big Data Analytics with Java First Steps in Data Analysis Data Visualization Basics of Machine Learning Regression on Big Data Naive Bayes and Sentiment Analysis Decision Trees Ensembling on Big Data Recommendation Systems Clustering and Customer Segmentation on Big Data Massive Graphs on Big Data Real-Time Analytics on Big Data Deep Learning Using Big Data Index

Chapter 4. Basics of Machine Learning

Any form of any analytical activity depends heavily on the presence of some clues or data. In today's world data is bountifully available. Due to the broad availability of various devices (such as mobile devices), IoT devices, or social network, the amount of data generated day by day is exploding. This data is not all waste; it can be used to make lots of deductions. For example, we can use this data to figure out what particular ad the user might click on next or what item the user might like to purchase along with the item they are already purchasing currently. This data can help us figure out a knowledge base that can directly impact the core business in many useful ways, hence it is very important.

This chapter is action-packed and we will try to cover a lot of ground while learning the basics. In this chapter, we will cover:

  • Basic concepts of machine learning such as what machine learning is, how it is used, and different forms of machine learning...

What is machine learning?


Machine learning is a form of artificial intelligence where a computer program learns from the data it is fed or trained with. After learning from this data it internally builds a knowledge base of rules and based on this knowledge base it can later make predictions when it is fed new data. Machine learning is part AI, part data mining, and part statistics, but overall the criterion is to teach a machine to make new decisions based on past data it is trained with. So, for example, if we teach a machine some data regarding the inventory statistics of a store throughout the year then you might be able to tell things such as in which months the items sell more or which items sell more often. Also, it can tell the shop owner if they are selling one particular item more than other items; it can also show this to the customer so as to increase sales.

The concept of making new predictions is very important as we can now make predictions such as in which zone or area a marketing...

Summary


This chapter was action packed on machine learning and its various concepts. We covered a lot of theoretical ground in this chapter by learning what machine learning is, some important real-life use cases, types of machine learning, and the important concepts of machine learning such as how we extract and select features, training our models, selecting our models, and tuning them for performance by using techniques such as training/test set and cross validation. We also learnt how we can run our machine learning models specifically on big data and what Spark has to offer on the machine learning side in terms of an API.

In the next chapter, we will dive into actual machine learning algorithms and we will learn a simple yet powerful and popular linear regression algorithm. We will understand it by using an example case study. After studying linear regression we will study another algorithm logistic regression and we will also try to learn it by using a sample case study.

lock icon The rest of the chapter is locked
You have been reading a chapter from
Big Data Analytics with Java
Published in: Jul 2017 Publisher: Packt ISBN-13: 9781787288980
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}