Julia for Data Science

Explore the world of data science from scratch with Julia by your side

Julia for Data Science

Anshul Joshi

2 customer reviews
Explore the world of data science from scratch with Julia by your side
Mapt Subscription
FREE
$29.99/m after trial
eBook
$28.00
RRP $39.99
Save 29%
Print + eBook
$49.99
RRP $49.99
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$28.00
$49.99
$29.99p/m after trial
RRP $39.99
RRP $49.99
Subscription
eBook
Print + eBook
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Preview in Mapt

Book Details

ISBN 139781785289699
Paperback346 pages

Book Description

Julia is a fast and high performing language that's perfectly suited to data science with a mature package ecosystem and is now feature complete. It is a good tool for a data science practitioner. There was a famous post at Harvard Business Review that Data Scientist is the sexiest job of the 21st century. (https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century).

This book will help you get familiarised with Julia's rich ecosystem, which is continuously evolving, allowing you to stay on top of your game.

This book contains the essentials of data science and gives a high-level overview of advanced statistics and techniques. You will dive in and will work on generating insights by performing inferential statistics, and will reveal hidden patterns and trends using data mining. This has the practical coverage of statistics and machine learning. You will develop knowledge to build statistical models and machine learning systems in Julia with attractive visualizations.

You will then delve into the world of Deep learning in Julia and will understand the framework, Mocha.jl with which you can create artificial neural networks and implement deep learning.

This book addresses the challenges of real-world data science problems, including data cleaning, data preparation, inferential statistics, statistical modeling, building high-performance machine learning systems and creating effective visualizations using Julia.

Table of Contents

Chapter 1: The Groundwork – Julia's Environment
Julia is different
Setting up the environment
Using REPL
Using Jupyter Notebook
Package management
Parallel computation using Julia
Julia's key feature – multiple dispatch
Facilitating language interoperability
Summary
References
Chapter 2: Data Munging
What is data munging?
What is a DataFrame?
Summary
References
Chapter 3: Data Exploration
Sampling
Inferring column types
Basic statistical summaries
Scalar statistics
Measures of variation
Scatter matrix and covariance
Computing deviations
Rankings
Counting functions
Histograms
Correlation analysis
Summary
References
Chapter 4: Deep Dive into Inferential Statistics
Installation
Understanding the sampling distribution
Understanding the normal distribution
Type hierarchy in Distributions.jl
Univariate distributions
Truncated distributions
Understanding multivariate distributions
Understanding matrixvariate distributions
Distribution fitting
Confidence interval
Understanding z-score
Understanding the significance of the P-value
Summary
References
Chapter 5: Making Sense of Data Using Visualization
Difference between using and importall
Pyplot for Julia
Unicode plots
Visualizing using Vega
Data visualization using Gadfly
Summary
References
Chapter 6: Supervised Machine Learning
What is machine learning?
Machine learning – the process
Understanding decision trees
Supervised learning using Naïve Bayes
Summary
References
Chapter 7: Unsupervised Machine Learning
Understanding clustering
K-means clustering
Summary
References
Chapter 8: Creating Ensemble Models
What is ensemble learning?
Random forests
Implementation in Julia
Why is ensemble learning superior?
Summary
References
Chapter 9: Time Series
What is forecasting?
What is TimeSeries?
Implementation in Julia
Summary
References
Chapter 10: Collaborative Filtering and Recommendation System
What is a recommendation system?
Association rule mining
Content-based filtering
Collaborative filtering
Building a movie recommender system
Summary
Chapter 11: Introduction to Deep Learning
Revisiting linear algebra
Probability and information theory
Differences between machine learning and deep learning
Implementation in Julia
Summary
References

What You Will Learn

  • Apply statistical models in Julia for data-driven decisions
  • Understanding the process of data munging and data preparation using Julia
  • Explore techniques to visualize data using Julia and D3 based packages
  • Using Julia to create self-learning systems using cutting edge machine learning algorithms
  • Create supervised and unsupervised machine learning systems using Julia. Also, explore ensemble models
  • Build a recommendation engine in Julia
  • Dive into Julia’s deep learning framework and build a system using Mocha.jl

Authors

Table of Contents

Chapter 1: The Groundwork – Julia's Environment
Julia is different
Setting up the environment
Using REPL
Using Jupyter Notebook
Package management
Parallel computation using Julia
Julia's key feature – multiple dispatch
Facilitating language interoperability
Summary
References
Chapter 2: Data Munging
What is data munging?
What is a DataFrame?
Summary
References
Chapter 3: Data Exploration
Sampling
Inferring column types
Basic statistical summaries
Scalar statistics
Measures of variation
Scatter matrix and covariance
Computing deviations
Rankings
Counting functions
Histograms
Correlation analysis
Summary
References
Chapter 4: Deep Dive into Inferential Statistics
Installation
Understanding the sampling distribution
Understanding the normal distribution
Type hierarchy in Distributions.jl
Univariate distributions
Truncated distributions
Understanding multivariate distributions
Understanding matrixvariate distributions
Distribution fitting
Confidence interval
Understanding z-score
Understanding the significance of the P-value
Summary
References
Chapter 5: Making Sense of Data Using Visualization
Difference between using and importall
Pyplot for Julia
Unicode plots
Visualizing using Vega
Data visualization using Gadfly
Summary
References
Chapter 6: Supervised Machine Learning
What is machine learning?
Machine learning – the process
Understanding decision trees
Supervised learning using Naïve Bayes
Summary
References
Chapter 7: Unsupervised Machine Learning
Understanding clustering
K-means clustering
Summary
References
Chapter 8: Creating Ensemble Models
What is ensemble learning?
Random forests
Implementation in Julia
Why is ensemble learning superior?
Summary
References
Chapter 9: Time Series
What is forecasting?
What is TimeSeries?
Implementation in Julia
Summary
References
Chapter 10: Collaborative Filtering and Recommendation System
What is a recommendation system?
Association rule mining
Content-based filtering
Collaborative filtering
Building a movie recommender system
Summary
Chapter 11: Introduction to Deep Learning
Revisiting linear algebra
Probability and information theory
Differences between machine learning and deep learning
Implementation in Julia
Summary
References

Book Details

ISBN 139781785289699
Paperback346 pages
Read More
From 2 reviews

Read More Reviews

Recommended for You

Python Machine Learning Book Cover
Python Machine Learning
$ 35.99
$ 25.20
Practical Machine Learning Book Cover
Practical Machine Learning
$ 37.99
$ 26.60
Learning Bayesian Models with R Book Cover
Learning Bayesian Models with R
$ 35.99
$ 25.20
Mastering Julia Book Cover
Mastering Julia
$ 43.99
$ 30.80
Getting Started with Julia Book Cover
Getting Started with Julia
$ 23.99
$ 4.80
Practical Data Science Cookbook Book Cover
Practical Data Science Cookbook
$ 29.99
$ 21.00