Data Science 101: Methodology, Python, and Essential Math [Video]
 FREE Subscription Read for free
 $54.99 Video Buy
 $12.99 Video + Subscription Buy

What do you get with a Packt Subscription?
 Instant access to this title and 7,500+ eBooks & Videos
 Constantly updated with 100+ new titles each month
 Breadth and depth in over 1,000+ technologies

Free ChapterIntroduction to Data Science 101

Best Language for Data Science

Data Science Methodology

Data Science Through Chatbot

Libraries, APIs, Datasets

GitHub

Installation / Jupyter / Comments (Windows and MacOS/Jupyter Notebook)

Introduction to Data Science in Python  Python Fundamentals
 How to Use Markdown Cells (Adding Headers, Links, and Images)
 Comments  Inline and Block Comments
 Python Indentation
 Writing Single and Multiple Lines of Code
 Understanding Variables
 Main Data Types and Creating Them (Integer, Float, String, List, Dictionary)
 Lists  How to Use
 Dictionaries  How to Use
 Creating a Tuple
 Tuple  How to Use
 Creating a Set
 Set  How to Use
 Operators

Introduction to Data Science in Python  Decision and Looping Structures

Introduction to Data Science in Python  Python Functions
 Introducing Functions
 Functions  General Syntax
 +1 Function
 Fav Band Function
 Celsius to Fahrenheit Function
 Optional Return Statement (and Comparing It to Print Statement)
 Defining a Function Versus Calling a Function
 Practical/Real World Example: Function to Get Reddit Data
 Lambda Introduction (Anonymous Functions)
 Formal Function Versus Lambda for Splitting Strings

Introduction to Data Science  Nested Data, Iteration, and List Comprehension
 Introducing you to Nested Data and Iteration
 Simple Nested Example
 Double Indexing
 Assigning Values
 List of Dicts and Dicts of Dicts Example
 Nested Iteration  Iterating Through List of Lists
 Defining List Comprehension and Syntax
 List Comprehension  Simple Examples
 List Comp as an Alternative to Loops
 Practical/Real World Example  Using Common Mathematical Notation
 Practical/Real World Example  Creating a Constrained ID
 Activity: Building Intuition (Loops, Nested Data, Iteration, and List Comp)

Introduction to Data Science in Python  Learn NumPy
 Introducing NumPy
 Creating Our First NumPy Array
 Shaping an Array (When You Know the Shape You Want)
 Creating a Sequence of Integers and Floats
 ElementWise Operations
 A Range with a Shape (Arrange Function with Reshape Function)
 NumPy Indexing
 NumPy Slicing
 Indexing and Slicing with Breast Cancer Wisconsin Dataset
 Delete Elements
 Append
 Insert Elements
 Reshape 1 Feature
 Flatten
 Transpose
 Concatenate
 Splitting
 Aggregate/Statistical Functions

Introduction to Data Science in Python  Pandas
 Introducing Pandas
 For SAS Programmers: Analogous Terms in Pandas (Python)
 Using Series as Input into DataFrame
 Comparing Series and DataFrame
 Importing TSLA Dataset
 IndexBased Selection (iloc)
 LabelBased Selection (loc)
 Conditional Selection
 Summary Functions
 Grouping (groupby)
 Sorting
 Checking Data Types and Converting
 Dealing with Missing Values
 Dropping Columns/Variables and Records/Rows
 Renaming Columns/Variables and Records/Rows
 Concat Function + Pop Quiz
 RealWorld Activity: Add New Columns and Predict Stock Movement

Introduction to Data Science in Python  Python Activity Solutions

Essential Math for Data Science  Linear Algebra Made Easy
 Linear Equation Definition
 Forms of a Linear Equation
 Systems of Linear Equations
 Line and Plane
 Aij Notation
 System of Equations as a Matrix
 System in Corresponding Forms
 Row Echelon Form (Gaussian Elimination)
 Reduced Row Echelon Form
 Row Operations Rules
 Row Operations Example (REF)
 Visualizing Ax=b
 General Formula  Matrix Vector Multiplication
 Tips for Row Operations

Essential Math for Data Science  Mathematical Structures
 Mathematical Structures
 Abelian Groups and Fields
 Vector Spaces 1
 Vector Spaces  Concrete Example
 Subspaces
 Linear Combinations and Span
 Is It in the Span?
 Linear Independence
 A Basis for a Vector Space
 Dim of C(A) and N(A)
 The Dimension of a Vector Space
 Linear Maps
 The Four Fundamental Subspaces
 Adding Geometry to Vector Spaces
 Orthogonal Projection  How to Derive Projection and Check for Orthogonality
 Least Squares
 Least Squares Through Pseudoinverse  with Python and SAS code

Essential Math for Data Science  Introduction to Probability

Essential Math for Data Science  Random Variables and Multiple Variables
 Random Variables
 Probability Mass Function and Discrete R.V.s
 Expectation and Variance for Discrete Random Variables
 Joint PMFs (Multiple Discrete Variables)
 Continuous Random Variables
 Continuous Random Variables and Probability Density Function
 Continuous R.V. Example
 Joint PDF Example  Banking
 Cumulative Distribution Function (CDF)
 Covariance, Correlation, and More on Variance
 Law of Large Numbers (LLN)
 Central Limit Theorem (CLT)

Essential Math for Data Science  Statistical Inference
The opening part of Data Science 101 examines some frequently asked questions.
Following that, we will explore data science methodology with a case study. You will see the typical data science steps and techniques utilized by data professionals. Next, you will build a simple chatbot so you can get a clear sense of what is involved.
The next part is an introduction to data science in Python. You will have an opportunity to master Python for data science as each section is followed by an assignment to practice your skills. By the end of the section, you will understand Python fundamentals, decision and looping structures, Python functions, how to work with nested data, and list comprehension. Finally, we will wrap up the two most popular libraries for data science—NumPy and Pandas.
The last part delves into essential math for data science. You will get the hang of linear algebra along with probability and statistics. Our goal for the linear algebra part is to introduce all necessary concepts and intuition for an indepth understanding of an oftenutilized technique for data fitting called least squares. We will spend a lot of time on probability, both classical and Bayesian, as reasoning about problems is a much more difficult aspect than simply running statistics.
By the end of this course, you will understand data science methodology and how to use essential math in your real projects.
All resources are available at https://github.com/PacktPublishing/DataScience101MethodologyPythonandEssentialMath
 Publication date:
 April 2022
 Publisher
 Packt
 Duration
 14 hours 49 minutes
 ISBN
 9781803242125