Packt+ | Advance your knowledge in tech

You're reading from Numerical Computing with Python Harness the power of Python to analyze and find hidden patterns in the data

Product type Course

Published in Dec 2018

Last Updated in Feb 2025

Publisher Packt

ISBN-13 9781789953633

Length 682 pages

Edition 1st Edition

Languages

Python

Tools

Scikit-learn

Concepts

Data Mining

Authors (5):

Pratap Dangeti

Allen Yu

Claire Chung

Aldrin Yim

Theodore Petrou

+1 more

View More author details

Table of Contents (21) Chapters

Title Page

Contributors

About Packt

Preface

1. Journey from Statistics to Machine Learning FREE CHAPTER

2. Tree-Based Machine Learning Models

3. K-Nearest Neighbors and Naive Bayes

4. Unsupervised Learning

5. Reinforcement Learning

6. Hello Plotting World!

7. Visualizing Online Data

8. Visualizing Multivariate Data

9. Adding Interactivity and Animating Plots

10. Selecting Subsets of Data

11. Boolean Indexing

12. Index Alignment

13. Grouping for Aggregation, Filtration, and Transformation

14. Restructuring Data into a Tidy Form

15. Combining Pandas Objects

1. Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Selecting with unique and sorted indexes

Index selection performance drastically improves when the index is unique or sorted. The prior recipe used an unsorted index that contained duplicates, which makes for relatively slow selections.

Getting ready

In this recipe, we use the college dataset to form unique or sorted indexes to increase the performance of index selection. We will continue to compare the performance to boolean indexing as well.

How to do it...

Read in the college dataset, create a separate DataFrame with STABBR as the index, and check whether the index is sorted:

>>> college = pd.read_csv('data/college.csv')
>>> college2 = college.set_index('STABBR')
>>> college2.index.is_monotonic
False

Sort the index from college2 and store it as another object:

>>> college3 = college2.sort_index()
>>> college3.index.is_monotonic
True

Time the selection of the state of Texas (TX) from all three DataFrames:

>>> %timeit college[college['STABBR...

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from Numerical Computing with Python Harness the power of Python to analyze and find hidden patterns in the data

Table of Contents (21) Chapters

Selecting with unique and sorted indexes

Getting ready

How to do it...

Authors (5)

Personalised recommendations for you

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access