You're reading from MongoDB Fundamentals

Product typeBook

Published inDec 2020

PublisherPackt

ISBN-139781839210648

Edition1st Edition

Tools

MongoDB

Concepts

Databases

Authors (4):

Amit Phaltankar

Juned Ahsan

Michael Harrison

Liviu Nedov

View More author details

9. Performance

Overview

This chapter introduces you to the concepts of query optimization and performance improvement in MongoDB. You will first explore the internal workings of query execution and identify the factors that can affect query performance, before moving on to database indexes and how indexes can reduce query execution time. You will also learn how to create, list, and delete indexes, and study the various types of indexes and their benefits. In the final sections, you will be introduced to various query optimization techniques that can help you use indexes effectively. By the end of this chapter, you will be able to analyze queries and use indexes and optimization techniques to improve query performance.

Introduction

In the previous chapters, we learned about the MongoDB query language and various query operators. We learned how to write queries to retrieve data. We also learned about various commands used to add and delete data and also to update or modify a piece of data. We ensured that the queries bring us the desired output; however, we did not pay much attention to their execution time and their efficiency. In this chapter, we will focus on how to analyze a query's performance and optimize its performance further, if needed.

Real-world applications are made up of multiple components, such as a user interface, processing components, databases, and more. The responsiveness of an application is dependent on the efficiency of each of these components. The database component performs different operations, such as saving, reading, and updating data. The amount of data a database table or collection stores, or the amount of data being pushed into or retrieved from a database...

Query Analysis

In order to write efficient queries, it is important to analyze them, find any possible performance issues, and fix them. This technique is called performance optimization. There are many factors that can negatively affect the performance of a query, such as incorrect scaling, incorrectly structured collections, and inadequate resources such as RAM and CPU. However, the biggest and most common factor is the difference between the number of records scanned and the number of records returned during the query execution. The greater the difference is, the slower the query will be. Thankfully, in MongoDB, this factor is the easiest to address and is done using indexes.

Creating and using indexes on a collection narrows down the number of records being scanned and improves the query performance noticeably. Before we delve further into indexes, though, we first need to cover the details of query execution.

Say you want to find a list of the movies released in the...

Introduction to Indexes

Databases can maintain and use indexes to make searches more efficient. In MongoDB, indexes are created on a field or a combination of fields. The database maintains a special registry of indexed fields and some of their data. The registry is easily searchable, as it maintains a logical link between the value of an indexed field and the respective documents in the collection. During a search operation, the database first locates the value in the registry and identifies the matching documents in the collection accordingly. The values in a registry are always sorted in ascending or descending order of the values, which helps during a range search and also while sorting the results.

To better understand how the index registry helps during searches, imagine you are searching for a theater by its ID, as follows:

db.theaters.find(
    {"theaterId" : 1009}
)

When the query is executed on the sample_mflix database, it returns a...

Creating and Listing Indexes

Indexes can be created by executing a createIndex() command on a collection, as follows:

db.collection.createIndex(
keys, 
options
)

The first argument to the command is a list of key-value pairs, where each pair consists of a field name and sort order, and the optional second argument is a set of options to control the indexes.

In a previous section, you wrote the following query to find all the movies released in 2015, sort them in descending order of the number of awards won, and print the title and number of wins:

db.movies.find(
    { 
        "year" : 2015
    },
    {
        "title" : 1, 
        "awards.wins" : 1
    }
).sort(
    {"awards.wins" : -1}
)

As the query uses a...

Query Analysis after Indexes

In the Query Analysis section, you analyzed the performance of a query that did not have suitable indexes to support its query condition. Because of this, the query scanned all 23539 documents in the collection to return 484 matching documents. Now that you have added an index on the year field, let's see how the query execution stats have changed.

The following query prints the execution statistics for the same query:

db.movies.explain("executionStats").find(
    { 
        "year" : 2015
    },
    {
        "title" : 1, 
        "awards.wins" : 1
    }
).sort(
    {"awards.wins" : -1}
)

The output for this is slightly different than the previous one, as shown in the following...

Hiding and Dropping Indexes

Dropping an index means removing the values of the fields from the index registry. Thus, any searches on the related fields will be performed in a linear fashion, provided there are no other indexes present on the field.

It is important to note that MongoDB does not allow updating an existing index. Thus, to fix an incorrectly created index, we need to drop it and recreate it correctly.

An index is deleted using the dropIndex function. It takes a single parameter, which can either be the index name or the index specification document, as follows:

db.collection.dropIndex(indexNameOrSpecification)

The index specification document is the definition of the index that is used to create it (like the following snippet, for example):

db.movies.createIndex(
    {title: 1}
)

Consider the following snippet:

db.movies.dropIndex(
     {title: 1}
)

This command drops the index on the title field of the movies...

Type of Indexes

We have seen how indexes help with query performance and how we can create, drop, and list indexes in the collection. MongoDB supports different types of indexes, such as single key, multikey, and compound indexes. Each of these indexes has different advantages that you will need to know before deciding which type is suitable for your collection. Let's start with a brief overview of default indexes.

Default Indexes

As seen in the previous chapters, each document in a collection has a primary key (namely, the _id field) and is indexed by default. MongoDB uses this index to maintain the uniqueness of the _id field, and it is available on all the collections.

Single-Key Indexes

An index created using a single field from a collection is called a single-key index. You used a single-key index earlier in this chapter. The syntax is as follows:

db.collection.createIndex({ field1: type}, {options})

Compound Indexes

Single-key indexes are preferable...

Properties of Indexes

In this section, we will cover different properties of indexes in MongoDB. An index property can influence the usage of an index and can also enforce some behavior on the collection. Index properties are passed as an option to the createdIndex function. We will be looking at unique indexes, TTL (time to live) indexes, sparse indexes, and finally, partial indexes.

Unique Indexes

A unique index property restricts the duplication of the index key. This is useful if you want to maintain the uniqueness of a field in a collection. The unique fields are useful for avoiding any ambiguity in identifying documents precisely. For example, in a license collection, a unique field such as license_number can help identify each document individually. This property enforces the behavior on the collection to reject duplicate entries. Unique indexes can be created on a single field or on a combination of fields. The following is the syntax to create a unique index on a single...

Other Query Optimization Techniques

So far, we have seen the internal workings of queries and how indexes help limit the number of documents to be scanned. We have also explored various types of indexes and their properties and learned how we can use the correct index and correct index properties in specific use cases. Creating the right index can improve query performance, but there are a few more techniques that are required to fine-tune the query performance. We will cover those techniques in this section.

Fetch Only What You Need

The performance of a query is also affected by the amount of data it returns. The database server and client communicate over a network. If a query produces a large amount of data, it will take longer to transfer it over a network. Moreover, to transfer the data over the network, it needs to be transformed and serialized by the server and deserialized by the receiving client. This means that the database client will have to wait longer to get the...

Summary

In this chapter, you practiced improving query performance. You first explored the internal workings of query execution and the query execution stages. You then learned how to analyze a query's performance and identify any existing problems based on the execution statistics. Next, you reviewed the concept of indexes; how they solve performance issues for a query; various ways to create, list, and delete indexes; different types of indexes; and their properties. In the final sections of this chapter, you studied query optimization techniques and got a brief look at the overheads associated with indexes. In the next chapter, you will learn about the concept of replication and how it is implemented in Mongo.

The rest of the chapter is locked

You have been reading a chapter from

MongoDB Fundamentals

Published in: Dec 2020Publisher: PacktISBN-13: 9781839210648

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (4)

Amit Phaltankar

Amit Phaltankar is a software developer and a blogger experienced in building lightweight and efficient software components. He specializes in wiring web-based applications and handling large-scale data sets using traditional SQL, NoSQL, and big data technologies. He is experienced in many technology stacks and loves learning and adapting to newer technology trends. Amit is passionate about improving his skill set and loves guiding and grooming his peers and contributing to blogs. He is also an author of MongoDB Fundamentals.
Read more about Amit Phaltankar

Juned Ahsan

Juned Ahsan is a software professional with more than 14 years of experience. He has built software products and services for companies and clients such as Cisco, Nuamedia, IBM, Nokia, Telstra, Optus, Pizzahut, AT&T, Hughes, Altran, and others. Juned has a vast experience in building software products and architecting platforms of different sizes from scratch. He loves to help and mentor others and is a top 1% contributor on StackOverflow. He is passionate about cognitive CX, cloud computing, artificial intelligence, and NoSQL databases.
Read more about Juned Ahsan

Michael Harrison

Michael Harrison started his career at the Australian telecommunications leader Telstra. He worked across their networks, big data, and automation teams. He is now a lead software developer and the founding member of Southbank Software, a Melbourne based startup that builds tools for the next generation of database technologies.
Read more about Michael Harrison

Liviu Nedov

Liviu Nedov is a senior consultant with more than 20 years of experience in database technologies. He has provided professional and consulting services to customers in Australia and Europe. Throughout his career, he has designed and implemented large enterprise projects for customers like Wotif Group, Xstrata Copper/Glencore, and the University of Newcastle and Energy, Queensland. He is currently working at Data Intensity, which is the largest multi-cloud service provider for applications, databases, and business intelligence. In recent years, he is actively involved in MongoDB NoSQL database projects, database migrations, and cloud DBaaS (Database as a Service) projects.
Read more about Liviu Nedov

Other recommended products

Related to this chapter

MongoDB Administrator's Guide

MongoDB is a high-performance and feature-rich NoSQL database that forms the backbone of the systems that power many different organizations. Packed with many features that have become essential for many different types of software professional and incredibly easy to use, this cookbook contains more than 100 recipes to address the everyday challenges of working with MongoDB, as well as guidance on effective techniques for efficient querying and administration in MongoDB. This book will help you will understand the indexing aspects of MongoDB. It also includes practical recipes on how you can optimize your database query performance, perform diagnostics, and query debugging. By the end of this book, you will have all the information you need to implement a high-performance MongoDB solution.

BookOct 2017226 pages

MongoDB 4 Quick Start Guide

MongoDB has grown to become the de facto NoSQL database with millions of users, from small start-ups to Fortune 500 companies. It can solve problems that are considered difficult, if not impossible, for aging RDBMS technologies. Written for version 4 of MongoDB, this book is the easiest way to get started with MongoDB.

BookSep 2018192 pages

Mastering MongoDB 4.x

This book will help you build expert proficiency in developing large-scale applications using MongoDB 4.x. You will master CRUD operations and perform tasks such as indexing, aggregation, monitoring, sharding, cluster management, and administration. You take building and administering scalable MongoDB applications to the next level.

BookMar 2019394 pages

Mastering MongoDB 3.x

MongoDB has gone from being a niche database to the king of NoSQL databases in a short time and this is no small feat. Mastering MongoDB will help you gain proficiency in developing apps using MongoDB. This book covers a range of topics such as CRUD operations, Indexing, aggregation, monitoring, sharding, cluster operations, and more. If you are a developer, architect, or DBA using MongoDB and want to be more productive when designing and administering MongoDB-backed applications, then this book can take you there in the minimum time.

BookNov 2017342 pages

Learn MongoDB 4.x

This book covers the latest release of MongoDB. You'll learn how to master various tasks related to the development and administration of a MongoDB database, along with best practices to optimize the workflow. The book also covers multiple financial and practical use cases that will enable you to use MongoDB for commercial data storage.

BookSep 2020610 pages

Seven NoSQL Databases in a Week

This book will help you understand the fundamentals of seven of the most popular NoSQL databases. You will see how the functionalities of each of them differ, while still giving you the same result - a database solution with speed, high performance, and accuracy.

BookMar 2018308 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages