You're reading from MongoDB 4 Quick Start Guide

Product typeBook

Published inSep 2018

Reading LevelIntermediate

PublisherPackt

ISBN-139781789343533

Edition1st Edition

Languages

JavaScript

Tools

MongoDB

Concepts

Databases

Author (1)

Doug Bierer

Building Complex Queries Using Aggregation

The focus of this chapter will be on a feature that's unique to MongoDB called the Aggregation Framework (https://www.mongodb.com/presentations/aggregation-framework-0?jmp=docs&_ga=2.166048830.1278448947.1531711178-137143613.1528093145), which is vital when forming complex queries. This feature allows database developers, or DBAs, to return subsets of data that are grouped, sorted, and filtered. We will start our discussion by forming a simple aggregation using single-purpose methods (https://docs.mongodb.com/manual/aggregation/#single-purpose-aggregation-operations). After that, we will get into the more complex topics of forming an aggregation pipeline (https://docs.mongodb.com/manual/aggregation/#aggregation-pipeline) and making use of the map-reduce (https://docs.mongodb.com/manual/aggregation/#map-reduce) function. We will...

An overview of aggregation

Before diving into the specifics, it's important to lay some groundwork. The first question which comes to mind is, What is aggregation? That question would be logically followed by, Why use it?

What is aggregation?

The main purpose of aggregation operations is to refine query results by grouping together field values from multiple documents, and then performing one or more transformations. Aggregation in MongoDB can be as simple as presenting the results of a query into a set of one or more fields, or as complex as performing a multistage query, breaking the output into buckets, and performing operations on each result set. A more advanced usage would be to manipulate complex fields within...

Using single-purpose aggregation

Single-purpose aggregation operators are available so that you can operate on a collection or a cursor. The following table summarizes the operators which can operate on a collection:

`db.collection.count()`	Wraps the `$count` aggregation operator to produce the number of documents in a collection.
`db.collection.distinct()`	Wrapper for the `distinct` command (https://docs.mongodb.com/manual/reference/command/distinct/#distinct). Produces distinct values for document fields across a collection.

The following table summarizes single-purpose aggregation operations which can be performed on a cursor (such as the iteration returned after executing db.collection.find()):

`cursor.count()`	Equivalent to `db.collection.count()` (see prior table)
`cursor.limit()`	Limits the number of documents in the final result
`cursor.sort()`	Returns the results in...

Using the aggregation pipeline

The MongoDB aggregation pipeline framework consists of the aggregate() collection method, and a sequence of operations referred to as stages (https://docs.mongodb.com/manual/reference/operator/aggregation-pipeline/#aggregation-pipeline-stages). This sequence is referred to as a pipeline.

For illustration, let's assume that there's a collection called purchases, where each purchase has an amount of information as well as embedded customer and product objects:

We wish to generate a report on the total sales for each customer from Australia. A simple db.collection.find() command will not suffice as it is incapable of grouping the customers. The problem is further compounded by the fact that country information is embedded in the customer object within each purchase. In order to generate this report, we will first need to address stages.

Be...

Using map-reduce

The db.collection.mapReduce() method delivers similar results to that of the aggregation pipeline. The main difference is that rather than performing operations in stages, map-reduce uses JavaScript functions to produce results. This gives you access to the full programming power that's available in JavaScript. Because it operates outside of the aggregation framework, however, performance is generally worse. If there is a high degree of complexity in your query, it might be worth considering using this feature. Otherwise, the MongoDB documentation recommends using the aggregation pipeline framework.

To demonstrate map-reduce functionality, we will use the same purchases collection that we described previously. Here is the general structure of a mapReduce() command:

When we run this method, we get operational metadata. Unlike the aggregate() method, the output...

Using the MongoDB Compass aggregation pipeline builder

The MongoDB Compass tool, introduced in Chapter 1, Introducing MongoDB, has an extremely useful feature which assists you in developing complex aggregation pipeline queries. To use Compass to build an aggregation pipeline query, you first need to start Compass and connect to MongoDB. You will then need to select the database and collection upon which you wish to perform an aggregation.

In the following example, we select the sweetscomplete database and the purchases collection. From the horizontal menu, we then select Aggregations. Here is how the screen appears so far:

We then turn our attention to the dialog box in the bottom left. Clicking on Select, we add our first stage, $match. You can then start typing the start of the desired expression. The following tables summarizes possible initial actions:

If You Type ....

Summary

In this chapter, you learned how to conduct complex queries using the aggregation pipeline framework. You learned about stages, expression operators, and how to accumulate information such as sum, average, and so on. One of the most important aspects of the aggregation pipeline framework that you learned about in this chapter is the ability to access embedded objects or arrays.

You also learned about single-purpose aggregation (for example, sort and limit), as well as how to use map-reduce. You learned that, although map-reduce gives you flexibility in that JavaScript functions can be used, the aggregation framework is preferred as it uses native MongoDB methods and offers better performance.

In the next chapter, you will learn about how to maintain MongoDB performance.

The rest of the chapter is locked

You have been reading a chapter from

MongoDB 4 Quick Start Guide

Published in: Sep 2018Publisher: PacktISBN-13: 9781789343533

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Doug Bierer

Doug Bierer has been hooked on computers since his first program, written on a DEC PDP-8, in 1971. In his wide-ranging career, he has been a professional contract programmer since 1978, having written applications in BASIC, PL/I, assembler, FORTH, C, C++, dBase/FoxBase/Clipper, Pascal, Perl, Java, and PHP. He deployed his first website in 1993 while living in San Francisco. He speaks four languages, has traveled extensively, and now resides in Thailand. He also spent some years doing system administration and TCP/IP networking. Some of his technical works include PHP 7 Programming Cookbook and Learning MongoDB 4.x (Packt), as well as Learning PHP and MySQL, Learning PHP Security and Learning Doctrine (O'Reilly Media).
Read more about Doug Bierer

Other recommended products

Related to this chapter

Learn MongoDB 4.x

This book covers the latest release of MongoDB. You'll learn how to master various tasks related to the development and administration of a MongoDB database, along with best practices to optimize the workflow. The book also covers multiple financial and practical use cases that will enable you to use MongoDB for commercial data storage.

BookSep 2020610 pages

MongoDB Administrator's Guide

MongoDB is a high-performance and feature-rich NoSQL database that forms the backbone of the systems that power many different organizations. Packed with many features that have become essential for many different types of software professional and incredibly easy to use, this cookbook contains more than 100 recipes to address the everyday challenges of working with MongoDB, as well as guidance on effective techniques for efficient querying and administration in MongoDB. This book will help you will understand the indexing aspects of MongoDB. It also includes practical recipes on how you can optimize your database query performance, perform diagnostics, and query debugging. By the end of this book, you will have all the information you need to implement a high-performance MongoDB solution.

BookOct 2017226 pages

Mastering MongoDB 3.x

MongoDB has gone from being a niche database to the king of NoSQL databases in a short time and this is no small feat. Mastering MongoDB will help you gain proficiency in developing apps using MongoDB. This book covers a range of topics such as CRUD operations, Indexing, aggregation, monitoring, sharding, cluster operations, and more. If you are a developer, architect, or DBA using MongoDB and want to be more productive when designing and administering MongoDB-backed applications, then this book can take you there in the minimum time.

BookNov 2017342 pages

Learning Azure Cosmos DB

Azure Cosmos DB is Microsoft's globally distributed, multi-model database providing low-latency and industry-standard SLAs. In this book, you will learn about Azure Cosmos DB features and how to implement and integrate them with real-world applications

BookMay 2018168 pages

MongoDB Fundamentals

MongoDB Fundamentals will get you started using MongoDB for data processing in a cloud computing environment. Starting with the fundamentals of NoSQL, you'll build up to learning advanced data manipulation techniques and application development with the help of hands-on case-studies.

BookDec 2020748 pages

Mastering MongoDB 4.x

This book will help you build expert proficiency in developing large-scale applications using MongoDB 4.x. You will master CRUD operations and perform tasks such as indexing, aggregation, monitoring, sharding, cluster management, and administration. You take building and administering scalable MongoDB applications to the next level.

BookMar 2019394 pages

Docker Cookbook

Docker is an open source platform for building, shipping, managing and securing containers. Docker has become a de facto tool of choice for the technologists focusing on IT automation, DevOps, and microServices. This book will be an excellent guide for your containerization journey, which will help you leverage Docker in the best way possible.

BookAug 2018352 pages2

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages