Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Modern Data Architecture on AWS

You're reading from  Modern Data Architecture on AWS

Product type Book
Published in Aug 2023
Publisher Packt
ISBN-13 9781801813396
Pages 420 pages
Edition 1st Edition
Languages
Author (1):
Behram Irani Behram Irani
Profile icon Behram Irani

Table of Contents (24) Chapters

Preface 1. Part 1: Foundational Data Lake
2. Prologue: The Data and Analytics Journey So Far 3. Chapter 1: Modern Data Architecture on AWS 4. Chapter 2: Scalable Data Lakes 5. Part 2: Purpose-Built Services And Unified Data Access
6. Chapter 3: Batch Data Ingestion 7. Chapter 4: Streaming Data Ingestion 8. Chapter 5: Data Processing 9. Chapter 6: Interactive Analytics 10. Chapter 7: Data Warehousing 11. Chapter 8: Data Sharing 12. Chapter 9: Data Federation 13. Chapter 10: Predictive Analytics 14. Chapter 11: Generative AI 15. Chapter 12: Operational Analytics 16. Chapter 13: Business Intelligence 17. Part 3: Govern, Scale, Optimize And Operationalize
18. Chapter 14: Data Governance 19. Chapter 15: Data Mesh 20. Chapter 16: Performant and Cost-Effective Data Platform 21. Chapter 17: Automate, Operationalize, and Monetize 22. Index 23. Other Books You May Enjoy

Operational Analytics

Every business performs certain operations to generate revenue. All these operations eventually generate lots of data, across multiple systems. Many organizations perform operations via the digital space by providing a variety of software applications. All these applications perform a ton of operational logs on the systems they are hosted on. There is a wealth of information in these log files – system errors, performance statistics, security aspects, network traffic patterns, customer information, and so forth. Across all these applications on multiple systems, the amount of daily log data that’s generated can be overwhelming to store, manage, and analyze to get insights from it. Finding relevant pieces of information across all these logs is like finding a needle in a haystack.

To solve this problem of analyzing log data, there needs to be a suitable technology and supporting toolset that can help index all this data and make it easy to search...

Amazon OpenSearch Service

OpenSearch (https://opensearch.org/) is an open source search engine forked from Elasticsearch 7.10.2. It also contains a visualization dashboard component that is forked from Kibana 7.10.2. OpenSearch has a distributed architecture where documents are collected in an index; these indices are divided into shards. Multiple nodes are present in an OpenSearch domain, where shards are replicated across nodes for high availability. In that sense, a lot of configuration, setup, and operational overhead is needed to set up a self-managed OpenSearch platform.

To alleviate the infrastructure setup pain, AWS provides a managed service that makes it easy to deploy, operate, and scale OpenSearch clusters. Amazon OpenSearch Service can either be created in provisioned mode, where users get to select the type and number of nodes to leverage in a cluster, or it can be in serverless mode, where there is no sizing to be done. Provisioned mode is suitable for steady and...

Amazon OpenSearch Service use cases

Let’s dive straight into how Amazon OpenSearch Service can help solve many of the operational analytics use cases that organizations may have.

Application and security monitoring

Use case for application and security monitoring using Amazon OpenSearch Service

GreatFin, being a large financial conglomerate, has many applications, services, systems, and infrastructure setups across all LOBs. The security team wants to leverage a cost-effective solution to provide them with search capabilities in a centralized logging account. This will ensure that security can proactively monitor all threats and look for certain patterns in the log data. At the same time, each LOB also wants to monitor the logs for application issues and latency. The teams are looking for options to ingest data into this operational analytics platform and use dashboards with monitoring and alerting capabilities to look out for specific patterns in the data.

What...

Summary

In this chapter, we looked at what operational analytics is and how organizations want a mechanism to gather insights from all their operational activities. All the system and application log data forms a major component of operational analytics.

We identified OpenSearch as the correct purpose-built tool for storing operational data. OpenSearch has a dashboard component that makes it easy to search for relevant information flexibly and cost-effectively. Along with specific plugins, OpenSearch Dashboards makes it easy to gain insights into log data.

We then looked into how Amazon OpenSearch Service makes the task of managing the OpenSearch cluster easy. Customers can focus on the operational insights part without having to manage the underlying infrastructure alongside the software configurations.

After, we walked through typical use cases that OS helps solve. We looked at a few options to ingest data into the OpenSearch domain. Once the data had been ingested, we looked...

lock icon The rest of the chapter is locked
You have been reading a chapter from
Modern Data Architecture on AWS
Published in: Aug 2023 Publisher: Packt ISBN-13: 9781801813396
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}