You're reading from Salesforce Data Architect Certification Guide

Product typeBook

Published inNov 2022

PublisherPackt

ISBN-139781801813556

Edition1st Edition

Tools

Salesforce Einstein Salesforce Einstein

Concepts

IT Operations

Author (1)

Aaron Allport

Understanding Large Data Volumes

While the Salesforce platform can cope with large amounts of data, some considerations apply to larger/massive amounts of data (referred to as Large Data Volumes (LDV)) and how performance on the platform is affected. This chapter covers LDV considerations and mitigations, as well as scalable data model design and data archiving strategies.

In this chapter, we will cover the following topics:

Designing a scalable data model
LDV performance mitigation strategies
Data archiving strategies

Unlike traditional applications that utilize a database, Salesforce stores all data in a few, large database tables. Therefore, traditional performance tuning techniques associated with databases don’t necessarily apply to the Salesforce platform. Instead, we, as data architects, must design our Salesforce implementations to handle large amounts of data. This is best achieved by understanding LDVs, their impact on Salesforce performance...

Designing a scalable data model

As Salesforce implementations grow in size and complexity, so does the volume of data. Salesforce, being a multi-tenant architecture, handles the scaling up automatically, but as the volume of data grows, the processing time for certain operations increases too.

Typically, two areas are affected by different data architectures or configurations on the Salesforce platform:

Loading or updating large amounts of records. This can be through the UI (directly) or with one or more integrations.
Extracting data, be it through reports or other views into the data or querying the data.

Optimizing the data model generally involves doing the following:

Only hosting data that truly needs to reside on the Salesforce platform based on business purpose and intent
Deferring or temporarily disabling sharing change processing and other business rule logic when performing certain data operations
Choosing the best (most efficient) operation...

LDV performance mitigation strategies

There are several tools available to us as data architects when it comes to working with data and keeping things running as optimally as possible. Of course, we should first question what data needs to reside in Salesforce. How often should we consider moving data off-platform to reduce performance impacts due to having large amounts of data available? Typically, data of a certain age (determined by business requirements or regulatory requirements in some instances) should be archived regularly to ensure that users are only interacting with the data that makes sense to be hosted on the Salesforce platform.

With data that may need to reside on Salesforce, some other techniques are available that can improve performance:

Custom indexes
Skinny tables
Selective filter conditions
Divisions

Let’s take a look at them.

Custom indexes

To speed up query performance, Salesforce supports the creation of custom indexes...

Data archiving strategies

Just as we have data on Salesforce for our users, there are situations where data needs to be archived off the platform. This may be due to various reasons, such as regulatory compliance (where certain data needs to be retained), or to keep the optimum amount of data in the platform (such as only the data that is being used being kept on the platform, and automatically archiving data over a certain age). Luckily, various options are available for archiving Salesforce data, such as using on-platform solutions such as big objects or storing data off-platform in an external system or data warehouse. We’ll take a look at these options in the following subsections.

Big objects

As covered in Chapter 2, Data Modeling and Database Design, big objects are used to store and manage huge amounts of data (up to 1 million records by default, though this can be scaled up at an additional cost to tens of millions or even billions of records).

Big objects provide...

Summary

In this chapter, we dug deep into the data architecture and its potential pitfalls and mitigation strategies. Understanding why data skew happens helps us design better parent/child record ownership strategies. Also, understanding the way Salesforce uses indexes ensures that we can create reports, list views, and build queries that work with the constraints of the multi-tenant architecture nature of the platform, not against them. Due to this, we looked at LDV issues and mitigation strategies, understanding how concepts such as selective filter conditions and skinny tables can be used to ensure we can work with our large amounts of data effectively.

Next, we turned our attention to data archiving strategies and the various options we have at our disposal, ensuring that we are keeping to regulatory requirements for data retention and archival (if appropriate) and ensuring that we only use the data that is relevant to our users.

In Chapter 7, Data Migration, we’re...

Practice questions

Test your knowledge of the topics covered in this chapter by answering the following questions:

What does LDV stand for?
Which type of Salesforce object is used to store 1 million or more records?
One account that contains over 10,000 child records is known as what?
More than 10,000 records looking up to a single parent record is known as what?
Utilizing data hosted on an external system is a technique known as what?
What can be used to partition data and reduce the number of records returned by SOQL queries and reports?
What can be used to avoid joins within queries to data held within a single object and speed up read-only operations?
More than 10,000 child records belonging to the same parent record is known as what?
Specify the type of index that’s used in the following scenarios:
- When a SOQL query is executed against object data where that object contains 400,000 records, and the filter matches 40,000 or fewer records...

Aaron Allport is a Chief Technical Officer, and has worked with CRM systems and integrations for his entire professional career. Aaron specializes in Salesforce technical architecture and integration, helping his clients ensure they get the most from their technology investment. Aaron has spoken at Dreamforce, written about everything from DevOps to Data Architecture online, and can regularly be found at the Salesforce London Developer Meetup.
Read more about Aaron Allport

Personalised recommendations for you

Based on your interests and search pattern

Designing and Implementing Microsoft Azure Networking Solutions

Designing and Implementing Microsoft Azure Networking Solutions Exam Ref AZ-700 is an all-encompassing guide to the AZ-700 exam and contains all the information you need to succeed in the world of virtual networking with Azure. With this book, you will be fully prepared for the exam and the world of cloud networking.

BookAug 2023524 pages

Microsoft 365 Security, Compliance, and Identity Administration

The Microsoft 365 Security, Compliance, and Identity Administration is a comprehensive guide that helps you employ Microsoft 365's robust suite of features and empowers you to optimize your administrative tasks.

BookAug 2023630 pages

Zero Trust Overview and Playbook Introduction

Get started on Zero Trust with this step-by-step playbook and learn everything you need to know for a successful Zero Trust journey with tailored guidance for every role, covering strategy, operations, architecture, implementation, and measuring success. This book will become an indispensable reference for everyone in your organization.

BookOct 2023240 pages

The Self-Taught Cloud Computing Engineer

This self-study book helps you master multiple clouds, including AWS, Azure, and GCP, and serves as a roadmap to becoming a certified cloud computing expert. The book will guide you to develop a professional cloud career by helping you build a broad cloud knowledge base, developing hands-on cloud computing skills, and getting cloud certified.

BookSep 2023472 pages

Technology Operating Models for Cloud and Edge

This book will help you build and create ownership of a technology operating model, as well as connect your leadership with engineering and operations, keeping your internal and external customers in mind. It provides practical tips on why, where, and how to make the cloud and edge platform paradigm sing for you, your team, and your organization.

BookAug 2023228 pages

Azure Architecture Explained

Azure is the preferred platform to build mission-critical and secure apps. This book provides comprehensive coverage of essential Azure products, services, and solutions vital for every solution architect's success. Elevate your knowledge and master the critical components of Azure to excel in your role with Azure Architecture Explained.

BookSep 2023446 pages

Pentesting Active Directory and Windows-based Infrastructure

This practical guide helps you explore the pentesting of Microsoft infrastructure in detail, and enhances your offensive skillset by showing you the different ways to perform security assessment. This book will help blue teamers and IT engineers get up to speed with possible security issues they may encounter in their Windows environments.

BookNov 2023360 pages

Practical Ansible

In Practical Ansible, you'll work with the latest release of Ansible and learn to solve complex issues quickly with the help of task-oriented scenarios. You'll start by installing and configuring Ansible to automate monotonous and repetitive IT tasks and get to grips with concepts such as playbooks, inventories, plugins, collections, and network modules.

BookSep 2023420 pages

Windows 11 for Enterprise Administrators

Microsoft’s launch of Windows 11 is a step toward satisfying the enterprise administrator’s needs for better management and enhanced user experience customization. This book provides the enterprise administrator with the knowledge needed to fully utilize the advanced feature set of Windows 11 Enterprise.

BookOct 2023286 pages

The Linux DevOps Handbook

This book is for software and IT professionals seeking knowledge on Linux systems and DevOps practices. This book will provide you with guidance and tools to learn and gain proficiency in managing Linux-based infrastructures and knowledge of DevOps.

BookNov 2023428 pages2