Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
AWS for Solutions Architects - Second Edition

You're reading from  AWS for Solutions Architects - Second Edition

Product type Book
Published in Apr 2023
Publisher Packt
ISBN-13 9781803238951
Pages 692 pages
Edition 2nd Edition
Languages
Authors (4):
Saurabh Shrivastava Saurabh Shrivastava
Profile icon Saurabh Shrivastava
Neelanjali Srivastav Neelanjali Srivastav
Profile icon Neelanjali Srivastav
Alberto Artasanchez Alberto Artasanchez
Profile icon Alberto Artasanchez
Imtiaz Sayed Imtiaz Sayed
Profile icon Imtiaz Sayed
View More author details

Table of Contents (19) Chapters

Preface 1. Understanding AWS Principles and Key Characteristics 2. Understanding the AWS Well-Architected Framework and Getting Certified 3. Leveraging the Cloud for Digital Transformation 4. Networking in AWS 5. Storage in AWS – Choosing the Right Tool for the Job 6. Harnessing the Power of Cloud Computing 7. Selecting the Right Database Service 8. Best Practices for Application Security, Identity, and Compliance 9. Driving Efficiency with CloudOps 10. Big Data and Streaming Data Processing in AWS 11. Data Warehouses, Data Queries, and Visualization in AWS 12. Machine Learning, IoT, and Blockchain in AWS 13. Containers in AWS 14. Microservice Architectures in AWS 15. Data Lake Patterns – Integrating Your Data across the Enterprise 16. Hands-On Guide to Building an App in AWS 17. Other Books You May Enjoy
18. Index

Amazon Elastic Map Reduce (EMR)

Back in 2009, AWS introduced EMR, a tool that can handle extremely large amounts of data (terabytes and petabytes) using the latest open-source big data tools like Spark, Hive, Presto, HBase, Flink, and Hudi in the cloud. Amazon EMR is a managed cluster platform that makes it easier to run big data tools, such as Apache Hadoop and Apache Spark, on the AWS cloud for processing and analyzing massive datasets. It is a wrapper around distributed open-source computing frameworks. This wrapper abstracts the effort required to set up infrastructure, security, network communication, disaster recovery, and scalability. Additionally, EMR offers 100% compliance with open-source APIs. So, there is no need to change your application code when you move to EMR from the on-premises Hadoop system.

EMR runs directly against the data stored in your S3 data lake, so you don’t need to move that data or transform your data. You can store data in the data lake...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}