Home Cloud & Networking Learning Ceph - Second Edition

Learning Ceph - Second Edition

By Karan Singh , Vaibhav Bhembre , Anthony D'Atri
books-svg-icon Book
Subscription FREE
eBook + Subscription €14.99
eBook €32.99
Print + eBook €41.99
READ FOR FREE Free Trial for 7 days. €14.99 p/m after trial. Cancel Anytime! BUY NOW BUY NOW BUY NOW
What do you get with a Packt Subscription?
This book & 7000+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with a Packt Subscription?
This book & 6500+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with eBook + Subscription?
Download this book in EPUB and PDF formats
This book & 6500+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with a Packt Subscription?
This book & 6500+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with eBook?
Download this book in EPUB and PDF formats
Access this title in our online reader
DRM FREE - Read whenever, wherever and however you want
Online reader with customised display settings for better reading experience
What do you get with video?
Download this video in MP4 format
Access this title in our online reader
DRM FREE - Watch whenever, wherever and however you want
Online reader with customised display settings for better learning experience
What do you get with Audiobook?
Download a zip folder consisting of audio files (in MP3 Format) along with supplementary PDF
READ FOR FREE Free Trial for 7 days. €14.99 p/m after trial. Cancel Anytime! BUY NOW BUY NOW BUY NOW
Subscription FREE
eBook + Subscription €14.99
eBook €32.99
Print + eBook €41.99
What do you get with a Packt Subscription?
This book & 7000+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with a Packt Subscription?
This book & 6500+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with eBook + Subscription?
Download this book in EPUB and PDF formats
This book & 6500+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with a Packt Subscription?
This book & 6500+ ebooks & video courses on 1000+ technologies
60+ curated reading lists for various learning paths
50+ new titles added every month on new and emerging tech
Early Access to eBooks as they are being written
Personalised content suggestions
Customised display settings for better reading experience
50+ new titles added every month on new and emerging tech
Playlists, Notes and Bookmarks to easily manage your learning
Mobile App with offline access
What do you get with eBook?
Download this book in EPUB and PDF formats
Access this title in our online reader
DRM FREE - Read whenever, wherever and however you want
Online reader with customised display settings for better reading experience
What do you get with video?
Download this video in MP4 format
Access this title in our online reader
DRM FREE - Watch whenever, wherever and however you want
Online reader with customised display settings for better learning experience
What do you get with Audiobook?
Download a zip folder consisting of audio files (in MP3 Format) along with supplementary PDF
  1. Free Chapter
    Introducing Ceph Storage
About this book
Learning Ceph, Second Edition will give you all the skills you need to plan, deploy, and effectively manage your Ceph cluster. You will begin with the first module, where you will be introduced to Ceph use cases, its architecture, and core projects. In the next module, you will learn to set up a test cluster, using Ceph clusters and hardware selection. After you have learned to use Ceph clusters, the next module will teach you how to monitor cluster health, improve performance, and troubleshoot any issues that arise. In the last module, you will learn to integrate Ceph with other tools such as OpenStack, Glance, Manila, Swift, and Cinder. By the end of the book you will have learned to use Ceph effectively for your data storage requirements.
Publication date:
October 2017
Publisher
Packt
Pages
340
ISBN
9781787127913

 

Chapter 1. Introducing Ceph Storage

Ceph is an open source project that provides a solution for software-defined, network-available storage with high performance and no single point of failure. It is designed to be highly scalable to the exabyte level and beyond while running on general-purpose commodity hardware.

In this chapter, we will cover the following topics:

  • The history and evolution of Ceph
  • What's new since the first edition of Learning Ceph
  • The future of storage
  • Ceph compared with other storage solutions

Ceph garners much of the buzz in the storage industry due to its open, scalable, and distributed nature. Today public, private, and hybrid cloud models are dominant strategies for scalable and scale-out infrastructure. Ceph's design and features including multi-tenancy are a natural fit for cloud Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) deployments: at least 60% of OpenStack deployments leverage Ceph.

Note

For more information regarding the use of Ceph within OpenStack deployments, visit https://keithtenzer.com/2017/03/30/openstack-swift-integration-with-ceph.

Ceph is architected deliberately to deliver enterprise-quality services on a variety of commodity hardware. Ceph's architectural philosophy includes the following:

  • Every component must be scalable
  • No individual process, server, or other component can be a single point of failure
  • The solution must be software-based, open source, and adaptable
  • Ceph software should run on readily available commodity hardware without vendor lock-in
  • Everything must be self-manageable wherever possible

Ceph provides great performance, limitless scalability, power, and flexibility to enterprises, helping them move on from expensive proprietary storage silos. The Ceph universal storage system provides block, file, and object storage from a single, unified back-end, enabling customers to access storage as their needs evolve and grow.

The foundation of Ceph is objects, building blocks from which complex services are assembled. Any flavor of data, be it a block, object, or file, is represented by objects within the Ceph backend. Object storage is the flexible solution for unstructured data storage needs today and in the future. An object-based storage system offers advantages over traditional file-based storage solutions that include platform and hardware independence. Ceph manages data carefully, replicating across storage devices, servers, data center racks, and even data centers to ensure reliability, availability, and durability. Within Ceph objects are not tied to a physical path, making objects flexible and location-independent. This enables Ceph to scale linearly from the petabyte level to an exabyte level.

 

The history and evolution of Ceph


Ceph was developed at University of California, Santa Cruz, by Sage Weil in 2003 as a part of his PhD project. The initial implementation provided the Ceph Filesystem (CephFS) in approximately 40,000 lines of C++ code. This was open sourced in 2006 under a Lesser GNU Public License (LGPL) to serve as a reference implementation and research platform. Lawrence Livermore National Laboratory supported Sage's early followup work from 2003 to 2007.

DreamHost, a Los-Angeles-based web hosting and domain registrar company also co-founded by Sage Weil, supported Ceph development from 2007 to 2011. During this period Ceph as we know it took shape: the core components gained stability and reliability, new features were implemented, and the road map for the future was drawn. During this time a number of key developers began contributing, including Yehuda Sadeh-Weinraub, Gregory Farnum, Josh Durgin, Samuel Just, Wido den Hollander, and Loïc Dachary.

In 2012 Sage Weil founded Inktank to enable the widespread adoption of Ceph. Their expertise, processes, tools, and support enabled enterprise-subscription customers to effectively implement and manage Ceph storage systems. In 2014 Red Hat, Inc.,the world's leading provider of open source solutions, agreed to acquire Inktank.

The term Ceph is a common nickname given to pet octopuses; Ceph and is an abbreviation of cephalopod, marine animals belonging to the Cephalopoda class of molluscs. Ceph's mascot is an octopus,referencing the highly parallel behavior of an octopus and was chosen to connect the file system with UCSC's mascot, a banana slug named Sammy. Banana slugs are gastropods,which are also a class of molluscs. As Ceph is not an acronym, it should not be uppercased as CEPH.

Note

For additional information about Ceph in general, please visit https://en.wikipedia.org/wiki/Ceph_(software)

Ceph releases

Each release of Ceph has a numeric version. Major releases also receive cephalopod code-names in alphabetical order. Through the Luminous release the Ceph community tagged a new major version about twice a year, alternating between Long Term Support (LTS) and stable releases. The latest two LTS releases were officially supported, but only the single latest stable release.

Note

For more information on Ceph releases please visit https://ceph.com/category/releases.

The release numbering scheme has changed since the first edition of Learning Ceph was published. Earlier major releases were tagged initially with a version number (0.87) and were followed by multiple point releases (0.87.1, 0.87.2, ...). Releases beginning with Infernalis however are numbered as shown:

The major release number matches the letter of the alphabet of its code name (for example I is the ninth letter of the English alphabet, so 9.2.1 was named Infernalis). As we write, there have been four releases following this numbering convention: Infernalis, Jewel, Kraken, and Luminous.

The early versions of each major release have a type of 0 in the second field, which indicates active pre-release development status for early testers and the brave of heart. Later release candidates have a type of 1 and are targeted at test clusters and brave users. A type of 2 represents a general-availability, production-ready release. Point releases mostly contain security and bug fixes, but sometimes offer functionality improvements as well.

Ceph release name

Ceph package version

Release date

Argonaut

0.48 (LTS)

July 2012

Bobtail

0.56 (LTS)

January 2013

Cuttlefish

0.61

May 2013

Dumpling

0.67 (LTS)

August 2013

Emperor

0.72

November 2013

Firefly

0.80 (LTS)

May 2014

Giant

0.87

October 2014

Hammer

0.94 (LTS)

April 2015

Infernalis

9.2.1

November 2015

Jewel

10.2.3 (LTS)

April 2016

Kraken

11.2.0

January 2017

Luminous

12.2.0 (LTS)

August 2017

Mimic

13.2.0

2018

Nautilus

14.2.0

2019

Note

Note that as this book was being readied for publication in October 2017 Sage announced that the release cycle has changed. Starting with Mimic there will no longer be alternating LTS and stable releases. Each release henceforth will be LTS at a roughly 9 month cadence. For the details visithttps://github.com/ceph/ceph/pull/18117/files

 

New since the first edition


The Jewel LTS release brought a number of significant changes:

  • Unified queue of client I/O, recovery, scrubs, and snapshot trimming
  • Daemons now run as the ceph user, which must be addressed when upgrading
  • Cache tier improvements
  • SHEC erasure coding is no longer experimental
  • The SWIFT API now supports object expiration
  • RBD improvements (now supports suffixes)
  • rbd du shows actual and provisioned usage quickly via object-map and fast-diff features
  • New rbd status command
  • deep-flatten now handles snapshots
  • CephFS snapshots can now be renamed
  • And CephFS is considered stable!
  • Scrubbing improvements
  • TCMalloc improvements
  • Multisite functionality in RGW significantly improved
  • OpenStack Keystone v3 support
  • Swift per-tenant namespace
  • Async RBD mirroring
  • A new look for ceph status

Note

More details on the Jewel release can be found at http://ceph.com/releases/v10-2-0-jewel-released.

As we write, the major Luminous LTS release has just reached general availability. Early experiences are positive and it is the best choice for new deployments. Much-anticipated features in Luminous include:

  • The BlueStore back end is supported
  • In-line compression and read checksums
  • Erasure coding for RBD volumes
  • Better tools for uniform OSD utilization
  • Improved tools for the OSD lifecycle
  • Enhanced CLI
  • Multiple active CephFS MDS servers are supported

Note

The release notes for Luminous 12.2.0 can be found at https://ceph.com/releases/v12-2-0-luminous-released/.

     
About the Authors
  • Karan Singh

    Karan is an IT expert and passionate tech enthusiast living with his beautiful wife Monika in Finland. He holds Honours degree in Bachelor of Computer Science and a Master degree in System Engineering from BITS, Pilani. Apart from this, he is a certified professional for technologies like OpenStack, NetApp and Oracle Solaris. Karan is currently working as a System Specialist of Storage and Cloud Platform for CSC - IT Center for Science Ltd. focusing all his energies on providing IaaS cloud solutions based on OpenStack and Ceph and building economic multi-petabyte storage system using Ceph. Karan possesses a rich skill set and working experience of a variety of cloud technologies. He devotes a part of his time to R&D and learning technologies. He is also the author of the very first book on Ceph titled Learning Ceph, published in 2014. Karan possesses extensive system administration skills and has excellent working experience on a variety of Unix environments, backup, enterprise storage systems, and cloud platforms. When not working on Ceph and OpenStack, Karan can be found working with Configuration management, containers and devops related tools. He loves writing about technologies and is an avid blogger. You can reach him on Twitter.

    Browse publications by this author
  • Vaibhav Bhembre

    Vaibhav Bhembre is a systems programmer working currently as a technical lead for cloud storage products at DigitalOcean. He holds a bachelor's degree in computer engineering from the University of Mumbai and a master's degree in computer science from the State University of New York in Buffalo. Before joining DigitalOcean, Vaibhav wore multiple hats and lead the backend engineering and reliability engineering teams at Sailthru Inc. From helping to scale dynamically generated campaign sends to over million users at a time, to architecting a cloud-scale compute and storage platform, Vaibhav has years of experience writing software across all layers of the stack. During his time as a student, Vaibhav co-published a novel graph algorithm that optimally computed closeness and betweenness in an incrementally updating social network. He also made changes to a highly available distributed file-system built on top of iRODs data management framework as his master's project. This system, that was actively used across more than 10+ educational institutions live, was his foray into large-scale distributed storage and his transition into using Ceph professionally was only natural.

    Browse publications by this author
  • Anthony D'Atri

    Anthony D'Atri's career in system administration spans from laptops to vector supercomputers. He has brought his passion for fleet management and server components to bear on a holistic yet, detailed approach to deployment and operations. His experience with architecture, operation, and troubleshooting of NetApp, ZFS, SVM, and other storage systems dovetailed neatly into Ceph. Anthony worked for three years at Cisco using Ceph as a petabyte-scale object and block backend to multiple OpenStack clouds. Now helping deliver awesome storage to droplet customers of DigitalOcean, Anthony aims to help the growing community build success with Ceph.

    Browse publications by this author
Latest Reviews (5 reviews total)
Thank you for your service , it's amazing
Excellent book, 2nd edition brings up to date many things
The perf tuning tips are pretty valuable.
Learning Ceph - Second Edition
Unlock this book and the full library FREE for 7 days
Start now