Reader small image

You're reading from  Building Big Data Pipelines with Apache Beam

Product typeBook
Published inJan 2022
Reading LevelBeginner
PublisherPackt
ISBN-139781800564930
Edition1st Edition
Languages
Right arrow
Author (1)
Jan Lukavský
Jan Lukavský
author image
Jan Lukavský

Jan Lukavský is a freelance big data architect and engineer who is also a committer of Apache Beam. He is a certified Apache Hadoop professional. He is working on open source big data systems combining batch and streaming data pipelines in a unified model, enabling the rise of real-time, data-driven applications.
Read more about Jan Lukavský

Right arrow

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in the text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: "For this reason, Beam defined a RestrictionTracker object."

A block of code is set as follows:

ClassLoader loader = FirstPipeline.class.getClassLoader();String file = loader.getResource("lorem.txt").getFile();List<String> lines = Files.readAllLines(Paths.get(file), StandardCharsets.UTF_8); 

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

@Override
public PCollection<String> expand(PBegin input) {
  return input
    .apply(Impulse.create())
    .apply(MapElements.into(TypeDescriptors.strings())
        .via(e -> directoryPath))
    .apply(new DirectoryWatch())
    .apply(Reshuffle.viaRandomKey())
    .apply(new FileRead());
}

Any command-line input or output is written as follows:

$ ./mvnw clean install

Bold: Indicates a new term, an important word, or words that you see on screen. For instance, words in menus or dialog boxes appear in bold. Here is an example: "A typical example of a runner would be Apache Flink, Apache Spark, or Google Cloud Dataflow."

Tips or Important notes

Appear like this.

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Building Big Data Pipelines with Apache Beam
Published in: Jan 2022Publisher: PacktISBN-13: 9781800564930

Author (1)

author image
Jan Lukavský

Jan Lukavský is a freelance big data architect and engineer who is also a committer of Apache Beam. He is a certified Apache Hadoop professional. He is working on open source big data systems combining batch and streaming data pipelines in a unified model, enabling the rise of real-time, data-driven applications.
Read more about Jan Lukavský