Packt+ | Advance your knowledge in tech

You're reading from Raspberry Pi Super Cluster

Product typeBook

Published inNov 2013

Reading LevelBeginner

PublisherPackt

ISBN-139781783286195

Edition1st Edition

Languages

Python

Tools

Raspberry Pi

Concepts

Single Board Computers

Author (1)

Andrew K. Dennis

Chapter 6. Calculate Pi with Hadoop and MPI

Now since we have set up Hadoop, written, and run our first application in it, we can look at the concept of Monte Carlo simulators and how we can calculate Pi (П)using Hadoop and MPI. This brings together and compares the two technologies we have explored in Chapters 2 through 6.

Monte Carlo simulators

A Monte Carlo simulator, also known as Monte Carlo methods, is a type of computational method found in a variety of fields ranging from physics to finance.

Monte Carlo simulators use randomized sampling repeatedly in order to obtain a result for a particular mathematical question.

The name is derived from the city of Monte Carlo in Monaco. The origin of the name comes from Manhattan project participants Stanislaw Ulam and John Von Neumann in reference to a relative of Ulam who had a taste for gambling.

Calculating П is an example of a problem especially suited to this type of algorithm and an early example of this is Buffon's needle. You can read more about the history of this experiment at Wolfram MathWorld:

http://mathworld.wolfram.com/BuffonsNeedleProblem.html

In order to calculate П we can also use another method that involves a diagram displaying a circle located in a square divided into four quarters.

In this diagram we are interested in the top-right quarter of...

A Hadoop application to calculate Pi

Hadoop comes packaged with a number of example applications. We are of course interested in calculating П program in particular.

The source code for this application can be downloaded from Apache's website at the following URL:

https://svn.apache.org/repos/asf/hadoop/common/trunk/hadoop-mapreduce-project/hadoop-mapreduce-examples/src/main/java/org/apache/hadoop/examples/QuasiMonteCarlo.java

The JAR file containing the compiled class can be found on your machine at:

/home/pi/hadoop/hadoop-1.2.1/hadoop-examples-1.2.1.jar

Let's navigate to this directory:

cd ~/hadoop/hadoop-1.2.1

We are now going to run the example. The program takes two inputs: the number of maps and the number of samples. Try running the following demonstration:

hadoop jar hadoop-examples-1.2.1.jar pi 2 4

You should now see something similar to:

Number of Maps  = 2
Samples per Map = 4
Wrote input for Map #0
Wrote input for Map #1
Starting Job
…
Job Finished in 273.167 seconds
Estimated value...

Pi with C language and MPI

We have seen that we can calculate П with Hadoop. We can now try a similar application in C. The program we will now write will generate results similar to what we saw with the example program included with MPICH and will also use a MapReduce-style approach.

Create a new file at the following location to store your code in:

~/mpich3/code/monte_carlo_pi.c

Open this file and add the following code:

#include "mpi.h"
#include <stdio.h>
#include <stdlib.h>
#include <time.h>

double insidecircle(int throws);

#define GAMES 20
#define THROWS 100

The previous block of code includes the necessary header files and defines a function and two constants. The function insidercircle() will be responsible for calculating П.

The first constant is the number of GAMES, that is, attempts at calculating П. The second defines the number of THROWS in each game. Now add the following code to the end of the file:

int main (int argc, char *argv[]) {

double jobaverage, calcpi...

Summary

In this chapter, we brought together the technologies we have studied so far and compared them by looking at how they both solve the problem through parallel computing.

This problem was calculating П using a Monte Carlo style simulator.

In case of the MPI, we wrote a small application in C, which gave us some more exposure to programming parallel applications.

Now that you have a taste for how these two technologies can be used, you have the context to explore both Hadoop and MPI in more detail including editing the C program and writing your own Java application.

In the next chapter, we shall be looking at further tasks which we can perform with our Raspberry Pi cluster.

The rest of the chapter is locked

You have been reading a chapter from

Raspberry Pi Super Cluster

Published in: Nov 2013Publisher: PacktISBN-13: 9781783286195

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Andrew K. Dennis

Andrew K. Dennis is a full stack and cybersecurity architect with over 17 years' experience who currently works for Modus Create in Reston, VA. He holds two undergraduate degrees in software engineering and creative computing and a master's degree in information security. Andy has worked in the US, Canada, and the UK in software engineering, e-learning, data science, and cybersecurity across his career, and has written four books on IoT, the Raspberry Pi, and supercomputing. His interests range from the application of pataphysics in computing to security threat modeling. Andy lives in New England and is an organizer of Security BSides CT.
Read more about Andrew K. Dennis

Personalised recommendations for you

Based on your interests and search pattern

Architectural Patterns and Techniques for Developing IoT Solutions

This book covers all the patterns and considerations that give you both the power and flexibility to build scalable, secure, and performant IoT solutions by combining various patterns in interesting ways. It also lists the benefits of combining IoT with technologies like blockchain, 3D-printing, 5G, Generative AI, quantum computing, and LLMs.

BookSep 2023304 pages

Arduino Data Communications

Arduino Data Communication focuses on IoT’s Internet aspect, guiding you in setting up your own infrastructure for storing and managing the data collected from sensors. This book goes beyond microcontroller basics, equipping you with the knowledge essential for building real-world projects.

BookNov 2023286 pages5

Arduino IoT Cloud for Developers

From fundamental principles to advanced techniques, this comprehensive book equips you with the knowledge and skills needed to design and deploy IoT applications seamlessly. Explore cloud integration, best practices, and real-world projects to harness the full potential of IoT application development with the Arduino IoT Cloud.

BookNov 2023402 pages

The Azure IoT Handbook

Building IoT Systems with Azure IoT is a comprehensive introduction for those who are new to the Internet of Things and looking to get up to speed in no time. This book will teach you how to create and develop IoT solutions with intelligent edge-to-cloud technologies in the Azure cloud.

BookDec 2023248 pages