Reader small image

You're reading from  Analytics for the Internet of Things (IoT)

Product typeBook
Published inJul 2017
Reading LevelIntermediate
PublisherPackt
ISBN-139781787120730
Edition1st Edition
Languages
Right arrow
Author (1)
Andrew Minteer
Andrew Minteer
author image
Andrew Minteer

Andrew Minteer is currently the senior director, data science and research at a leading global retail company. Prior to that, he served as the director, IoT Analytics and Machine Learning at a Fortune 500 manufacturing company. He has an MBA from Indiana University with a background in statistics, software development, database design, cloud architecture, and has led analytics teams for over 10 years. He first taught himself to program on an Atari 800 computer at the age of 11 and fondly remembers the frustration of waiting through 20 minutes of beeps and static to load a 100-line program. He now thoroughly enjoys launching a 1 TB GPU-backed cloud instance in a few minutes and getting right to work. Andrew is a private pilot who looks forward to spending some time in the air sometime soon. He enjoys kayaking, camping, traveling the world, and playing around with his six-year-old son and three-year-old daughter.
Read more about Andrew Minteer

Right arrow

Applying big data technology to storage


With IoT data flooding into your cloud environment and after processing and transforming it, the next problem to solve is how to store it. The solution should support holding large datasets and be easy to interact with for analytics.

Hadoop

Hadoop is an open source effort that falls under the umbrella of the Apache Software Foundation. As defined by the official project documentation, The Apache Hadoop project develops open source software for reliable, scalable, distributed computing. Hadoop is available for free in its pure open source form.

Unless you have some Hadoop experts on your team, you should opt for one of the managed Hadoop distributions. This will give you a level of troubleshooting support and implementation advice. Cloudera and Hortonworks are two main providers of managed distributions and support. Amazon AWS and Microsoft Azure both have their own Hadoop managed services, EMR and HDInsights respectively.

Hadoop is a little difficult to...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Analytics for the Internet of Things (IoT)
Published in: Jul 2017Publisher: PacktISBN-13: 9781787120730

Author (1)

author image
Andrew Minteer

Andrew Minteer is currently the senior director, data science and research at a leading global retail company. Prior to that, he served as the director, IoT Analytics and Machine Learning at a Fortune 500 manufacturing company. He has an MBA from Indiana University with a background in statistics, software development, database design, cloud architecture, and has led analytics teams for over 10 years. He first taught himself to program on an Atari 800 computer at the age of 11 and fondly remembers the frustration of waiting through 20 minutes of beeps and static to load a 100-line program. He now thoroughly enjoys launching a 1 TB GPU-backed cloud instance in a few minutes and getting right to work. Andrew is a private pilot who looks forward to spending some time in the air sometime soon. He enjoys kayaking, camping, traveling the world, and playing around with his six-year-old son and three-year-old daughter.
Read more about Andrew Minteer