This article created by Jonathan R. Owens, Jon Lentz, and Brian Femiano, authors of Hadoop Real-World Solutions Cookbook, contains recipes designed to show how you can put Hadoop to use to answer different questions about your data. Several of the Hive examples will demonstrate how to properly implement and use a custom function (UDF) for reuse in different analytics. There are two Pig recipes that show different analytics with the Audioscrobbler dataset and one MapReduce Java API recipe that shows Combiners.
In this article, we will cover:
Counting distinct IPs in weblog data using MapReduce and Combiners
Using Hive date UDFs to transform and sort event dates from geographic event data
Using Hive to build a per-month report of fatalities over geographic event data
Implementing a custom UDF in Hive to help validate source reliability over geographic event data
Marking the longest period of non-violence using Hive MAP/REDUCE operators and Python
Calculating the cosine similarity of Artists in the Audioscrobbler dataset using Pig
Trim outliers from the Audioscrobbler dataset using Pig and datafu
Learning to apply Apache Hive, Pig, and MapReduce to solve the specific problems you are faced with can be difficult. The recipes in this article present a few big data problems and provide solutions that show how to tackle them. You will notice that the questions we ask of the data are not incredibly complicated, but you will require a different approach when dealing with a large volume of data. Even though the sample datasets in the recipes are small, you will find that the code is still very applicable to bigger problem spaces distributed over large Hadoop clusters.
The analytic questions in this article are designed to highlight many of the more powerful features of the various tools. You will find many of these features and operators useful as you begin solving your own problems.Read Big Data Analysis in full
This article written by Mythravarun Vepakomma, the author of the book Blender Compositing and Post Processing, deals with advanced compositing beyond grading. These techniques emphasize alternate methods in Blender Compositing for some specific 3D render requirements that can save lots of render time, thereby also saving budgets, in making a CG film. Following is the list of contents that will be presented in this article:
- Fresnel effect in compositing
As a DBA and advanced user of SQL Server 2008 R2, you have to develop best practices that combine a streamline of services for deployment, migration, and management of your data platform. SQL Server 2008 R2 has rich and intuitive GUI management tools, which lower the costs in data management by incorporating automation and delegation of administrative tasks. This article covers the SSMS features that are new and enhanced in SQL Server 2008 R2 to enable operational efficiency in developing the joint best practices and integrated solutions.
In this article by Satya Shyam K Jayanty, author of Microsoft SQL Server 2008 R2 Administration Cookbook, we will cover:
- Implementing Utility and Non-utility collection sets
- Designing and refreshing Scalable Share database features and enhancements
- Managing SQL Server Replication processes
- Implementing security for SQL Server Agent jobs management
Packt is dedicated to the Open Source community and over the course of the past month, there has been articles published on www.PacktPub.com which feature various Open Source communities with insight provided by a respected figure from the community. Now, let’s turn the tables and look at the Open Source community within Packt.
The Packt Open Source team provides the Open Source community with the valuable resource of books and eBooks that share the experiences of your fellow IT professionals and brings you more focused, solutions-based content.
We start our 'Meet the team' with no other than Doug Paterson, the Open Source Publisher...
Read Believe in Open Source: Meet the team in full
haXe allows us to target several platforms; so, you may want to take advantage of this feature to be able to use your applications or libraries on several platforms. Unfortunately, there are some drawbacks, but don't worry, we will go through them and see how to work around them.
In this article by Benjamin Dasnois, author of haXe 2 Beginner's Guide: RAW, we will:
- See what is cross-platform in the standard library
- Talk about platform-specific packages
- Learn about their specificities
- Learn about conditional compilation
So, not only are we going to talk about being cross-platform, but also about platform-specific things. So, if you're ready, let's get started!Read Being Cross-platform with haXe in full
Ever wondered what goes on behind the scenes when a book is published?
Follow the story, as it takes place, behind the Oracle Database 11g R2 Performance Tuning Cookbook
Behavior-driven Development (BDD) is an agile software development practice that enhances the paradigm of Test Driven Development (TDD) and acceptance tests, and encourages the collaboration between developers, quality assurance, domain experts, and stakeholders. Behavior-driven Development was introduced by Dan North in the year 2003 in his seminal article available at http://dannorth.net/introducing-bdd/.
In this article by Unmesh Gundecha, author of Selenium Testing Tools Cookbook, we will cover:
Using Cucumber-JVM and Selenium WebDriver in Java for BDD
Using SpecFlow.NET and Selenium WebDriver in .NET for BDD
Using JBehave and Selenium WebDriver in Java
Using Capybara, Cucumber, and Selenium WebDriver in Ruby
The BatteryMonitor application allows you to monitor the state and battery levels of your iOS device using the APIs that come with the iOS SDK. Each iOS device represents a unique set of properties that include the device's current physical orientation, its model name, and its battery state. It also provides access to the onboard hardware.
This article by Steven F Daniel, author of iPad Enterprise Application Development BluePrints, we will be taking a closer look at how we can use the Core Graphics framework to create and draw a gauge that will be use to present and visualize the total amount of battery life remaining on the iOS device, and then start to design the user interface for our app.Read BatteryMonitor Application in full
More specifically, we'll cover the following:
- Issues with combining scripts
In this article by Alex Libby, the author of the book, Instant HTML5 Local Storage How-to, we will discuss about HTML5 Local Storage.
When using HTML5 Local Storage, there are two types that you can use—local storage and session storage. We'll begin by looking at the former first, using a simple form as the basis for our article.Read Basic use of Local Storage in full
In India, being a Manager is highly valued. A majority of people see themselves taking a managerial position some day. However, can anyone become a manager? A really good manager? Are managers born or made? Do all managers, at least all good managers, share something in common?
When we look around and see the journeys being taken by different managers, their working styles and behaviors, we can hypothesize that:
- Managers are born and made. Some folks have a natural flair to be a manager and some acquire essential skills to be a manager in a given situation.
- Not everyone may enjoy being a manager. While you may be 'promoted' to become a manager, you may find that you don't really enjoy the time spent talking to people, driving them to results, and compiling status reports for your management.
- It appears that good managers do have many things in common, even though they may have their own style of execution.
In this article by Rahul Goyal author of Management in India: Grow from an Accidental to a Successful Manager in the IT & Knowledge Industry , we will explore the skills, traits, talents, and competencies that are usually required and expected for playing a manager role, and also burst some myths surrounding managers.Read Basic Skills, Traits, and Competencies of a Manager in full
In this article by Benjamin Bahrenburg the author of book Appcelerator Titanium Business Application Development Cookbook,will cover:
- Implementing iOS data protection in Titanium
- Basic authentication using Ti.Network.HTTPClient
- Implementing a cross-platform passcode screen
- Working with protected ZIP files on iOS
In this article by Joachim Ziebs, author of AfterShot Pro: Non-destructive photo editing and management , we will perform some basic editing:
Conversion to black and white
In this article by Peter Svensson, we will have a look at Dijit or Dojo Widget which is an instance of a Dojo class that lives in the page and usually has a DOM node associated with it. Examples are FloatingPane, Tree, NumberSpinner, and ValidationtextBox.
The basic premise of a Dijit is that you should be able to create any number of widgets of the same type on the same page, regardless of where on the page you choose to put them. Certain Dijits have special requirements, naturally, such as the Layout Containers, which assume that they are surrounding ContentPane Dijits.
In general, Dijits are very versatile and much time has been spent to assure portability and modularization of the Dijit system. Best of all, this is accessible to you as a developer as well.Read Basic Dijit Knowledge in Dojo in full
This article has been created by Vijay Parthasarathy, the author of Learning Cassandra for Administrators. The Apache Cassandra database is a linearly scalable and highly available distributed data store which doesn't compromise on performance and runs on commodity hardware. Cassandra's support for replicating across multiple datacenters / multiple discrete environments is the best in the industry. Cassandra provides high throughput with low latency without any single point of failure on commodity hardware.
Cassandra was inspired by the two papers published by Google (BigTable) in 2006 and Amazon (Dynamo) in 2007, after which Cassandra added more features. Cassandra is different from most of the NoSQL solutions in a lot of ways: the core assumption of most of the distributed NoSQL solutions is that Mean Time Between Failures (MTBF) of the whole system becomes negligible when the failures of individual nodes are independent, thus resulting in a highly reliable system.
Read Basic Concepts and Architecture of Cassandra in full
This article by Krishna Kumar, the author of book Learning Physics Modeling with PhysX, provides an overview of the concepts that we use in PhysX. It will familiarize you with terms such as scene, actor, material, shape, and so on.
The topics covered in this article are as follows:
- Initializing PhysX and creating the scene and actors
- Creating shapes and materials and then assigning them to actors
- Simulating and then shutting down PhysX
Exchanging information in the form of short messages is becoming more and more important, so frameworks for doing this will be a key factor in software development. Messages and information can be exchanged at exponential speed with JBoss HornetQ asynchronous messaging middleware.
In this article by Piero Giacomelli, author of HornetQ Messaging Developer’s Guide, we will cover the following topics::
- Installing Eclipse and NetBeans for developing with HornetQ on both Windows and Ubuntu
- Setting up a development environment for working with HornetQ core API in Eclipse and NetBeans
- Creating an example for producing and consuming messages in HornetQ in both a synchronous and an asynchronous way
- Implementing some classes using the performance practice for managing core API connections, sessions, and clients