Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Apache Flume: Distributed Log Collection for Hadoop

You're reading from  Apache Flume: Distributed Log Collection for Hadoop

Product type Book
Published in Jul 2013
Publisher Packt
ISBN-13 9781782167914
Pages 108 pages
Edition 1st Edition
Languages

Time zones are evil


In case you missed my bias against using local time in Chapter 4, Sinks and Sink Processors, I'll repeat it here a little stronger—time zones are evil. Evil like Dr. Evil (http://en.wikipedia.org/wiki/Dr._Evil)—and let's not forget about its "Mini Me" (http://en.wikipedia.org/wiki/Mini-Me) counterpart—daylight savings time.

We live in a global world now. You are pulling data from all over the place into your Hadoop cluster. You may even have multiple data centers in different parts of the country (or the world). The last thing you want to be doing while trying to analyze your data is to deal with askew data. Daylight savings time changes at least somewhere on Earth a dozen times in a year. Just look at the history (ftp://ftp.iana.org/tz/releases/). Save yourself a headache and just normalize it to UTC. If you want to convert it to "local time" on its way to human eyeballs, feel free. But while it lives in your cluster, keep it normalized to UTC. Consider adopting UTC everywhere...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}