Skip to content

HDInsight Whitepapers

2013 April 13
by Brian Mitchell

It seems like most of the information you will find around Hadoop is focused on how to use the tools that are part of its ecosystem.  There is less documentation and focus on the architectural best practices.  I believe this is by design because the nature of Hadoop is supposed to be that it just works.  But if you’ve used it for any length of time, you know that this really isn’t true and some guidance on improving performance is always helpful.  Microsoft has released three new white papers on this subject for their version of Hadoop, HDInsight.  They are well written and provide good guidance for three subject areas that are going to come up time and time again for anyone implementing HDInsight.

Compression in Hadoop

http://msdn.microsoft.com/en-us/dn168917.aspx

Hadoop Performance in Hyper-V

http://msdn.microsoft.com/en-us/dn168918.aspx

Job Optimization in Hadoop

http://msdn.microsoft.com/en-us/dn197899.aspx

No comments yet

Leave a Reply

Note: You can use basic XHTML in your comments. Your email address will never be published.

Subscribe to this comment feed via RSS