Sunday, April 20, 2014

Hadoop Distributions? Which version? Hype wheel Spins again!

It looks like the major Hadoop vendors are battling it out for mind share in the Big Data community.

The major announcement from Hortonworks for their next round of funding.

Followed by the major announcement that Intel was dropping their distro and backing Cloudera with $740M for an 18% stake.

The only player not jumping into the media blitz is MapR.

This latest round of buffoonery is driving a lot back and forth marketing stuff as the market consolidates and each tries to assert their value and differentiation.  This is driving some misleading press releases from both Cloudera and Hortonworks.  Hortonworks is fighting for its life and honesty I have no idea how they plan to stay relevant.

The tired go-to-market position of, "We've got the most Hadoop committers and ours is 100% Open Source" is wearing thin.

This latest announcement from Arun Murthy's of Hadoop 2.4 is a thinly veiled Hortonworks Product Announcement that in many ways violates the spirit of the Apache Software Foundations recommendations on what constitutes Hadoop and procedures for press announcements. I guess they are getting ready for Hadoop Summit in June and want to steal the spotlight for a moment.

And for the educated reader (someone that takes the time to decode this mess) the last Hadoop Release labeled "GA" is 2.2 and not 2.4

Furthermore - I leave it to the reader to attempt to build and verify Hadoop 2.4 so that it will build and run like the derivative 2.4 from Hortonworks.  I think this will be an enlightening exercise.
To Horton's credit they are still (as of 4/20/2014) listing 2.1  as their current version. Which in someways is confusing even more than just sticking with the 2.2 Apache GA version.  The announcement of Query Grid by Hortonworks+Teradata is nothing more than a continued refinement of Teradata's UDA strategy. Eventually Teradata is going to wake up and realize that Hortonworks is a dead-end and is only riding their coat-tails into major accounts. Bolting SQL-H onto Stinger only makes the stack more prone to failure.

Cloudera is not blameless in this war of announcement-counter-announcement. The latest series of videos and info-mercials about Impala versus Presto | Stinger | Hive is just plain junk.  Especially in the case of Facebook's Presto capabilities at scale.  Which at the moment Impala does have a "snowballs chance in h311" of being able to handle.  And how does the TPC ignore their constant use of "TPC-DS" in a very loose fashion when discussing benchmark results plainly meant as a sales pitch?

So what to do??

Well MapR makes no bones about their value proposition and are quietly building a reputation for quality and reliability. They are embracing emerging technology from Berkley's AMPLab in the form of a partnership with Databricks. Shark/Spark is rapidly becoming the hot tech around real-world analytic projects that deliver business value.

Going with MapR does have some risks since MapR has decided to replace some critical pieces of Apache code with their own.

Then there are the newly minted independents that are building off the Apache main source code trunk.
Bunnyworks.net is a group quietly putting out an Apache "derivative" based solely on the approved 2.2 code line without additions. They are calling it pHd 2.2.  An analogy would be "pHd 2.2 to Apache 2.2" like "CentOS to Redhat".

More than ever, users of Hadoop based technology really need to investigate and understand what they're getting when buying into different Hadoop based product versions.

Caveat emptor!

7 comments:

  1. Great post! I am actually getting ready to across this information, It's very helpful for this blog.Also great with all of the valuable information you have Keep up the good work you are doing well.
    Toilet Cleaning in Chennai

    ReplyDelete
  2. I just see the post i am so happy to the communication science post of information's.So I have really enjoyed and reading your blogs for these posts.Any way I’ll be replay for your great thinks and I hope you post again soon.

    Digital Marketing Company in Chennai

    ReplyDelete
  3. After looking into a handful of the blog articles on your site, I really like your technique of writing a blog. I book marked it to my bookmark site list and will be checking back in the near future. Take a look at my website as well and let me know your opinion.
    Manpower services in Chennai
    Skilled manpower services in Chennai

    ReplyDelete
  4. I’ve been browsing on-line greater than three hours today, but I never discovered any attention-grabbing article like yours. It is beautiful worth sufficient for me. Personally, if all webmasters and bloggers made good content material as you did, the net will be a lot more helpful than ever before.
    RFID Solutions
    Sports Analytic Software
    Logistic ERP
    Athletic Management Software

    ReplyDelete
  5. It is very useful information at my studies time, i really very impressed very well articles and worth information, i can remember more days that articles.
    Architecture Firms in Chennai
    Best Interior Designers in Chennai
    Industrial Architecture
    Warehouse Architect
    Civil Engineering Consultants

    ReplyDelete
  6. Fertility is the natural capability to produce offspring. As a measure, fertility rate is the number of offspring born per mating pair, individual or population.Human fertility depends on factors of nutrition, sexual behavior, consanguinity, culture, instinct, endocrinology, timing, economics, way of life, and emotions.Greate thinks of a fertility center for humans.

    Fertility Center in OMR

    ReplyDelete
  7. Nice information on here, I would like to share with you all my experience trying to get a loan to expand my Clothing Business here in Malaysia. It was really hard on my business going down due to my little short time illness then when I got heal I needed a fund to set it up again for me to begin so I came across Mr Benjamin a loan consultant officer at Le_Meridian Funding Service He asked me of my business project and I told him i already owned One and i just needed loan of 200,000.00 USD he gave me form to fill and I did also he asked me of my Valid ID in few days They did the transfer and my loan was granted. I really want to appreciate there effort also try to get this to anyone looking for business loan or other financial issues to Contact Le_Meridian Funding Service On Email: lfdsloans@lemeridianfds.com / lfdsloans@outlook.com He also available on WhatsApp Contact:+1-9893943740.

    ReplyDelete