Strata Conference + Hadoop World – Reflections on Big Data Trends

Bill Schmarzo shares his perspective on Strata Conference and Hadoop World, as well as what companies should do to take advantage of Big Data opportunities.

As many people know, I love attending Strata events.  They’re always full of interesting and provocative topics for discussion and the exhibit hall has lots of new or updated products and product roadmaps.

I recently attended the Strata + Hadoop World event in New York City and picked up a plethora of highly interesting and valuable information.

  1. It was a much older, more mature, and more experienced audience.  The first Strata conference I attended in Santa Clara was geared towards the startup and engineer/data scientist audiences.  There were lots of presentations and hallway discussions about over-throwing traditional data and analytics markets and leveraging a bevy of new, exotic technologies like Hadoop and MapReduce.  Very fun and very exciting!  The ambiance of this year’s NYC show was more reserved and the conversations more practical, which I think reflect that big data and Hadoop are moving into the mainstream conversation.
  2. The presence of traditional data management, data warehouse, and business intelligence vendors has grown.  The event is definitely maturing, and the traditional larger data warehouse and business intelligence vendors are demanding a larger say in the topics and agenda – which is not always to the benefit of attendees.  Too many booth presentations glossed over the real big data challenges like proof points and use cases. By the way, sticking the words “Big Data” in front of your product does not make your product big data ready.
  3. There was a constant and confusing stream of new product announcements.  Want Hadoop?  Which version?  From which vendor?  Instead of starting to settle out, the technology foundation of big data just seems to keep morphing.  How are organizations supposed to know which technologies they are going to need to invest in when there is a seemingly endless number of choices and combinations?  To be honest, I think these ever-shifting sands of big data technologies are slowing adoption as more organizations are struggling to stay on top of the latest and greatest product promises.  I’d hate to be a CIO today.
  4. Many of the same show topics were discussed in previous years.  Plus, many of those topics still focused on educating folks on the basics of big data technologies (or how yesterday’s technologies are still relevant).  I sat in a marvelous and packed presentation that discussed the differences between a data warehouse and Hadoop. The audience loved the presentation, but I saw that same presentation from different presenter two years ago.  Has big data become one of those topics everyone talks about, but no one really does much to further develop?  Or is it that they are still trying to find the best place to use big data technologies?

The conference convinced me more than ever that in order to be successful in determining which technology capabilities to invest in, organizations need to take a business-led approach. If organizations take a technology-led approach with big data, the constantly changing options will just confuse IT organizations and slow the adoption of some of these game-changing technologies.

Instead, we in EMC Global Services have learned that organizations need to define a targeted business initiative first.  Then let the business requirements (the decisions the business stakeholders need to make, the questions that they need to answer to support those decisions, the key performance indicators against which the business defines success) drive the technology and data requirements.

Organizations need guidelines that they can use in order to make the necessary data and technology determinations. Targeting a key business initiative is a great way to start.  Organizations must ask themselves the following questions:

  • What data, data management, and advanced analytics do I need to support that business initiative?
  • What supporting technologies and infrastructure do I need to support the data management and advanced analytics requirements?
  • How am I going to surface the results of the analytics in such a way that is actionable to my business stakeholders (user experience requirements)?

Perhaps the most important thing about selecting a specific business initiative is that it will also tell organizations what data, data management, and advanced analytic capabilities they won’t have to worry about.

The technology capabilities of the big data movement continue to amaze me. However, until we embrace a business-led approach that focuses on enabling the organization’s key business initiatives with specific data management, advanced analytics, and actionable user experience capabilities, big data will continue to look like a science experiment with confusing and questionable technology claims – lots of talk and lots of interest, but not a lot of action.

About the Author: Bill Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”, is responsible for setting strategy and defining the Big Data service offerings for Dell EMC’s Big Data Practice. As a CTO within Dell EMC’s 2,000+ person consulting organization, he works with organizations to identify where and how to start their big data journeys. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide. Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata. Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications. Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.