There is a disconnect in the Enterprise between desired state and actual outcomes when it comes to big data projects — by some measures 2 out of every 3 projects are not meeting expectations. What is driving the mismatch between interest and success?
This gap in expectations is due to a variety of factors, ranging from process alignment issues, to security concerns, to customers not defining a clear question to be answered from the onset of the project. Most commonly though, not identifying a clear and measurable Return on Investment (ROI) for new projects is the fatal flaw for many well-intentioned projects.
Without question, defining ROI objectives can be challenging. This is why it is often best for new projects for big data to start by addressing an operational efficiency use case like ETL offload to optimize existing Data Warehouses (DW). DW optimization is one of the more popular use cases due to the high ROI that can be attained by offloading ETL from your data warehouse. Gartner research has indicated that Data Transformation consumes 80% of database capacity and more worryingly, that 70% of Data Warehouses are performance and capacity constrained.
With such a clear cut opportunity for improvement, it is easy to see why many customers start their journey here. As noted though, even clear line of sight to a strong use case and ROI does not necessarily mean success. So how can customers improve their odds? Enter the Dell | Cloudera | Syncsort Data Warehouse Optimization – ETL Offload Reference Architecture, a tested and validated blueprint for implementation.
The work that Dell, Cloudera, and Syncsort have invested in optimizing, validating and supporting a focused reference architecture for ETL will help ensure that customers see results faster than going it alone. Using Dell R730XD servers ensures customers start with proven and highly performant building blocks powered by Intel architecture. Additionally, when combined with Cloudera’s distribution of Hadoop customers will ensure that they also have industry leading Hadoop software with the capabilities that enterprises will need for security, performance, and ecosystem support perspective as they grow their Hadoop deployments over time. Finally, with Syncsort DMX-h, customers are getting a truly optimized approach for managing their ETL processes, backed by over 40 years of experience. Additionally with Syncsort SILQ software, customers can accelerate their time to value with their easy to use interface which enables them to visually build new ETL flows as well as the ability to consume existing legacy ETL SQL scripts seamlessly.
This new solution brings together best of breed ingredients in a targeted and easily consumable way, which will serve as a faster on-ramp for our customers to begin on their big data journey. For new big data projects, using focused use case solutions like the Dell | Cloudera | Syncsort Data Warehouse Optimization – ETL Offload solution will allow customers to demonstrate a quick win with tangible results. This will allow them to create a track record of success and generate momentum for additional projects within their business.