Hortonworks and Hewlett Packard Enterprise accelerate Apache Spark

Hortonworks and Hewlett Packard Labs are working together to enhance Apache Spark, one of the most active Apache big data projects. The collaboration will center around an entirely new class of analytic workloads that benefit from large pools of shared memory.

Early results of the collaboration include the following:
  • Enhanced shuffle engine technologies: Faster sorting and in-memory computations, which has the potential to dramatically improve Spark performance.   
  • Better memory utilization: Improved performance and usage for broader scalability, which will help enable new large-scale use cases.

 

“This collaboration indicates our mutual support of and commitment to the growing Spark community and its solutions,” said Scott Gnau, chief technology officer, Hortonworks.  “We will continue to focus on the integration of Spark into broad data architectures supported by Apache YARN as well as enhancements for performance and functionality and better access points for applications like Apache Zeppelin.” 

 

“We’re hoping to enable the Spark community to derive insight more rapidly from much larger data sets without having to change a single line of code,” said Martin Fink, EVP and CTO, Hewlett Packard Enterprise and Hortonworks Board Member. “We’re very pleased to be able to work with Hortonworks to broaden the range of challenges that Spark can address.”

 

Hortonworks and Hewlett Packard Enterprise plan to contribute the new technologies to the Apache Spark community.

An examination of how Atlassian’s Rovo and Teamwork Graph introduce AI-driven automation into...
Discover how Gamma Communications fosters relationships and supports charity at its annual Padel...
Belden expands its portfolio with new products and enhancements to strengthen IT/OT networks,...
Exploring the challenges faced by IT leaders in deploying AI, with emphasis on the essential role...
Bull and Hon Hai Technology Group (Foxconn) have announced a collaboration focused on the...
The new Vector Core Compute (VC2) platform combines technologies from SambaNova, Intel and NVIDIA...
VAST Data and Megaport collaborate to streamline AI workloads across hybrid and multicloud...
The DCA and the Carbon Trust are partnering to drive sustainable growth and transition to Net Zero...