Spring Certifies with Pivotal and Cloudera Simplifying the Creation of Real-World Big Data Applications
Spring for Apache Hadoop and Spring XD enable Java developers to build portable big data applications quickly, unifying many disparate approaches in a single solution
SAN FRANCISCO, Calif. and PALO ALTO, Calif., October 29, 2013 — At the Strata Conference + Hadoop World 2013, Pivotal announced that Spring for Apache Hadoop, which is bundled with Spring XD, has been certified with Pivotal™ HD and Cloudera CDH products. This certification enables Java developers to use modern and familiar tools to build big data applications that work across major Hadoop distributions without modification including Apache Hadoop 1.2.1 and 2.0.6 alpha, as well as Pivotal HD 1.0, Cloudera CDH 4.3.
Developing comprehensive big data applications can be difficult, as today's open source projects typically use separate, independent projects to integrate everything from stream analysis and batch workflow orchestration to data import and export. This leaves a Java developer to wrestle with different programming, configuration and extensibility models.
In addition to helping support portability across Hadoop runtime environments, Spring for Apache Hadoop (SHDP) aims to help simplify the development of Hadoop based applications by providing a consistent configuration and API across a wide range of Hadoop ecosystem projects such as Pig, Hive, and Cascading in addition to providing extensions to Spring Batch for orchestrating Hadoop based workflows.
SHDP, together with Spring Integration, Spring Batch and Spring Data are part of the Spring IO Platform as foundational libraries. Building on top of, and extending this foundation, the Spring IO platform provides a big data runtime named Spring XD (XD = eXtreme Data). Spring XD, available in an early milestone release, provides a single platform that comprehensively addresses common use cases in big data solutions – without the need to write code – just by using a domain specific language (DSL). These use cases include data ingestion from external sources, data transformation and real-time analytics, data import/export to/from HDFS, and workflow orchestration.
These foundational parts of Spring IO platform make Hadoop development more accessible to a wider range of Java developers - including the massive Spring developer community - and make the process even faster for Hadoop experts.
"The combination of Spring for Apache Hadoop and Spring XD represent a major advancement for the Big Data ecosystem,” said Peter Cooper-Ellis, vice president, Engineering for Cloudera."We're pleased to see the debut of a higher level, unified API for developers - which can only result in greater market adoption of Hadoop."
"Applications on Big Data are a critical emerging trend for enterprises to improve utility and increase value from their Big Data," said Adrian Colyer, chief technology officer, Applications at Pivotal "This Spring XD advancement works to unlock Hadoop Big Data for the leading distributions for the masses of enterprise Java developers using Spring."Spring XD is open source and available at no charge under an Apache 2.0 license, and can be obtained from the Spring.io website.
- Spring IO Platform
- Spring XD Project
- Spring for Apache Hadoop Project
- How Spring XD works
- Source code:
- Install Spring XD via Homebrew
brew tap pivotal/tap && brew install springxd
- Getting Started Guide: Creating a stream of live Twitter data using Spring XD on Mac OS
Pivotal, committed to open source and open standards, is a leading provider of application and data infrastructure software, agile development services, and data science consulting. Pivotal's revolutionary Enterprise PaaS product, powered by Cloud Foundry, will be available in Q4 2013. Learn more at www.gopivotal.com. Follow Pivotal on Twitter @gopivotalAPJ, LinkedIn, and G+.
Founded in 2008, Cloudera pioneered the business case for Hadoop with CDH, the world's most comprehensive, thoroughly tested and widely deployed 100% open source distribution of Apache Hadoop in both commercial and non-commercial environments. Now, the company is redefining data management with its Platform for Big Data, Cloudera Enterprise, empowering enterprises to Ask Bigger Questions™ and gain rich, actionable insights from all their data, to quickly and easily derive real business value that translates into competitive advantage. As the top contributor to the Apache open source community and leading educator of data professionals with the broadest array of Hadoop training and certification programs, Cloudera also offers comprehensive consulting services. Over 700 partners across hardware, software and services have teamed with Cloudera to help meet organizations' big data goals. With tens of thousands of nodes under management and hundreds of customers across diverse markets, Cloudera is the category leader that has set the standard for Hadoop in the enterprise. www.cloudera.com
This release contains "forward-looking statements" as defined under the Federal Securities Laws. Actual results could differ materially from those projected in the forward-looking statements as a result of certain risk factors, including but not limited to: (i) adverse changes in general economic or market conditions; (ii) delays or reductions in information technology spending; (iii) the relative and varying rates of product price and component cost declines and the volume and mixture of product and services revenues; (iv) competitive factors, including but not limited to pricing pressures and new product introductions; (v) component and product quality and availability; (vi) fluctuations in VMware, Inc.'s operating results and risks associated with trading of VMware stock; (vii) the transition to new products, the uncertainty of customer acceptance of new product offerings and rapid technological and market change; (viii) risks associated with managing the growth of our business, including risks associated with acquisitions and investments and the challenges and costs of integration, restructuring and achieving anticipated synergies; (ix) the ability to attract and retain highly qualified employees; (x) insufficient, excess or obsolete inventory; (xi) fluctuating currency exchange rates; (xii) threats and other disruptions to our secure data centers or networks; (xiii) our ability to protect our proprietary technology; (xiv) war or acts of terrorism; and (xv) other one-time events and other important factors disclosed previously and from time to time in the filings EMC Corporation, the parent company of Pivotal, with the U.S. Securities and Exchange Commission. EMC and Pivotal disclaim any obligation to update any such forward-looking statements after the date of this release.