Entries tagged [opensource]

Tuesday Jun 02, 2015

The Apache® Software Foundation Announces Annual Report for 2014-2015 Fiscal Year

Open Source Leadership Grows with Accelerated Innovation and Adoption of Apache Projects

Forest Hill, MD —2 June 2015— The Apache® Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the availability of its annual report for its 2014-2015 fiscal year, which ended 30 April 2015.

Momentum continues to accelerate under the Apache umbrella, including:

Open Source Leadership --

  • 20th anniversary of the Apache HTTP Server Project
  • 15th anniversary of The Apache Software Foundation
  • More than 16,000 code commits per month
  • 831 Individual Contributor License Agreements (CLA), 58 Corporate CLAs, and 41 Software Grants signed 

Innovation --

  • More than 350 Projects in development
  • 17 new Apache Top-level Projects (TLPs)
  • 41 podlings in the Apache Incubator
  • Facilitated contribution with 317 Git and 296 Subversion repositories 

Community --

  • ASF Membership exceeded 600 individuals
  • 163 TLPs overseen by more than 2,000 Project Management Committee (PMC) members
  • Held hundreds of events globally, including ApacheCon Europe and North America, as well as countless conferences, workshops, and regional MeetUps
  • Mentored hundreds of students in "The Apache Way" of software development for the 11th consecutive year through the Google Summer of Code

Operations --

  • Bolstered Infrastructure, with 99.57% total uptime across all Apache services
  • Launched new "Powered by Apache" graphics and revised apache.org look-and-feel
  • Provided trademarks, brand management, and legal support for dozens of existing and new projects
  • Rise in corporate support, with 39 Sponsors
  • The ASF exited FY 2014 with revenue at $996K, ahead of projected budget


The full report is available online at http://s.apache.org/ZVp

# # #

© The Apache Software Foundation. "Apache" and "ApacheCon", are registered trademarks or trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

Tuesday May 19, 2015

The Apache Software Foundation Announces Apache™ Drill™ 1.0

Thousands of users adopt Open Source, enterprise-grade, schema-free SQL query engine for Apache Hadoop®, NoSQL, and Cloud storage.

Forest Hill, MD --19 May 2015-- The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the availability of Apache™ Drill™ 1.0, the schema-free SQL query engine for Apache Hadoop®, NoSQL, and Cloud storage.

"The production-ready 1.0 release represents a significant milestone for the Drill project," said Tomer Shiran, member of the Apache Drill Project Management Committee. "It is the outcome of almost three years of development involving dozens of engineers from numerous companies. Apache Drill's flexibility and ease-of-use have attracted thousands of users, and the enterprise-grade reliability, security and performance in the 1.0 release will further accelerate adoption."

With the exponential growth of data in recent years, and the shift towards rapid application development, new data is increasingly being stored in non-relational, schema-free datastores including Hadoop, NoSQL and Cloud storage. Apache Drill revolutionizes data exploration and analytics by enabling analysts, business users, data scientists and developers to explore and analyze this data without sacrificing the flexibility and agility offered by these datastores. Drill processes the data in-situ without requiring users to define schemas or transform data.

"Drill introduces the JSON document model to the world of SQL-based analytics and BI" said Jacques Nadeau, Vice President of Apache Drill. "This enables users to query fixed-schema, evolving-schema and schema-free data stored in a variety of formats and datastores. The architecture of relational query engines and databases is built on the assumption that all data has a simple and static structure that’s known in advance, and this 40-year-old assumption is simply no longer valid. We designed Drill from the ground up to address the new reality.”

Apache Drill's architecture is unique in many ways. It is the only columnar execution engine that supports complex and schema-free data, and the only execution engine that performs data-driven query compilation (and re-compilation, also known as schema discovery) during query execution. These unique capabilities enable Drill to achieve record-breaking performance with the flexibility offered by the JSON document model.

The business intelligence (BI) partner ecosystem is embracing the power of Apache Drill. Organizations such as Information Builders, JReport (Jinfonet Software), MicroStrategy, Qlik®, Simba, Tableau, and TIBCO, are working closely with the Drill community to interoperate BI tools with Drill through standard ODBC and JDBC connectivity. This collaboration enables end users to explore data by leveraging sophisticated visualization tools and advanced analytics.

"We've been using Apache Drill for the past six months," said Andrew Hamilton, CTO of Cardlytics. "Its ease of deployment and use along with its ability to quickly process trillions of records has made it an invaluable tool inside Cardlytics. Queries that were previously insurmountable are now common occurrence. Congratulations to the Drill community on this momentous occasion." 

"Drill's columnar execution engine and optimizer take full advantage of Apache Parquet's columnar storage to achieve maximum performance," said Julien Le Dem, Technical Lead of Analytics Data Pipeline at Twitter and Vice President of Apache Parquet. "The Drill team has been a key contributor to the Parquet project, including recent enhancements to Parquet types and vectorization. The Drill team’s involvement in the Parquet community is instrumental in driving the standard."

"Apache Drill 1.0 raises the bar for secure, reliable and scalable SQL-on-Hadoop," said Piyush Bhargava, distinguished engineer, IT, Cisco Systems. "Because Drill integrates with existing data virtualization and visualization tools, we expect it will improve adoption of self-service data exploration and large-scale BI queries on our advanced Hadoop platform at Cisco."

"MicroStrategy recognized early on the value of Apache Drill and is one of the first analytic platforms to certify Drill," said Tim Lang, senior executive vice president and chief technology officer at MicroStrategy Incorporated.  "Because Drill is designed to be used with a minimal learning curve, it opens up more complex data sets to the end user who can immediately visualize and analyze new information using MicroStrategy’s advanced capabilities."

"Apache Drill closes a gap around self-service SQL queries in Hadoop, especially on complex, dynamic NoSQL data types," said Mike Foster, Strategic Alliances Technology Officer at Qlik.  "Drill's performance advantages for Hadoop data access, combined with the Qlik associative experience, enables our customers to continue discovering business value from a wide range of data. Congratulations to the Apache Drill community."

"Apache Drill empowers people to access data that is traditionally difficult to work with," said Jeff Feng, product manager, Tableau.  "Direct access within a centralized data repository and without pre-generating metadata definitions encourages data democracy which is essential for data-driven organizations. Additionally, Drill's instant and secure access to complex data formats, such as JSON, opens up extended analytical opportunities."

"Congratulations to the Apache Drill community on the availability of 1.0," said Karl Van den Bergh, Vice President, Products and Cloud at TIBCO. "Drill promises to bring low-latency access to data stored in Hadoop and HBase via standard SQL semantics. This innovation is in line with the value of Fast Data analysis, which TIBCO customers welcome and appreciate."

"The community's accomplishment is a testament to The Apache Software Foundation's ability to bring together diverse companies to work towards a common goal. None of this would have been possible without the contribution of engineers with advanced degrees and experience in relational databases, data warehousing, MPP, query optimization, Hadoop and NoSQL," added Nadeau. "Our community's strength is what will solidify Apache Drill as a key data technology for the next decade. We welcome interested individuals to learn more about Drill by joining the community's mailing lists, attending upcoming talks by Drill code committers at various conferences including Hadoop Summit, NoSQL Now, Hadoop World, or at a local Apache Drill MeetUp."

Availability and Oversight
Apache Drill 1.0 is available immediately as a free download from http://drill.apache.org/download/. Documentation is available at http://drill.apache.org/docs/. As with all Apache products, Apache Drill software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the project's day-to-day operations, including community development and product releases. For ways to become involved with Apache Drill, visit http://drill.apache.org/ and @ApacheDrill on Twitter.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Bloomberg, Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

© The Apache Software Foundation. "Apache", "Apache Drill", "Drill", "Apache Hadoop", "Hadoop", "Apache Parquet", "Parquet", and "ApacheCon", are registered trademarks or trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday Feb 03, 2015

Apache™ PDFBox™ named an Open Source Partner Organization of the PDF Association

Liaison helps enterprise users benefit from enhanced PDF technology and serves as a foundation to other software applications.

Forrest Hill, MD —03 February 2015— Apache PDFBox™, a Top-Level Project of The Apache Software Foundation, today announced the project has been named as a Partner Organization of the PDF Association. 

The Apache PDFBox™ library is an Open Source Java tool for working with Portable Document Format (PDF) documents. It allows for the creation of new PDF documents, manipulation, rendering, signing of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command line utilities.

"We are proud to be recognized by the PDF Association as a driver of ISO-­standardized PDF technology for electronic documents," said Andreas Lehmkühler, Vice President of Apache PDFBox. "Our liaison will help further strengthen the development of Apache PDFBox by providing access to knowledge and resources around PDF technology."

PDF was first released by Adobe Systems in 1993, became an ISO International Standard - ISO 32000-1 in 2008.

Founded in 2006, the PDF Association (http://www.pdfa.org/) is an international organization promoting awareness and adoption of open standards in digital document applications using PDF technology. The association facilitates education, networking, communication, and sharing of expertise and experience with interested parties worldwide. It offers its membership of over 150 enterprises and individual subject-matter experts from more than 20 countries direct contact with PDF technology experts and access to documents from ISO working groups, including release candidates for PDF upcoming standards.

The PDF Association's Partner Organizations are international associations concerned with document management, enterprise content management, long-term archiving and accessibility. Apache PDFBox the PDF Association's first Open Source Partner Organization. The PDF Association delivers vital information about implementing PDF technology to software developers and IT decision makers, and helps document management and ECM implementers understand and leverage ISO-standardized PDF technology. In turn, enterprise systems implementers and end-users benefit from enhanced PDF technology.

"With Apache PDFBox, the first Open Source organization is joining our Association," said Thomas Zellmann, Managing Director of the PDF Association. "This is our contribution to support the growth of freely available PDF solutions and their functionality to further expand the market penetration of the PDF standard."

"Being part of the PDF Association recognizes our commitment to making ISO standardized electronic document technology easily available by leveraging Apache PDFBox as a foundation for other software applications," added Lehmkühler.

About Apache PDFBox
The Apache PDFBox™ library is an Open Source Java tool that allows users to create new PDF documents, manipulate existing documents, extract content, digitally sign, print, and validate files against the PDF/A-1b standard. It also includes several command line utilities, including encrypt, decrypt, overlay, debugger, merger, PDFToImage, and TextToPDF. Apache PDFBox software is released under the Apache License v2.0, and is overseen by a self-selected volunteer team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache PDFBox, visit http://pdfbox.apache.org/

# # #


Tuesday Jan 27, 2015

The Apache Software Foundation Announces Apache™ Samza™ as a Top-Level Project

Open Source Big Data distributed stream processing framework used in business intelligence, financial services, healthcare, mobile applications, security, and software development, among other industries.

Forest Hill, MD –27 January 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache™ Samza™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

"The incubation process at Apache has been great. It has helped us cultivate a strong community, and provided us with the support and infrastructure to make Samza grow," said Chris Riccomini, Vice President of Apache Samza.

Apache Samza is a distributed stream processing framework, designed to handle fault tolerance, stateful processing, message durability, and scalability. Samza helps users to write light-weight processors that consume streams of data from messaging systems such as Apache Kafka. These processors empower organizations to understand and react to their data in real-time. In addition, Samza uses Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.

Samza represents a different approach to stream processing. It has been purpose-built first and foremost as a production-grade system with operability and scalability in mind. Samza integrates tightly with Apache Kafka, which makes it a natural fit to those already running Kafka in their data pipeline. The framework also introduces the concept of stateful processing and aggregation as a first-class feature. Stateful processing gives Samza developers a completely new paradigm for aggregating stream data. These features help organizations do high performance stream processing at scale.

Created to process tracking data, service log data, and for data ingestion pipelines for realtime services, Samza originated at LinkedIn, and was submitted to the Apache Incubator in July 2013. 

"LinkedIn is thrilled to see Apache Samza experience such strong adoption and now graduate to a Top-Level Project. Samza was developed to help solve some of LinkedIn's  toughest stream processing challenges and has become a central piece of our infrastructure," said Kevin Scott, Senior Vice President of Engineering and Operations at LinkedIn.

Apache Samza is used in an array of industries, applications, and organizations, including:
  • DoubleDutch, developers of mobile apps for events and conferences, uses Samza to power their analytics platform and stream data live into an event dashboard for real-time insights;
  • Forstcales' Big Data security analytics solutions use Samza to processes security events log as part of the data ingestion pipelines and on-line machine learning models creation process;
  • Happy Pancake, Northern Europe's largest internet dating service, uses Samza for all event handlers and data replication;
  • Advertising technology provider Improve Digital uses Samza as the foundation of a realtime processing capability performing data analytics and as the basis for an alerting system;
  • Jack Henry & Associates uses Samza to process user activity data across its Banno suite of products for financial institutions;
  • MobileAware uses Samza as a foundation for two mobile network products: real time analytics and multi channel notification (push, text message and HTML5);
  • Technology startup Project Florida uses Samza for real-time monitoring of data streams from wearable sensors, for preventative healthcare purposes;
  • Quantiply, providers of Cloud-based micro-applications, uses Samza to bring together user event, system performance, and business operational data for real-time visibility and decision support; and
  • Social media business intelligence solution VinTank uses Samza to power their analysis and natural language processing (NLP) pipeline.


"We've had great experiences with Samza at Improve Digital where it has enabled us to  build out our streaming data platform," said Garry Turkington, CTO of Improve Digital. "It's fantastic to see it graduate to a top-level project."

Jay Kreps, CEO of Confluent, said "Samza is a fantastic piece of infrastructure, and a great complement to Apache Kafka. We at Confluent are really excited to see it added as a top-level Apache project."

"Fortscale has been using Apache Samza successfully to build online machine learning algorithms and detect insider threats," said Dotan Patrich, Software Architect at Fortscale. "It's been a great experience building large scale streaming solution and using Samza's and enjoying it's unique state management architecture. It's fantastic to see it graduate to a Top-Level Project."

"I've been involved in Apache Samza's community since its inception. It's been thrilling to watch the community grow, and I'm very proud and excited to see that the project is graduating. Samza has a bright future, and I'm looking forward to what's to come," added Riccomini.

Availability and Oversight
As with all Apache products, Apache Samza software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Samza, visit http://samza.apache.org/ and @SamzaStream on Twitter

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow https://twitter.com/TheASF.

© The Apache Software Foundation. "Apache", "Apache Samza", "Samza", "Apache Hadoop", "Hadoop", "Hadoop YARN", "Apache Kafka", "Kafka", "ApacheCon", and the Apache Samza logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

The Apache Software Foundation Announces Apache™ BookKeeper™ as a Top-Level Project

Open Source distributed Big Data logging service and publish/subscribe system used to reliably log streams of records

Forest Hill, MD –27 January 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache™ BookKeeper™ has graduated to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

Apache BookKeeper was established in 2011 as a sub-project of Apache ZooKeeper™ (Open Source API for highly reliable distributed coordination) to reliably log streams of records. It serves as a building block for reliable system consistency and recovery, and can be used to turn any standalone service into a highly available replicated service.

With disk/server failure rates up to 10% annually, replication is a must in today's always-on Cloud and Big Data services. One way to build a replicated service is to ensure that all write operations to the service are copied to all replicas; Apache BookKeeper's replicated logging service is well suited for this purpose. A database may have two replicas to ensure availability: if one crashes, the other can continue to serve traffic. However, ensuring that the data in these two replicas is consistent is not an easy problem to solve. Unlike naive solutions that run into problems like deadlock and inconsistency when one or both of the replicas fail, BookKeeper uses a combination of quorum writes, fencing, and, when necessary, outsourcing of consensus to ZooKeeper to ensure no state will be lost in the case of a replica failure. BookKeeper can similarly be applied to different classes of systems, such as messaging systems, filesystems and transaction processing systems.

Apache BookKeeper is highly available (no single point of failure), and scales horizontally as more storage nodes are added. BookKeeper is used in production in many web scale companies. At Yahoo, it is used as the persistence layer for its Cloud messaging infrastructure, which delivers tens of billions of messages in a day. BookKeeper is used at Twitter as the replicated persistence backend for different messaging use cases, and is also used by Huawei as a shared storage in their solution for HDFS Namenode High Availability. 

"We're very proud to have BookKeeper become a Top-Level Project. It is a testament to the hard work that my fellow committers have put in over the years that the ASF would give us their stamp of approval," said Ivan Kelly, Vice President of Apache BookKeeper. "We hope that the increased exposure will bring even more contributions and use cases to the community."

Availability and Oversight
As with all Apache products, Apache BookKeeper software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache BookKeeper, visit http://bookkeeper.apache.org and https://twitter.com/asfbookkeeper

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

© The Apache Software Foundation. "Apache", "Apache BookKeeper", "BookKeeper", ApacheCon", and the Apache BookKeeper logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Monday Jan 19, 2015

The Apache Software Foundation Announces Apache™ Falcon™ as a Top-Level Project

Open Source Big Data processing and management solution for Apache Hadoop™ in use at Hortonworks, InMobi, and Talend, among others.

Forest Hill, MD –19 January 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache™ Falcon™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

Apache Falcon is a data processing and management solution for Apache Hadoop™, designed for data motion, coordination of data pipelines, lifecycle management, and data discovery. Falcon provides enterprises higher quality and predictable outcomes for their data by enabling end consumers to quickly onboard their data and its associated processing and management tasks on Hadoop clusters. The platform is successfully deployed across various industries, including advertising, healthcare, mobile applications, software solutions, and technology.

"Apache Falcon solves a very important and critical problem in the big data space. Graduation to TLP marks an important step in progression of the project," said Srikanth Sundarrajan, Vice President of Apache Falcon. "Falcon has a robust road map to ease the pain of application developers and administrators alike in authoring and managing complex data management and processing applications."

"Graduation of Apache Falcon's is a proud moment for the community who came together to solve a very relevant problem of data processing and management in Hadoop ecosystem," said Mohit Saxena, CTO and co-founder InMobi, one of the largest users of Apache Falcon. "I also want to applaud the efforts of contributors, committers and user community who actively pitched in the development of Falcon and it is only because of their conviction and efforts project has graduated. I am hoping promotion of Falcon to TLP will increase the contribution and adoption across the community and help Falcon achieve newer heights." 

Falcon represents a significant step forward in the Hadoop platform by enabling easy data management. Users of Falcon platform simply define infrastructure endpoints, data sets and processing rules declaratively. These declarative configurations are expressed in such a way that the dependencies between these configured entities are explicitly described. This information about inter-dependencies between various entities allows Falcon to orchestrate and manage various data management functions.

"Falcon has evolved over the last couple of years into a mature data management solution for Apache Hadoop with many production deployments proving it to be very valuable for users to manage their data and associated processing on Hadoop clusters," said Venkatesh Seetharam, Apache Falcon Project Management Committee member. 

"As Hadoop usage patterns have matured, the highest value implementations are based on the data lake concept. Data lakes require prescriptive and reliable pipelines," explained Greg Pavlik, Vice President of Engineering at Hortonworks. "Apache Falcon represents the best and most mature --and therefore essential-- building block for modeling, managing and operating data lakes."

"Falcon has enabled our team to incrementally build up a complex pipeline comprised of over 90 processes and 200 feeds that would have been very challenging with Apache Oozie alone," said programmer Michael Miklavcic.

"I began to work on Falcon in my spare time for fun, but it quickly became interesting in relation to my job at Talend", said Jean-Baptise Onofré, Vice President of Apache Karaf and Software Architect at Talend. "As Talend DataIntegration provides features like CDC (Change Data Capture), and data notification, we are in the process of integrating Apache Falcon in Talend products." 

"Apache Falcon's graduation is a milestone for the project and a credit to its contributors. Its open, collaborative development has effected a robust community around software essential to the Hadoop ecosystem," said Chris Douglas, Falcon incubation mentor at the ASF. "By becoming a Top-Level Project, the ASF recognizes its demonstrated ability to self-govern. Congratulations to Falcon's users, to its contributors, and particularly to its new Project Management Committee on this achievement."

Availability and Oversight
As with all Apache products, Apache Falcon software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Falcon, visit http://falcon.apache.org/ and @ApacheFalcon on Twitter

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow https://twitter.com/TheASF.

© The Apache Software Foundation. "Apache", "Apache Falcon", "Falcon", "Apache Hadoop", "Hadoop", "Apache Oozie", "Oozie", "ApacheCon", and the Apache Falcon logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Monday Jan 12, 2015

The Apache Software Foundation Announces Apache™ Flink™ as a Top-Level Project

Open Source distributed Big Data system for expressive, declarative, and efficient batch and streaming data processing and analysis

Forest Hill, MD –12 January 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache™ Flink™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

Apache Flink is an Open Source distributed data analysis engine for batch and streaming data. It offers programming APIs in Java and Scala, as well as specialized APIs for graph processing, with more libraries in the making.

"I am very happy that the ASF has become the home for Flink," said Stephan Ewen, Vice President of Apache Flink. "For a community-driven effort, I can think of no better umbrella. It is great to see the project is maturing and many new people are joining the community."

Flink uses a unique combination of streaming/pipelining and batch processing techniques to create a platform that covers and unifies a broad set of batch and streaming data analytics use cases. The project has put significant efforts into making a system that runs reliably and fast in a wide variety of scenarios. For that reason, Flink contained its own type serialization, memory management, and cost-based query optimization components from the early days of the project.

Apache Flink has its roots in the Stratosphere research project that started in 2009 at TU Berlin together with the Berlin and later the European data management communities, including HU Berlin, Hasso Plattner Institute, KTH (Stockholm), ELTE (Budapest), and others. Several Flink committers recently started data Artisans, a Berlin-based startup committed to growing Flink both in code and community as 100% Open Source. More than 70 people have by now contributed to Flink.

"Becoming a Top-Level Project in such short time is a great milestone for Flink and reflects the speed with which the community has been growing," said Kostas Tzoumas, co-founder and CEO of data Artisans. "The community is currently working on some exciting new features that make Flink even more powerful and accessible to a wider audience, and several companies around the world are including Flink in their data infrastructure."

"We use Apache Flink as part of our production data infrastructure," said Ijad Madisch, co-founder and CEO of ResearchGate. "We are happy all around and excited that Flink provides us with the opportunity for even better developer productivity and testability, especially for complex data flows. It’s with good reason that Flink is now a top-level Apache project."

"I have been experimenting with Flink, and we are very excited to hear that Flink is becoming a top-level Apache project," said Anders Arpteg, Analytics Machine Learning Manager at Spotify.

Denis Arnaud, Head of Data Science Development of Travel Intelligence at Amadeus said, "At Amadeus, we continually seek for better improvement in our analytic platform and our experiments with Apache Flink for analytics on our travel data show a lot of potential in the system for our production use."

"Flink was a pleasure to mentor as a new Apache project," said Alan Gates, Apache Flink Incubator champion at the ASF, and architect/co-founder at Hortonworks. "The Flink team learned The Apache Way very quickly. They worked hard at being open in their decision making and including new contributors. Those of us mentoring them just needed to point them in the right direction and then let them get to work."

Availability and Oversight
As with all Apache products, Apache Flink software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Flink, visit http://flink.apache.org/ and @ApacheFlink on Twitter.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

© The Apache Software Foundation. "Apache", "Apache Flink", "Flink", ApacheCon", and the Apache Flink logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Friday Dec 19, 2014

ASF publishes long-overdue Code Of Conduct

tl;dr: The ASF has published a Code of Conduct

We pride ourselves at The Apache Software Foundation on our principles of "community over code" and "don't be a jerk". But, alas, we've been slow to codify some of these things in public. Part of this, I'm sure, is that it’s easy to think we all just know how we're supposed to treat people, and so you shouldn't have to say, right?


But, of course, you do have to say. In part because some people don't know. And in part because it’s important that we communicate our values to the people in our community, and to people who might be considering joining our community. There has been a recent push in tech circles to include a Code of Conduct at events, conferences, etc. (Ashe Dryden maintains an introductory resource for learning more about how Codes of Conduct can help.) Increasingly, open source projects are adopting a Code of Conduct too, and we think this is a good idea that could help improve open source as a whole.

At ApacheCon, I was approached by Joan Touzet, an active member of the Apache CouchDB community, who had noted that we referenced a Code of Conduct on the main ASF website, but that no such document actually existed anywhere on our site. CouchDB has devoted a lot of time over the last few months crafting their Code of Conduct. It addresses everything from what's acceptable on the mailing lists, to how to report it if someone isn’t upholding community standards. This seemed like a great starting point, and so the ASF has adopted this as our initial Code of Conduct, with minor edits that remove the CouchDB-specific language. (It is my understanding that the CouchDB community now intends to use the Foundation level Code of Conduct, and will work with us to bring additional improvements to it.) 

No doubt, we'll get criticism for being so slow to do this, and we accept that. But it's never too late to take steps in the right direction, and we feel that this is an important one. Not just for the ASF, but for all open source projects and organisations.

You are encouraged to join the conversation on the Community Development mailing list. Whether you have changes you'd like to see in that document, or whether you'd like to discuss any other aspect of the Apache community. Any sort of community discussion topic is welcome. For example, Noah Slater, also from the CouchDB community, brought up the subject of punitive measures for infractions, which is an important but difficult issue. We'd love to hear your perspective on this, and help us continue to move in the right direction.

--Rich Bowen, Executive Vice President

Tuesday Dec 09, 2014

The Apache Software Foundation Announces Apache™ MetaModel™ as a Top-Level Project

Dynamic, metadata-driven Open Source framework provides uniform data access and code consolidation across various data stores. 

Forest Hill, MD –09 December 2014– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 200 Open Source projects and initiatives, announced today that Apache™ MetaModel™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles. 

"It's a great privilege for us to have MetaModel graduated to a Top Level Project at Apache. It makes us proud and excited about welcoming more people into our community of coders and users," said Kasper Sørensen, Vice President of Apache MetaModel. "We've learned a lot about the Apache Way since entering the Apache Incubator in July 2013." 

Apache MetaModel is a data access framework that provides a common interface for the discovery, exploration, and querying of different types of data sources. Unlike traditional mapping frameworks, MetaModel emphasizes metadata of the data source itself and the ability to add more data sources at runtime. MetaModel's schema model and SQL-like query API is applicable to databases, CSV files, Excel spreadsheets, NoSQL databases, Cloud-based business applications, and even regular Java objects. This level of abstraction makes MetaModel great for dynamic data processing applications, less so for applications modelled strictly around a particular domain. 

MetaModel is so called as it's a model for interacting with data based on metadata, enabling developers to go above the physical data layer and apply their application to just about any data. 

"MetaModel enables you to consolidate code and consolidate data a lot quicker than any other library out there," Sørensen explained. "In these 'Big Data days' there's a lot of focus on performance and scalability, and surely these topics also surround Apache MetaModel. The Big Data challenge is not always about massive loads of data, but instead massive variation and feeding a lot of different sources into a single application. Now to make such an application you both need a lot of connectivity capabilities and a lot of modelling flexibility. Those are the two aspects where Apache MetaModel shines. We make it possible for you to build applications that retain the complexity of your data – even if that complexity may change over time. The trick to achieve this is to model on the metadata and not on your assumptions." 

"The performance and flexibility of Apache MetaModel is a key building block for us to improve the usability and power for the thousands of users of DataCleaner – the leading Open Source data quality solution, supported by Neopost," said Enno Ebels, Executive Vice President of Customer Information Management at Neopost. 

"It's been a joy to follow the growth in the community and in functionality," added Sørensen. "Over the last year we've introduced connectivity for Apache HBase, JSON files, ElasticSearch, Apache Cassandra and a whole lot more. It's always a great pleasure to see the excitement in people's eyes when they realize that you can develop for these data sources using the same API." 

"Apache MetaModel is the core technology used underneath our MDM offering at Human Inference, providing us an abstraction layer above the different database schemes we currently support, including Postgres, DB2, Oracle, SQL Server, and ElasticSearch," said Ankit Kumar, Technical Lead at Human Inference and Member of the Apache MetaModel Project Management Committee.

"The MetaModel query language helps us write code agnostic of the underlying database. Within our MDM offering we have even implemented some virtual data stores using MetaModel," said Winfried van Holland, CTO of Neopost Customer Information Management. "These expose our data model in a custom view for our consultants - stripping away the technical complexities and exposing the business value in a data model that is natural for the business people to consume."

"Apache MetaModel is a key technology in Stratio Datavis, allowing us to manage metadata and create SQL-based connectors for a bunch of data stores," said David Morales, Big Data Architect at Stratio. "Thanks to Apache MetaModel, Datavis users can create beautiful dashboards using their SQL skills, instead of knowing several query languages. That's why we are proud to be contributors of MetaModel and we will continue to collaborate with this great project." 

Availability and Oversight 
As with all Apache products, Apache MetaModel software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache MetaModel, visit http://metamodel.apache.org and https://twitter.com/ApacheMetaModel

About The Apache Software Foundation (ASF) 
Established in 1999, the all-volunteer Foundation oversees more than two hundred leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter

© The Apache Software Foundation. "Apache," "Apache MetaModel," "MetaModel," ApacheCon," and the Apache MetaModel logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners. 

# # # 

Wednesday Nov 19, 2014

The Apache Software Foundation Celebrates 15 Years of Open Source Innovation and Community Leadership

Apache has been at the forefront of dozens of today's industry-defining technologies and tools; nearly every end-user computing device has been touched by at least one Apache product.

Budapest, Hungary –19 November– At ApacheCon Europe, members of the Apache community commemorated The Apache Software Foundation (ASF)'s fifteenth anniversary and congratulated the people, projects, initiatives, and organizations that played a role in its success.

Recognized as the leader in community-led Open Source software development, the ASF was established to shepherd, develop, and incubate Open Source innovations "The Apache Way". Reflections on achievements over the past 15 years include:


Apache products power half the Internet, manage exabytes of data, execute teraflops of operations, store billions of objects in virtually every industry, and enhance the lives of countless users and developers worldwide. Apache projects power mission-critical applications in financial services, aerospace, publishing, big data, Cloud computing, mobile, government, healthcare, research, infrastructure, development frameworks, foundational libraries, and many other categories. Beginning with the Apache HTTP Server —the world's most popular Web server— Apache software has been at the forefront of dozens of today's industry-defining technologies and tools, playing an integral role in nearly every end-user computing device, from laptops to tablets to mobile phones.

Apache software is so ubiquitous that 50% of the top 10 downloaded Open Source products are Apache projects. The commercially-friendly and permissive Apache License v2 has become an industry standard within the Open Source world. The Apache License and open development model are widely recognized as among the best ways to ensure open standards gain traction and adoption. The ASF offers a vendor-neutral space in which to collaborate whilst enabling third parties to pursue almost any for-profit or not-for-profit business model. To date, hundreds of thousands of software solutions have been distributed under the Apache License.

Amazingly, this is achieved by an all-volunteer community comprising 505 individual Members and 4,081 Apache Committers collaborating across six continents. The ASF's day-to-day operating expenses are offset by the generous sponsorship of individual donors and corporate sponsors including Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, IBM, Matt Mullenweg, Microsoft, and Yahoo.

"ASF @ 15" Timeline and Highlights follow. Visit http://apache.org/ and @TheASF on Twitter for more information.

Highlights: pre-1999
Brian Behlendorf started collecting patches to be applied to the last version of the NCSA http server. The Apache Group, consisting of 8 individuals, traded patches on a mailing list set up for the purpose. In April of 1995 the first public release of Apache (version 0.6.2) came out. Apache 1.0 released on December 1, 1995, and within a year surpassed NCSA as the most-used Web server.

Highlights: 1999
The ASF formally incorporates as a Delaware-based 501(c)(3) non-profit corporation from The Apache Group on 1 June. Original directors are: Brian Behlendorf (President), Ken Coar (VP Conferences), Roy T. Fielding (Chairman), Ben Hyde (VP Apache HTTP Server Project), Jim Jagielski (Secretary and EVP), Ben Laurie, Sameer Parekh, Randy Terbush (Treasurer), and Dirk-Willem van Gulik. New Apache Jakarta and XML Projects join the Apache HTTP Server Project. Board Committees on ASF Conferences, Licenses, and Security are formed. Discussions about ASF's role as an Open Source incubator address fostering new technologies such as Cocoon. The ASF receives numerous industry awards, including the ACM Software System Award, the Datamation Product of the Year, and LinuxWorld Editor's Award. ASF is listed in the Industry Standard's "100 Companies That Matter" and included in the ServerWatch Hall of Fame.

Highlights: 2000
Perl-Apache Project, as well as Apache PHP, Apache/TCL Project, and Apache Portable Runtime Project are established. Apache Struts, Batik, FOP, and Ant undergo "incubation". The ASF draws record attendance at the second ApacheCon in Orlando (the first-ever conference was held in San Francisco in 1998), and launches its first European event in London later that year.

Highlights: 2001
Apache Avalon, Commons, and Jetspeed/Portals undergo "incubation". Work begins on next version of the Apache License. The fourth ApacheCon is held in Santa Clara, where the ASF maxim of "Community Over Code" is widespread and collaborators meet in person for the very first time. The ASF receives the Internet Service Providers Association's Internet Industry Awards for "Best Software Supplier" Apache XML's Xalan-Java 1.2.2 is a finalist in the Best Java-XML Application category in the JavaWorld Editors' Choice Awards.

Highlights: 2002
Participation in The ASF booms; its process for community and collaborative development becomes known as "the Apache Way". New Board is formed: Greg Stein elected Chairman, Dirk-Willem van Gulik as President, Randy Terbush as Treasurer (later replaced that year by Chuck Murko), and Jim Jagielski as Executive Vice President/Secretary. Apache Jakarta launches sub-project BSF; the Apache Incubator Project is born: new projects include Apache Ant, Avalon, DB, Forrest, HC, POI, and TCL. Apache HTTP Server and Portable Runtime Project Management Committees are reestablished. New Board Committees on Infrastructure as well as Fundraising are formed. The ASF participates in the Java Community Process. The fifth ApacheCon takes place in Las Vegas. The first community-driven Apache Cocoon GetTogether is held.

Highlights: 2003
"Web 2.0" comes to the ASF; the Apache Web Services Project is formed. New projects in the Apache Incubator include Directory, Geronimo, Gump, James, Logging Services, Maven, Pluto, SpamAssassin, Tapestry, and XML Beans. Perl-Apache Project is renamed to the Apache Perl Project, and Cocoon becomes a Top Level Project. The sixth ApacheCon is held in Las Vegas, featuring an expo exchange with COMDEX. The Apache HTTP Server wins Best Server Software by Linux Format; Apache Ant wins Software Development Magazine Jolt Product Excellence and Productivity Award, the Java Pro Readers' Choice Award for Most Valuable Java Deployment Technology, as well as the JavaWorld Editors' Choice Award for "Most Useful Java Community-Developed Technology". JavaWorld also awards Apache Xerces-J Editors' Choice for "Best Java XML Tool". SpamAssassin wins the OSDir Editor's Choice Award. The Apache License v.1.2 is released; all products of the Foundation are required to be released under the new license.

Highlights: 2004
ASF Board members are re-elected: Greg Stein as Chairman, Dirk-Willem van Gulik as President, Chuck Murko as Treasurer, and Jim Jagielski as Executive Vice President/Secretary. The stable Apache License v.2.0 is released, and the ASF Contributor License Agreement (CLA) is expanded to accommodate corporate donations. New Apache projects in the Incubator include Beehive, Excalibur, Forrest, Gump, Hivemind, iBatis, Lenya, myFaces, Portals, SpamAssassin, Struts, wsrp4J (Portals sub-project), Xalan, XMLBeans, and XML Graphics. The Apache Commons project is terminated, as well as the Project Management Committee for Avalon. A New Public Relations Committee is established, and The ASF issues a formal response regarding alleged JBoss IP infringement in Geronimo. The PHP project amicably separates from The ASF, granting all rights and responsibilities pertaining to its codebases to the PHP Group. ApacheCon returns to Las Vegas for its seventh conference. Apache Ant wins the Java Developer's Journal "Editors' Choice Award".

Highlights: 2005
The ASF continues to be the community of choice to spearhead new innovations through its Incubator. Numerous projects in development include activeMQ, Apollo, Bridges, Continuum, Derby, Directory, Felix, Harmony, Roller, stdcxx, Synapse, and Xerces; Apache Lucene graduates as a Top Level Project. ApacheCon returns to Europe with the eighth conference held in Stuttgart, Germany, followed by ApacheCon US in San Diego. Tomcat receives the SD Software Development Readers' Choice Awards for "Best Open Source Tool"; Software Development Magazine's JOLT! Awards recognize Apache Jakarta and Tomcat.

Highlights: 2006
A new Board of Directors is elected: Greg Stein and Jim Jagielski are re-elected as Chairman and Executive Vice President/Secretary respectively; Sander Striker joins the Board as President, and Justin Erenkrantz is elected Treasurer. The Incubator matures, with new projects created to meet growing industry interest in Open Source solutions for enterprise resource planning and manage related business processes. Projects undergoing incubation are Abdera, Archiva, Cayenne, CXF, Hadoop, Harmony, HiveMind, Jackrabbit, MINA, ODE, OfBIZ, Open JPA, Open EJB, Qpid, Santuario, Shale, Tapestry, Tiles, and Velocity; Apache Cayenne, OFBiz, and Tiles graduate to become Top Level Projects later that year. The Apache Security Team is re-established, a new Testing project is established to oversee the creation of software related to the domain of software testing; in addition, and the ASF launches new Innovation Laboratories for the experimentation of new ideas without Project bylaws or community building requirements. The ASF hosts its tenth ApacheCon in Dublin, Ireland, followed by ApacheCon US in Austin, and launches ApacheCon Asia in Colombo, Sri Lanka. The Foundation establishes the Sponsorship program to help offset day-to-day operating expenses; donations are accepted by both individual and corporate contributors. SpamAssassin wins the Linux New Media Award, and Tapestry was awarded Sun's annual Duke's Choice Award for outstanding Java product innovation.

Highlights: 2007
The breadth and capability of The ASF is reflected in the largest changeover its Board members since its incorporation: Jim Jagielski is elected Chairman, Justin Erenkrantz as President, J. Aaron Farr as Treasurer, and Sam Ruby as Executive Vice President/Secretary. New projects continue to germinate, including Buildr, Camel, C++ Standard Library, Pig, Quetzalcoatl, ServiceMix, Synapse, and Tiles entering the Incubator; Apache ActiveMQ, Commons (Jakarta), Felix, HttpComponents, ODE, OpenEJB, OpenJPA, POI, Quetzalcoatl, Roller, ServiceMix, Turbine, and Wicket graduate as Top Level Projects. The ASF establishes a Legal Affairs Committee to manage legal policies, as well as a Travel Assistance Committee to provide financial support to select individuals otherwise unable to attend ApacheCon. The twelfth ApacheCon is successfully held in Amsterdam, The Netherlands, followed by ApacheCon US in Atlanta.

Highlights: 2008
The ASF re-elects Jim Jagielski, Justin Erenkrantz, and Sam Ruby to the Board as Chairman, President, and Secretary respectively; Sander Striker is elected Executive Vice President, and J. Aaron Farr is Treasurer. HBase, Hive, and Zookeeper enter the Incubator; Apache Abdera, Archiva, Buildr, Continuum, CouchDB, CXF, Hadoop, Qpid, and Tuscany become Top Level Projects. The Apache Attic is established to retire ASF projects that have reached their end of life through a scalable process. Apache user gatherings continue to gain popularity, with events hosted by projects that include Cocoon, Derby, Forrest, Hadoop, Jakarta, OfBIZ, Pig, Wicket, among others. The fourteenth ApacheCon is held in Amsterdam, The Netherlands, followed by ApacheCon US in New Orleans, where sixty members of the community participate in voluntourism efforts to help rebuild the City still suffering from the effects of Hurricane Katrina. ApacheCon US also marks the expansion of ASF-wide developer and user community events to include "unconferences" such as BarCamps, GetTogethers, Symposia, and the first ASF Meet Up in Beijing. Apache tops the Software Development Times 100 list of Industry Influencers for the third year running in the category of Application Servers, The ASF wins its third Member of the Year prize awarded by the Java Community Process Program Management Office, Apache SpamAssassin won the InfoWorld "Best Of Open Source Software" BOSSIE Award, Apache Directory Studio finishes as runner-up for the Eclipse Community Award's Best Open Source RCP Application, and barely six months under incubation, Sling wins the JAX Innovation Award.

Highlights: 2009
The ASF announces Ten Years of Apache; celebrates a decade of innovation in Open Source software and community development. Nearly 300 ASF Members collaborate successfully with more than 2,000 Committers; 68 Top Level Projects, 35 initiatives in the Incubator, and 23 Labs concepts are currently active at the Foundation. ApacheCon Europe 2009 was held 23-27 March in Amsterdam, with the Hackathon (face-to-face Apache project-related collaboration/development with ASF Members and Committers) open to the public and including another BarCamp. 10th Anniversary celebrations continued at ApacheCon US 2009, in Oakland 2-6 November, where both the Governor of California and the Mayor of Oakland congratulated Apache on its success and named 4 November "Apache Software Foundation Day".

Highlights: 2010
The ASF hits its millionth code commit with a revision milestone today with a commit by ASF Member Yonik Seeley on behalf of the Apache Lucene Project. Apache Aries, Avro, Cassandra, Click, ESME, HBase, Hive, jUDDI, Karaf, Mahout, Nutch, OODT, Pig, Pivot, Shindig, Shiro, Subversion, Thrift, Tika, Traffic Server, UIMA, ZooKeeper become Top-level Projects. Alois, Amber, Bean Validation, Celix, Chukwa, Deltacloud, Gora, Isis, Jena, Kitty, Lucy, ManifoldCF, Mesos, NPanday, Nuvem, OODT, OpenNLP, SIS, Stanbol, Wave, Whirr, and Zeta Components entered the Apache Incubator. Milestone project releases include Cassandra 0.6, Cayenne 3.0, FOP 1.0, Maven 3.0, SpamAssassin 3.3.0, and Tomcat 7.0. Apache Excalibur, iBatis, Quetzalcoatl, and WSIF Projects were retired to the Attic. The ASF launches "Apache Extras" (hosted by Google) to provide a "home-away-from-home" for code associated with Apache projects. The ASF issued Public Statements about Apache Harmony as well as Oracle's decision on the Java SE Technology Compatibility Kit's Field Of Use, and resigns from the Java Community Process Executive Committee. Shane Curcuru, Doug Cutting, Bertrand Delacretaz, Roy T. Fielding, Jim Jagielski, Sam Ruby, Noirin Shirley, Greg Stein, and Henri Yandell have been elected to serve on the ASF Board of Directors; Geir Magnusson, Jr., is named as replacement for Henri Yandell. ASF Director Greg Stein awarded O'Reilly Open Source Award at OSCON. New role of Executive Assistant has been created and staffed. 30 new ASF Members were elected this year. ASF Platinum Sponsors are Google, Microsoft, and Yahoo!; IBM joins Gold Sponsor Hewlett-Packard; Silver Sponsors are Cloudera, Progress Software and Springsource/VMWare, and Bronze Sponsors are BlueNog, Intuit, Joost, and Matt Mullenweg. ApacheCon North America took place in Atlanta, Georgia. BarCampApache in Sydney, Australia, was the first ASF-backed event to take place in the Southern Hemisphere.

Highlights: 2011
Apache ACE, Chemistry, Deltacloud, JMeter, Libcloud, River, Whirr became Top-level Projects. More projects than ever submitted to become part of the Apache community: Accumulo, Airavata, Ambari, Any23, AWF, Bigtop, Bloodhound, Cordova, DeltaSpike, DirectMemory, EasyAnt, Flex, Flume, Giraph, HCatalog, Kafka, Kalumet, Lucene.Net, MRUnit, ODF Toolkit, OGNL, Oozie,   OpenMeetings, OpenOffice, Rave, S4, and Sqoop entered the Incubator. Apache Alois retired from the Incubator. Apache Harmony, Jakarta, and Xindice moved to the Attic. Milestone project releases include Cassandra 0.7 and 1.0, Geronimo v3.0-beta-1, Pivot 2.0, Subversion 1.7.0, Tika 1.0, and Turbine 4.0-M1. Apache TomEE is certified as Java EE 6 Web Profile Compatible. Apache UIMA and Hadoop advance data intelligence and semantic capabilities of Watson, IBM's "Smartest Machine on Earth" demonstrated in first-ever man vs. machine competition on Jeopardy! quiz show. Apache Hadoop wins MediaGuardian’s "Innovator of the Year" award. The ASF accepted to become an Affiliate at the Open Source Initiative. New Executive Committee is appointed: Doug Cutting as Chair, Greg Stein as Vice Chair, Jim Jagielski as President, Noirin Plunkett as Executive Vice President, Sam Ruby as Vice President - Infrastructure, Craig L Russell as Secretary, Sam Ruby as Assistant Secretary, and Geir Magnusson, Jr., as Treasurer. The ASF is subpoenaed by the United Stated District Court to produce documents in Oracle America vs. Google related to the use of Apache Harmony code in the Android software platform, and the unsuccessful attempt by Apache to secure an acceptable license to the Java SE Technology Compatibility Kit. The ASF issues statement on Apache OpenOffice.org (the first mature, end-user-facing Apache project) and Open Letter to the Open Document Format Ecosystem clarifying that its code base was not pursued by the ASF prior to its acceptance into the Apache Incubator, and articulating the project’s vision within the wider Open Document Format ecosystem. 42 new ASF Members were elected, bringing the active membership to 370 individuals and 2,663 Apache Commiters world-wide. ASF Platinum Sponsors are Google, Microsoft, and Yahoo!; AMD, Facebook, and Hortonworks join Gold Sponsors Hewlett-Packard and IBM; PSW Group joins Silver Sponsors Cloudera, Progress Software and Springsource/VMWare; and Liip AG, Lucid Imagination, Talend, and WANdisco join Bronze Sponsors BlueNog, Intuit, Joost, and Matt Mullenweg. ApacheCon North America took place in Vancouver, Canada, marking the 25th event in the conference series.

Highlights: 2012
The ASF celebrated the 17th Anniversary of the Apache HTTP Server with the release of v2.4; the project maintains its standing as the world's most popular Web server, powering nearly 400 million sites. The Apache Incubator continues to gain momentum, with 85 podlings graduating over the past decade. Apache Accumulo, Airavata, Any23, Bigtop, BVal, Cordova, Creadur, DirectMemory, Empire-db, Flex, Flume, Giraph, Gora, Hama, ISIS, Jena, Kafka, Lucene.Net, Lucy, ManifoldCF, MRUnit, Oozie, OpenNLP, OpenOffice, Rave, SIS, Sqoop, Stanbol, Steve, Syncope, VCL, Wink, Wookie become Top-level Projects. Allura, Blur, CloudStack, Crunch, cTAKES, DeviceMap, Drill, Hadoop Development Tools, Helix, Marmotta, Ripple, Streams, and Syncope entered the Incbuator. Apache AWF, HISE, Kato, Kitty, and PhotArk retired from the Incubator. Milestone project releases include Deltacloud 1.0, Hadoop 1.0, Nutch 2.0, TomEE 1.0, Traffic Server 3.2, and Wicket 6.0. New Executive Committee is appointed: Doug Cutting as Chair, Greg Stein as Vice Chair, Jim Jagielski as President, Ross Gardler as Executive Vice President, Craig L Russell as Secretary, Chris Mattmann as Treasurer, and Sam Ruby as Assistant Secretary. ASF Officers that now serve at the direction of the President are: Vice President, Brand Management; Vice President, Fundraising; Vice President, Marketing and Publicity; and Vice President, Conference Planning. The office of Vice President, Java Community Process is dissolved. 46 new ASF Members were elected this year. Citrix became an ASF Sponsor, joining Facebook, Google, Microsoft, and Yahoo! at the Platinum level; AMD, Hortonworks, HP, IBM, and Matt Mullenweg at the Gold level; GoDaddy, Huawei, and In Motion Hosting joined Basis Technology, Cloudera, PSW GROUP, SpringSource, and WANdisco at the Silver level; and Intuit and Twitter joined BlueNog, Digital Primates, Intuit, Joost, Liip AG SA Ltd, Lucid Imagination, Talend, and Two Sigma Investments at the Bronze level. The ASF returned to Europe with the ApacheCon Europe Community Edition in Sinsheim, Germany, underwritten and hosted by SAP.

Highlights: 2013
Apache Ambari, Bloodhound, Chukwa, Clerezza, CloudStack, Crunch, cTAKES, Curator, DeltaSpike, Etch, Helix, jclouds, JSPWiki, Marmotta, Mesos, Oltu, Onami, OpenMeetings, graduate as Top-level Projects. Aurora, BatchEE, Curator, Falcon, jclouds, Knox, log4cxx2, MetaModel, MRQL, Olingo, Open Climate Workbench, Phoenix, Provisionr, Samza, Sentry, Sirona, Spark, Storm, Stratos, Tajo, Tez, Twill, Usergrid entered the Apache Incubator. Apache Provisionr retired from the Incubator. Milestone project releases include Cassandra 1.2 and 2.0, OpenOffice 4.0, and Subversion 1.8.0. Apache Struts 1 announces End-Of-Life, and recommends Struts 2 as successor. Apache C++ Standard Library (STDCXX), ESME, and XMLBeans moved to the Attic. The ASF issues a statement on Oracle's Technology Compatibility Kit License. Shane Curcuru, Doug Cutting, Bertrand Delacretaz, Roy Fielding, Jim Jagielski, Chris Mattmann, Brett Porter, Sam Ruby, and Greg Stein were elected to the ASF Board of Directors. The office of Vice President, Conference Planning, was dissolved; committee was renamed to Events Planning. 36 new ASF Members were elected, bringing the active membership to 468 individuals. ASF Sponsors are Citrix, Facebook, Google, Microsoft, and Yahoo! at the Platinum level; AMD, Hortonworks, HP, IBM, and Matt Mullenweg at the Gold level; Basis Technology, Cloudera, GoDaddy, Huawei, InMotion Hosting, PSW GROUP, SpringSource, and WANdisco at the Silver level; and BlueNog, Digital Primates, Intuit, Joost, Liip AG SA Ltd, Lucid Imagination, Talend, Twitter, and Two Sigma Investments at the Bronze level. Freie Universität Berlin (FUB) committed to become the provider of Apache server hosting and bandwidth in Europe. The ASF was accepted into the Google Summer of Code (GSoC) as a mentoring organization for the ninth consecutive year; hundreds of students have been mentored in "The Apache Way" under the guidance of the ASF Community Development Project, with many continuing to be long-term code committers on a variety of Apache projects, as well as some active program participants elected as ASF Members. ApacheCon North America took place in Portland, Oregon.

Highlights: 2014
The ASF exceeded 2 Million code commits: the two millionth revision was by ASF Member Daniel Kulp on behalf of the Apache CXF Project. The Apache HTTP Server remains the world's leading Web server: the Netcraft September Web Server Survey exceeded a billion Websites, stating "Apache truly dominates this market, with more than half of all active sites choosing to use Apache software". Interest in Apache's projects continued to boom, accelerating development and participation by 100% in four years: Apache Allura, Celix, Knox, Olingo, Open Climate Workbench, Phoenix, Spark, Storm, Stratos, Tajo, Tez, VXQuery became Top-level Projects. Argus, Brooklyn, Calcite, DataFu, Flink, HTrace, Ignite, Johnzon, Lens, Parquet, REEF, Slider, Tamaya, and Taverna entered the Apache Incubator. Milestone project releases included Cayenne 3.1, CloudStack 4.3, Log4j 2, SpamAssassin 3.4.0, and Spark 1.0. Apache Click was retired to the Attic. Apache OpenOffice reached a major adoption milestone with 100 million downloads. Apache TomEE won a Duke's Choice and Geek Choice Award; DeltaSpike, dubbed "the Swiss Army Knife of modern Java EE" won a Duke's Choice Award. The ASF Celebrated Document Freedom Day, with numerous Apache Projects supporting standards-based document accessibility and interoperability. Rich Bowen, Doug Cutting, Bertrand Delacretaz, Ross Gardler, Jim Jagielski, Chris Mattmann, Brett Porter, Sam Ruby, and Greg Stein were elected to the ASF Board of Directors. The ASF boasts 505 active Members and 4,081 Apache Committers. The ASF Infrastructure team continues to keep the ASF's multi-datacenter, multi-cloud deployment running 24x7x365 on multiple continents, distributing terabytes of artifacts per week and archiving more than 11 million Apache email messages. Apache's repositories changed greatly with the introduction of Git to the source code management system four years ago; since then the original Subversion repository had been decentralized and augmented with 268 Git repositories, and a robust GitHub presence with 564 different repositories. In addition, the Infrastructure team launched a new status service that provides extensive information about the health of the Apache infrastructure and activity within its projects, as well as a new code signing service for Java, Windows and Android applications for any Apache project to use to sign their releases. The ASF provided new "Powered by Apache" graphical assets for Apache projects, developers, and users to identify their affiliation with products and initiatives under the Apache umbrella. The ASF continues to flourish thanks to support from Platinum Sponsors Citrix, Facebook, Google, Matt Mullenweg, Microsoft, and Yahoo!; Gold Sponsors Cloudera, Comcast, HP, Hortonworks, and IBM; Silver Sponsors Budget Direct, Cerner, Huawei, InMotion Hosting, Pivotal, Produban, and WANdisco; and Bronze Sponsors Accor, Basis Technology, Bluehost, Cloudsoft Corporation, Samsung, Talend, and Twitter. The ASF decided to accept donations using Bitcoin, and received more than 90 transactions within 48 hours of opening its Bitcoin wallet. ApacheCon North America took place in Denver, Colorado, and ApacheCon Europe was held in Budapest, Hungary.

# # #

Wednesday Nov 05, 2014

The ASF @ 15 -- Sponsorship and Stewardship

Part 2 of a 3-part series celebrating 15 years of community-led development at The Apache Software Foundation.

The mission of The Apache Software Foundation is to provide software for the public good. We do this by providing services and support for many like-minded software project communities of individuals. As the Foundation grows (more than 150 top-level projects, and over 4,000 committers) so do the demands for services and support.

The Foundation does not pay for software development within its projects, nor does it influence the technical direction projects wish to take. However, the Foundation does provide technical services such as version control, mailing lists, web sites, issue trackers (and much more), as well as legal services such as intellectual property and brand management. We also have core marketing services to assist projects. All this costs money; and the amount it costs increases with each project we take on and each new service projects require.

In order to meet these costs, the Foundation accepts sponsorship from companies and individuals. However, this sponsorship does not buy influence over either the Foundation or its projects. The only way to influence our projects is to get involved with the project community and deliver valuable contributions that earn you individual merit and thus influence in that project.

So why do companies sponsor the ASF?

We've asked a number of our sponsors why the Foundation is important to them. As you might expect, there are a wide range of answers, but one common theme occurs across all sponsors. It can be boiled down to being assured that downstream reuse of our software is both a legally and strategically sound decision. Without the Foundation these very valuable software projects would not exist, at least not in the same form.

The Foundation provides a neutral space for companies, which might compete in the marketplace, to collaborate freely on Open Source software. This neutrality is protected by the fierce independence of the Foundation as it drives towards its mission of producing software for the public good (as opposed to the good for some subset of the public).

Balancing the Foundation's need to raise funds to support its projects whilst ensuring our projects remain independent of  those sponsors is a difficult task. However, we are lucky enough to have a large roster of sponsors who are very happy to donate with "no strings attached". Without those sponsors the Foundation could not exist and we thank them for their generosity.

Of course, most of our sponsors also contribute directly to one or more of our projects through code, documentation, and community management. Without these non-cash contributions our Foundation would be nothing more than an empty shell.

What do we use Sponsorship Money for?

The cost of running the Foundation is kept low by our extensive use of volunteers, even at the foundational level. As with the software development within our projects, all of our strategic decision-making roles are filled by volunteers who do not receive any payment from the Foundation itself.  All of our Vice Presidents, Directors, and other titled roles are members of our project communities. The success of the Foundation is personally important to them and therefore they contribute to that success. Our meritocratic system recognizes such individuals and ensures that the Foundation is run both for and by our project communities.

We do, however, spend money in supporting our projects. Our largest expense category is infrastructure which accounts for 63% of our budget in 2014-15. We have a number of infrastructure contractors who work tirelessly around the globe (and thus the clock) to ensure our distributed project teams can get on with their work without having to worry about the services they depend upon.

Our second largest budget line, at 10%, is marketing where we have a contractor who ensures prompt and appropriate responses to all press enquiries. A further 10% is spent on general administration (legal and bank fees, insurances, executive assistant and similar). The only other category over 5% is brand management which ensures our project brands remain independent of any individual commercial interests through trademark registration and related activities.

Looking to the Future

As the number of projects in the Foundation continues to grow we are looking to the future of our core services. As stewards of some of the world’s most popular Open Source software, we must ensure that our projects will continue to receive the same level of support as they have done during the last 15 years. However, it is not just the number of projects that puts a strain on the Foundations resources. A growing range of tools and services are needed for an Open Source project to be successful.

With our current raft of sponsors we are in very good shape. With our ever growing contributors to our projects we know we have an excellent source of volunteers to keep both our projects and our Foundation moving. That said, with more money and more volunteers there is always more we can do. We invite you to take some time to review our sponsorship programs and ask yourself if your employer might be interested. If you are looking to volunteer your time to one of our projects then take a look at our community development website.

--Ross Gardler, President

Tuesday Sep 30, 2014

The Apache Software Foundation Announces Apache™ Cayenne™ v3.1

Enterprise-grade Open Source Java framework for object relational mapping (ORM), persistence, and caching now easier to configure, with improved modularity and performance.

Forest Hill, MD –30 September 2014– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 200 Open Source projects and initiatives, announced today the availability of Apache™ Cayenne™ v3.1, the Open Source Java framework for object relational mapping (ORM), persistence, and caching.

"With the launch of version 3.1, Apache Cayenne has continued to evolve its mature 12 year-old library by introducing 125 new features," said Andrus Adamchik, Vice President of Apache Cayenne.

Cayenne is an enterprise Java ORM with integrated support for caching, three-tier persistence, object lifecycles and workflow, inheritance, paging, on demand faulting, auditing and much more. As an object relational mapping library, Cayenne integrates applications to any SQL database available today, freeing solutions from being locked into one database engine. At the same time it improves performance through paging and caching, enforces data integrity and makes it dramatically faster for developers to build a reliable application.

Cayenne has a track record of solid performance in high-volume environments. Apache Cayenne is an exceptional choice for persistence services, and is in use at ish onCourse, National Hockey League, Nike, Unilever and the Law Library of Congress (the world's largest publicly-available legal index) as well as dozens of high-demand applications and Websites accessed by millions of users each day.

Apache Cayenne v3.1 is the result of 4 years of development. Notable new features and improvements include:
  • easier configuration and embedding in any type of application;
  • highly configurable runtime, enabled by one of the industry's smallest built-in Dependency Injection (DI) containers written specifically for Cayenne (and that co-exists with other DI/IoC, such as Apache Tapestry). It is also very easy to create more than one runtime, which opens interesting possibilities like multi-tenancy;
  • nearly all components now pluggable, making it very easy to create more than one runtime and easily change or extend internals of the stack declaratively --from cache provider to SQL log format to DataSource lookup strategy and much more;
  • improved ORM modularity to allow  projects to be included in libraries without assumptions about the target use. Different aspects of an application can now be modeled in separate mapping projects and combined in runtime as needed. As a result Cayenne projects can be included in libraries that make no assumptions about the target use;
  • extended persistent events model from simple per-object events to more higher-level "workflows" that can be configured with app-specific annotations on persistent classes. Cayenne ships with "cayenne-lifecycle" module that provides a few common examples of such workflows activated on data changes: data modifications audit, precision cache invalidation, etc.; and
  • performance optimizations for improved overall concurrency

"Developers who are seeking an alternate to EJB/Hibernate might find Cayenne's graphical modeler, reverse database engineering, easy to use query API and flexible context model a joy to work with," said Aristedes Maniatis, member of the Apache Cayenne Project Management Committee and CEO of ish.

"We use Apache Cayenne as the ORM for a large and complex budgeting project for around twenty government organizations," said Daniel Abrams, CEO of MassLight. "Cayenne is used to access and persist exhibit data, business validation rules, and account information, and has simplified the development process. A single Cayenne method call evaluates all changes in the user's context and generates all statements required to commit their changes within a single transaction without the developer having to write code to track the changes -- Cayenne does all the work. Since switching to Cayenne, there haven't been any faulting errors that tended to plague the previous version of the application because of the complex data model. This was one of the principal reasons for the switch to Cayenne and the data model has become significantly more complex now."

"We use Cayenne in our system to collect, quality control and distribute world coverage nautical charts to navies, pilots, inspectors and several thousand vessels," said Tore Halset, Development Manager at Electronic Chart Centre and PRIMAR. "We have been happy users of Apache Cayenne since 2005 and are now on version 3.1."

"Apache Cayenne is a core service in Avoka Transact, an engagement platform for multi-channel sales and service transactions," said Malcolm Edgar, Vice President of Engineering at Avoka.

"We use Apache Cayenne to support the Oracle, MySQL, and SQL Server databases. Cayenne provides the right blend of ORM capabilities and low level JDBC access when required. It has been a rock-solid technology for us."

In addition, Apache Cayenne's HTML documentation and tutorials have been completely revised and available in PDF for the first time.

"Our comprehensive documentation and vibrant, helpful user community are just what you need when you have questions about the internals of Cayenne or the best way to achieve your goals," added Adamchik.

Availability and Oversight
Cayenne v3.1 is available immediately as a free download from http://cayenne.apache.org/download.html. As with all Apache products, Apache Cayenne software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Cayenne, visit http://cayenne.apache.org/ and @ApacheCayenne on Twitter.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than two hundred leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 450 individual Members and 4,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

© The Apache Software Foundation. "Apache", "Apache Cayenne", "Cayenne", "ApacheCon", and the Apache Cayenne logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Friday May 30, 2014

The Apache Software Foundation Announces Apache™ Spark™ v1.0

Open Source large-scale, flexible, "Hadoop Swiss Army Knife" cluster computing framework offers enhanced data analysis and richer integration with other Apache projects

Forest Hill, MD –30 May 2014– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 170 Open Source projects and initiatives, announced today the availability of Apache Spark v1.0, the super-fast, Open Source large-scale data processing and advanced analytics engine.

Apache Spark has been dubbed a "Hadoop Swiss Army knife" for its remarkable speed and ease of use, allowing developers to quickly write applications in Java, Scala, or Python, using its built-in set of over 80 high-level operators. With Spark, programs can run up to 100x faster than Apache Hadoop MapReduce in memory.

"1.0 is a huge milestone for the fast-growing Spark community. Every contributor and user who's helped bring Spark to this point should feel proud of this release," said Matei Zaharia, Vice President of Apache Spark.

Apache Spark is well-suited for machine learning,  interactive queries, and stream processing. It is 100% compatible with Hadoop's Distributed File System (HDFS), HBase, Cassandra, as well as any Hadoop storage system, making existing data immediately usable in Spark. In addition, Spark supports SQL queries, streaming data, and complex analytics such as machine learning and graph algorithms out-of-the-box.

New in v1.0, Apache Spark offers strong API stability guarantees (backward-compatibility throughout the 1.X series), a new Spark SQL component for accessing structured data, as well as richer integration with other Apache projects (Hadoop YARN, Hive, and Mesos).

Patrick Wendell, software engineer at Databricks and Apache Spark 1.0 release manager explained, "In addition to providing long-term stability for Spark's core APIs, this release contains a several new features. Spark 1.0 adds a unified submission tool for deploying applications on a local machine, Mesos, YARN, or a dedicated cluster. We've added a new module, Spark SQL, to provide schema-aware data modeling and SQL language support in Spark. Spark's machine learning library, MLLib, has been enhanced with several new algorithms. Spark’s streaming and graph libraries have also seen major updates. Across the board, we've focused on building tools to empower the data scientists, statisticians and engineers who must grapple with large data sets every day."

Spark was originally developed at UC Berkeley AMP Lab, and its ease of use has made it a go-to solution for both small and large enterprise environments across a wide range of industries, including Alibaba, ClearStory Data, Cloudera, Databricks, IBM, Intel, MapR, Ooyala, and Yahoo, among others. Not only are organizations rapidly adopting and deploying Apache Spark, many contributors are committing code to the project as well.

"Apache Spark is an important big data technology in delivering a high performance analytics solution for the IT industry and satisfying the fast-growing customer demand," said Michael Greene, Vice President and General Manager of System Technologies and Optimization at Intel. "Intel is proud to participate in its development and we congratulate the community on this release."

"At NASA, we're really excited to leverage Spark and its highly interactive analytic capabilities and the speedups offered by 1.0 along with Spark SQL are going to help out critical projects looking at measurement of Snow in the Western US and also on projects related to Regional Climate Modeling and in Model Evaluation for the U.S. National Climate Assessment related Activities," said Chris Mattmann, an ASF Director, Chief Architect, Instrument and Science Data Systems Section at NASA JPL, and Adjunct Associate Professor at the University of Southern California. "I'm looking forward to designing Spark-related projects in my Software Architectures and in my Search Engines courses at USC as well. The community is one of our most active at the ASF and the interest has really peaked and these guys are doing a great job."

"We're continuing to see very fast growth — 102 individuals have contributed patches to this release over the past four months, which is our highest number of contributors ever," added Zaharia.

Availability and Oversight
As with all Apache products, Apache Spark software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project’s day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Spark, visit http://spark.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than one hundred and seventy leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 400 individual Members and 3,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

"Apache", "Spark", "Apache Spark", and "ApacheCon" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

 # # #

Tuesday Oct 04, 2011

The Apache Software Foundation Announces Apache TomEE Certified as Java EE 6 Web Profile Compatible

Groundbreaking, lightweight, scalable, all-Apache stack ideal for use in enterprise-grade Cloud applications

The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of nearly 150 Open Source projects and initiatives, today announced that Apache TomEE has obtained certification as Java EE 6 Web Profile Compatible Implementation.

Making its certification debut at JavaOne, Apache TomEE (pronounced "Tommy") is the Java Enterprise Edition of Apache Tomcat (Tomcat + Java EE = TomEE) that unites several quality Java enterprise projects including Apache OpenEJB, Apache OpenWebBeans, Apache OpenJPA, Apache MyFaces and more.

"It is with great pride that we're announcing Apache TomEE as a certified implementation of the Java EE 6 Web Profile," said David Blevins, Vice President of Apache OpenEJB and original co-developer of TomEE. "Apache TomEE is the newest addition to the Java EE server space, standing alongside the likes of GlassFish, JBoss, and Apache Geronimo."

Developers build applications using Java EE-certified products to ensure portability across Java Enterprise Edition-compatible solutions. Apache TomEE is one of only six certified implementations available to the industry today.

Redefining Enterprise Cloud; Unifying Communities

The three core design objectives for TomEE were: 1) do not alter Tomcat; 2) maintain simplicity; and 3) avoid architecture overhead. This enables developers to quickly and easily build highly performant lightweight enterprise solutions using leading Apache projects without the need for complex modifications or customization. Apache TomEE's integration of Apache OpenWebBeans, Apache MyFaces, Apache ActiveMQ, Apache OpenJPA, and Apache CXFis simple, to-the-point, and focused on the singular task of delivering the Java EE 6 Web Profile in a minimalist fashion.

The simple, all-Apache stack is both incredibly light and fully embeddable, making it ideal for testing and usage in today's evolution of the enterprise Cloud, where the key to scalability is hundreds of tiny servers, as opposed to the traditional definition of how large your servers. Apache TomEE boasts groundbreaking performance in the following areas:

- Size: exceptionally small (about 24MB for the entire Web profile), consumes very little resources;

- Memory: TCK (Technology Compatibility Kit) passed with no additional memory settings beyond the default – a first in Java EE; and

- Speed: runs exceptionally fast in embedded mode: start/deploy/test/undeploy/stop in 2-3 seconds.

"No longer do developers have to ask 'Do we use Tomcat or Java EE?' at the start of a project, as has been the case for the last 10 years," explained Blevins. "These two camps have historically been separate, and certification is a major step in unifying these communities. With TomEE, developers can now retire untested legacy stacks and use a reliable product that doesn't deviate from the Tomcat that they know and love."

Blevins and members of the Apache OpenEJB community will be presenting several sessions, including "TomEE – Tomcat with a Kick", in the "Servers/Tomcat & Geronimo" track at ApacheCon, 7-11 November 2011, in Vancouver, Canada. To register, visit http://apachecon.com/

Availability and Oversight
Apache TomEE software is released under the Apache License v2.0, and is overseen by the Apache OpenEJB Project Management Committee (PMC) that guides the Project's day-to-day operations, community development, and product releases. Apache TomEE is certified on Amazon EC2 t1.micro, m1.small, and m1.large 32bit images; certification on 64bit EC2 images and other Cloud platforms are in the Project's future plans. Those Cloud vendors wishing to donate resources for TomEE to be certified on their platforms are encouraged to contact the Apache OpenEJB Project for information on how to participate. Apache TomEE source code, documentation, mailing lists, and related resources are available at http://openejb.apache.org/.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading Open Source projects, including Apache HTTP Server -- the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 350 individual Members and 3,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(3)(c) not-for-profit charity, funded by individual donations and corporate sponsors including AMD, Basis Technology, Cloudera, Facebook, Google, HP, Hortonworks, IBM, Matt Mullenweg, Microsoft, PSW Group, SpringSource/VMware, and Yahoo!. For more information, visit http://www.apache.org/.

"Apache", "Apache OpenEJB", and "Apache TomEE" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday Sep 27, 2011

The Apache Software Foundation Announces 10th Anniversary of Apache Lucene

Powers smart search and indexing solutions for AOL, Apple, Comcast, Disney, IBM, LinkedIn, Twitter, Wikipedia, and more.

Forest Hill, MD – 27 September 2011 – The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of nearly 150 Open Source projects and initiatives, today announced the 10th anniversary of Apache Lucene.

The Lucene information retrieval software was first developed in 1997, entered the ASF as a sub-project of the Apache Jakarta project in 2001, and became a standalone, Top-Level Project (TLP) in 2005. Apache Top-Level Projects and their communities demonstrate that they are well-governed under the Foundation’s meritocratic, consensus-driven process and principles.

"Ten years ago, Apache provided Lucene a home where it could build a solid community. Today we can see the fruit of that community, both through the wide breadth of Lucene-based applications deployed, and through the depth of improvements to Lucene made in the past decade," said Doug Cutting, ASF Chairman and original Lucene creator.

Apache Lucene powers smart search and indexing for eCommerce, financial services, business intelligence, travel, social networking, libraries, publishing, government, and defense solutions.

"Lucene has changed the world by opening doors that didn't exist before it arrived on the Open Source scene,” said ASF Member and Apache Lucene Committer Erik Hatcher. “Lucene has massively disrupted the enterprise/proprietary search market, with wide adoption around the globe in every industry.”

Highly performant, Apache Lucene is in use across an array of applications, from mobile to Internet scale, and powers enterprise-grade search solutions for AOL, Apple, IBM (including its artificial intelligence-driven supercomputer Watson), LinkedIn, Netflix, Wikipedia, Zappos, and many other global organizations.

"When it arrived to ASF, Lucene immediately made a huge impact --Lucene was one of those technologies that made a whole generation of businesses possible-- it was fast, easy to use, free, and had a growing community of users and developers. Apache Lucene can be found in an amazing number of products and services we all know and use, as well as in products and services we have never heard of,” said ASF Member and Apache Lucene Committer Otis Gospodnetic.

"While it's been six years since I joined the Lucene community, the last two were certainly the most exciting,” said Simon Willnauer, Vice President of Apache Lucene.

Current Apache Lucene sub-projects are PyLucene and Open Relevance; other sub-projects, including Droids, Lucene.Net, and Lucy, have spun out of the project and are undergoing further development in the Apache Incubator with the intention of becoming standalone TLPs. Solr, the high-speed Open Source enterprise search platform, has merged into the Lucene project itself, whilst former Lucene sub-projects Hadoop, Mahout, Nutch, and Tika have all successfully graduated as autonomous Apache Hadoop, Apache Mahout, Apache Nutch, and Apache Tika TLPs.

Originally written in Java, Apache Lucene is available in many programming languages such as Perl, C#, C++, PHP, Python, and Ruby. “Now, 10 years later, Apache Lucene is backed by a large community of users, contributors and developers with incredible energy poured into Lucene every hour of every day of the year," said Gospodnetic, who is also co-author of Lucene in Action, and founder of Sematext International.

“Even after 10 years, it seems this blazing community and codebase hasn't reached its potential yet,” added Willnauer. “I'm proud to be part of this community and look forward to another decade of Open Source Search."

Hatcher, who is also co-author of Lucene in Action and co-founder of Lucid Imagination, added, “if you need search (and you do!), Lucene is the best core technology choice."

Hatcher, Willnauer, and other members of the Apache Lucene community will be presenting sessions on data handling and analytics –a.k.a. “Lucene and Friends”-- including what's upcoming in Apache Lucene 4.0 (with performance improvements up to 20,000% from previous versions and more) at ApacheCon, 7-11 November 2011, in Vancouver, Canada. To register, visit http://apachecon.com/

Availability and Oversight

Apache Lucene software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project’s day-to-day operations, including community development and product releases. Apache Lucene source code, documentation, mailing lists, and related resources are available at http://lucene.apache.org/.

About The Apache Software Foundation (ASF)

Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading Open Source projects, including Apache HTTP Server -- the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 350 individual Members and 3,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(3)(c) not-for-profit charity, funded by individual donations and corporate sponsors including AMD, Basis Technology, Cloudera, Facebook, Google, HP, Hortonworks, IBM, Matt Mullenweg, Microsoft, PSW Group, SpringSource/VMware, and Yahoo!. For more information, visit http://www.apache.org/.

"Apache" and “Apache Lucene” are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Calendar

Search

Hot Blogs (today's hits)

Tag Cloud

Categories

Feeds

Links

Navigation