The Apache Software Foundation Blog

Tuesday March 26, 2019

20 Years of Open Source Innovation, The Apache Way

by Jim Jagielski and Sally Khudairi

As the world's largest and one of the most influential Open Source foundations, The Apache Software Foundation (ASF) is home to more than 350 community-led projects and initiatives. The ASF's 731 individual Members and more than 7,000 Committers are global, diverse, and often embodies a case of collective humility. We've assembled a list of 20 ubiquitous and up-and-coming Apache projects to celebrate the ASF's 20th Anniversary on 26 March 2019, applaud our all-volunteer community, and thank the billions of users who benefit from their Herculean efforts.


1. Apache HTTP Server
Web/Servers. http://httpd.apache.org/

The most popular Open Source HTTP server on the planet shot to fame just 13 months from its inception in 1995, and remains so today due to its ability to provide a secure, efficient and extensible server that provides HTTP services observing the latest HTTP standards. Serving modern operating systems including UNIX, Microsoft Windows, and Mac OS/X, the Apache HTTP Server played a key role in the initial growth of the World Wide Web; its rapid adoption over all other Web servers combined was also instrumental to the wide proliferation of eCommerce sites and solutions. The Apache HTTP Server project was the ASF's flagship project at its launch, and served as the basis upon which future Apache projects emulated with its open, community-driven, merit-based development process known as "The Apache Way".


2. Apache Incubator
Innovation. http://incubator.apache.org/

The Apache Incubator is the ASF's nexus for innovation, serving as the entry path for projects and codebases wishing to officially become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects go through the incubation process to ensure all donations are in accordance with the ASF legal standards, and develop diverse communities that adhere to the ASF's guiding principles. Incubation is required of newly accepted projects until their infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. Whilst incubation is neither a reflection of the completeness or stability of the code, nor does it indicate that the project has yet to be fully endorsed by the ASF, its rigorous process of mentoring projects and their communities according to "The Apache Way" has led to the graduation of nearly 200 projects in the Incubator's 16-year history. Today 51 "podlings" are undergoing development in the Apache Incubator across an array of categories, including annotation, artificial intelligence, Big Data, cryptography, data science/storage/visualization, development environments, Edge and IoT, email, JavaEE, libraries, machine learning, serverless computing, and more.


3. Apache Kafka
Big Data. https://kafka.apache.org/

The Apache footprint as the foundation of the Big Data ecosystem continues to grow, from Accumulo to Hadoop to ZooKeeper, with fifty active projects to date and two dozen more in the Apache Incubator. Apache Kafka's highly-performant distributed, fault tolerant, real-time publish-subscribe messaging platform powers Big Data solutions at Airbnb, LinkedIn, MailChimp, Netflix, The New York Times, Oracle, PayPal, Pinterest, Spotify, Twitter, Uber, Wikimedia Foundation, and countless other businesses.


4. Apache Maven
Build Management. http://maven.apache.org/

Spinning out of the Apache Turbine servlet framework project in 2004, Apache Maven has risen to the top as the hugely popular build automation tool that helps Java developers build and release software. Stable, flexible, and feature-rich, Maven streamlines continuous builds, integration, testing, and delivery processes with an impressive central repository and robust plug-in ecosystem, making it the go-to choice for developers who want to easily manage a project’s build, reporting, and documentation.


5. Apache CloudStack
Cloud Computing. http://cloudstack.apache.org/

Super-quick to deploy, well-documented, and with an easy production environment, one of the biggest draws to Apache CloudStack is that it "just works". Powering some of the industry's most visible Clouds –from global hosting providers to telcos to the Fortune 100 top 5% and more– the CloudStack community is cohesive, agile, and focused, leveraging 11 years of Cloud success to enable users to rapidly and affordably build fully featured clouds.


6. Apache cTAKES
Content. http://ctakes.apache.org/

Developed from real-world use at the Mayo Clinic in 2006, cTAKES was created by a team of physicians, computer scientists and software engineers seeking a natural language processing system for extraction of information from electronic medical record clinical free-text. Today Apache cTAKES is an integral part of the Mayo Clinic's electronic medical records and has processed more than 80 million clinical notes. Apache cTAKES is a growing standard for clinical data management infrastructure across hospitals and academic institutions that include Boston Children’s Hospital, Cincinnati Children’s Hospital, Massachusetts Institute of Technology, University of Colorado Boulder, University of Pittsburgh, and University of California San Diego, as well as companies such as Wired Informatics.


7. Apache Ignite
Data Management. https://ignite.apache.org/

Apache Ignite is used for transactional, analytical, and streaming workloads at petabyte scale for the likes of American Airlines, ING, Yahoo Japan and countless others on premises, on cloud platforms, or in hybrid environments. Apache Ignite's in-memory data fabric provides an in-memory data grid, compute grid, streaming, and acceleration solutions across the Apache Big Data system ecosystem, including Apache Cassandra, Apache Hadoop, Apache Spark, and more.


8. Apache CouchDB
Databases. http://couchdb.apache.org/

Thousands of organizations such as the BBC, GrubHub, and the Large Hadron Collider use Apache CouchDB for seamless data flow between every imaginable computing environment, from globally-distributed server clusters to mobile devices to Web browsers. Its Couch Replication Protocol allows you to store, retrieve, and replicate data safely on premises or on the Cloud with very high performance reliability. Apache CouchDB does all the heavy lifting so you can sit back and relax.


9. Apache Edgent (incubating)
Edge computing. http://edgent.incubator.apache.org/

The boom of IoT –personal assistants, smart phones, smart homes, connected cars, Industry 4.0 and beyond– is producing an ever-growing amount of data streaming from millions of systems, sensors, equipment, vehicles and more. The demand for reliable, efficient real-time data has driven the need for the "Empowered Edge", where data collection and analysis is optimized by moving away from centralized sources towards the edges of of the networks, where much of the data originates. Companies like IBM and SAP are leveraging Apache Edgent to accelerate analytics at the edge across the IoT ecosystem. Apache Edgent can be used in conjunction with many Apache data analytics solutions such as Apache Flink, Apache Kafka, Apache Samza, Apache Spark, Apache Storm, and more.


10. Apache OFBiz
Enterprise Resource Planning (ERP). https://ofbiz.apache.org/

Whereas most of the ASF projects are about running or creating infrastructure, we also realize the importance of running and handling a business. Apache OFBiz is a comprehensive suite of business applications from accounting and CRM through Warehousing and Inventory control. The Java based framework provides the power and the flexibility to serve as the core of one's B2B and B2C business management and is easily expandable and customizable. Apache OFBiz is a complete ERP solution, flexible, free, and fully Open Source and services users from United Airlines to Cabi.


11. Apache SIS (Spatial Information System)
Geospatial. http://sis.apache.org/

The US National Oceanic and Atmospheric Administration, Vietnamese National Space Center, numerous spatial agencies, governments, and others rely on Apache SIS to create their own intelligent, standards-based interoperable geospatial applications. The Apache SIS toolkit handles spatial data, location awareness, geospatial data representation, and provides a unified metadata model for file formats used for real-time smart city visualization, geospatial dataset discovery, state-of-the-art location-enabled emergency management, earth observation, as well as information modeling for extra-terrestrial bodies such as Mars and asteroids.


12. Apache Syncope
Identity Management. http://syncope.apache.org/

Apache Syncope manages digital identity data in enterprise applications and environments to handle user information such as username, password, first name, last name, email address, etc. Identity management involves considering user attributes, roles, resources and entitlements that control who access to what data, when, how, and why. Apache Syncope users include the Italian Army, the University of Helsinki, University of Milan, and the SWITCH Swiss university network.


13. Apache PLC4X (incubating)
Internet of Things (IoT). http://plc4x.incubator.apache.org/

Connectivity and integration across many Industrial IoT edge gateways is often impossible with closed-source, proprietary legacy systems with incompatible protocols. Apache PLC4X provides a universal protocol adapter for creating Industrial IoT applications through a set of libraries that allow unified access to any type of industrial programmable logic controllers (PLCs) using a variety of protocols with a shared API. In addition, the project is planning integrations modular to Apache IoT projects that include Apache Brooklyn, Apache Camel, Edgent, Apache Kafka, Apache Mynewt, and Apache NiFi.


14. Apache Commons
Libraries. http://commons.apache.org/

With 42%+ of Apache projects written in Java (that's 62+ million lines of code), having a set of stable, reusable Open Source Java software components available to all Apache projects and external users is both helpful and necessary. Apache Commons provides a suite of dozens of stable, reusable, easily deployed Java components, and a workspace for Commons contributors to collaborate on the development of new components.


15. Apache Spark
Machine Learning. http://spark.apache.org/

Big Data is growing exponentially each year, accelerated by industries such as agriculture, big business, FinTech, healthcare, IoT, manufacturing, mobile advertising and more. Apache Spark's unified analytics engine for processing and analyzing large-scale data processing helps data scientists apply machine learning insights and an array of libraries to improve responsiveness more accurate results. Apache Spark runs workloads 100x faster on Apache Hadoop, Apache Mesos, Kubernetes, whether standalone or in the cloud, and to access diverse data sources, from Apache Cassandra, Apache Hadoop HDFS, Apache HBase, Apache Hive, and hundreds of others.


16. Apache Cordova
Mobile. https://cordova.apache.org/

Apache Cordova is the popular developer tool used to easily build cross-platform, cross-device mobile apps using a Write-Once-Run-Anywhere solution, which enabling developers to create a single app that will appear the same across multiple mobile device platforms. Apache Cordova acts as an extensible container, and serves as the base that most mobile application development tools and frameworks are built upon, including mobile development platforms and commercial software products by Blackberry, Google, IBM, Intel, Microsoft, Oracle, Salesforce, and many others.


17. Apache Tomcat
Java/Servers. https://tomcat.apache.org/

Starting off as the Apache JServ project, designed to allow for Java "servlets" to be run in a Web environment, Tomcat grew to being a full-fledged, comprehensive Java Application server and was the de-facto reference implementation for the Java specifications. Since 2005, Apache Tomcat has formed, and still forms, the foundation of numerous Java-based web infrastructures such as eBay, E*Trade, WalMart, and The Weather Channel.


18. Apache Lucene/Solr
Search. http://lucene.apache.org/solr/

Adobe, AOL, Apple, AT&T, Bank of America, Bloomberg, Cisco, Disney, eTrade, Ford, The Guardian, Homeland Security, Instagram, MTV Networks, NASA Planetary Data System, Netflix, SourceForge, Verizon, Walmart, whitehouse.gov, Zappos, and countless others turn to Apache Lucene Solr to quickly and reliably index and search multiple sites and enterprise data such as documents and email. Popular features include near real-time indexing, automated failover and recovery, rich document parsing and indexing, user-extensible caching, design for high-volume traffic, and much more. 


19. Apache Wicket
Web Framework. http://wicket.apache.org/

The Apache Wicket component-based Web application framework is prized by many followers for its "Plain Old Java Object" (POJO) data model and markup/logic separation not common in most frameworks. Developers have been using Apache Wicket since 2004 to quickly create powerful, reusable components using object oriented methodology with Java and HTML. Wicket powers thousands of applications and sites for governments, stores, universities, cities, banks, email providers, and more, including Apress, DHL, SAP, Vodafone, and Xbox.com.


20. Apache Daffodil (incubating)
XML. http://daffodil.apache.org/

Governments handle massive amounts of complex and legacy data across security boundaries every day. In order for such data to be consumed, it must be inspected for correctness and sanitized of malicious data. Whilst traditional inspection methods are often proprietary, incomplete, and poorly maintained, Apache Daffodil streamlines the process with an Open Source implementation of the Data Format Description Language specification (DFDL) that fully describes a wide array of complex and legacy file formats down to the bit level. Daffodil can parse data to XML or JSON to allow for validation, sanitization, and transformation, and also serialize or ''unparse'' back to the original file format, effectively mitigating a large variety of common vulnerabilities.

The Apache Software Foundation is a leader in community-driven open source software and continues to innovate with dozens of new projects and their communities. Apache projects are managing exabytes of data, executing teraflops of operations, and storing billions of objects in virtually every industry. Apache software is an integral part of nearly every end user computing device, from laptops to tablets to phones. The commercially-friendly and permissive Apache License v2.0 has become an open source industry standard. As the demand for quality open source software continues to grow, the collective Apache community will continue to rise to the challenge of solving current problems and ideate tomorrow’s opportunities through The Apache Way of open development. Learn more at http://apache.org/

# # # 

Thursday March 21, 2019

The Apache Software Foundation Announces Apache® Unomi™ as a Top-Level Project

Powerful Open Source Customer Data Platform in use at Al-Monitor, Altola, Jahia, and Yupiik, among others. 

Wakefield, MA —21 March 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Unomi™ as a Top-Level Project (TLP).

Apache Unomi is a standards-based, Customer Data Platform (CDP) that manages online customer, leads, and visitor information to provide personalized experiences that adheres to visitor privacy rules such as GDPR and “Do Not Track” preferences. The project was originally developed at Jahia, and was submitted to the Apache Incubator in October 2015.

"I am truly thankful to our community, especially our mentors, who have helped us achieve this milestone," said Serge Huber, Vice President of Apache Unomi. "The original vision behind Unomi was to ensure true privacy by making the technologies handling customer data completely Open Source and independent. Since it was submitted to the Apache Incubator, developing Unomi using the Apache Way will ensure the project grows its community to be more diverse and welcome new users and developers."

Apache Unomi is versatile, and features privacy management, user/event/goal tracking, reporting, visitor profile management, segmentation, personas, A/B testing, and more. It can be used as:

  • a personalization service for a Web CMS;

  • an analytics service for  native mobile applications;

  • a centralized profile management system with segmentation capabilities; and

  • a consent management hub

Apache Unomi is the industry's first reference implementation of the upcoming OASIS CDP specification (established by the OASIS CXS Technical Committee, which sets standards as a core technology for enabling the delivery of personalized user experiences). As a reference implementation, Apache Unomi serves as a real world example of how the standard will be stable, and is quickly gaining traction by those interested in truly open and transparent customer data privacy. Apache Unomi is in use at organizations such as Al-Monitor, Altola, Jahia, Yupiik, and many others to create and deliver consistent personalized experiences across channels, markets, and systems.

"When Serge and I announced the launch of the Apache Unomi project at the 2015 ApacheCon Budapest, Apache Unomi, at that time, was the first proposal among the rising Customer Data Platform industry's segment, positioned as an 'ethical data-driven marketing' product that would respect the privacy of customers while leveraging the power of unified customers data," said Elie Auvray, Head of Business Development at Jahia. "Jahia's digital experience management solutions are based on Apache Unomi, and we can't wait to see how the project will now evolve with its growing community. Seeing today Apache Unomi becoming a Top-Level Project is a great reward for us as Open Source software believers. We are proud of this milestone, grateful to the Apache Software Foundation and our mentors, and we know it's only the beginning of a new –hopefully long and successful– journey."

"Under development at OASIS, the Customer Data Platform specification –for which Apache Unomi aims to be the reference implementation– lies at the crossroads of many solutions providers needs such as WCM, CRM, Big Data Platforms, Machine Learning, IoT and Digital Marketing," said Laurent Liscia, CEO of OASIS. "At a time when client data interoperability and built-in data privacy are mandatory foundations for legal, consistent, and personalized experiences across channel markets and systems, the CDP specification, together with Apache Unomi, is a clear and welcome answer to end-user concerns."

"Apache Unomi is the perfect solution to implement a user profile platform," said Jean-Baptiste Onofré, Fellow at Talend. "It fully addresses the user trust and privacy needs, allowing to easily create user profile and Web marketing features. As Unomi is powered by Apache Karaf, it's also a great platform for several use cases, such as digital marketing in Web applications, managing user profiles on IoT devices, and more."

"Apache Unomi enables Al-Monitor readers to be driven towards additional personalized content that corresponds, via content tags profiling and related automated segmentations, to what they have already accessed," said Valerie Voci, Head of Digital Strategy and Marketing at Al-Monitor. "This data follows our customers where they go, so it's a consistent experience whether they are getting these recommendations in their inbox or on the Website or both. And if a change takes place on one, that change is immediately reflected on the other. It helps us create a very cohesive marketing message and a great overall digital experience."

"As we were developing a progressive web app (PWA) for a client, we were looking for a Customer Data Platform (CDP) to store customer insights, such as behavioral and explicit customer data," said Lars Petersen, Co-Founder at Altola. "Privacy was table stake for us, along with the flexibility to customize data schema and open API. We selected Apache Unomi based on these parameters, we had it up and running on AWS in less than 30 min. and are very impressed with the maturity of the platform, its privacy by design and how easy it was to work with."

"In a digital world, customer data is very important to offer a better experience to users. However, data privacy and trust is not an option for users," said François Papon, CTO at Yupiik. "Apache Unomi is the best solution for our clients because it's an Open Source project managed by an independent foundation, there is no vendor lock-in. It's also based on other solutions like Apache Karaf that made it ready for modularity, scalability, cloud, devops, and more." 

"Apache Unomi is poised to disrupt the Customer Data Platform market," said Thomas Sigdestad, CTO at Enonic, and co-chair, with Serge Huber, of the CDP standards work at OASIS open. "The CDP marketplace is lacking from a standard way of exchanging data, and the vendor space is over-represented by closed source and proprietary cloud offerings. This effectively limits the potential and adoption of CDP in general. Apache Unomi is not merely Open Source, but also the reference implementation of the imminent CDP standard from OASIS. Companies using Unomi will benefit from faster and simpler integrations without locking their customer data into yet another proprietary silo." 

"Graduating as an Apache Top-Level Project is only the beginning," added Huber. "Unomi has a lot of potential that it still to be developed, and is a perfect opportunity for those interested in Customer Data Privacy to participate through our mailing lists and Slack channel, and to learn more about the project on our Website and presentations."

Catch Apache Unomi in action at ApacheCon North America (9-12 September 2019 in Las Vegas, Nevada), and ApacheCon Europe (22-24 October 2019 in Berlin, Germany) http://apachecon.com/ .

Availability and Oversight
Apache Unomi software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Unomi, visit http://unomi.apache.org/

About the Apache Incubator
The Apache Incubator is the entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects seeking to join the ASF enter through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. For more information, visit http://incubator.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, Leaseweb, Microsoft, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, Workday, and Verizon Media. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Unomi", "Apache Unomi", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

Wednesday March 20, 2019

The Apache Software Foundation Announces Apache® CloudMonkey® v6.0

Popular Open Source Command Line Interface tool that simplifies Apache CloudStack configuration and management now faster and easier to use.

Wakefield, MA —20 March 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® CloudStack® CloudMonkey v6.0, the latest version of the turnkey enterprise Cloud orchestration platform's command line interface tool. 

Apache CloudStack is the proven, highly scalable, and easy-to-deploy IaaS platform used for rapidly creating private, public, and hybrid Cloud environments. Thousands of large-scale public Cloud providers and enterprise organizations use Apache CloudStack to enable billions of dollars worth of business transactions annually across their clouds.

Apache CloudMonkey v6.0.0 is the latest major release since the previous major 5.x release in September 2013.  CloudMonkey v6.0.0 is a rewrite of the original tool in Go programming language, and can be used both as an interactive shell and as a command line tool that simplifies CloudStack configuration and management.

Some of the new features and major changes include:
  • Rewrite in Go, ships as single binary for Linux, Mac, and Windows
  • Drop-in replacement for legacy Python-based cloudmonkey
  • About 5-20x faster than legacy Python-based cloudmonkey
  • Interactive UX for parameter and arg completion and selection
  • JSON is the default output format
  • New column based output
  • Enable debug mode using set debug true option, file-based logging removed
  • Per server profile based API cache
  • New syntax arg=@/path/to/file to pass the content of file as API argument value similar to curl
  • Improve help docs using -h argument
  • Removed: XML output, coloured output, several set options

"This release is the work of over one year of effort and driven by the people operating CloudStack clouds," said Rohit Yadav, Apache CloudStack CloudMonkey v6.0 author, and release manager. "I would like to thank the contributors across all of these organizations for supporting this release, which reflects both the user-driven nature of our community and the Apache CloudStack project's commitment to continue to be the most stable, easily deployable, scalable Open Source platform for IaaS. Along with ease of installation, usage and availability of cross-platform dependency-free builds including Windows builds, v6.0 brings many changes and optimizations such as more interactive shell for parameter completion, faster API requests processing, server profile specific API caching, improved API help docs and a new syntax to pass content of files as API parameter argument." More on the background and story behind the CloudMonkey 6.0 effort can be found at https://blogs.apache.org/cloudstack/entry/what-s-coming-in-cloudmonkey

"Apache CloudStack is a significant part of our Cloud portfolio right now – we run large deployments all over the world, often supporting critical customer applications," said Robert van der Meulen, Product Strategy Lead at Leaseweb Global B.V. "CloudMonkey is an invaluable tool for interacting with CloudStack-based clouds, and it's the go-to tool that we recommend to our customers when they want to use command-line interaction with our CloudStack platforms."

"CloudMonkey is an effective tool for the operators of CloudStack environments and  it becomes essential in large-scale CloudStack deployments," said Giles Sirett, CEO of ShapeBlue. "It's great to see this new version of CloudMonkey: having a CLI that can run on Windows desktops as well as Linux and Mac is important as we see more enterprise adoption of Apache CloudStack."

"CloudMonkey is now written in Golang, and with version v6.0 loading, speed has been drastically improved (accessing the CLI in under 0.5s)," said Pierre-Luc Dion, Cloud Architect at Cloud.ca. "This simplifies installation, deployments, updates, and operational efficiency."

"After many years of managing production Apache CloudStack deployments, I consider CloudMonkey a core tool in anyone's CloudStack toolkit, and now also being available for Windows makes me really happy," said Andrija Panic, Apache CloudStack Committer. "I can certainly see major speed improvements, but also having backward compatibility is what is so great with this new release."

Catch Apache CloudStack in action at ApacheCon 9-12 September 2019 in Las Vegas, Nevada, and at numerous Meetups worldwide, held throughout the year.

Downloads and Documentation
The official source code for CloudMonkey v6.0.0 can be downloaded from http://cloudstack.apache.org/downloads.html. The community-maintained builds are available at the project's Github release page at https://github.com/apache/cloudstack-cloudmonkey/releases . CloudMonkey's usage is documented at https://github.com/apache/cloudstack-cloudmonkey/wiki

Availability and Oversight
Apache CloudStack software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache CloudStack, visit http://cloudstack.apache.org/ and https://twitter.com/CloudStack

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server —the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, Leaseweb, Microsoft, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, Workday, and Verizon Media. For more information, visit http://apache.org/ and https://twitter.com/TheASF


© The Apache Software Foundation. "Apache", "CloudStack", "Apache CloudStack", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday March 19, 2019

The Apache Way to Sustainable Open Source Success

As Open Source software continues to grow in importance, it seems appropriate to reflect upon the ongoing success of The Apache Software Foundation (ASF) as it approaches its 20th anniversary. The Apache Way of community-driven development continues to gain momentum despite the compounding challenges of building software in the greater Open Source ecosystem.

This approach, The Apache Way, was defined over 24 years ago by the original Apache Group, prior to the establishment of the Foundation. It has led to our success as a foundation and we believe it has been fundamental to the triumph of Open Source as a whole.

While The Apache Way has been refined over the years, it remains true to the original goals of transparent, community-driven collaboration in a vendor-neutral environment that is accessible to all.

The Apache Way defines Open Source in terms of both a legal and a social framework for collaboration. It helps others understand what makes Open Source powerful and how participants are expected to behave. In this post we will examine The Apache Way in the context of the Foundation's mission:

"The mission of the Apache Software Foundation (ASF) is to provide software for the public good. We do this by providing services and support for many like-minded software project communities consisting of individuals who choose to participate in ASF activities." 

Let's dissect this mission statement. 

"Provide Software for the Public Good"

Key points in this section: 

  • We produce software that is non-excludable and non-rivalrous

  • Use of the software in any context does not reduce its availability to others

  • Users and contributors have no committed responsibility to the foundation, our projects or our communities

  • Use of a license that conforms to the Open Source Definition is necessary but not sufficient to deliver on our mission 

Investopedia defines a public good as "a product that one individual can consume without reducing its availability to another individual, and from which no one is excluded." On the surface, this is a good definition for our use of the term. However, there is a nuance in our use. Our mission is not to produce "public goods" but to "provide software for the public good". 

To understand why this is important, one needs to think about what motivates the ASF to produce software that is a public good.

Open Source software can be digitally copied and reused in an unlimited number of ways. Every user can modify it for their specific needs. They can combine it with other software. They can design innovative new products and services using it and can make a living from the proceeds. This is all possible without impacting other people's use of the software. As such, the ASF produces software that can be used for the public good in many different ways.

To allow us to deliver on this part of the mission, it is critical that we adopt a license that uses the law to protect the software curated here at the Foundation. For us that license is the Apache License, Version 2. In addition, we adopt an inbound licensing policy that defines which licenses are allowable on software reused within Apache projects. This policy can be summarized as: 

  • The license must meet the Open Source Definition (OSD).

  • The license, as applied in practice, must not impose significant restrictions beyond those imposed by the Apache License 2.0.

This means that you can be assured that software curated by projects within The Apache Software Foundation is both a public good and for the public good. You can use Apache software for any purpose and you have no responsibility to the Foundation or the project to contribute back (though as addressed in the next section, it is often in your interests to do so). 

It is important to recognize that there are software projects out there that adopt our license but do not adopt our inbound licensing policy. Such projects may bring restrictions that are not covered by our license; therefore, it is important to carefully examine the licensing policies of these projects. Using the Apache License alone may not provide you with the same options a Foundation project provides. 

Apache projects are successful, in large part, because of our diligence with respect to clearly-defined licensing policies. Such diligence makes it much easier for downstream users to understand what they can and cannot do with Apache software. The Apache License is deliberately permissive to ensure that everyone has an opportunity to participate in Open Source within the ASF or elsewhere. Modifications of our license are allowed, but modified licenses are neither the Apache License nor affiliated with or endorsed by The Apache Software Foundation. No modified license can be represented as such. Modified licenses that use the Apache name are strictly disallowed, as they are both confusing to users and undermine the Apache brand.

While we recognize that there are many ways to license software, whether Open Source or otherwise, we assert that only projects that use both our license (unmodified) and our inbound licensing policy truly follow and adhere to The Apache Way. 

While an OSD-approved license and associated policies are necessary for successful Open Source production, they are not sufficient. They provide a legal framework for the production of Open Source, but they do not provide a social framework, which brings us to the second sentence of our mission:

"The mission of the Apache Software Foundation is to provide software for the public good. We do this by providing services and support for many like-minded software project communities of individuals who choose to contribute to Apache projects."

"Like-Minded Software Project Communities of Individuals"

Key points in this section: 

  • The Apache Way provides a governance model designed to create a social framework for collaboration

  • The Apache Software Foundation develops communities, and those communities develop software

  • ASF project communities develop and reuse software components that in turn may be reused in products

  • Users of ASF software often build products and services using our software components

  • Our model, and others like it, have produced some of the largest and longest-lived Open Source projects that have literally revolutionized the industry 


There is a lot packed into these few words. It is an understanding of these words that makes the difference between software that is under an Open Source license and software that reaches sustainability through The Apache Way. These words underscore the fact that the Foundation does not directly produce software. That's right, The Apache Software Foundation, with upwards of $8Bn of software code, does not directly produce software. Rather than focus on software, we focus on the creation of and support of collaborative communities; the software is an intentional by-product. 

Our like-minded project communities come together because they share common problems that can be addressed in software. As the saying goes, "a problem shared is a problem halved". By bringing together individuals with their unique ideas and skills, we break down barriers to collaboration. 

The Apache Way is carefully crafted to create a social structure for collaboration, which complements the legal framework discussed above. Where the legal framework ensures an equal right to use the software, The Apache Way ensures an equal ability to contribute to the software. This is critically important to the long term sustainability of Open Source software projects. This social structure for collaboration is missing from many non-Apache projects, yet a robust social structure is invariably a key component in long-term successful projects outside of the ASF.

The Apache Way is fully inclusive, open, transparent and consensus-based. It promotes vendor neutrality to prevent undue influence (or control) from a single company. It ensures that any individual with a valuable contribution is empowered, and it seeks to assure that a project remains sustainable despite inevitable changes in community membership over time.

Apache projects typically produce software components that can be combined with other software (of any license) in different ways to solve different problems. This provides plenty of opportunity for participants to collaborate within a given software project independent of their relationship outside the Foundation. This is very different from the idea of licensing your product as a whole under an Open Source license. Our model offers more opportunities for reuse which, in turn, increase the pool of individuals likely to contribute to the project.

In addition, our merit-based system seeks to ensure that as people come and go, for whatever reason, there is always someone to take their place. As a result, some ubiquitous Apache projects have existed for over 20 years and helped commercialize the World Wide Web; while dozens of newer projects have defined industry segments such as Big Data and IoT (Internet of Things). 

A core tenet of The Apache Way is "Community Over Code", which encapsulates our deep belief that a healthy community is a far higher priority than good code. A strong community can always rectify a problem with the code, whereas an unhealthy community will likely struggle to maintain a codebase in a sustainable manner. Healthy communities ensure the Foundation has the stability to thrive for the next 20 years and beyond. Apache projects do not have the problem of scaling that others, who focus only on the legal frameworks of Open Source, suffer from. If you look around at projects that have grown up alongside the Apache projects, you will see a similar focus on scaling the governance model. This is no accident. 

Why this is Important

Software is a critical part of any modern economy. It touches every part of every life in the developed world, and is increasingly transforming everyday life, from womb to grave, everywhere.

At The Apache Software Foundation, we believe that every developer has their personal motivations for building software. We celebrate their right to choose when and how they build their software, including their right to use a non-open license. 

We will not dictate what is best for developers or for the software industry.

We care about the provision of software that enables our users, our contributors, and the general public to decide what is best for them.

We welcome you to use our software and contribute to our projects -- or not. It's up to you. 

We ask that you leave commercial interests at the door.

Countless organizations are proving that their team members who collaborate in a vendor-neutral environment often apply Open Innovation processes (such as The Apache Way) to their work. This helps create internal efficiencies and lays the groundwork for new external opportunities that may provide additional added benefits.

Bringing only your intention of contributing what best serves the greater Apache community reinforces trust in the people and projects behind the Apache brand, and helps us realize our mission of providing software for the public good. 

We learn together and work together to deliver the best software we can. 

Apache software is available for all.

The freedom to choose is what makes the Foundation and Apache projects so strong.

Summary

The software industry has changed and continues to change. The ways software is delivered to end users have changed. Some of the leaders in our industry have retired and new leaders have emerged. But some things have not changed. Our model of collaborative software development, through a combination of a licensing and social framework, remains one of the most successful models of software production.

Increasing the number of users, even those who do not contribute to code, should be seen as a benefit, not a problem, in Open Source. More users present an opportunity. At Apache, more users means more success since they are our future contributors.

As a US 501(c)(3) public charitable organization, The Apache Software Foundation helps individuals and organizations understand how Open Source at scale works in a highly competitive market. For more than two decades our focus has not been on producing software, but rather mentoring communities who produce software. The Apache Way advances sustainable Open Source communities: everything we do is Open Source so all kinds of users can benefit from our experience. Apache is for everyone.

# # #

Monday March 04, 2019

MEDIA ALERT: The Apache Software Foundation to present on Apache software projects in government, cybersecurity, and more at Roadshow and Open Source Job Fair at George Mason University

Wakefield, MA —4 March 2019— ApacheCon, the official conference series of The Apache Software Foundation (ASF), announced today the Apache Roadshow/DC, held in partnership with George Mason University (GMU).

WHO
The Apache Software Foundation is the world's largest Open Source foundation. The all-volunteer ASF develops, stewards, and incubates more than 350 enterprise-grade projects that serve as the backbone for some of the most visible and widely used applications in computing today. From artificial intelligence and deep learning, Big Data, build management, Cloud computing, content management, DevOps, IoT and Edge computing, mobile, servers, Web frameworks, and many other categories, all Apache software products are available to the public-at-large at 100% no cost, and benefit billions of users around the world.

George Mason University, Virginia’s largest and most diverse public research university, enrolls 37,000 students from 130 countries and all 50 states, who attend on campuses in Fairfax, Arlington, Manassas, and Loudoun, Virginia, as well as in Songdo, South Korea.

The Apache Roadshow is held in cooperation with the Center for Assurance Research & Engineering (CARE) of the Volgenau School of Engineering at George Mason University. The multidisciplinary school maintains a dual pre-eminence in both information technology and engineering, and seeks to graduate students who will take initiative, step up, and leave the planet better than they found it.

WHAT
Apache Roadshows are part of the official ASF conference series: they are smaller events with a more focused program and regional community engagement.

The DC Roadshow program features two dozen presentations on Apache projects across the following tracks: government, cybersecurity, CARE projects, and career development, in tandem with one of the first Open Source Software job fairs.

Topics include The Apache Way, analytics, Big Data, Cloud, cybersecurity, governance, healthcare, machine learning and natural language processing (NLP), and security, featuring Apache projects such as Airflow, Cassandra, cTAKES, Daffodil (incubating), Drill, Fortress, Hadoop, Ignite, NiFi, Rya, Tomcat, and more.

Speakers include representatives from The Apache Software Foundation, Apifocal, Benefits Data Trust, BMC Software, Capital One, Cloudera, ConSensys, DataNexus, Deloitte, Deutsche Bank, Fannie Mae, George Mason University, GridGain, IEEE, MITRE, Symas, Thomson Reuters, Total Child Health, Tresys, US Naval Academy, Verizon Media, and WSO2, among others.

The Roadshow's opening keynote, "Why Open Source is Vital in IT", will be presented by Jim Jagielski, co-founder of the ASF, Project Management Committee member of Apache HTTP Server and Apache Tomcat, and Open Source Chef at ConSensys.

The Open Source Job Fair will include representatives from AWS, Bloomberg, dito, Google, GridGain, Linode, and Security University, among others.


WHEN
Monday 25 March 2019

     Breakfast and Registration: 8AM ET
     Kickoff and Keynote: 9AM ET
     Open Source Job Fair: 10AM ET
     Post-Event Happy Hour: 5.15PM ET

WHERE
Dewberry Hall within the Johnson Center at George Mason University Main Fairfax Campus.

HOW
For more information and to register, visit http://www.apachecon.com/usroadshowdc19/

Tickets are $25 for standard admission; $15 for students; and free of charge for the Career Fair (pre-registration required).

About ApacheCon
ApacheCon is the official global conference series of The Apache Software Foundation. Since 1998 ApacheCon has been drawing participants at all levels to explore ”Tomorrow’s Technology Today” across 300+ Apache projects and their diverse communities. ApacheCon showcases the latest developments in ubiquitous Apache projects and emerging innovations through hands-on sessions, keynotes, real-world case studies, trainings, hackathons, community events, and more. For more information, visit http://apachecon.com/ and https://twitter.com/ApacheCon .

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, Leaseweb, Microsoft, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, Workday, and Verizon Media. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Airflow", "Cassandra", "cTAKES", "Daffodil (incubating)", "Drill", "Fortress", "Hadoop", "Ignite, "NiFi", "Rya (incubating)", "Tomcat", "Apache Airflow", "Apache Cassandra", "Apache cTAKES", "Apache Daffodil (incubating)", "Apache Drill", "Apache Fortress", "Apache Hadoop", "Apache Ignite, "Apache NiFi", "Apache Rya (incubating)", "Apache Tomcat", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Friday March 01, 2019

The Apache Software Foundation Operations Summary: November 2018 - January 2019

FOUNDATION OPERATIONS SUMMARY

Third Quarter, Fiscal Year 2019 (November 2018 - January 2019)

"Happy to be a sponsor of one of the most important foundations in the world!"
Randy Abernethy, Managing Partner, RX-M LLC (ASF Bronze Sponsor)

> Conferences and Events: During this period, the Events team was mostly focused on preparations for upcoming events, as reported in our previous quarterly report, so there's not much to add. Our current status, as of 31 January, is as below:

  • Apache Roadshow, DC
    After several setbacks, the DC roadshow has been rescheduled for March 25th at George Mason University. Event details, and the full schedule of talks, may be found at http://apachecon.com/usroadshowdc19

    The DC roadshow will also feature a job fair.

  • Apache Roadshow, Chicago
    The planning for the Chicago roadshow is moving along nicely. It will be held May 13-14 in Logan Square, and details may be found at http://apachecon.com/chiroadshow19/index.html

  • ApacheCon North America 2019
    During the period of this report, we signed agreements to hold ApacheCon North America 2019 in Las Vegas, Nevada, September 9-12, 2019. This event celebrates the 20th anniversary of the Apache Software Foundation. The call for presentations will open soon, and sponsor opportunities will also be available soon.

  • ApacheCon Europe 2019
    Planning for ApacheCon Europe 2019 is well underway, and you can expect announcements and further details very soon.


> Community Development
One of the main themes raised this quarter was around ways to encourage new contributors by lowering the barriers to entry and other techniques that could help onboard them.

Some new contributor onboarding material tailored specifically tailored around Apache and participation in Apache projects is currently being developed and the community feedback has been very positive.

Discussion has also started about how to recognize non technical contributions. This is an area where Community Development could really help in the development of materials and processes, as well as providing guidance to projects on how it could be implemented.

In December, we participated at the Paris Open Source Summit, one of the main Open Source events in France. We had an ASF booth and over the two day summit saw lots of attendees who were keen to know more about the ASF. 

We were also invited to participate in a workshop to talk about how the administrative sector could help increase collaboration with Open Source foundations and communities.

The ASF has once again has applied to be a mentoring organisation for GSoC. In preparation, we have asked all ASF projects interested in participating to start recording their ideas.

We are continually receiving requests to participate at events and would like to be more organised in selecting and planning our event participation. As a first step we have asked our community for their feedback on which events they think are the most important ones for us to be involved. Based on this we hope to create a high level event participation plan for 2019.

Our mailing list subscriptions have increased this quarter and even though we have had a lot of active discussions, the traffic has decreased, probably due to the December holiday period.

> Committers and Contributions: Over the past quarter, 1,615 contributors committed 45,054 changes that amount to 18,612,331 lines of code across Apache projects. The top 5 contributors during this timeframe were: Andrea Cosentino (1,142 commits), Jean-Baptiste Onofré (747 commits), Tilman Hausherr (599 commits), Mark Thomas (594 commits), and Tellier Benoit (512 commits).


All individuals who are granted write access to the Apache repositories must submit an Individual Contributor License Agreement (ICLA). Corporations that have assigned employees to work on Apache projects as part of an employment agreement may sign a Corporate CLA (CCLA) for contributing intellectual property via the corporation. Individuals or corporations donating a body of existing software or documentation to one of the Apache projects need to execute a formal Software Grant Agreement (SGA) with the ASF. 

During Q3 FY2019, the ASF Secretary processed 232 ICLAs, 3 CCLAs, and 10 Software Grants. History of Apache committer growth can be seen at https://projects.apache.org/timelines.html

> Brand Management: Operations — The work of the Brand Management team falls broadly into one of three categories:

  • trademark transfers and registrations
  • granting permission to use our marks
  • addressing potential infringements of our marks

The volume of work fell slightly this quarter which was probably due to the end of year holiday period. The budget request for next financial year was finalized and submitted.

This quarter has seen the usual collection of requests to use Apache marks for user groups, events, merchandise and publications with nearly all requests being granted, subject to our Trademark Usage Policy. 

Registrations — The US registration of OPENOFFICE completed this quarter and we started the process to register IGNITE in the EU and China. We also worked with counsel to respond to office actions for a number of in-progress registrations.

With the assistance of counsel, we addressed a number of external registrations that overlap with our own with those external registrations being amended to avoid the overlap.

Infringements — Potential infringements are brought to our attention from both internal and external sources. The majority of infringements we see are accidental and our project communities are able to resolve these quickly and informally with occasional input from the Brand Management team. A small number of issues take longer to resolve. After last quarter's successes in resolving two of these more complex issues, this quarter has seen a small uptick in the number of complex issues we are tracking.

We saw an increase in projects reaching out to speakers and conference organizers this quarter to ensure that Apache projects were referred to correctly in conference session titles and abstracts.

And finally…

The Brand Management team welcomes your comments and suggestions as well as any questions you might have. Please see https://www.apache.org/foundation/marks/contact for our contact details.


> Infrastructure:
 It has been a very busy quarter for the Infrastructure team. We have been migrating many services to new approaches and systems, including gitbox, backups, and configuration management.

We have been reporting on the move to "Gitbox" for a while now, and this quarter the testing, development, and evolution is nearly at a close. This quarter, many hundreds of repositories have migrated from our old git provisioning to our new gitbox system. A mere hundred repositories are left, and will be completed in February, allowing us to shut down old hardware, old systems, and reduce our maintenance efforts. Meanwhile, the many tools that GitHub brings to development will be available to our projects.

This quarter has also seen a revamp of our backup systems to a simpler, straightforward mechanism across our multitude of virtual machines. We had a mix of commercial backup and in-house systems built on standard F/OSS tools. By concentrating on the in-house tooling, we have dramatically reduced costs and simplified our ongoing maintenance.

Our final, large system migration involves our use of Puppet for configuration management. For the past several years, we've been migrating our very old, organically-constructed systems onto Puppet version 3. While that process of bringing modern configuration management into our operations, Puppet has released many versions, now up to version 6. This quarter, we have begun to deploy new systems using this v6 setup. Looking forward, we'll begin shifting our operations to this new platform, to improve our operations.

In addition to these larger system moves, the team has kept busy with larger scale planning. Hiring more staff, longer term migration of our core email handling, five year budget planning, monitoring improvements, and decommissioning of our old hardware.

> Financial Statement:



> Fundraising:
 Fundraising has continued to go well this past quarter. We are pleased to have had several renewals from our awesome sponsors and we've even grown the ASF family of Sponsors and Targeted Sponsors! Welcome Amazon Web Services at the Platinum level, and RX-M LLC and Online Holland Casino at the Bronze level. We've recently begun our quarterly cycles of outreach via our Sponsor Ambassadors and look forward to hearing from our sponsors.

Work on reviewing the concept of a potential endowment or endowment-like project for the ASF has been transferred to VP Finance. The Fundraising committee remains engaged and eager to support however possible.

Fundraising support for ASF events (ApacheCon and Apache Roadshow) has picked up with the development of event prospectuses and assignments of Event Sponsor Ambassadors. We have also established and documented a new model for handling event fundraising.

It was wonderful to spend some time with our ASF family and friends at FOSDEM and look forward to doing the same at Apache Roadshow DC in March.

Internally the Fundraising committee continues to focus on improvements. Processes, documentation, record keeping and tracking is being reviewed and bolstered across the board. The team also thanks Kevin A McGrail for his hard work and dedication (VP and Co-VP Fundraising) over the past two years as he steps away from Fundraising.

= = = 

Thank you to all our Sponsors!

  • PLATINUM: Amazon Web Services, Cloudera, Comcast, Facebook, Google, LeaseWeb, Microsoft, Pineapple Fund, Oath, Tencent
  • GOLD: Anonymous, ARM, Bloomberg, Handshake, Hortonworks, Huawei, IBM, Indeed, ODPi, Pivotal
  • SILVER: Aetna, Alibaba Cloud Computing, Baidu, Budget Direct, Capital One, Cerner, Inspur, Private Internet Access, Red Hat, Target, Union Investment
  • BRONZE: Airport Rentals, Best VPN, The Blog Starter, Bookmakers, Cash Store, Casino Bonus, Casino2k, Cloudsoft, Emerio, Holland Online Casino, HostChecka.com, HostingAdvice.com, HostPapa Web Hosting, The Linux Foundation, Mobile Slots, RX-M LLC, SCAMS.info, Site Builder Report, Talend, The Best VPN, Twitter, Web Hosting Secret Revealed
  • TARGETED PLATINUM: DLA Piper, Microsoft, Oath, OSU Open Source Labs, Sonatype
  • TARGETED GOLD: Atlassian, The CrytpoFund, Datadog, PhoenixNAP, Quenda
  • TARGETED SILVER: Amazon Web Services, HotWax Systems, Rackspace
  • TARGETED BRONZE: Bintray, Education Networks of America, Google, Hopsie, No-IP, PagerDuty, Peregrine Computer Consultants Corporation, Sonic.net, SURFnet, Virtru

To Sponsor The Apache Software Foundation, visit http://apache.org/foundation/sponsorship.html . To make a one-time or monthly recurring donation, please visit https://donate.apache.org/

# # #

Report prepared by Sally Khudairi, Vice President Marketing & Publicity, with contributions by Rich Bowen, Vice President Conferences; Sharan Foga, Vice President Community Development; Mark Thomas, Vice President Brand Management; Greg Stein, ASF Infrastructure Administrator; Tom Pappas, Vice President Finance; and Daniel Ruggeri, Vice President Fundraising.

For more information, subscribe to the announce@apache.org mailing list and visit http://www.apache.org/, the ASF Blog at http://blogs.apache.org/, the @TheASF on Twitter, and https://www.linkedin.com/company/the-apache-software-foundation.

(c) The Apache Software Foundation 2019.

Tuesday February 19, 2019

The Apache® Software Foundation Announces Apache Arrow™ Momentum

Open Source Big Data in-memory columnar layer adopted by dozens of Open Source and commercial technologies; exceeded 1,000,000 monthly downloads within first three years as an Apache Top-Level Project

Wakefield, MA —19 February 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, today announced momentum with Apache® Arrow™, the Open Source Big Data in-memory columnar layer.

Since the founding of the project in January 2016, Apache Arrow has quickly become the defacto standard for representing and processing analytical data in memory, accelerating analytical processing and interchange by more than 100x.

"When we became a Top-Level Project, we projected that the majority of the world's data will be processed through Arrow within the next decade," said Jacques Nadeau, Vice President of Apache Arrow. "In just three years time, we are proud to see Arrow's substantial industry adoption and increased value across a wide range of analytical, machine learning, and artificial intelligence workloads."

Highlights of Apache Arrow's success include:

Industry Adoption —more than 20 major technologies adopted Arrow to accelerate in-memory analytics, including Apache Spark, NVIDIA RAPIDS, pandas, and Dremio, among others. A list of known Open Source and commercial implementations can be found at https://arrow.apache.org/powered_by/

Millions of Downloads —leveraging and integrating Apache Arrow into many other technologies has bolstered downloads to more than 1,000,000 each month.

New Language Support —as a cross-language development platform, supporting multiple programming languages is paramount. Apache Arrow has grown from supporting one language to eleven different languages today; they include C++, Java, Python, R, C#, Javascript, and Ruby, among others.

Seamless Data Format Support —Arrow supports different data types, both simple and nested, located in arbitrary memory such as regular system RAM, memory-mapped files or on-GPU memory. In addition, it can ingest data from popular storage formats such as Apache Parquet, CSV files, Apache ORC, JSON, and more.

Major Code Donations —Apache Arrow's new features and expanded functionality are due in part to code and component donations that include:
  • C# Library
  • Gandiva LLVM-based Expression Compiler
  • Go Library
  • Javascript Library
  • Plasma Shared Memory Object Store
  • Ruby Libraries (Apache Arrow and Apache Parquet)
  • Rust Libraries (Parquet and DataFusion Query Engine)
Community and Contributor Growth —over the past 12 months, nearly 300 individuals have submitted more than 3,000 contributions that have grown the Apache Arrow code base by 300,000 lines of code. The Arrow community is welcoming approximately 10 new contributors each month.


In January the project announced its most recent release, Apache Arrow 0.12.0, which reflects more than 600 enhancements developed during Q4 2018. The Apache Arrow community is actively working on a number of impactful new initiatives that include solving high performance analytical problems and allowing for more efficient data distribution across entire clusters.

"Apache Arrow's rapid industry adoption and developer community growth supports our original thesis of the importance of a language-independent open standard for columnar data," said Wes McKinney, member of the Apache Arrow Project Management Committee, and creator of Python's pandas project. "Additionally, we are seeing productive collaborations take place not only between programming languages but also between the database systems and data science worlds. We look forward to welcoming more data system developers into our community."

About Apache Arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, C#, Go, Java, JavaScript, MATLAB, Python, R, Ruby, and Rust.

Availability and Oversight
Apache Arrow software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Arrow, visit http://arrow.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official global conference series. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, and Workday. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Arrow", "Apache Arrow", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Monday January 28, 2019

The Apache Software Foundation Welcomes Amazon Web Services (AWS) as its Newest Sponsor

The Apache Software Foundation (ASF) today welcomed Amazon Web Services (AWS), the latest company to sponsor the ASF at the Platinum level.

"We are pleased to have AWS as a Platinum Sponsor," said ASF Chairman Phil Steitz. "In addition to helping support the ASF’s day-to-day operations through a Platinum sponsorship, AWS has been a Silver-level Targeted Sponsor providing the ASF Infrastructure team with AWS Cloud credits over the past two years. More than 350 Apache projects and their communities directly benefit from the generosity of ASF Sponsors."

AWS joins the following Sponsors:

Platinum level --Cloudera, Comcast, Facebook, Google, LeaseWeb, Microsoft, Oath, Pineapple Fund, and Tencent;

Gold level --Anonymous, ARM, Bloomberg, Handshake, Hortonworks, Huawei, IBM, Indeed, Pivotal, and Union Investment;

Silver level --Aetna, Alibaba Cloud Computing, Baidu, Budget Direct, Capital One, Cerner, Inspur, ODPi, Private Internet Access, Red Hat, and Target;

Bronze level --Airport Rentals, The Blog Starter, Bookmakers, Cash Store, Casino Bonus, Casino2k, Cloudsoft, Emerio, Footprints Recruiting, HostChecka.com, HostingAdvice.com, HostPapa Web Hosting, The Linux Foundation, Mobile Slots, Mutuo Kredit, Online Holland Casino, RX-M, SCAMS.info, Site Builder Report, Talend, The Best VPN, Twitter, and Web Hosting Secret Revealed.

For more information on becoming a Sponsor of the ASF, please see http://apache.org/foundation/sponsorship.html

# # #

Wednesday January 23, 2019

The Apache Software Foundation Announces Apache® Hadoop® v3.2.0

Pioneering Open Source distributed enterprise framework powers US$166B Big Data ecosystem

Wakefield, MA —23 January 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, today announced Apache® Hadoop® v3.2.0, the latest version of the Open Source software framework for reliable, scalable, distributed computing.

Now in its 11th year, Apache Hadoop is the foundation of the US$166B Big Data ecosystem (source: IDC) by enabling data applications to run and be managed on large hardware clusters in a distributed computing environment. "Apache Hadoop has been at the center of this big data transformation, providing an ecosystem with tools for businesses to store and process data on a scale that was unheard of several years ago," according to Accenture Technology Labs.

"This latest release unlocks the powerful feature set the Apache Hadoop community has been working on for more than nine months," said Vinod Kumar Vavilapalli, Vice President of Apache Hadoop. "It further diversifies the platform by building on the cloud connector enhancements from Apache Hadoop 3.0.0 and opening it up for deep learning use-cases and long-running apps."

Apache Hadoop 3.2.0 highlights include:
  • ABFS Filesystem connector —supports the latest Azure Datalake Gen2 Storage;
  • Enhanced S3A connector —including better resilience to throttled AWS S3 and DynamoDB IO;
  • Node Attributes Support in YARN —helps to tag multiple labels on the nodes based on its attributes and supports placing the containers based on expression of these labels;
  • Storage Policy Satisfier  —supports HDFS (Hadoop Distributed File System) applications to move the blocks between storage types as they set the storage policies on files/directories; 
  • Hadoop Submarine —enables data engineers to easily develop, train and deploy deep learning models (in TensorFlow) on very same Hadoop YARN cluster;
  • C++ HDFS client —helps to do async IO to HDFS which helps downstream projects such as Apache ORC;
  • Upgrades for long running services —supports in-place seamless upgrades of long running containers via YARN Native Service API (application program interface) and CLI (command-line interface).

"This is one of the biggest releases in Apache Hadoop 3.x line which brings many new features and over 1,000 changes," said Sunil Govindan, Apache Hadoop 3.2.0 release manager. "We are pleased to announce that Apache Hadoop 3.2.0 is available to take your data management requirements to the next level. Thanks to all our contributors who helped to make this release happen."

Apache Hadoop is widely deployed at numerous enterprises and institutions worldwide, such as Adobe, Alibaba, Amazon Web Services, AOL, Apple, Capital One, Cloudera, Cornell University, eBay, ESA Calvalus satellite mission, Facebook, foursquare, Google, Hortonworks, HP, Huawei, Hulu, IBM, Intel, LinkedIn, Microsoft, Netflix, The New York Times, Rackspace, Rakuten, SAP, Tencent, Teradata, Tesla Motors, Twitter, Uber, and Yahoo. The project maintains a list of educational and production users, as well as companies that offer Hadoop-related services at https://wiki.apache.org/hadoop/PoweredBy

Global Knowledge hails, "...the open-source Apache Hadoop platform changes the economics and dynamics of large-scale data analytics due to its scalability, cost effectiveness, flexibility, and built-in fault tolerance. It makes possible the massive parallel computing that today's data analysis requires."

Hadoop is proven at scale: Netflix captures 500+B daily events using Apache Hadoop. Twitter uses Apache Hadoop to handle 5B+ sessions a day in real time. Twitter’s 10,000+ node cluster processes and analyzes more than a zettabyte of raw data through 200B+ tweets per year. Facebook’s cluster of 4,000+ machines that store 300+ petabytes is augmented by 4 new petabytes of data generated each day. Microsoft uses Apache Hadoop YARN to run the internal Cosmos data lake, which operates over hundreds of thousands of nodes and manages billions of containers per day.

Transparency Market Research recently reported that the global Hadoop market is anticipated to rise at a staggering 29% CAGR with a market valuation of US$37.7B by the end of 2023.

Apache Hadoop remains one of the most active projects at the ASF: it ranks #1 for Apache project repositories by code commits, and is the #5 repository by size (3,881,797 lines of code).

"The Apache Hadoop community continues to go from strength to strength in further driving innovation in Big Data," added Vavilapalli. "We hope that developers, operators and users leverage our latest release in fulfilling their data management needs."

Catch Apache Hadoop in action at the Strata conference, 25-28 March 2019 in San Francisco, and dozens of Hadoop MeetUps held around the world, including on 30 January 2019 at LinkedIn in Sunnyvale, California.

Availability and Oversight
Apache Hadoop software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Hadoop, visit http://hadoop.apache.org/ and https://twitter.com/hadoop

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official global conference series. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, and Union Investment. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Hadoop", "Apache Hadoop", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday January 08, 2019

The Apache Software Foundation Announces Apache® Airflow™ as a Top-Level Project

Open Source Big Data workflow management system in use at Adobe, Airbnb, Etsy, Google, ING, Lyft, PayPal, Reddit, Square, Twitter, and United Airlines, among others.

Wakefield, MA —8 January 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Airflow™ as a Top-Level Project (TLP).

Apache Airflow is a flexible, scalable workflow automation and scheduling system for authoring and managing Big Data processing pipelines of hundreds of petabytes. Graduation from the Apache Incubator as a Top-Level Project signifies that the Apache Airflow community and products have been well-governed under the ASF's meritocratic process and principles.

"Since its inception, Apache Airflow has quickly become the de-facto standard for workflow orchestration," said Bolke de Bruin, Vice President of Apache Airflow. "Airflow has gained adoption among developers and data scientists alike thanks to its focus on configuration-as-code. That has gained us a community during incubation at the ASF that not only uses Apache Airflow but also contributes back. This reflects Airflow’s ease of use, scalability, and power of our diverse community; that it is embraced by enterprises and start-ups alike, allows us to now graduate to a Top-Level Project."

Apache Airflow is used to easily orchestrate complex computational workflows. Through smart scheduling, database and dependency management, error handling and logging, Airflow automates resource management, from single servers to large-scale clusters. Written in Python, the project is highly extensible and able to run tasks written in other languages, allowing integration with commonly used architectures and projects such as AWS S3, Docker, Apache Hadoop HDFS, Apache Hive, Kubernetes, MySQL, Postgres, Apache Zeppelin, and more. Airflow originated at Airbnb in 2014 and was submitted to the Apache Incubator March 2016.

Apache Airflow is in use at more than 200 organizations, including Adobe, Airbnb, Astronomer, Etsy, Google, ING, Lyft, NYC City Planning, Paypal, Polidea, Qubole, Quizlet, Reddit, Reply, Solita, Square, Twitter, and United Airlines, among others. A list of known users can be found at https://github.com/apache/incubator-airflow#who-uses-apache-airflow

"Adobe Experience Platform is built on cloud infrastructure leveraging open source technologies such as Apache Spark, Kafka, Hadoop, Storm, and more," said Hitesh Shah, Principal Architect of Adobe Experience Platform. "Apache Airflow is a great new addition to the ecosystem of orchestration engines for Big Data processing pipelines. We have been leveraging Airflow for various use cases in Adobe Experience Cloud and will soon be looking to share the results of our experiments of running Airflow on Kubernetes." 

"Our clients just love Apache Airflow. Airflow has been a part of all our Data pipelines created in past 2 years acting as the ring-master and taming our Machine Learning and ETL Pipelines," said Kaxil Naik, Data Engineer at Data Reply. "It has helped us create a Single View for our client's entire data ecosystem. Airflow's Data-aware scheduling and error-handling helped automate entire report generation process reliably without any human-intervention. It easily integrates with Google Cloud (and other major cloud providers) as well and allows non-technical personnel to use it without a steep learning curve because of Airflow’s configuration-as-a-code paradigm."

"With over 250 PB of data under management, PayPal relies on workflow schedulers such as Apache Airflow to manage its data movement needs reliably," said Sid Anand, Chief Data Engineer at PayPal. "Additionally, Airflow is used for a range of system orchestration needs across many of our distributed systems: needs include self-healing, autoscaling, and reliable [re-]provisioning."

"Since our offering of Apache Airflow as a service in Sept 2016, a lot of big and small enterprises have successfully shifted all of their workflow needs to Airflow," said Sumit Maheshwari, Engineering Manager at Qubole. "At Qubole, not only are we a provider, but also a big consumer of Airflow as well. For example, our whole Insight and Recommendations platform is built around Airflow only, where we process billions of events every month from hundreds of enterprises and generate insights for them on big data solutions like Apache Hadoop, Apache Spark, and Presto. We are very impressed by the simplicity of Airflow and ease at which it can be integrated with other solutions like clouds, monitoring systems or various data sources."

"At ING, we use Apache Airflow to orchestrate our core processes, transforming billions of records from across the globe each day," said Rob Keevil, Data Analytics Platform Lead at ING WB Advanced Analytics. "Its feature set, Open Source heritage and extensibility make it well suited to coordinate the wide variety of batch processes we operate, including ETL workflows, model training, integration scripting, data integrity testing, and alerting. We have played an active role in Airflow development from the onset, having submitted hundreds of pull requests to ensure that the community benefits from the Airflow improvements created at ING.  We are delighted to see Airflow graduate from the Apache Incubator, and look forward to see where this exciting project will be taken in future!"

"We saw immediately the value of Apache Airflow as an orchestrator when we started contributing and using it," said Jarek Potiuk, Principal Software Engineer at Polidea. "Being able to develop and maintain the whole workflow by engineers is usually a challenge when you have a huge configuration to maintain. Airflow allows your DevOps to have a lot of fun and still use the standard coding tools to evolve your infrastructure. This is 'infrastructure as a code' at its best."

"Workflow orchestration is essential to the (big) data era that we live in," added de Bruin. "The field is evolving quite fast and the new data thinking is just starting to make an impact. Apache Airflow is a child of the data era and therefore very well positioned, and is also young so a lot of development can still happen. Airflow can use bright minds from scientific computing, enterprises, and start-ups to further improve it. Join the community, it is easy to hop on!"

Availability and Oversight
Apache Airflow software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Airflow, visit http://airflow.apache.org/ and https://twitter.com/ApacheAirflow

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, and Union Investment. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Airflow", "Apache Airflow", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday January 01, 2019

Apache in 2018 - By The Digits

It's been a great year for the Apache community at-large. With nearly 200M lines of code under the ASF's stewardship, our ongoing success is the result of community-led development "The Apache Way", executed through the collaborative efforts of more than 300 Apache projects and their communities. Highlights include:

Apache Projects https://projects.apache.org/
  • Total number of projects + sub-projects - 328 (not including Apache Labs initiatives)
  • Top-Level Projects - 198
  • Podlings in the Apache Incubator - 51
  • Other groups, including operations/support - 62

Community/People —http://home.apache.org/
  • Apache Committers - 7,032 (6,693 active)
  • ASF Members (individuals) - 730
  • New Members elected - 44

Apache Projects/Code —https://projects.apache.org/statistics.html

3,208 Apache Committers changed 78,493,228 lines of code over 201,220 commits. We also  welcomed 4,638 new code contributors and 15,861 new issue/pull request contributors.

 Top 5 Apache Code Committers 

  1. Andrea Cosentino (2,508 commits; 237,224 lines changed)
  2. Jean-Baptiste Onofré (2,098 commits; 1,208,851 lines changed)
  3. Duo Zhang (1,956 commits; 809,085 lines changed)
  4. Mark Thomas (1,823 commits; 179,883 lines changed)
  5. Tilman Hausherr (1,736 commits; 81,940 lines changed)

Top 5 Apache Project Repositories by Commits
  1. Hadoop
  2. HBase
  3. Beam
  4. Camel
  5. Flink
Top 5 Apache Project Repositories by Size (Lines of Code)
  1. OpenOffice (7,822,699)
  2. NetBeans (7,741,506)
  3. Flex (whiteboard: 5,233,722; SDK 3,933,522)
  4. Mynewt (documentation: 4,381.072)
  5. Hadoop (3,881,797)

"If it didn't happen on-list, it didn't happen." https://lists.apache.org/
  • Total number of mailing lists 1,131
  • 19,435 authors sent 1,497,005 emails on 505,793 topics
Top 5 most active Apache user@ mailing lists
  1. Flink
  2. Lucene
  3. Ignite
  4. Cassandra
  5. Kafka
Top 5 most active Apache dev@ mailing lists
  1. Beam
  2. Ignite
  3. Kafka
  4. Tomcat
  5. James

Contributor License Agreements and Software Grants —https://www.apache.org/licenses/

We welcomed an average of 387 new code contributors and 1,250 new people filing issues each month. Individuals who are granted write access to the Apache repositories must submit an Individual Contributor License Agreement (ICLA). Corporations that have assigned employees to work on Apache projects as part of an employment agreement may sign a Corporate CLA (CCLA) for contributing intellectual property via the corporation. Individuals or corporations donating a body of existing software or documentation to one of the Apache projects need to execute a formal Software Grant Agreement (SGA) with the ASF. 
  • ICLAs signed - 831
  • CCLAs signed - 35
  • Software Grants submitted - 25

Sponsorship and Individual Support —http://apache.org/foundation/contributing.html

Thank you to our hundreds of individual donors and Sponsors whose generous support helps offset the ASF's day-to-day operating expenses that include Infrastructure, Accounting, Fundraising, Marketing & Publicity, and more.
  • Platinum: Cloudera, Comcast, Facebook, Google, LeaseWeb, Microsoft, Oath, Pineapple Fund, and Tencent Cloud.

  • Gold: Anonymous, ARM, Bloomberg, Handshake, Hortonworks, Huawei, IBM, Indeed, Pivotal, and Union Investment.

  • Silver: Aetna, Alibaba Cloud Computing, Baidu, Budget Direct, Capital One, Cerner, Inspur, ODPi, Private Internet Access, Red Hat, and Target.

  • Bronze: Airport Rentals, Best VPN, The Blog Starter, Bookmakers, Cash Store, Casino Bonus, Casino2k, Cloudsoft, Emerio, Footprints Recruiting, HostChecka.com, HostingAdvice.com, HostPapa Web Hosting, The Linux Foundation, Mobile Slots, Mutuo Kredit AG, Online Holland Casino, RX-M, SCAMS.info, Site Builder Report, Talend, The Best VPN, Twitter, and Web Hosting Secret Revealed.

ASF Targeted Sponsors provide the Foundation with contributions for specific activities or programs.
  • Targeted Platinum: DLA Piper, Microsoft, Oath, OSU Open Source Labs, and Sonatype.

  • Targeted Gold: Atlassian, The CrytpoFund, Datadog, PhoenixNAP, and Quenda.

  • Targeted Silver: Amazon Web Services, HotWax Systems, and Rackspace.

  • Targeted Bronze: Bintray, Education Networks of America, Google, Hopsie, No-IP, PagerDuty, Peregrine Computer Consultants Corporation, Sonic.net, SURFnet, and Virtru.


Together, our Members, Committers, contributors, users, supporters, and sponsors continue to build on our mission of providing Open Source software for the public good and are helping keep Apache software accessible to everyone.

Wishing you the best in 2019!

# # #

Wednesday December 12, 2018

The Apache Software Foundation Announces Apache® Griffin™ as a Top-Level Project

Open Source Big Data quality solution in use at eBay, Expedia, Huawei, JD.com, Meituan, PayPal, Pingan Bank, PPDAI, VIP.com, VMWare, and more.

Wakefield, MA —12 December 2018— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Griffin™ as a Top-Level Project (TLP).

Apache Griffin is a robust Open Source Big Data quality solution for distributed data systems at any scale. It provides a unified process to measure data quality from different perspectives, as well as building and validating trusted data assets in both streaming or batch contexts. Griffin originated at eBay and entered the Apache Incubator in December 2016.

"We are very proud of Griffin reaching this important milestone," said William Guo, Vice President of Apache Griffin. "By actively improving Big Data quality, Griffin helps build trusted data assets, therefore boosting your confidence in your business." 

Apache Griffin enables data scientists/analysts to handle data quality issues by:
  • Defining –specifying data quality requirements such as accuracy, completeness, timeliness, profiling, etc.;

  • Measuring –source data ingested into the Griffin computing cluster will apply data quality measurement based on user-defined requirements; and

  • Applying Metrics –data quality reports as metrics will be exported to designated destination.

In addition, Griffin allows users to easily onboard new requirements into the platform and write comprehensive logic to further define their data quality. 

Apache Griffin is in use in high volume, high demand environments at 163.com/Netease, eBay, Expedia, Huawei, JD.com, Meituan, PayPal, Pingan Bank, PPDAI, VIP.com, and VMWare, among others.

"eBay contributed Griffin to the Apache Incubator in December 2016 to ensure its future development in a community-driven manner. It started with the idea on how eBay could address the data quality issue across multiple systems, especially in streaming context," said Vivian Tian, VP of eBay, GM - China Center of Excellence. "Griffin brings data quality solution to data ecosystem and ensure data applications have a solid quality foundation. We are extremely happy to see Griffin graduate as an Apache Top Level Project, and look forward to continued innovation and collaboration with the Apache community."

"We have been using Apache Griffin for about two years, monitoring 1000+ tables with data quality metrics, and are very happy to see it graduate to a Top-Level Project," said Chao Zhu, Senior Director at VIPshop Finance. "Apache Griffin and its data quality DSL can help us easily identify data quality issues instantly on our big data platform. In addition, Apache Griffin's architecture is highly extensible. We are looking forward to using it in real time data quality management system. We also look forward to contribute some of our minor enhancement to Griffin back to the community."

"We appreciate the Griffin project which really helps so much in our daily data jobs.After years of struggling with the complexity of data quality issues, we turned to Apache Griffin for a new platform that would simplify our data quality pipeline," said Jianfeng Liu, Director of Real-time Data Department at PPDAI. "Because of Apache Griffin's unified model for both batch and stream processing, we've been able to replace legacy systems with one solution that works seamlessly in our production environment. Griffin DSLs have allowed us to dramatically simplify our pipeline and to reduce our efforts a lot. I'm very proud and excited to see that the project is graduating."

"Apache Griffin is one of the best data quality solutions which my team has been used so far. It has been an exciting journey seeing the Griffin community evolve rapidly. And many people iteratively adopting it and contributing to newer capabilities," said Austin Sun, Senior Engineering Manager, Enterprise Service Platform at PayPal. "In PayPal risk domain, we benefit a lot from Apache Griffin to provide high quality data to make precise decision and protect our customer. In addition to PayPal risk, I knew there are several other corporates also leverages core capability from Griffin as their data quality solution. It’s my great honor to witness Griffin grows to a top level project. Way to go, Griffin."

"Apache Griffin project is yet another showcase how community over code can work for projects coming out from internal usages of companies into the open source," said Henry Saputra, ASF member and Incubator Mentor for Apache Griffin. "I am proud to be the part of the projects and mentors for the project when it was being contributed from eBay, in addition to several other projects already donated to ASF such as Apache Kylin and Eagle. The team has worked tremendously hard to adapt the Apache Way, and also shown great respect for the open source community in all the processes for design, development, and release processes.As a Top-Level Project I believe the PMC will help lead the project to much more success in the future."

"Graduation is not the end, it is the beginning of another journey. We hope to take Apache Griffin to the next level with a wider set of features and users," added Guo. "We welcome anyone to join our efforts by helping with product design, documentation, code, technical discussions or promoting Apache Griffin in The Apache Way."

Availability and Oversight
Apache Griffin software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Griffin, visit http://griffin.apache.org/ and https://twitter.com/apachegriffin

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 6,800 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, and Union Investment. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Griffin", "Apache Griffin", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Thursday November 29, 2018

The Apache Software Foundation Operations Summary: August - October 2018

FOUNDATION OPERATIONS SUMMARY

Second Quarter, Fiscal Year 2019 (August - October 2018)

"There have been no signs Apache will stop growing."
Nick Ismail, InformationAge

> Conferences and Events: During Q2, we conducted one event, and were in the midst of planning three others.

  • ApacheCon North America 2018, September 24-27 | Montreal, Canada — This event was our 30th ApacheCon, and celebrated the 20th anniversary of the first ApacheCon in San Francisco in 1998. This was an intentionally smaller event, produced mostly in house, with the assistance of volunteer producer Ruth Suehle, as well as a cast of characters from around the Foundation. We had roughly 350 in attendance, and were able to turn a small profit on the entire venture. Video from the event may be viewed at https://s.apache.org/oUA1 (Keynotes) and audio from the event may be found at https://feathercast.apache.org/ (the rest of the presentations). Keynotes included Cliff Schmidt, Myrle Krantz, Bridget Kromhout, and Dave Neuman, and Fred Reiss, all of whom delivered inspiring talks about the role of open source in making the world a better place, with an emphasis on how open source helps the underprivileged of the world. Bertrand Delacretaz delivered the traditional State Of The Feather address on behalf of the Foundation.

  • Apache North America Road Show, December 4 | Fairfax, VA — During this quarter, we were actively working towards the Apache North America Roadshow, which will be held at George Mason University on December 4th of this year. This event is led by Kevin McGrail, in conjunction with GMU. You can read more about this event at http://www.apachecon.com/usroadshow18/

  • ApacheCon North America 2019 — During this quarter we were investigating venues for ApacheCon North America 2019, which will celebrate the 20th anniversary of the ASF. We expect to announce dates and venue in the coming quarter.

  • ApacheCon EU 2019 — During this quarter we were in discussion with a producer in Europe for the production of ApacheCon EU 2019, which is tentatively planned to be in Berlin. Details, dates, and venue should be announced in the coming quarter.


Sponsorship: as always, if you are interested in sponsoring ApacheCon, or other Apache events, we encourage you to get in touch with the sponsorship team, who can provide you with all active prospectuses at that time.

> Community Development: During this quarter our focus was helping prepare for and support ApacheCon NA in Montreal as well as participation in various other events.

During August one of our key activities was coordinating and setting up a central logo page and repository for high resolution logos for every single ASF project http://www.apache.org/logos/ . It was a great way to collaborate with all our projects on achieving a common goal. We now have the most recent versions of all ASF project logos available in various formats. Feedback and participation from projects was extremely positive and we now have a great resource for communities to use for marketing materials as well as reinforcing the Apache brand.

Our main focus in September was to help provide support for ApacheCon NA in Montreal. As usual we coordinated the Apache booth which is always a popular central meeting place throughout the conference. As well as speaking to conference attendees, we also had promotional material and giveaways.

We are continuing to be active in external events. During September we participated at Solutions Hamburg and for the first time had an Apache booth at the event. Once again we received very positive feedback and interest from attendees wanting to learn more about ASF activities. 

It has been nearly two years since we ran the Committer Diversity survey and in October we started discussions around re-running it. The community has been very positive and are very supportive of organizing another survey to understand how and if committer diversity has changed. It is hoped to get this done during the next quarter.

We are still receiving many requests to participate at various events and are looking running smaller Apache Roadshow style events as a way to reach audiences that may not have the capacity to attend ApacheCon.

Our mailing list subscriptions have increased this quarter and traffic is also higher reflecting the continued participation and interest, our discussions have created.

> Committers and Contributions: Over the past quarter, 1,656 contributors committed 50,346 changes that amount to 17,789,994 lines of code across Apache projects. The top 5 contributors during this timeframe were: David Moravek (920 commits), Radu Cotescu (718 commits), Andrea Cosentino (561 commits), Derek Gelinas (484 commits), and Jean-Baptiste Onofré (483 commits).

All individuals who are granted write access to the Apache repositories must submit an Individual Contributor License Agreement (ICLA). Corporations that have assigned employees to work on Apache projects as part of an employment agreement may sign a Corporate CLA (CCLA) for contributing intellectual property via the corporation. Individuals or corporations donating a body of existing software or documentation to one of the Apache projects need to execute a formal Software Grant Agreement (SGA) with the ASF. 

During Q1 FY2019, the ASF Secretary processed 182 ICLAs, 12 CCLAs, and 4 Software Grants. History of Apache committer growth can be seen at https://projects.apache.org/timelines.html

> Brand Management: Operations — The work of the Brand Management team falls broadly into one of three categories:

  • trademark transfers and registrations
  • granting permission to use our marks
  • addressing potential infringements of our marks

The volume of work has remained steady this quarter. Registrations and transfers are lengthy processes and the previous work to put in place a task tracking system continues to pay dividends. 

This quarter has seen the usual collection of requests to use Apache marks for user groups, events, merchandise and publications with nearly all requests being granted, subject to our Trademark Usage Policy. 

Registrations — The transfer of the NETBEANS mark in the EU completed this quarter. The transfer of the US registration is still in progress. We also also working on the agreements to transfer the marks associated with a number of other podlings. These typically span multiple jurisdictions so can take a while to complete.

We are working with counsel to address a number of external registrations that overlap with our own. While progress is being made, this too is a lengthy process and we expect that the issues currently being worked on will not be resolved until well into next calendar year.

Infringements Potential infringements are brought to our attention from both internal and external sources. The majority of infringements we see are accidental and our project communities are able to resolve these quickly and informally with occasional input from the Brand Management team. A small number of issues take longer to resolve. After last quarter's successes in resolving two of these more complex issues, this quarter has seen a small uptick in the number of complex issues we are tracking.

And finally…The Brand Management team  welcomes your comments and suggestions as well as any questions you might have. Please see https://www.apache.org/foundation/marks/contact for our contact details.


> Infrastructure:
 Infrastructure had a great quarter, as this was our yearly gathering at ApacheCon. This year, it was held in Montreal, and we arrived from all corners to discuss work, do some team bonding, and knock out a large number of action items helped by sitting across the table from each other.

Our most interesting item this quarter is that we posted a job opening, and will be working to hire additional teammates. Our last round of hiring was two years ago, yet with the growth of the Foundation, the additional help is very much needed.

The rest of the quarter has been spent with typical activity, across Jenkins, email and gitbox. Jenkins saw a lot of work focusing on disk space issues, the new master we deployed last quarter, and continued upgades to support the many projects using Jenkins. We made significant progress on preparing our new email infrastructure, to move us away from decade-old hardware and a tricky software install. Email is such a critical service to our communities, that we are approaching this migration process with the utmost care. Lastly, we improved many features around our "gitbox" service (close integration with GitHub), such that we hope to migrate all projects within the next quarter or two, and decommission our old git servers.

> Financial Statement:



> Fundraising:
 The Fundraising team has continued building upon the success of previous quarters. We are happy to report that event fundraising is nearly complete for the year and we are ahead of projected budget.

We are delighted to announce Tencent has joined us as a platinum sponsor as well as the return of Alibaba as a silver sponsor this quarter. 

In late September we added three new ambassadors/sponsor relations team members:

  • Ted Liu
  • Craig Russell
  • Bob Paulin

ApacheCon NA 2018 was a wonderful opportunity to meet personally with sponsors and friends of the foundation. We thank those who attended the sponsor lunch as well as everyone who made the event such a success!

We continue to drive out efficiencies in our procedures and systems. The team has begun working on system integrations to automate access to data managed by the Fundraising team. We also continue to watch for and drive out fraudster use of the individual donation platform (Hopsie). Focus on maturing our sponsor relations practice and procedures continues as well.

The recently published targeted sponsorship policy continues to evolve. We have added notes indicating that endorsements are not acceptable per the policy and are working on a draft of a revised policy after feedback and conversation since initial publishing. More to come soon!

The finance team has been considering ideas for securing our financial future via establishing some sort of restricted fund or endowment. Because our sponsors are so important to us, we reached out to several seeking thoughts and opinions. We are happy to report that the feedback was positive and highly valuable. More discussions with sponsors may come as we continue to support the finance team in their evaluations.

= = = 

Thank you to all our Sponsors!

  • PLATINUM: Cloudera, Comcast, Facebook, Google, LeaseWeb, Microsoft, Pineapple Fund, Oath, Tencent
  • GOLD: Anonymous, ARM, Bloomberg, Handshake, Hortonworks, Huawei, IBM, Indeed, ODPi, Pivotal
  • SILVER: Aetna, Alibaba Cloud Computing, Baidu, Budget Direct, Capital One, Cerner, Inspur, Private Internet Access, Red Hat, Target, Union Investment
  • BRONZE: Airport Rentals, Best VPN, The Blog Starter, Bookmakers, Cash Store, Casino Bonus, Casino2k, Cloudsoft, Emerio, HostChecka.com, HostingAdvice.com, HostPapa Web Hosting, The Linux Foundation, Mobile Slots, SCAMS.info, Site Builder Report, Talend, The Best VPN, Twitter, Web Hosting Secret Revealed
  • TARGETED PLATINUM: DLA Piper, Microsoft, Oath, OSU Open Source Labs, Sonatype
  • TARGETED GOLD: Atlassian, The CrytpoFund, Datadog, PhoenixNAP, Quenda
  • TARGETED SILVER: Amazon Web Services, HotWax Systems, Rackspace
  • TARGETED BRONZE: Bintray, Education Networks of America, Google, Hopsie, No-IP, PagerDuty, Peregrine Computer Consultants Corporation, Sonic.net, SURFnet, Virtru 

To Sponsor The Apache Software Foundation, visit http://apache.org/foundation/sponsorship.html . To make a one-time or monthly recurring donation, please visit https://donate.apache.org/

# # #

Report prepared by Sally Khudairi, Vice President Marketing & Publicity, with contributions by Rich Bowen, Vice President Conferences; Sharan Foga, Vice President Community Development; Mark Thomas, Vice President Brand Management; Greg Stein, ASF Infrastructure Administrator; Tom Pappas, Vice President Finance; and Daniel Ruggeri and Kevin McGrail, Vice Presidents Fundraising.

For more information, subscribe to the announce@apache.org mailing list and visit http://www.apache.org/, the ASF Blog at http://blogs.apache.org/, the @TheASF on Twitter, and https://www.linkedin.com/company/the-apache-software-foundation.

(c) The Apache Software Foundation 2018.

Tuesday November 27, 2018

Support Apache: Individual Giving and Corporate Gifts Campaigns

Giving Tuesday is here, kicking off our end-of-year Individual Giving and Corporate Gifts campaigns.


It's one of our favorite times of the year, as giving back feels so good. Here's how you can help:

Individual Donations.

  • Giving to Apache is easy! Click https://donate.apache.org/ to make a one-time or monthly recurring donation using a debit/credit card, ACH electronic transfer, or PayPal. You'll also receive a receipt for your tax-deductible contribution via email.

  • Purchasing Programs. Those of you who shop from Amazon can start your retail journey at https://smile.amazon.com/ so a portion of your purchase will be donated to the ASF. Now that's smart shopping!

  • Additional Options. If you'd rather mail us a check or send a Bitcoin donation or explore other contribution options, please visit http://apache.org/foundation/contributing.html

Corporate Gifts.

Employers of all sizes can feel good as well! This includes companies such as:

  • Bloomberg, whose employee volunteer and giving program provides numerous service opportunities year-round for a wide variety of philanthropic activities. Employees who have earned 25 or 50 hours of volunteer time can request that a donation of $2,500 or $5,000 be made to their select charity with whom has an established relationship Bloomberg Philanthropies. This is the second year that the ASF is among the candidate organizations that employees can donate their hours to as part of their "Dollars For Your Hours" program.

  • Microsoft, who works through volunteer matching platform Benevity to allow their developers to contribute to Open Source projects and communities that they are passionate about. The program donates $25 per volunteer hour, which enables Microsoft developers to contribute back to the ASF while logging hours towards for matching. The Apache Software Foundation has benefited from this and their "Tech Talent for Good" program for several years.

Some organizations contribute to the ASF in the form of a cash donation --whether it's a one-time gift or recurring monthly made at https://donate.apache.org/ , or a matching gifts program, where the company matches the financial donation of an employee to the ASF. We have received contributions that reflected dollar-for-dollar matches, sometimes multiple times the contribution made by employees.

Corporate Sponsorship is also a great way to support the ASF on an annual basis. Those wishing to quickly become a Sponsor using a credit card, ACH transfer, or PayPal may easily do so at https://donate.apache.org/ ; otherwise we invite interested parties to review our Sponsorship program at http://apache.org/foundation/sponsorship.html or contact us at fundraising(at)apache(dot)org.

We thank you in advance for your generous consideration. Through your support, the ASF is able to continue to develop, incubate, and steward 350+ Open Source projects and their communities and keep Apache for everyone. For more information on the ASF's operations and how we put your donated dollars at work, please see our Annual Report for FY2018 https://s.apache.org/FY2018AnnualReport .

# # # 

Wednesday October 31, 2018

The Apache Software Foundation Announces Apache® Subversion® v1.11.0

Popular Open Source version control system used by millions across an array of applications worldwide.

Wakefield, MA —31 October 2018— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Subversion® v1.11.0, the latest version of the popular centralized software version control system.

Apache Subversion is characterized by its reliability as a safe haven for valuable data; the simplicity of its model and usage; and its ability to support the needs of a wide variety of users and projects, from individuals to large-scale enterprise operations.

"Subversion 1.11 is the first release which follows our new six-month release cycle," said Stefan Sperling, Vice President of Apache Subversion. "Subversion 1.11 provides iterative improvements of features released in Subversion 1.10, and will be supported for six months."

Apache Subversion v1.11.0 highlights include:

  • More robust shelving of changes in the working copy;
  • Interactive conflict resolution supports more conflict situations involving moved files and directories; and
  • New command to write out a view specification describing the current working copy shape.


Based on the new release cycle, Apache Subversion 1.10 is now considered a long-term support (LTS) release, with a planned support period of four years. The next LTS release will be Subversion 1.14, which is anticipated 18 months from now.

"Our new release plan gives users who value stability plenty of time to schedule upgrades between LTS releases, while users who prefer quick access to new features will be able to get updated approximately every 6 months," added Sperling. "Several new client-side features are marked ‘experimental’ in this 1.11 release. We encourage all our users to test these features and provide feedback."

Availability and Oversight
Apache Subversion software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Subversion, visit http://subversion.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 6,800 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Anonymous, ARM, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, and Union Investment. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Subversion", "Apache Subversion", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Calendar

Search

Hot Blogs (today's hits)

Tag Cloud

Categories

Feeds

Links

Navigation