Entries tagged [opensource]

Wednesday April 24, 2019

The Apache Software Foundation Announces Apache® SkyWalking™ as a Top-Level Project

Open Source Application Performance Monitor (APM) tool in use at Alibaba, China Eastern Airlines, Huawei, and WeBank, among others.

Wakefield, MA —24 April 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® SkyWalking™ as a Top-Level Project (TLP).

Apache SkyWalking is an application performance monitor (APM) tool that provides an automatic, highly efficient way to instrument microservices, cloud native, and container-based applications. The project was originally developed in 2015, and entered the Apache Incubator in December 2017.

"This is a special day for the SkyWalking project and its community. We thank our mentors, contributors, and the Apache Incubator for helping us achieve this goal," said Sheng Wu, Vice President of Apache SkyWalking. "The original agenda behind SkyWalking was to help newcomers understand what is distributed tracing, and the community has grown bigger and stronger since we entered the Apache Incubator. Through The Apache Way, SkyWalking has a very active and diverse community, is used by over 70 companies, and has over 100 source contributors from dozens of different organizations."

Apache SkyWalking provides tracing, service mesh telemetry analysis, metric aggregation and visualization for the distributed system. The project landscape has expanded from a pure tracing system, to an observability analysis platform, and application performance management/monitoring system. Features include:

  • Distributed tracing-based APM: 100% traces collected with low payload for original system;
  • Cloud-native friendly: observe distributed system powered by service mesh, Istio and Envoy;
  • Automated source code change: multiple language agents provided, especially with auto instrumentation supported, in Java, .NET and Nodejs;
  • Easy to operate: doesn’t require Big Data in monitoring large scale distributed system; and
  • Advanced visualization: used in traces, metrics and topology map.

Apache SkyWalking is in use at dozens of organizations that include 5i5j Group, Alibaba, autohome.com, China Eastern Airlines, China Merchants Bank, Daocloud, dangdang.com, guazi.com, Huawei, ke.com, iFLYTEK, primeton.com, Sinolink Securities, tetrate.io, tuhu.cn, tuya.com, WeBank, Yonghui Superstores, youzan.com, and more.

"Instrumentation is unquestionably the most time-consuming part of establishing a distributed tracing solution into an existing platform. I had the chance to code with some of the SkyWalking community earlier on and could see the quality being invested back then," said Mick Semb Wever, ASF Member and Apache SkyWalking Incubating Mentor. "When they were looking for mentors and a champion to help them create a proposal to become an Apache project, I was excited at the opportunity to help bring the project to the Apache Incubator, and was pleasantly surprised to see how prepared, and ASF-like, the SkyWalking community and project had already become. As was the case with Apache Kylin, SkyWalking has not only been a model project during the incubation process, they have also become ambassadors on open development The Apache Way to the greater Open Source community in China. Congratulations on graduating as an Apache Top-Level Project."

"SkyWalking is one of the only Open Source tracing systems where usability and user interface have been a focus, something missing in most Open Source projects," said Jonah Kowall, CTO at Kentik, and former VP Research at Gartner. "Making tracing and APM more easily used by developers and operations team is a key goal which makes Apache Skywalking a project to watch."

"Apache SkyWalking has done a lot of work in spreading modern cloud native observability in China and across the world," said Chris Aniczszyk, CTO and COO of the Cloud Native Computing Foundation. "We are happy to see Apache SkyWalking become a TLP and look forward to their community growing and collaborating with CNCF projects like Kubernetes, Envoy, Jaeger and more."

"I hear regularly from users that observability is the most important feature they're getting out of their service mesh," said Zack Butcher, Core Contributor to Istio. "By integrating Apache SkyWalking with Istio, the SkyWalking team has brought their incredible tools for deeply understanding system behavior to the mesh. We've already seen great results, and I can't wait to see what further insights users unlock using Apache SkyWalking together with Istio to observe and manage their deployments."

"At WeBank, we use different banking architectures, from distributed architecture to Open Source technologies. We’ve built a messaging bus called WeMQ based on Apache RocketMQ that fully utilizes the benefits of messaging by implementing various messaging techniques in different scenarios, such as message exchanges, pub/sub and request/reply models," said Eason Chen, WeBank Tech Specialist, and Apache RocketMQ Contributor. "However, after adding different messaging services that are critical to our business, we realized there is a need for a universal visual traceable system for the distributed message to help us to diagnosis problem of applications. We believe Apache SkyWalking can address our current challenges, and we look forward to contributing to its efforts."

"I am very glad to see SkyWalking has been promoted as Apache Top-Level Project," said Lie Mao, Architect at China Eastern Airlines IT Solution Department. "Apache SkyWalking is integrated into the China Eastern Airlines microservice architecture support platform. SkyWalking provides practical features and visualization capabilities about topology map and distributed tracing, to help us understand the distributed system. I hope the Open Source community can contribute more plugins to Apache SkyWalking to enhance its role in the multi-language hybrid architecture."

"I found SkyWalking in 2017. In two years, it has grown very fast, and the community is very active," said DongXue Si, Senior Software Engineer at CloudWise Inc. "The project is adopted by many companies, and is attracting a lot of developers. Apache SkyWalking makes application performance monitoring easier and more convenient. I believe it will be better and better powered by its diversity community: Bless it."

"As early adopters of SkyWalking, we are very glad to see it graduate as an Apache Top-Level Project," said Liang Zhang, Architect at JD.com, Podling Project Management Committee member of Apache ShardingSphere (incubating), and former Architect at dangdang.com. "Dangdang.com adopted SkyWalking much earlier before it joined the Apache Incubator: we have witnessed its development, new features, and community growth. It is a very good example for Apache ShardingSphere (incubating). I look forward to our projects cooperating on observability in databases, and building a better Open Source ecosystem together."

"Congratulations to SkyWalking for becoming an Apache Top Level project," said Yuqi Zhou, Middleware Development Manager at Sinolink Securities Co. "Apache SkyWalking’s elegant design and good performance solves the our tracing and monitoring needs. Thanks to the Open Source community for bringing us such an awesome project: I wish it continued success."

"In helping enterprise customers transform their business application from traditional architecture to a Microservices architecture, one of the most important aspects of the microservices governance platform is its observability to obtain invocation relationships between components, as well as inside service itself, and to generate statistics based on these data, including SLA of services provided to the outside world," said Grissom Wang, Chief Architect at DaoCloud. "We surveyed a number of similar Open Source technologies and eventually chose Apache SkyWalking as one of the core components of DaoCloud Microservices platform because of its openness, extendibility, high performance, excellent code quality, active community, and forward-looking integration with Istio."

"Congrats SkyWalking being an Apache TLP," said Niangang Xu, co-founder of Yonghui Cloud Computing. "Apache SkyWalking helps us to improve the design of microservice, and has been enabling us to manage and observe a lot of distributed systems at scale!"

"SkyWalking is on its way to becoming a world wide Open Source project," added Wu. "We welcome everyone to participate on our mailing lists, GitHub, and Slack channels, and to learn more through our events, presentations, Website, and documents."

Catch Apache SkyWalking in action at SkyWalking DevCon (Shanghai; 11 May 2019), GIAC (Shenzhen; 21-23 June 2019), KubeCon + CloudNativeCon China (Shanghai; 25-26 June 2019), ApacheCon North America  (Las Vegas; 9-12 September 2019), and DevOps Stage (Kiev; 18-19 October 2019).

Availability and Oversight
Apache SkyWalking software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache SkyWalking, visit http://skywalking.apache.org/ and https://twitter.com/ASFSkyWalking

About the Apache Incubator
The Apache Incubator is the entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects wishing to join the ASF enter through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. For more information, visit http://incubator.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects that provide $20B+ worth of Apache Open Source software to the public at 100% no cost. Through the ASF's merit-based process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting billions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, Leaseweb, Microsoft, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, Workday, and Verizon Media. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "SkyWalking", "Apache SkyWalking", "Kylin", "Apache Kylin", "RocketMQ", "Apache RocketMQ", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday March 26, 2019

20 Years of Open Source Innovation, The Apache Way

by Jim Jagielski and Sally Khudairi

As the world's largest and one of the most influential Open Source foundations, The Apache Software Foundation (ASF) is home to more than 350 community-led projects and initiatives. The ASF's 731 individual Members and more than 7,000 Committers are global, diverse, and often embodies a case of collective humility. We've assembled a list of 20 ubiquitous and up-and-coming Apache projects to celebrate the ASF's 20th Anniversary on 26 March 2019, applaud our all-volunteer community, and thank the billions of users who benefit from their Herculean efforts.


1. Apache HTTP Server
Web/Servers. http://httpd.apache.org/

The most popular Open Source HTTP server on the planet shot to fame just 13 months from its inception in 1995, and remains so today due to its ability to provide a secure, efficient and extensible server that provides HTTP services observing the latest HTTP standards. Serving modern operating systems including UNIX, Microsoft Windows, and Mac OS/X, the Apache HTTP Server played a key role in the initial growth of the World Wide Web; its rapid adoption over all other Web servers combined was also instrumental to the wide proliferation of eCommerce sites and solutions. The Apache HTTP Server project was the ASF's flagship project at its launch, and served as the basis upon which future Apache projects emulated with its open, community-driven, merit-based development process known as "The Apache Way".


2. Apache Incubator
Innovation. http://incubator.apache.org/

The Apache Incubator is the ASF's nexus for innovation, serving as the entry path for projects and codebases wishing to officially become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects go through the incubation process to ensure all donations are in accordance with the ASF legal standards, and develop diverse communities that adhere to the ASF's guiding principles. Incubation is required of newly accepted projects until their infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. Whilst incubation is neither a reflection of the completeness or stability of the code, nor does it indicate that the project has yet to be fully endorsed by the ASF, its rigorous process of mentoring projects and their communities according to "The Apache Way" has led to the graduation of nearly 200 projects in the Incubator's 16-year history. Today 51 "podlings" are undergoing development in the Apache Incubator across an array of categories, including annotation, artificial intelligence, Big Data, cryptography, data science/storage/visualization, development environments, Edge and IoT, email, JavaEE, libraries, machine learning, serverless computing, and more.


3. Apache Kafka
Big Data. https://kafka.apache.org/

The Apache footprint as the foundation of the Big Data ecosystem continues to grow, from Accumulo to Hadoop to ZooKeeper, with fifty active projects to date and two dozen more in the Apache Incubator. Apache Kafka's highly-performant distributed, fault tolerant, real-time publish-subscribe messaging platform powers Big Data solutions at Airbnb, LinkedIn, MailChimp, Netflix, The New York Times, Oracle, PayPal, Pinterest, Spotify, Twitter, Uber, Wikimedia Foundation, and countless other businesses.


4. Apache Maven
Build Management. http://maven.apache.org/

Spinning out of the Apache Turbine servlet framework project in 2004, Apache Maven has risen to the top as the hugely popular build automation tool that helps Java developers build and release software. Stable, flexible, and feature-rich, Maven streamlines continuous builds, integration, testing, and delivery processes with an impressive central repository and robust plug-in ecosystem, making it the go-to choice for developers who want to easily manage a project’s build, reporting, and documentation.


5. Apache CloudStack
Cloud Computing. http://cloudstack.apache.org/

Super-quick to deploy, well-documented, and with an easy production environment, one of the biggest draws to Apache CloudStack is that it "just works". Powering some of the industry's most visible Clouds –from global hosting providers to telcos to the Fortune 100 top 5% and more– the CloudStack community is cohesive, agile, and focused, leveraging 11 years of Cloud success to enable users to rapidly and affordably build fully featured clouds.


6. Apache cTAKES
Content. http://ctakes.apache.org/

Developed from real-world use at the Mayo Clinic in 2006, cTAKES was created by a team of physicians, computer scientists and software engineers seeking a natural language processing system for extraction of information from electronic medical record clinical free-text. Today Apache cTAKES is an integral part of the Mayo Clinic's electronic medical records and has processed more than 80 million clinical notes. Apache cTAKES is a growing standard for clinical data management infrastructure across hospitals and academic institutions that include Boston Children’s Hospital, Cincinnati Children’s Hospital, Massachusetts Institute of Technology, University of Colorado Boulder, University of Pittsburgh, and University of California San Diego, as well as companies such as Wired Informatics.


7. Apache Ignite
Data Management. https://ignite.apache.org/

Apache Ignite is used for transactional, analytical, and streaming workloads at petabyte scale for the likes of American Airlines, ING, Yahoo Japan and countless others on premises, on cloud platforms, or in hybrid environments. Apache Ignite's in-memory data fabric provides an in-memory data grid, compute grid, streaming, and acceleration solutions across the Apache Big Data system ecosystem, including Apache Cassandra, Apache Hadoop, Apache Spark, and more.


8. Apache CouchDB
Databases. http://couchdb.apache.org/

Thousands of organizations such as the BBC, GrubHub, and the Large Hadron Collider use Apache CouchDB for seamless data flow between every imaginable computing environment, from globally-distributed server clusters to mobile devices to Web browsers. Its Couch Replication Protocol allows you to store, retrieve, and replicate data safely on premises or on the Cloud with very high performance reliability. Apache CouchDB does all the heavy lifting so you can sit back and relax.


9. Apache Edgent (incubating)
Edge computing. http://edgent.incubator.apache.org/

The boom of IoT –personal assistants, smart phones, smart homes, connected cars, Industry 4.0 and beyond– is producing an ever-growing amount of data streaming from millions of systems, sensors, equipment, vehicles and more. The demand for reliable, efficient real-time data has driven the need for the "Empowered Edge", where data collection and analysis is optimized by moving away from centralized sources towards the edges of of the networks, where much of the data originates. Companies like IBM and SAP are leveraging Apache Edgent to accelerate analytics at the edge across the IoT ecosystem. Apache Edgent can be used in conjunction with many Apache data analytics solutions such as Apache Flink, Apache Kafka, Apache Samza, Apache Spark, Apache Storm, and more.


10. Apache OFBiz
Enterprise Resource Planning (ERP). https://ofbiz.apache.org/

Whereas most of the ASF projects are about running or creating infrastructure, we also realize the importance of running and handling a business. Apache OFBiz is a comprehensive suite of business applications from accounting and CRM through Warehousing and Inventory control. The Java based framework provides the power and the flexibility to serve as the core of one's B2B and B2C business management and is easily expandable and customizable. Apache OFBiz is a complete ERP solution, flexible, free, and fully Open Source and services users from United Airlines to Cabi.


11. Apache SIS (Spatial Information System)
Geospatial. http://sis.apache.org/

The US National Oceanic and Atmospheric Administration, Vietnamese National Space Center, numerous spatial agencies, governments, and others rely on Apache SIS to create their own intelligent, standards-based interoperable geospatial applications. The Apache SIS toolkit handles spatial data, location awareness, geospatial data representation, and provides a unified metadata model for file formats used for real-time smart city visualization, geospatial dataset discovery, state-of-the-art location-enabled emergency management, earth observation, as well as information modeling for extra-terrestrial bodies such as Mars and asteroids.


12. Apache Syncope
Identity Management. http://syncope.apache.org/

Apache Syncope manages digital identity data in enterprise applications and environments to handle user information such as username, password, first name, last name, email address, etc. Identity management involves considering user attributes, roles, resources and entitlements that control who access to what data, when, how, and why. Apache Syncope users include the Italian Army, the University of Helsinki, University of Milan, and the SWITCH Swiss university network.


13. Apache PLC4X (incubating)
Internet of Things (IoT). http://plc4x.incubator.apache.org/

Connectivity and integration across many Industrial IoT edge gateways is often impossible with closed-source, proprietary legacy systems with incompatible protocols. Apache PLC4X provides a universal protocol adapter for creating Industrial IoT applications through a set of libraries that allow unified access to any type of industrial programmable logic controllers (PLCs) using a variety of protocols with a shared API. In addition, the project is planning integrations modular to Apache IoT projects that include Apache Brooklyn, Apache Camel, Edgent, Apache Kafka, Apache Mynewt, and Apache NiFi.


14. Apache Commons
Libraries. http://commons.apache.org/

With 42%+ of Apache projects written in Java (that's 62+ million lines of code), having a set of stable, reusable Open Source Java software components available to all Apache projects and external users is both helpful and necessary. Apache Commons provides a suite of dozens of stable, reusable, easily deployed Java components, and a workspace for Commons contributors to collaborate on the development of new components.


15. Apache Spark
Machine Learning. http://spark.apache.org/

Big Data is growing exponentially each year, accelerated by industries such as agriculture, big business, FinTech, healthcare, IoT, manufacturing, mobile advertising and more. Apache Spark's unified analytics engine for processing and analyzing large-scale data processing helps data scientists apply machine learning insights and an array of libraries to improve responsiveness more accurate results. Apache Spark runs workloads 100x faster on Apache Hadoop, Apache Mesos, Kubernetes, whether standalone or in the cloud, and to access diverse data sources, from Apache Cassandra, Apache Hadoop HDFS, Apache HBase, Apache Hive, and hundreds of others.


16. Apache Cordova
Mobile. https://cordova.apache.org/

Apache Cordova is the popular developer tool used to easily build cross-platform, cross-device mobile apps using a Write-Once-Run-Anywhere solution, which enabling developers to create a single app that will appear the same across multiple mobile device platforms. Apache Cordova acts as an extensible container, and serves as the base that most mobile application development tools and frameworks are built upon, including mobile development platforms and commercial software products by Blackberry, Google, IBM, Intel, Microsoft, Oracle, Salesforce, and many others.


17. Apache Tomcat
Java/Servers. https://tomcat.apache.org/

Starting off as the Apache JServ project, designed to allow for Java "servlets" to be run in a Web environment, Tomcat grew to being a full-fledged, comprehensive Java Application server and was the de-facto reference implementation for the Java specifications. Since 2005, Apache Tomcat has formed, and still forms, the foundation of numerous Java-based web infrastructures such as eBay, E*Trade, WalMart, and The Weather Channel.


18. Apache Lucene/Solr
Search. http://lucene.apache.org/solr/

Adobe, AOL, Apple, AT&T, Bank of America, Bloomberg, Cisco, Disney, eTrade, Ford, The Guardian, Homeland Security, Instagram, MTV Networks, NASA Planetary Data System, Netflix, SourceForge, Verizon, Walmart, whitehouse.gov, Zappos, and countless others turn to Apache Lucene Solr to quickly and reliably index and search multiple sites and enterprise data such as documents and email. Popular features include near real-time indexing, automated failover and recovery, rich document parsing and indexing, user-extensible caching, design for high-volume traffic, and much more. 


19. Apache Wicket
Web Framework. http://wicket.apache.org/

The Apache Wicket component-based Web application framework is prized by many followers for its "Plain Old Java Object" (POJO) data model and markup/logic separation not common in most frameworks. Developers have been using Apache Wicket since 2004 to quickly create powerful, reusable components using object oriented methodology with Java and HTML. Wicket powers thousands of applications and sites for governments, stores, universities, cities, banks, email providers, and more, including Apress, DHL, SAP, Vodafone, and Xbox.com.


20. Apache Daffodil (incubating)
XML. http://daffodil.apache.org/

Governments handle massive amounts of complex and legacy data across security boundaries every day. In order for such data to be consumed, it must be inspected for correctness and sanitized of malicious data. Whilst traditional inspection methods are often proprietary, incomplete, and poorly maintained, Apache Daffodil streamlines the process with an Open Source implementation of the Data Format Description Language specification (DFDL) that fully describes a wide array of complex and legacy file formats down to the bit level. Daffodil can parse data to XML or JSON to allow for validation, sanitization, and transformation, and also serialize or ''unparse'' back to the original file format, effectively mitigating a large variety of common vulnerabilities.

The Apache Software Foundation is a leader in community-driven open source software and continues to innovate with dozens of new projects and their communities. Apache projects are managing exabytes of data, executing teraflops of operations, and storing billions of objects in virtually every industry. Apache software is an integral part of nearly every end user computing device, from laptops to tablets to phones. The commercially-friendly and permissive Apache License v2.0 has become an open source industry standard. As the demand for quality open source software continues to grow, the collective Apache community will continue to rise to the challenge of solving current problems and ideate tomorrow’s opportunities through The Apache Way of open development. Learn more at http://apache.org/

# # # 

Thursday March 21, 2019

The Apache Software Foundation Announces Apache® Unomi™ as a Top-Level Project

Powerful Open Source Customer Data Platform in use at Al-Monitor, Altola, Jahia, and Yupiik, among others. 

Wakefield, MA —21 March 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Unomi™ as a Top-Level Project (TLP).

Apache Unomi is a standards-based, Customer Data Platform (CDP) that manages online customer, leads, and visitor information to provide personalized experiences that adheres to visitor privacy rules such as GDPR and “Do Not Track” preferences. The project was originally developed at Jahia, and was submitted to the Apache Incubator in October 2015.

"I am truly thankful to our community, especially our mentors, who have helped us achieve this milestone," said Serge Huber, Vice President of Apache Unomi. "The original vision behind Unomi was to ensure true privacy by making the technologies handling customer data completely Open Source and independent. Since it was submitted to the Apache Incubator, developing Unomi using the Apache Way will ensure the project grows its community to be more diverse and welcome new users and developers."

Apache Unomi is versatile, and features privacy management, user/event/goal tracking, reporting, visitor profile management, segmentation, personas, A/B testing, and more. It can be used as:

  • a personalization service for a Web CMS;

  • an analytics service for  native mobile applications;

  • a centralized profile management system with segmentation capabilities; and

  • a consent management hub

Apache Unomi is the industry's first reference implementation of the upcoming OASIS CDP specification (established by the OASIS CXS Technical Committee, which sets standards as a core technology for enabling the delivery of personalized user experiences). As a reference implementation, Apache Unomi serves as a real world example of how the standard will be stable, and is quickly gaining traction by those interested in truly open and transparent customer data privacy. Apache Unomi is in use at organizations such as Al-Monitor, Altola, Jahia, Yupiik, and many others to create and deliver consistent personalized experiences across channels, markets, and systems.

"When Serge and I announced the launch of the Apache Unomi project at the 2015 ApacheCon Budapest, Apache Unomi, at that time, was the first proposal among the rising Customer Data Platform industry's segment, positioned as an 'ethical data-driven marketing' product that would respect the privacy of customers while leveraging the power of unified customers data," said Elie Auvray, Head of Business Development at Jahia. "Jahia's digital experience management solutions are based on Apache Unomi, and we can't wait to see how the project will now evolve with its growing community. Seeing today Apache Unomi becoming a Top-Level Project is a great reward for us as Open Source software believers. We are proud of this milestone, grateful to the Apache Software Foundation and our mentors, and we know it's only the beginning of a new –hopefully long and successful– journey."

"Under development at OASIS, the Customer Data Platform specification –for which Apache Unomi aims to be the reference implementation– lies at the crossroads of many solutions providers needs such as WCM, CRM, Big Data Platforms, Machine Learning, IoT and Digital Marketing," said Laurent Liscia, CEO of OASIS. "At a time when client data interoperability and built-in data privacy are mandatory foundations for legal, consistent, and personalized experiences across channel markets and systems, the CDP specification, together with Apache Unomi, is a clear and welcome answer to end-user concerns."

"Apache Unomi is the perfect solution to implement a user profile platform," said Jean-Baptiste Onofré, Fellow at Talend. "It fully addresses the user trust and privacy needs, allowing to easily create user profile and Web marketing features. As Unomi is powered by Apache Karaf, it's also a great platform for several use cases, such as digital marketing in Web applications, managing user profiles on IoT devices, and more."

"Apache Unomi enables Al-Monitor readers to be driven towards additional personalized content that corresponds, via content tags profiling and related automated segmentations, to what they have already accessed," said Valerie Voci, Head of Digital Strategy and Marketing at Al-Monitor. "This data follows our customers where they go, so it's a consistent experience whether they are getting these recommendations in their inbox or on the Website or both. And if a change takes place on one, that change is immediately reflected on the other. It helps us create a very cohesive marketing message and a great overall digital experience."

"As we were developing a progressive web app (PWA) for a client, we were looking for a Customer Data Platform (CDP) to store customer insights, such as behavioral and explicit customer data," said Lars Petersen, Co-Founder at Altola. "Privacy was table stake for us, along with the flexibility to customize data schema and open API. We selected Apache Unomi based on these parameters, we had it up and running on AWS in less than 30 min. and are very impressed with the maturity of the platform, its privacy by design and how easy it was to work with."

"In a digital world, customer data is very important to offer a better experience to users. However, data privacy and trust is not an option for users," said François Papon, CTO at Yupiik. "Apache Unomi is the best solution for our clients because it's an Open Source project managed by an independent foundation, there is no vendor lock-in. It's also based on other solutions like Apache Karaf that made it ready for modularity, scalability, cloud, devops, and more." 

"Apache Unomi is poised to disrupt the Customer Data Platform market," said Thomas Sigdestad, CTO at Enonic, and co-chair, with Serge Huber, of the CDP standards work at OASIS open. "The CDP marketplace is lacking from a standard way of exchanging data, and the vendor space is over-represented by closed source and proprietary cloud offerings. This effectively limits the potential and adoption of CDP in general. Apache Unomi is not merely Open Source, but also the reference implementation of the imminent CDP standard from OASIS. Companies using Unomi will benefit from faster and simpler integrations without locking their customer data into yet another proprietary silo." 

"Graduating as an Apache Top-Level Project is only the beginning," added Huber. "Unomi has a lot of potential that it still to be developed, and is a perfect opportunity for those interested in Customer Data Privacy to participate through our mailing lists and Slack channel, and to learn more about the project on our Website and presentations."

Catch Apache Unomi in action at ApacheCon North America (9-12 September 2019 in Las Vegas, Nevada), and ApacheCon Europe (22-24 October 2019 in Berlin, Germany) http://apachecon.com/ .

Availability and Oversight
Apache Unomi software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Unomi, visit http://unomi.apache.org/

About the Apache Incubator
The Apache Incubator is the entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects seeking to join the ASF enter through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. For more information, visit http://incubator.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, Leaseweb, Microsoft, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, Workday, and Verizon Media. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Unomi", "Apache Unomi", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

Tuesday March 19, 2019

The Apache Way to Sustainable Open Source Success

As Open Source software continues to grow in importance, it seems appropriate to reflect upon the ongoing success of The Apache Software Foundation (ASF) as it approaches its 20th anniversary. The Apache Way of community-driven development continues to gain momentum despite the compounding challenges of building software in the greater Open Source ecosystem.

This approach, The Apache Way, was defined over 24 years ago by the original Apache Group, prior to the establishment of the Foundation. It has led to our success as a foundation and we believe it has been fundamental to the triumph of Open Source as a whole.

While The Apache Way has been refined over the years, it remains true to the original goals of transparent, community-driven collaboration in a vendor-neutral environment that is accessible to all.

The Apache Way defines Open Source in terms of both a legal and a social framework for collaboration. It helps others understand what makes Open Source powerful and how participants are expected to behave. In this post we will examine The Apache Way in the context of the Foundation's mission:

"The mission of the Apache Software Foundation (ASF) is to provide software for the public good. We do this by providing services and support for many like-minded software project communities consisting of individuals who choose to participate in ASF activities." 

Let's dissect this mission statement. 

"Provide Software for the Public Good"

Key points in this section: 

  • We produce software that is non-excludable and non-rivalrous

  • Use of the software in any context does not reduce its availability to others

  • Users and contributors have no committed responsibility to the foundation, our projects or our communities

  • Use of a license that conforms to the Open Source Definition is necessary but not sufficient to deliver on our mission 

Investopedia defines a public good as "a product that one individual can consume without reducing its availability to another individual, and from which no one is excluded." On the surface, this is a good definition for our use of the term. However, there is a nuance in our use. Our mission is not to produce "public goods" but to "provide software for the public good". 

To understand why this is important, one needs to think about what motivates the ASF to produce software that is a public good.

Open Source software can be digitally copied and reused in an unlimited number of ways. Every user can modify it for their specific needs. They can combine it with other software. They can design innovative new products and services using it and can make a living from the proceeds. This is all possible without impacting other people's use of the software. As such, the ASF produces software that can be used for the public good in many different ways.

To allow us to deliver on this part of the mission, it is critical that we adopt a license that uses the law to protect the software curated here at the Foundation. For us that license is the Apache License, Version 2. In addition, we adopt an inbound licensing policy that defines which licenses are allowable on software reused within Apache projects. This policy can be summarized as: 

  • The license must meet the Open Source Definition (OSD).

  • The license, as applied in practice, must not impose significant restrictions beyond those imposed by the Apache License 2.0.

This means that you can be assured that software curated by projects within The Apache Software Foundation is both a public good and for the public good. You can use Apache software for any purpose and you have no responsibility to the Foundation or the project to contribute back (though as addressed in the next section, it is often in your interests to do so). 

It is important to recognize that there are software projects out there that adopt our license but do not adopt our inbound licensing policy. Such projects may bring restrictions that are not covered by our license; therefore, it is important to carefully examine the licensing policies of these projects. Using the Apache License alone may not provide you with the same options a Foundation project provides. 

Apache projects are successful, in large part, because of our diligence with respect to clearly-defined licensing policies. Such diligence makes it much easier for downstream users to understand what they can and cannot do with Apache software. The Apache License is deliberately permissive to ensure that everyone has an opportunity to participate in Open Source within the ASF or elsewhere. Modifications of our license are allowed, but modified licenses are neither the Apache License nor affiliated with or endorsed by The Apache Software Foundation. No modified license can be represented as such. Modified licenses that use the Apache name are strictly disallowed, as they are both confusing to users and undermine the Apache brand.

While we recognize that there are many ways to license software, whether Open Source or otherwise, we assert that only projects that use both our license (unmodified) and our inbound licensing policy truly follow and adhere to The Apache Way. 

While an OSD-approved license and associated policies are necessary for successful Open Source production, they are not sufficient. They provide a legal framework for the production of Open Source, but they do not provide a social framework, which brings us to the second sentence of our mission:

"The mission of the Apache Software Foundation is to provide software for the public good. We do this by providing services and support for many like-minded software project communities of individuals who choose to contribute to Apache projects."

"Like-Minded Software Project Communities of Individuals"

Key points in this section: 

  • The Apache Way provides a governance model designed to create a social framework for collaboration

  • The Apache Software Foundation develops communities, and those communities develop software

  • ASF project communities develop and reuse software components that in turn may be reused in products

  • Users of ASF software often build products and services using our software components

  • Our model, and others like it, have produced some of the largest and longest-lived Open Source projects that have literally revolutionized the industry 


There is a lot packed into these few words. It is an understanding of these words that makes the difference between software that is under an Open Source license and software that reaches sustainability through The Apache Way. These words underscore the fact that the Foundation does not directly produce software. That's right, The Apache Software Foundation, with upwards of $8Bn of software code, does not directly produce software. Rather than focus on software, we focus on the creation of and support of collaborative communities; the software is an intentional by-product. 

Our like-minded project communities come together because they share common problems that can be addressed in software. As the saying goes, "a problem shared is a problem halved". By bringing together individuals with their unique ideas and skills, we break down barriers to collaboration. 

The Apache Way is carefully crafted to create a social structure for collaboration, which complements the legal framework discussed above. Where the legal framework ensures an equal right to use the software, The Apache Way ensures an equal ability to contribute to the software. This is critically important to the long term sustainability of Open Source software projects. This social structure for collaboration is missing from many non-Apache projects, yet a robust social structure is invariably a key component in long-term successful projects outside of the ASF.

The Apache Way is fully inclusive, open, transparent and consensus-based. It promotes vendor neutrality to prevent undue influence (or control) from a single company. It ensures that any individual with a valuable contribution is empowered, and it seeks to assure that a project remains sustainable despite inevitable changes in community membership over time.

Apache projects typically produce software components that can be combined with other software (of any license) in different ways to solve different problems. This provides plenty of opportunity for participants to collaborate within a given software project independent of their relationship outside the Foundation. This is very different from the idea of licensing your product as a whole under an Open Source license. Our model offers more opportunities for reuse which, in turn, increase the pool of individuals likely to contribute to the project.

In addition, our merit-based system seeks to ensure that as people come and go, for whatever reason, there is always someone to take their place. As a result, some ubiquitous Apache projects have existed for over 20 years and helped commercialize the World Wide Web; while dozens of newer projects have defined industry segments such as Big Data and IoT (Internet of Things). 

A core tenet of The Apache Way is "Community Over Code", which encapsulates our deep belief that a healthy community is a far higher priority than good code. A strong community can always rectify a problem with the code, whereas an unhealthy community will likely struggle to maintain a codebase in a sustainable manner. Healthy communities ensure the Foundation has the stability to thrive for the next 20 years and beyond. Apache projects do not have the problem of scaling that others, who focus only on the legal frameworks of Open Source, suffer from. If you look around at projects that have grown up alongside the Apache projects, you will see a similar focus on scaling the governance model. This is no accident. 

Why this is Important

Software is a critical part of any modern economy. It touches every part of every life in the developed world, and is increasingly transforming everyday life, from womb to grave, everywhere.

At The Apache Software Foundation, we believe that every developer has their personal motivations for building software. We celebrate their right to choose when and how they build their software, including their right to use a non-open license. 

We will not dictate what is best for developers or for the software industry.

We care about the provision of software that enables our users, our contributors, and the general public to decide what is best for them.

We welcome you to use our software and contribute to our projects -- or not. It's up to you. 

We ask that you leave commercial interests at the door.

Countless organizations are proving that their team members who collaborate in a vendor-neutral environment often apply Open Innovation processes (such as The Apache Way) to their work. This helps create internal efficiencies and lays the groundwork for new external opportunities that may provide additional added benefits.

Bringing only your intention of contributing what best serves the greater Apache community reinforces trust in the people and projects behind the Apache brand, and helps us realize our mission of providing software for the public good. 

We learn together and work together to deliver the best software we can. 

Apache software is available for all.

The freedom to choose is what makes the Foundation and Apache projects so strong.

Summary

The software industry has changed and continues to change. The ways software is delivered to end users have changed. Some of the leaders in our industry have retired and new leaders have emerged. But some things have not changed. Our model of collaborative software development, through a combination of a licensing and social framework, remains one of the most successful models of software production.

Increasing the number of users, even those who do not contribute to code, should be seen as a benefit, not a problem, in Open Source. More users present an opportunity. At Apache, more users means more success since they are our future contributors.

As a US 501(c)(3) public charitable organization, The Apache Software Foundation helps individuals and organizations understand how Open Source at scale works in a highly competitive market. For more than two decades our focus has not been on producing software, but rather mentoring communities who produce software. The Apache Way advances sustainable Open Source communities: everything we do is Open Source so all kinds of users can benefit from our experience. Apache is for everyone.

# # #

Tuesday February 19, 2019

The Apache® Software Foundation Announces Apache Arrow™ Momentum

Open Source Big Data in-memory columnar layer adopted by dozens of Open Source and commercial technologies; exceeded 1,000,000 monthly downloads within first three years as an Apache Top-Level Project

Wakefield, MA —19 February 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, today announced momentum with Apache® Arrow™, the Open Source Big Data in-memory columnar layer.

Since the founding of the project in January 2016, Apache Arrow has quickly become the defacto standard for representing and processing analytical data in memory, accelerating analytical processing and interchange by more than 100x.

"When we became a Top-Level Project, we projected that the majority of the world's data will be processed through Arrow within the next decade," said Jacques Nadeau, Vice President of Apache Arrow. "In just three years time, we are proud to see Arrow's substantial industry adoption and increased value across a wide range of analytical, machine learning, and artificial intelligence workloads."

Highlights of Apache Arrow's success include:

Industry Adoption —more than 20 major technologies adopted Arrow to accelerate in-memory analytics, including Apache Spark, NVIDIA RAPIDS, pandas, and Dremio, among others. A list of known Open Source and commercial implementations can be found at https://arrow.apache.org/powered_by/

Millions of Downloads —leveraging and integrating Apache Arrow into many other technologies has bolstered downloads to more than 1,000,000 each month.

New Language Support —as a cross-language development platform, supporting multiple programming languages is paramount. Apache Arrow has grown from supporting one language to eleven different languages today; they include C++, Java, Python, R, C#, Javascript, and Ruby, among others.

Seamless Data Format Support —Arrow supports different data types, both simple and nested, located in arbitrary memory such as regular system RAM, memory-mapped files or on-GPU memory. In addition, it can ingest data from popular storage formats such as Apache Parquet, CSV files, Apache ORC, JSON, and more.

Major Code Donations —Apache Arrow's new features and expanded functionality are due in part to code and component donations that include:
  • C# Library
  • Gandiva LLVM-based Expression Compiler
  • Go Library
  • Javascript Library
  • Plasma Shared Memory Object Store
  • Ruby Libraries (Apache Arrow and Apache Parquet)
  • Rust Libraries (Parquet and DataFusion Query Engine)
Community and Contributor Growth —over the past 12 months, nearly 300 individuals have submitted more than 3,000 contributions that have grown the Apache Arrow code base by 300,000 lines of code. The Arrow community is welcoming approximately 10 new contributors each month.


In January the project announced its most recent release, Apache Arrow 0.12.0, which reflects more than 600 enhancements developed during Q4 2018. The Apache Arrow community is actively working on a number of impactful new initiatives that include solving high performance analytical problems and allowing for more efficient data distribution across entire clusters.

"Apache Arrow's rapid industry adoption and developer community growth supports our original thesis of the importance of a language-independent open standard for columnar data," said Wes McKinney, member of the Apache Arrow Project Management Committee, and creator of Python's pandas project. "Additionally, we are seeing productive collaborations take place not only between programming languages but also between the database systems and data science worlds. We look forward to welcoming more data system developers into our community."

About Apache Arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, C#, Go, Java, JavaScript, MATLAB, Python, R, Ruby, and Rust.

Availability and Oversight
Apache Arrow software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Arrow, visit http://arrow.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official global conference series. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, Union Investment, and Workday. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Arrow", "Apache Arrow", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Wednesday January 23, 2019

The Apache Software Foundation Announces Apache® Hadoop® v3.2.0

Pioneering Open Source distributed enterprise framework powers US$166B Big Data ecosystem

Wakefield, MA —23 January 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, today announced Apache® Hadoop® v3.2.0, the latest version of the Open Source software framework for reliable, scalable, distributed computing.

Now in its 11th year, Apache Hadoop is the foundation of the US$166B Big Data ecosystem (source: IDC) by enabling data applications to run and be managed on large hardware clusters in a distributed computing environment. "Apache Hadoop has been at the center of this big data transformation, providing an ecosystem with tools for businesses to store and process data on a scale that was unheard of several years ago," according to Accenture Technology Labs.

"This latest release unlocks the powerful feature set the Apache Hadoop community has been working on for more than nine months," said Vinod Kumar Vavilapalli, Vice President of Apache Hadoop. "It further diversifies the platform by building on the cloud connector enhancements from Apache Hadoop 3.0.0 and opening it up for deep learning use-cases and long-running apps."

Apache Hadoop 3.2.0 highlights include:
  • ABFS Filesystem connector —supports the latest Azure Datalake Gen2 Storage;
  • Enhanced S3A connector —including better resilience to throttled AWS S3 and DynamoDB IO;
  • Node Attributes Support in YARN —helps to tag multiple labels on the nodes based on its attributes and supports placing the containers based on expression of these labels;
  • Storage Policy Satisfier  —supports HDFS (Hadoop Distributed File System) applications to move the blocks between storage types as they set the storage policies on files/directories; 
  • Hadoop Submarine —enables data engineers to easily develop, train and deploy deep learning models (in TensorFlow) on very same Hadoop YARN cluster;
  • C++ HDFS client —helps to do async IO to HDFS which helps downstream projects such as Apache ORC;
  • Upgrades for long running services —supports in-place seamless upgrades of long running containers via YARN Native Service API (application program interface) and CLI (command-line interface).

"This is one of the biggest releases in Apache Hadoop 3.x line which brings many new features and over 1,000 changes," said Sunil Govindan, Apache Hadoop 3.2.0 release manager. "We are pleased to announce that Apache Hadoop 3.2.0 is available to take your data management requirements to the next level. Thanks to all our contributors who helped to make this release happen."

Apache Hadoop is widely deployed at numerous enterprises and institutions worldwide, such as Adobe, Alibaba, Amazon Web Services, AOL, Apple, Capital One, Cloudera, Cornell University, eBay, ESA Calvalus satellite mission, Facebook, foursquare, Google, Hortonworks, HP, Huawei, Hulu, IBM, Intel, LinkedIn, Microsoft, Netflix, The New York Times, Rackspace, Rakuten, SAP, Tencent, Teradata, Tesla Motors, Twitter, Uber, and Yahoo. The project maintains a list of educational and production users, as well as companies that offer Hadoop-related services at https://wiki.apache.org/hadoop/PoweredBy

Global Knowledge hails, "...the open-source Apache Hadoop platform changes the economics and dynamics of large-scale data analytics due to its scalability, cost effectiveness, flexibility, and built-in fault tolerance. It makes possible the massive parallel computing that today's data analysis requires."

Hadoop is proven at scale: Netflix captures 500+B daily events using Apache Hadoop. Twitter uses Apache Hadoop to handle 5B+ sessions a day in real time. Twitter’s 10,000+ node cluster processes and analyzes more than a zettabyte of raw data through 200B+ tweets per year. Facebook’s cluster of 4,000+ machines that store 300+ petabytes is augmented by 4 new petabytes of data generated each day. Microsoft uses Apache Hadoop YARN to run the internal Cosmos data lake, which operates over hundreds of thousands of nodes and manages billions of containers per day.

Transparency Market Research recently reported that the global Hadoop market is anticipated to rise at a staggering 29% CAGR with a market valuation of US$37.7B by the end of 2023.

Apache Hadoop remains one of the most active projects at the ASF: it ranks #1 for Apache project repositories by code commits, and is the #5 repository by size (3,881,797 lines of code).

"The Apache Hadoop community continues to go from strength to strength in further driving innovation in Big Data," added Vavilapalli. "We hope that developers, operators and users leverage our latest release in fulfilling their data management needs."

Catch Apache Hadoop in action at the Strata conference, 25-28 March 2019 in San Francisco, and dozens of Hadoop MeetUps held around the world, including on 30 January 2019 at LinkedIn in Sunnyvale, California.

Availability and Oversight
Apache Hadoop software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Hadoop, visit http://hadoop.apache.org/ and https://twitter.com/hadoop

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official global conference series. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, and Union Investment. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Hadoop", "Apache Hadoop", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday January 08, 2019

The Apache Software Foundation Announces Apache® Airflow™ as a Top-Level Project

Open Source Big Data workflow management system in use at Adobe, Airbnb, Etsy, Google, ING, Lyft, PayPal, Reddit, Square, Twitter, and United Airlines, among others.

Wakefield, MA —8 January 2019— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Airflow™ as a Top-Level Project (TLP).

Apache Airflow is a flexible, scalable workflow automation and scheduling system for authoring and managing Big Data processing pipelines of hundreds of petabytes. Graduation from the Apache Incubator as a Top-Level Project signifies that the Apache Airflow community and products have been well-governed under the ASF's meritocratic process and principles.

"Since its inception, Apache Airflow has quickly become the de-facto standard for workflow orchestration," said Bolke de Bruin, Vice President of Apache Airflow. "Airflow has gained adoption among developers and data scientists alike thanks to its focus on configuration-as-code. That has gained us a community during incubation at the ASF that not only uses Apache Airflow but also contributes back. This reflects Airflow’s ease of use, scalability, and power of our diverse community; that it is embraced by enterprises and start-ups alike, allows us to now graduate to a Top-Level Project."

Apache Airflow is used to easily orchestrate complex computational workflows. Through smart scheduling, database and dependency management, error handling and logging, Airflow automates resource management, from single servers to large-scale clusters. Written in Python, the project is highly extensible and able to run tasks written in other languages, allowing integration with commonly used architectures and projects such as AWS S3, Docker, Apache Hadoop HDFS, Apache Hive, Kubernetes, MySQL, Postgres, Apache Zeppelin, and more. Airflow originated at Airbnb in 2014 and was submitted to the Apache Incubator March 2016.

Apache Airflow is in use at more than 200 organizations, including Adobe, Airbnb, Astronomer, Etsy, Google, ING, Lyft, NYC City Planning, Paypal, Polidea, Qubole, Quizlet, Reddit, Reply, Solita, Square, Twitter, and United Airlines, among others. A list of known users can be found at https://github.com/apache/incubator-airflow#who-uses-apache-airflow

"Adobe Experience Platform is built on cloud infrastructure leveraging open source technologies such as Apache Spark, Kafka, Hadoop, Storm, and more," said Hitesh Shah, Principal Architect of Adobe Experience Platform. "Apache Airflow is a great new addition to the ecosystem of orchestration engines for Big Data processing pipelines. We have been leveraging Airflow for various use cases in Adobe Experience Cloud and will soon be looking to share the results of our experiments of running Airflow on Kubernetes." 

"Our clients just love Apache Airflow. Airflow has been a part of all our Data pipelines created in past 2 years acting as the ring-master and taming our Machine Learning and ETL Pipelines," said Kaxil Naik, Data Engineer at Data Reply. "It has helped us create a Single View for our client's entire data ecosystem. Airflow's Data-aware scheduling and error-handling helped automate entire report generation process reliably without any human-intervention. It easily integrates with Google Cloud (and other major cloud providers) as well and allows non-technical personnel to use it without a steep learning curve because of Airflow’s configuration-as-a-code paradigm."

"With over 250 PB of data under management, PayPal relies on workflow schedulers such as Apache Airflow to manage its data movement needs reliably," said Sid Anand, Chief Data Engineer at PayPal. "Additionally, Airflow is used for a range of system orchestration needs across many of our distributed systems: needs include self-healing, autoscaling, and reliable [re-]provisioning."

"Since our offering of Apache Airflow as a service in Sept 2016, a lot of big and small enterprises have successfully shifted all of their workflow needs to Airflow," said Sumit Maheshwari, Engineering Manager at Qubole. "At Qubole, not only are we a provider, but also a big consumer of Airflow as well. For example, our whole Insight and Recommendations platform is built around Airflow only, where we process billions of events every month from hundreds of enterprises and generate insights for them on big data solutions like Apache Hadoop, Apache Spark, and Presto. We are very impressed by the simplicity of Airflow and ease at which it can be integrated with other solutions like clouds, monitoring systems or various data sources."

"At ING, we use Apache Airflow to orchestrate our core processes, transforming billions of records from across the globe each day," said Rob Keevil, Data Analytics Platform Lead at ING WB Advanced Analytics. "Its feature set, Open Source heritage and extensibility make it well suited to coordinate the wide variety of batch processes we operate, including ETL workflows, model training, integration scripting, data integrity testing, and alerting. We have played an active role in Airflow development from the onset, having submitted hundreds of pull requests to ensure that the community benefits from the Airflow improvements created at ING.  We are delighted to see Airflow graduate from the Apache Incubator, and look forward to see where this exciting project will be taken in future!"

"We saw immediately the value of Apache Airflow as an orchestrator when we started contributing and using it," said Jarek Potiuk, Principal Software Engineer at Polidea. "Being able to develop and maintain the whole workflow by engineers is usually a challenge when you have a huge configuration to maintain. Airflow allows your DevOps to have a lot of fun and still use the standard coding tools to evolve your infrastructure. This is 'infrastructure as a code' at its best."

"Workflow orchestration is essential to the (big) data era that we live in," added de Bruin. "The field is evolving quite fast and the new data thinking is just starting to make an impact. Apache Airflow is a child of the data era and therefore very well positioned, and is also young so a lot of development can still happen. Airflow can use bright minds from scientific computing, enterprises, and start-ups to further improve it. Join the community, it is easy to hop on!"

Availability and Oversight
Apache Airflow software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Airflow, visit http://airflow.apache.org/ and https://twitter.com/ApacheAirflow

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 730 individual Members and 7,000 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, Anonymous, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cerner, Cloudera, Comcast, Facebook, Google, Handshake, Hortonworks, Huawei, IBM, Indeed, Inspur, LeaseWeb, Microsoft, Oath, ODPi, Pineapple Fund, Pivotal, Private Internet Access, Red Hat, Target, Tencent, and Union Investment. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Airflow", "Apache Airflow", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Monday April 16, 2018

The Apache Software Foundation Announces Apache® Subversion® v1.10.0

Open Source version control system ranked among leaders in $970MM+ market

Wakefield, MA —16 April 2018— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Subversion® v1.10.0, the popular centralized software version control system.

With Apache Subversion, files and directories can be edited, copied, deleted, merged, and tagged, with each such operation leaving a permanent entry in the version control system's history record. Files can be locked for exclusive access. The system supports a command-line interface and several third-party graphical interfaces, and can also be scripted in Python, Perl, Ruby, or Java. Subversion is very portable and runs on virtually all general-purpose operating systems in use today. The software versioning and revision control system initiated in 2000, entered the Apache Incubator in 2009, and graduated as an Apache Top-Level Project in 2010.

"Dealing with merge conflicts is one of the most complicated aspects of version control. While merge conflicts in a text file are usually best resolved by editing the file directly, merge conflicts can also occur when the directory tree structure of a project is changed. Resolving such structural conflicts requires users to manipulate entire collections of files and directories at once, which is time-consuming and error-prone," said Stefan Sperling, Vice President of Apache Subversion. "With Subversion 1.10, structural conflicts can now be resolved with support from a built-in interactive conflict resolver which automates conflict resolution tasks users had to perform manually in the past. This is a major usability improvement and saves users who do a lot of merging a significant amount of time."

Apache Subversion 1.10 is the result of more than three years’ development effort, and features:
  • Numerous bug fixes
  • Improved path-based authorization
  • New interactive conflict resolver
  • LZ4 compression support over the wire and backend storage
  • Shelving (experimental)

The full list of new features can be found in the project release notes at https://subversion.apache.org/docs/release-notes/1.10.html

In its new "Version Control Systems Market: Global Industry Analysis" report, Future Market Insights estimates that the version control systems market will exhibit 11.5% CAGR and reach US$971.8MM by 2027. Apache Subversion has been ranked in the "Market Leader" quadrant in G2 Crowd’s "Best Version Control Systems" for 2018. Time tracking and productivity tool producer Time Doctor recommends, "if you want to have a single master source tree that is being worked on by a small core development group, SVN should be the first system you try as it’s reliable and tailored for that."

Millions of users worldwide rely on Apache Subversion (SVN) to safely and easily manage version control across an array of applications.

"As a large Open Source project with many moving parts, and many connections to other projects, the FreeBSD operating system depends on Apache Subversion to be its single source of truth for all version control operations," said members of the FreeBSD Core team. "Subversion has proved to be a reliable, stable, and overall usable system for the project for many years and we appreciate the high quality of the work done in designing and maintaining SVN."  

"Apache Subversion is a reliable and robust centralized version control system that is well suited for enterprises," said Michael Diers, Technical Director of elego Software Solutions GmbH. "Over the years, we have had very little problems with Subversion deployments that we setup and maintain for our customers. Its development community is very competent, friendly and welcoming."

"One of the usually overlooked advantages of Apache Subversion is that it works natively on all modern platforms, including Windows," said Ivan Zhakov, Technical Director of VisualSVN Software Ltd. "Subversion is based on Apache Portable Runtime and does not impose dependencies on Cygwin or its replacements. Nowadays, this advantage is of extreme importance for enterprise users who work in heterogeneous environment in the vast majority of cases."

"Assembla customers rely on Apache Subversion to build leading edge technologies and products in a secure and scalable environment," said Jacek Materna, Assembla CTO. "With more than 4,000 Assembla users running SVN, we are excited to have contributed directly to Apache Subversion 1.10.0 and the improved performance and key features it will bring. LZ4 Compression, improved path-based authorization and shelving are just a few of the updates that represent significant innovation for SVN."

"Over 18 years of its development, Subversion has grown into a very mature and solid version control package supported by a friendly and healthy Open Source project community with long-term productivity and success," added Sperling. "Subversion's community and wider ecosystem is an exemplary example of how the collaborative Open Source development model can work to the benefit of its users, its developers, and its sponsors."

Availability and Oversight
Apache Subversion software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Subversion, visit http://subversion.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,500 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Facebook, Google, Hortonworks, Huawei, IBM, Inspur, iSIGMA, ODPi, LeaseWeb, Microsoft, PhoenixNAP, Pivotal, Private Internet Access, Red Hat, Target, Union Investment, and Yahoo. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Subversion", "Apache Subversion", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday January 30, 2018

The Apache Software Foundation Announces Apache® Kibble™ as a Top-Level Project

Open Source tools used for collecting, aggregating and visualizing software project activity.

Wakefield, MA —30 January 2018— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today Apache® Kibble™ as a Top-Level Project (TLP).

Apache Kibble is an activity reporting platform created to collect, aggregate, analyze, and visualize activity in software projects and communities. With Kibble, users can track a project's code, discussions, issues, and individuals through detailed views mapped across specified time periods.

"We are passionate about solving hard problems, particularly as they relate to defining and measuring a project's success," said Rich Bowen, Vice President of Apache Kibble. "As doing so is notoriously difficult, we want to provide a set of tools that allow a project to define success, and track their progress towards that success, in terms that make the most sense for their community. Apache Kibble is a way to make this happen."

Apache Kibble is the latest project to enter the ASF directly as a Top-Level Project, bypassing the Apache Incubator (the official entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation). As part of its eligibility, Apache Kibble had to meet the many requirements of the Apache Maturity Model http://s.apache.org/O4p that include a project’s code, copyright, licenses, releases, consensus building, independence, and more.

Kibble is the Open Source edition of Snoot, the enterprise project and community reporting platform used by dozens of Apache projects as well as by the ASF for its official reports including the ASF Annual Report.

"By gaining an in-depth view into the ASF's operations through 1,433 Apache project repositories, we are able to obtain performance metrics for more than 300 Apache projects and nearly 900 million code line changes by more than 6,500 contributors," said Sally Khudairi, Vice President of Marketing and Publicity at The Apache Software Foundation. "We are excited to share the ability to provide insight with projects of all kinds, and help their communities identify trends and advance their impact."

"We're getting input and data from both a wide range of Apache projects as well as projects from outside of the foundation," added Bowen. "We're also collecting historical metrics from older projects with their rich history of successes and mistakes. They have a great deal of history and passion around measuring their communities, and hearing from disparate projects is helping to refine that vision. We would love to hear from more projects about what metrics are important to track, and invite their communities to join our mailing lists to discuss how we can help one another."

Catch Apache Kibble in action at FOSDEM, 3-4 February 2018 in Brussels.

Availability and Oversight
Apache Kibble software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Kibble, visit http://kibble.apache.org/ and https://twitter.com/ApacheKibble

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,500 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Aetna, Alibaba Cloud Computing, ARM, Baidu, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Facebook, Google, Hortonworks, Huawei, IBM, Inspur, iSIGMA, ODPi, LeaseWeb, Microsoft, PhoenixNAP, Pivotal, Private Internet Access, Red Hat, Target, Union Investment, and Yahoo. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Kibble", "Apache Kibble", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Thursday December 14, 2017

The Apache Software Foundation Announces Apache® Hadoop® v3.0.0 General Availability

Ubiquitous Open Source enterprise framework maintains decade-long leading role in $100B annual Big Data market

Forest Hill, MD —14 December 2017— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, today announced Apache® Hadoop® v3.0.0, the latest version of the Open Source software framework for reliable, scalable, distributed computing.

Over the past decade, Apache Hadoop has become ubiquitous within the greater Big Data ecosystem by enabling firms to run and manage data applications on large hardware clusters in a distributed computing environment.

"This latest release unlocks several years of development from the Apache community," said Chris Douglas, Vice President of Apache Hadoop. "The platform continues to evolve with hardware trends and to accommodate new workloads beyond batch analytics, particularly real-time queries and long-running services. At the same time, our Open Source contributors have adapted Apache Hadoop to a wide range of deployment environments, including the Cloud."

"Hadoop 3 is a major milestone for the project, and our biggest release ever," said Andrew Wang, Apache Hadoop 3 release manager. "It represents the combined efforts of hundreds of contributors over the five years since Hadoop 2. I'm looking forward to how our users will benefit from new features in the release that improve the efficiency, scalability, and reliability of the platform."

Apache Hadoop 3.0.0 highlights include:
  • HDFS erasure coding —halves the storage cost of HDFS while also improving data durability;
  • YARN Timeline Service v.2 (preview) —improves the scalability, reliability, and usability of the Timeline Service;
  • YARN resource types —enables scheduling of additional resources, such as disks and GPUs, for better integration with machine learning and container workloads;
  • Federation of YARN and HDFS subclusters transparently scales Hadoop to tens of thousands of machines;
  • Opportunistic container execution improves resource utilization and increases task throughput for short-lived containers. In addition to its traditional, central scheduler, YARN also supports distributed scheduling of opportunistic containers; and 
  • Improved capabilities and performance improvements for cloud storage systems such as Amazon S3 (S3Guard), Microsoft Azure Data Lake, and Aliyun Object Storage System.

Hadoop 3.0.0 has already undergone extensive testing and integration with the broader Open Source ecosystem at The Apache Software Foundation. With this release, its community of developers and users promote this release series out of beta.

Apache Hadoop is widely deployed at numerous enterprises and institutions worldwide, such as Adobe, Alibaba, Amazon Web Services, AOL, Apple, Capital One, Cloudera, Cornell University, eBay, ESA Calvalus satellite mission, Facebook, foursquare, Google, Hortonworks, HP, Hulu, IBM, Intel, LinkedIn, Microsoft, Netflix, The New York Times, Rackspace, Rakuten, SAP, Tencent, Teradata, Tesla Motors, Twitter, Uber, and Yahoo. The project maintains a list of known users at https://wiki.apache.org/hadoop/PoweredBy

"It's tremendous to see this significant progress, from the raw tool of eleven years ago, to the mature software in today's release," said Doug Cutting, original co-creator of Apache Hadoop. "With this milestone, Hadoop better meets the requirements of its growing role in enterprise data systems.  The Open Source community continues to respond to industrial demands."

Apache Hadoop's diverse community enjoys continued growth amongst the ASF's most active projects, and remains at the forefront of more than three dozen Apache Big Data projects.

Apache Hadoop committer history

Apache Hadoop has received countless awards, including top prizes at the Media Guardian Innovation Awards and Duke's Choice Awards, and has been hailed by industry analysts:

"...the lifeblood of organizational analytics…" —Gartner

"Hadoop Is Here To Stay" —Forrester

"...today Hadoop is the only cost-sensible and scalable open source alternative to commercially available Big Data management packages. It also becomes an integral part of almost any commercially available Big Data solution and de-facto industry standard for business intelligence (BI)." —MarketAnalysis.com/Market Research Media

"...commanding half of big data’s $100 billion annual market value...Hadoop is the go-to big data framework." —BigDataWeek.com

"Hadoop, and its associated tools, is currently the 'big beast' of the big data world and the Hadoop environment is undergoing rapid development..." —Bloor Research


"The opportunity to effect meaningful, even fundamental change in the Apache Hadoop project remains open," added Douglas. "Our new contributors uprooted the project from its historical strength in Web-scale analytics by introducing powerful, proven abstractions for data management, security, containerization, and isolation. Apache Hadoop drives innovation in Big Data by growing its community. We hope this latest release continues to draw developers, operators, and users to the ASF."

Catch Apache Hadoop in action at the Strata Data Conference in San Jose, CA, 5-8 March 2018, and at dozens of Hadoop Meetups held around the world.

Availability and Oversight
Apache Hadoop software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Hadoop, visit http://hadoop.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server —the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,300 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Alibaba Cloud Computing, ARM, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Facebook, Google, Hortonworks, Huawei, IBM, Inspur, iSIGMA, ODPi, LeaseWeb, Microsoft, PhoenixNAP, Pivotal, Private Internet Access, Red Hat, Serenata Flowers, Target, Union Investment, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Hadoop", "Apache Hadoop", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday October 31, 2017

The Apache Software Foundation Announces Apache® Juneau™ as a Top-Level Project

Open Source framework for quickly and easily creating Java-based REST microservices and APIs in use at IBM, The Open Group, and Salesforce, among others.

Forest Hill, MD –31 October 2017– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache® Juneau™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

Apache Juneau is a cohesive framework that allows developers to marshal POJOs (Plain Old Java Objects) and develop REST (Representational State Transfer) microservices and APIs. Marshalling is used to transform an object’s memory representation to a data format suitable for moving between different parts of a computer program (or across programs), and to simplify communications to remote objects with an object.

"We've worked hard on making the Apache Juneau code as simple and easy to use as possible," said James Bognar, Vice President of Apache Juneau. "We packed Juneau with rich features and functionality, and have successfully directed our efforts on building a diverse community that will help drive the project’s future. We’re very proud to graduate as an Apache Top-Level Project."

Apache Juneau consists of:

  1. A universal toolkit for marshalling POJOs to a wide variety of content types using a common cohesive framework;
  2. A universal REST server API for creating self-documenting REST interfaces using POJOs, simply deployed as one or more top-level servlets in any Servlet 3.1.0+ container;
  3. A universal REST client API for interacting with Juneau or 3rd-party REST interfaces using POJOs and proxy interfaces; and
  4. A REST microservice API that combines all the features above with a simple configurable Jetty server for creating lightweight standalone REST interfaces that start up in milliseconds.


Apache Juneau is in use at IBM, The Open Group, and Salesforce, among others. The Apache Streams project began incorporating Apache Juneau libraries in late 2016.

"Removing Dropwizard and Jackson in favor of Apache Juneau simplified our dependency tree, increased the performance of our APIs, and added several features, especially HTML rendering, that have been a huge hit," said Steve Blackmon, Vice President of Apache Streams. "An on-going collaboration between our projects continues to expand the capabilities of Juneau's Remoteable library. As Apache Streams adds additional data provider Java SDKs powered by Juneau, the variety of HTTP interfaces that can be modeled and integrated with Juneau has expanded."

"We were able to replace existing home-grown REST interfaces on top of EMF objects with ones based on Apache Juneau and dramatically reduced the size of our codebase," said Craig Chaney, former Jazz Repository team lead at IBM. "We also used it as the basis for our Docker-based microservices in our CLM-as-a-Service offering."

"I have used Apache Juneau on projects where I need to work with Web Services," said David Goddard, Executive IT Specialist at IBM. "Juneau has saved us many development hours, enabling me to easily consume third-party REST APIs and construct my own Web Services far more quickly than I would otherwise be able to. Juneau also aids the development of robust, maintainable applications with clear logical code structure."

"When The Apache Software Foundation moved the JSON.org license to Category X, successors for JSON processing were needed," said John D. Ament, Vice President of the Apache Incubator, and Apache Juneau incubation mentor. "Apache Juneau was identified as a clean solution. It provides an easy to use API, great performance and a large number of features that made it a strong recommendation for others to leverage."

"As Apache Juneau grows, we welcome new contributors to join the project and take an active role in its development," added Bognar. "Whether reviewing user code, helping with feedback, or contributing code changes through the mailing list, we look forward to learning more about usage patterns to further improve the product."

Meet members of the Apache Juneau community at the Salesforce Dreamforce 2017 conference 6-9 November 2017 in San Francisco.

Availability and Oversight
Apache Juneau software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Juneau, visit http://juneau.apache.org/ and https://twitter.com/ApacheJuneau

About the Apache Incubator
The Apache Incubator is the entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects wishing to join the ASF enter through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. For more information, visit http://incubator.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,300 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Alibaba Cloud Computing, ARM, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Facebook, Google, Hewlett Packard, Hortonworks, Huawei, IBM, Inspur, iSIGMA, ODPi, LeaseWeb, Microsoft, PhoenixNAP, Pivotal, Private Internet Access, Red Hat, Serenata Flowers, Target, WANdisco, and Yahoo. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Juneau", "Apache Juneau", "Streams", "Apache Streams", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday October 24, 2017

The Apache Software Foundation Announces Apache® PredictionIO™ as a Top-Level Project

Open Source Machine Learning server used to manage and deploy production-ready predictive services at ActionML, BizReach, LiftIQ, Pluralsight, and Salesforce, among others.

Forest Hill, MD –24 October 2017– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache® PredictionIO™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

Apache PredictionIO is an Open Source Machine Learning Server that enables developers to manage and deploy production-ready predictive services for various kinds of Machine Learning tasks. 

"PredictionIO was started with the goal of democratizing Machine Learning, by providing a high-degree of customization through templates, using an integrated stack of proven technologies provided by other Apache and Open Source projects," said Donald Szeto, Vice President of Apache PredictionIO and Principal Data Engineer for Einstein at Salesforce. "It has been inspiring to see the project going through incubation, with a growing user and developer community who provided invaluable feedback and contribution. We are excited about our graduation, and look forward to continuing the project's goal with the help from the community."

Apache PredictionIO focuses on enabling developers to quickly develop and deploy production-ready Machine Learning pipelines. The project features an engine template gallery, where developers can pick a template, and quickly ramp up a complete setup for their Machine Learning use cases. Each template in the gallery is designed for a specific Machine Learning scenario.

Apache PredictionIO is in use at ActionML, BizReach, LiftIQ, Pluralsight, and Salesforce, among others.

"We are very interested in PredictionIO for solving any Machine Learning tasks," said Shinsuke Sugaya, Chief Scientist at BizReach, Inc. "At BizReach, using PredictionIO, we have built a data-analysis platform for HR, which fits learning models from about 5 million job descriptions and recommends preferred items from them to users everyday. PredictionIO has accelerated our analysis and development tasks for data scientists and developers, and simplified infrastructure from data management to prediction server."

"It was indeed an honor to be asked to mentor PredictionIO through its successful graduation out of the Apache Incubator," said Suneel Marthi, ASF Member and Apache PredictionIO Incubation Mentor. "Apache PredictionIO is the platform that fills the gap between academic research and productionizing Machine Learning-as-a-Service. As a long-time practitioner of Machine Learning involving large scale analytics, and Apache Mahout project committer for many years, I've enjoyed working with PredictionIO team, and can see myself coming back to this community for help with questions when using PredictionIO on the job."

"I'm excited to see Apache PredictionIO begin to gain the recognition it has truly earned," said Cody Kimball, Machine Learning Engineer at Pluralsight. "I was fascinated with the growing field of Machine Learning, but had no idea how to get started given my limited development experience. I had the opportunity at work to spearhead some marketing-related Machine Learning efforts, with a 9-month plan to get a working POC up and running. After only 12 weeks, using PredictionIO, I was able to build a fully functioning recommendation engine on our externally-facing Website. We soon saw a 29% increase in forms being filled out, which resulted in a 29% increase in new qualified sales leads, and projected $1,333 increase in MRR. We rolled out this POC test to just 10% of the Web traffic, with much more areas to improve on. This has opened up so many opportunities that never would have been possible had it not been for the availability and reliability of the PredictionIO platform!"

"Apache PredictionIO is a strategic platform that Data Scientists around the globe should learn to master!” said Shane Johnson, Founder and CEO at LiftIQ. “Our team of developers use PredictionIO at the core of our product architecture, and to power our Lift Intelligence Platform (LiftIQ, an app on Salesforce App Exchange). We have been super impressed with the flexibility of the framework: PredictionIO is built on a solid, progressive foundation and cuts Machine Learning development time in half. It allows developers to stay focused on tuning models and integrating Machine Learning with existing apps. The contributors and community are extremely active and helpful. We have had multiple challenges along our path to proving out our product. Each time we have reached out, we received responses from the community within minutes. Thank you PredictionIO team and community and congratulations on becoming an Apache Top-Level Project!"

"ActionML has been obsessed with Machine Learning for years. Some of us have been committers to Apache Mahout, for instance. Apache PredictionIO proved the missing link in putting ML into production for our more demanding clients, several of which are Fortune 500 companies," said Pat Ferrel, Chief Consultant at ActionML. "PredictionIO plays a key part in our story of 'Success at Apache' https://s.apache.org/l9OO "

"Salesforce is committed to making machine learning more accessible and empowering business users from companies of all industries and sizes to work smarter and be more productive. After donating PredictionIO's Open Source code to ASF, we've seen collaboration from several of our teams, as well as customers, ISVs and a wider community,” said Simon Chan, Senior Director, Product Management, Einstein. "Apache PredictionIO reaching Top-Level Project status will unlock the power of AI for companies large and small, empowering them to combine machine learning with their CRM to deliver smarter, more productive customer experiences."

"We welcome anyone who is passionate about our mission of bringing Machine Learning to the masses to join our effort," added Szeto. "Any feedback or contribution is invaluable to the project. Join the discussion on our user and development mailing lists."

Catch Apache PredictionIO in action at the Salesforce Dreamforce 2017 conference 6-9 November 2017 in San Francisco.

Availability and Oversight
Apache PredictionIO software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache PredictionIO, visit http://predictionio.apache.org/ and https://twitter.com/PredictionIO

About the Apache Incubator
The Apache Incubator is the entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organizations and existing external projects wishing to join the ASF enter through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF. For more information, visit http://incubator.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,300 Committers across six continents successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Alibaba Cloud Computing, ARM, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Facebook, Google, Hewlett Packard, Hortonworks, Huawei, IBM, Inspur, iSIGMA, ODPi, LeaseWeb, Microsoft, PhoenixNAP, Pivotal, Private Internet Access, Red Hat, Serenata Flowers, Target, WANdisco, and Yahoo. For more information, visit http://apache.org/ and https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Mahout", "Apache Mahout", "PredictionIO", "Apache PredictionIO", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Thursday October 19, 2017

The Apache Software Foundation Announces Five Years of Apache® OpenOffice™ as a Top-Level Project

Latest, secure version of leading Open Source office application and personal productivity suite for Windows, Linux, and Mac now available in 41 languages.

Forest Hill, MD —19 October 2017— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the five-year anniversary of Apache® OpenOffice™, the leading Open Source office document productivity suite.

"OpenOffice has been downloaded by millions of users since becoming an Apache project five years ago," said Marcus Lange, Vice President of Apache OpenOffice. "We are extremely proud of our community of loyal users and developers who are committed to the future of OpenOffice. We are inspired by their encouragement and thank them by making the next version of the world's leading Open Source productivity suite even better."

With more than 225 million downloads, Apache OpenOffice includes the following applications:
  1. "Writer" - a word processor;
  2. "Calc" - a spreadsheet tool;
  3. "Impress" - a presentation editor;
  4. "Draw" - a vector graphics editor; 
  5. "Math" - a mathematical formula editor; and 
  6. "Base" - a database management program. 

Apache OpenOffice is available in 41 languages on Windows, macOS and Linux.

In celebration of OpenOffice's triple anniversary this month —17 years as an Open Source project, 6 years at the ASF, and 5 years as an ASF Top-Level Project— the Apache OpenOffice Project Management Committee also announced the immediate availability of Apache OpenOffice 4.1.4, which reflects changes that include:
  • Several updates for language dictionaries
  • Some translation fixes in the UI
  • Bug fixes
  • Security improvements
  • Updated graphics/logos (new Apache feather)
  • Enhancements to the build tools (for developers)

The complete list of changes and new features is available at https://s.apache.org/AOO-414changes ; users are encouraged to download the official version from https://www.openoffice.org/download/

Apache OpenOffice is used by millions of organizations, institutions, and individuals around the world. OpenOffice also plays an integral role in many governments, in response to their mandates to use files in the ISO/IEC standard Open Document Format (ODF). OpenOffice supports localized versions in more than 120 languages (those that are 100% translated and maintained are officially released).

As with all Apache projects, Apache OpenOffice is available as a free download to all users at no cost, charge, or fees of any kind. OpenOffice is Open Source software: its C++ source code is readily available for anyone who wishes to enhance the applications.

Availability and Oversight
Apache OpenOffice software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For project data, documentation, and more information on Apache OpenOffice, visit https://openoffice.apache.org/

Download
The project strongly recommends that users download OpenOffice only from the official site https://www.openoffice.org/download/ to ensure that they receive the original software in the correct and most recent version. The project also recommends users review the Release Notes https://s.apache.org/AOO-414releasenotes for important updates and remarks concerning any known issues with this version and their workarounds.

Get Involved!
Apache OpenOffice welcomes contributions and community participation through mailing lists as well as attending face-to-face MeetUps, developer trainings, and user events. Those wishing to get involved in the project can find out more at https://openoffice.apache.org/get-involved.html

About Apache OpenOffice
Originally created as "StarOffice" by StarDivision and after further expansion as an Open Source product under the name "OpenOffice.org" at Sun Microsystems, the project continued development after Oracle Corporation acquired Sun Microsystems in 2010. OpenOffice entered the Apache Incubator in 2011 and graduated as an Apache Top-level Project in October 2012. 9 releases have been made under the auspices of the ASF, with more than 225 million downloads recorded to date. Visit https://openoffice.apache.org/ and https://twitter.com/ApacheOO for more information.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server -- the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,300 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Alibaba Cloud Computing, ARM, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Confluent, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, iSigma, LeaseWeb, Microsoft, ODPi, PhoenixNAP, Pivotal, Private Internet Access, Produban, Red Hat, Serenata Flowers, Target, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ and https://twitter.com/TheASF
© The Apache Software Foundation. "Apache", "OpenOffice", "Apache OpenOffice", and "ApacheCon" are registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. All other brands and trademarks are the property of their respective owners.

# # #

Thursday October 12, 2017

Apache Is Open.

"The Apache Software Foundation is a cornerstone of the modern open source software ecosystem – supporting some of the most widely used and important software solutions powering today's Internet economy."
— Mark Driver, Research Vice President, Gartner

Lauded among the most successful influencers in Open Source, The Apache Software Foundation's commitment to collaborative development has long served as a model for producing consistently high quality software that advances the future of open development. Apache projects power half the Internet, manage exabytes of data, execute teraflops of operations, and store billions of objects in virtually every industry. Apache software projects are an integral part of nearly every end-user computing device, from laptops to tablets to phones.


Open Source.
One of the greatest disruptors to enterprise software, Open Source solutions provide many benefits, including:
  • Lowered costs
  • Higher quality software
  • Freedom from vendor lock-in and proprietary solutions

Open Development.
Organizations of all sizes that embrace open development methodologies benefit from improved speed of development and gain business advantage through:
  • Reduced investment in re-architecting applications
  • Active community support
  • Access to common federation on the leading edge of technology

Enter Apache.

In 1995, eight individuals produced the first public release of a new server software named "Apache", and called themselves the "Apache Group". 22 years after its inception, the Apache HTTP Web Server remains the most popular Web server on the planet.


Incorporation of the ASF.
In 1999, the Apache Group formed The Apache Software Foundation (ASF) with the mission of providing software for the public good. 

  • Membership-based, US 501(c)(3) not-for-profit corporation
  • Ensures Apache projects continue to exist beyond the participation of individual volunteers
  • Establishes role as an Open Source incubator to foster new technologies


Since its inception, the ASF has long been recognized as a leading source for Open Source software that meets the demand for mission-critical, enterprise-grade interoperable, adaptable, and sustainable solutions. 

Open Leadership.

"The Apache Software Foundation has set the standard for modern application and infrastructure software as well as the open source collaborative processes through which it is developed."
— Matt Aslett, Research Director, 451 Research


Today the ASF develops, stewards, and incubates more than 350 Open Source projects and initiatives through its leadership, robust community, and meritocratic process known as the "Apache Way".
  • "Flat" organization: Apache projects and their communities drive development
  • Project development and leadership driven entirely by individual volunteers
  • Provides organizational, legal, and financial support

Open To All.
All Apache software —project downloads, documentation, updates, patches, and more— can be downloaded and used entirely free of any license fees or charge of any kind.
  • Can be used by anyone for any purpose
  • Free of restrictions on installation or deployment
  • Distributed under the flexible, business-friendly Apache License 2.0

Open Participation.
Code for all Apache projects is written by more than 6,000 volunteer individuals and employees of corporations across six continents and contributed to the ASF at no cost. The ASF is governed by the community it most directly serves —the people collaborating within its projects. The ASF's meritocratic processes serve as best practices widely embraced by organizations and individuals alike.
  • Contributions include code, patches, and documentation
  • Select contributors earn "Committer" status, enabling them to commit/write directly to the code repository, vote on community-related decisions, and propose active users for Committership
  • Committers who demonstrate merit in the Foundation's growth, evolution, and progress may be nominated for ASF Membership by existing members

Open Community.
ASF Community Development helps newcomers learn about Apache projects, governance, and activities, and provides guidance on becoming part of the meritocratic, all-volunteer Apache community.
  • "Community Over Code" is the cornerstone of the Foundation's core tenets
  • The ASF has served as a Google Summer of Code mentoring organization each year the since the program's creation in 2005
  • More than 6,300 Apache Committers help grow and maintain the health of the Apache community

Open Project Oversight.
The ASF does not lead the technical direction of Apache projects, but rather provides operational support for projects to self-govern. All Apache projects are overseen by a self-selected team of active contributors.
  • Apache Project Management Committees (PMCs) guide day-to-day operations, including community development and product releases
  • The ASF Board appoints a Vice President to serve as Chair of the PMC
  • Vice President/PMC Chair role is administrative, and carries no additional weight or influence on a project (one vote on project matters just like other PMC members)

Open Innovation.
All code donations, established projects, and communities intending to become fully-fledged Apache projects do so through the Apache Incubator. To graduate as an Apache Top-Level Project, candidate podlings must meet the Apache Maturity Model's rigorous requirements for code integrity, copyright, licenses, releases, consensus building, and independence, among others.
  • 187 Project Management Committees oversee 312 Apache projects
  • 54 new podlings undergoing development in the Apache Incubator
  • Recognized leadership across numerous categories, such as Big Data, libraries, servers and more

Open Communication.
All official communications at the ASF are conducted via mailing lists. Asynchronous communications are required to accommodate geographically-distributed groups across time zones, as is the case for nearly all Apache communities.
  • "If it didn't happen on-list, it didn't happen."
  • Built upon the transparency-oriented culture of the Apache Group, whose collaboration took place on email lists
  • Since the ASF's founding, 340,000+ authors wrote 17.5M+ emails on 7.5M topics, which are archived on 1,247 Apache publicly-accessible mailing lists

Open Opportunity.

"... unlike other open source organizations, the strength of the ASF is its independence from corporate interests … this independence has created a safe haven for a burgeoning open source developer population."
— Matt Asay, InfoWorld

Apache projects must be governed independently of commercial influence. As a vendor-neutral, not-for-profit organization, the ASF and all Apache projects do not take sides, nor endorse or support any particular vendor over other vendors.
  • The ASF does not discourage the development of "competing" products
  • Third parties are free to pursue almost any for-profit or not-for-profit business model based on Apache projects
  • The commercially-friendly and permissive Apache License v2 has become an industry standard within the Open Source world

Continuing Growth.
The ASF has scaled more than 35,000% over 18 years with very limited resources. The ASF is responsible for millions of lines of code by countless contributors across the Open Source landscape: each day millions of people across the globe access the ASF's two dozen servers and 75 distinct hosts.
  • The ASF has grown from an inaugural membership of 21 individuals to 680 individual Members and 6,300 Committers
  • The ASF oversees 150M+ lines of code (valued at US$7B+), developed over 65,000 person-years, with an average of 18,000 Apache code commits each month
  • Nearly 300 new code contributors and 300-400 new people file issues each month

Apache Committers have the responsibility to the collective community to help create a product that will outlive the interest of any particular volunteer, and that the code committed should be clear enough that others not involved in its current development will be able to maintain and extend it.

How You Can Help.

The ASF is funded through tax-deductible contributions from corporations, foundations, and private individuals. You can help the greater Apache community by contributions in the form of:

  • Code and documentation for Apache Projects
  • Funds —become a Sponsor or Individual donor
  • Corporate matching gift program —increase your donation with your employer’s support

Approximately 75% of the ASF's US$1.5MM annual budget is dedicated to running critical infrastructure support services, including bandwidth, connectivity, servers, and hardware: the ASF Infrastructure team keep Apache services running 24x7x365 at near 100% uptime on an annual budget of less than US$5,000 per project. Donations to the ASF also helps offset day-to-day operating expenses such as legal and accounting services, brand management and public relations, general office expenditures, and support staff.

Join the hundreds of donors who have helped support the ASF this year. Every dollar counts! http://apache.org/foundation/contributing.html



# # #

Thursday June 29, 2017

The Apache® Software Foundation Announces Annual Report for 2017 Fiscal Year

Apache's community-led projects bring billions in value to users, developers, and critical applications; organization poised to foster continued growth

Forest Hill, MD —29 June 2017— The Apache® Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the availability of the annual report for its 2017 fiscal year, which ended 30 April 2017. 

Now in its 18th year, the ASF's operational highlights include:
  1. 35M page views per week across apache.org;
  2. Web requests received from every Internet-connected country on the planet;
  3. ASF's 100M functional lines of code (out of 150M+ total) have been developed over 65,000 person years, valued at US$7B;
  4. 65+M lines of code committed over the past year;
  5. 20% of all Apache lines of code are comments (nearly 3x the entire Linux codebase);
  6. 3,300 Apache code Committers made 214,398 commits;
  7. More than half of all contributions made by individuals new to Apache;
  8. Nearly 300 new code contributors and 300-400 new people filing issues each month;
  9. 25,154 authors sent 2,105,992 emails on 834,045 topics over the past year;
  10. Approximately 9M source code downloads served from Apache mirrors on a yearly basis (excluding convenience binaries);
  11. 64 new individual ASF Members elected, bringing the total to 684;
  12. Exceeded 6,000 code Committers;
  13. Remains all-volunteer for all code/development-related activities;
  14. 182 Top-Level communities overseeing 300+ Apache projects and sub-projects;
  15. Dozens of Apache projects continue to dominate the enterprise Big Data ecosystem
  16. Record 64 "podlings" undergoing incubation during FY2017 (59 at end of FY2017);
  17. New innovations include IoT, Microfinance, Machine Learning, and Cryptography;
  18. 22nd anniversary of the Apache HTTP Server (18 years under the ASF umbrella);
  19. Apache OpenOfficeTM exceeded 200M downloads (value to users $25M+ per day);
  20. Apache GroovyTM downloaded 12M times during the first 4 months of 2017;
  21. 976 Individual Contributor License Agreements (CLAs) signed;
  22. 42 Corporate Contributor License Agreements signed (totalling 384);
  23. 30 Software Grant Agreements signed;
  24. Apache License remains one of the most popular Open Source licenses;
  25. Apache Infrastructure services running 24x7x365 at near 100% uptime on an annual budget of less than US$5,000 per project;
  26. New "Gitbox" service launched to allow communities to host their read/write Git repositories on GitHub;
  27. Continued reduction in Infrastructure costs by moving select services to the Cloud;
  28. Improved buildbot and Jenkins build farms for continuous integration, testing, and automated Website generation;
  29. Increased trademarks, brand management, and legal support for dozens of projects;
  30. Launched new Apache Community survey, newsletter, and social media resources;
  31. ASF serves as a mentoring organization in Google Summer of Code for 12th consecutive year;
  32. Participated in hundreds of events globally, including ApacheCon North America and Europe;
  33. Increased corporate backing, with 51 ASF Sponsors and 11 Infrastructure partners;
  34. Improved individual giving program and outreach tactics;
  35. Completed first-time budgetary planning with five-year projections;
  36. FY17 ended with a 15.3-month cash reserve (more than double the industry average);
  37. ASF operational areas (Brand Management, Fundraising, Marketing and Publicity, Infrastructure, Conferences, and Travel Assistance) now supported by professional staff overseen by appointed ASF Members;
  38. FY17 tapped as an investment year, with resources committed to core Infrastructure and Marketing & Publicity services.

The full report is available online at https://s.apache.org/FY2017AnnualReport

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 680 individual Members and 6,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Alibaba Cloud Computing, ARM, Bloomberg, Budget Direct, Capital One, Cash Store, Cerner, Cloudera, Comcast, Confluent, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, iSigma, LeaseWeb, Microsoft, ODPi, PhoenixNAP, Pivotal, Private Internet Access, Produban, Red Hat, Serenata Flowers, Target, WANdisco, and Yahoo. For more information, visit https://www.apache.org/ and https://twitter.com/TheASF

# # #

© The Apache Software Foundation. "Apache", "Apache Groovy", "Groovy", "Apache HTTP Server", "Apache OpenOffice", "OpenOffice", and "ApacheCon", are registered trademarks or trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

Calendar

Search

Hot Blogs (today's hits)

Tag Cloud

Categories

Feeds

Links

Navigation