Entries tagged [apache]

Friday Mar 13, 2015

The Apache News Round-up: week ending 13 March 2015

It's certainly been a productive week! Here's what's happened within the Apache community:

ApacheCon™ –the official conference series of The Apache Software Foundation.
 - Special Registration Rates for ApacheCon North America available through 21 March http://apachecon.com/

ASF Infrastructure –our distributed team on four continents keeps the ASF's infrastructure running around the clock.
 - Whilst currently undergoing maintenance, uptime over the past week was a stellar 99.987% http://status.apache.org/

ASF Community Development –our team of volunteers that help guide newcomers to The Apache Software Foundation, provide insight and advice on The Apache Way of meritocratic development, including how to contribute to our many projects and to the Apache community at-large.
 - The ASF has been accepted as a Google Summer of Code mentoring organization for the 11th consecutive year http://community.apache.org/gsoc.html

Apache Incbuator™ –the entry path into The Apache Software Foundation (ASF) for projects and codebases wishing to become part of the Foundation's efforts. All code donations from external organisations and existing external projects wishing to join Apache enter through the Incubator.
 - AsterixDB, CommonsRDF, Myriad, and SAMOA are new podlings http://incubator.apache.org/projects/index.html
 - Groovy Submitted to Become a Project at The Apache Software Foundation http://s.apache.org/zj7

Apache Hive™ –data warehouse software facilitates querying and managing large datasets residing in distributed storage; built on top of Apache Hadoop.
 - Apache Hive 1.1.0 released https://hive.apache.org/downloads.html

Apache Jackrabbit™ –scalable, high-performance hierarchical content repository designed for use as the foundation of modern world-class Web sites and other demanding content applications.
 - Apache Jackrabbit Oak 1.1.7 released http://jackrabbit.apache.org/downloads.html

Apache Kafka™ –high-throughput, publish-subscribe messaging system rethought of as a distributed commit log.
 - Apache Kafka 0.8.2.1 released http://kafka.apache.org/downloads.html

Apache Tajo™ –mature Open Source relational Big Data warehousing solution provides advanced "SQL-on-Hadoop" functionality and support.
 - The Apache Software Foundation Announces Apache™ Tajo™ v0.10.0 http://s.apache.org/mwx

Apache UIMA™ –component architecture and framework for the analysis of unstructured content like text, video and audio data.
 - Apache UIMA Java SDK 2.7.0 released http://uima.apache.org

Did You Know?

 - Did you know the top 5 Apache projects in terms of number of committers are: Apache OpenOffice (140), Apache Commons (121), Apache HTTP Server (111), Apache Hadoop (108), and Apache CloudStack (100)?

 - Did you know that Apache Spark helps analyze and process more than 2.5 petabytes of data per day for some of the world's largest telcos? 

 - Did you know that we're always looking for Apache success stories that we can share with the community? Send newsworthy metrics/users/datapoints of Apache products to <press@apache.org>!


Apache Community Notices:

 - SparkSummit East will be held 18-19 March in New York http://spark-summit.org/

 - Apache OFBiz Community Day will be held 21 March https://cwiki.apache.org/confluence/display/OFBIZ/OFBiz+Community+Days

 - Apache Mesos User Group MeetUp will take place 25 March in London http://www.meetup.com/London-Mesos-User-Group/events/220602962/

 - Apache Flink MeetUp will take place 31 March in Berlin http://www.meetup.com/Apache-Flink-Meetup/events/221037302/

 - The Apache CouchDB community published guidelines on running a CouchDB Day http://blog.couchdb.org/2015/02/25/how-to-organise-your-own-couchdb-day-its-easy/

 - The Apache Mesos community is planning an initial MeetUp in Shanghai in Q2 http://www.weibo.com/u/3068800961

 - Apache Calcite will be featured in Enterprise Data World's New Data Management Technologies/NoSQL track 2 April in Washington DC http://edw2015.dataversity.net/

 - The Women of ASF Luncheon will be held on 14 April at ApacheCon http://events.linuxfoundation.org/events/apachecon-north-america/attend/women-asf-luncheon

 - CloudStack Days will take place 16 April during ApacheCon (future events planed for Budapest, Dublin, Seattle, and Tokyo) http://cloudstack.apache.org/

 - Apache Spark Mini-Summit will take place 16 April in tandem with ApacheCon http://s.apache.org/ojh

 - Apache TrafficServer Spring Summit will take place 16-17 April in Austin https://cwiki.apache.org/confluence/display/TS/Spring+2015+Summit

 - Apache Lucene-Solor MeetUp will take place 21 April in London http://www.meetup.com/Apache-Lucene-Solr-London-User-Group

 - HBaseCon, the fourth annual event for the Apache HBase contributor/developer/admin/user community, will take place 7 May in San Francisco http://hbasecon.com/

 - Apache Usergrid will be featured in CODESHOW SE on 21 May in Charleston https://www.codeshowse.com/

 - The next Berlin Buzzwords conference will feature Apache projects such as Lucene, Spark, Solr and Cassandra and will take place 31 May-3 June in Berlin http://berlinbuzzwords.de

 - Apache Kafka and Samza will be featured at the GeekOut conference 11-12 June in Estonia http://2015.geekout.ee/

 - Are your software solutions Powered by Apache? Download & use our "Powered By" logos http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news by sending mail to announce-subscribe@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community, https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Wednesday Mar 11, 2015

Groovy Submitted to Become a Project at The Apache Software Foundation

The Apache Software Foundation (ASF) has confirmed today that the Groovy dynamic object-oriented programming language has formally submitted itself to the Apache Incubator in order to become part of the Foundation's efforts. [1]

The incubation of Groovy is the first step to becoming an ASF Top-Level Project. The Groovy community will join more than 350 Open Source initiatives overseen by the ASF --including 35 projects in the Apache Incubator-- and will benefit from the Foundation's widely-emulated community-driven process, stewardship, infrastructure, outreach, and events.

"We are happy to welcome Groovy to the Apache Incubator," said ASF Vice Chairman Greg Stein. "Groovy has a diverse and active community that will find 'The Apache Way' of meritocratic development a perfect complement to their existing recognition of the value and benefits of the Apache License, under which their code is released. The ASF's proven framework will offer Groovy the organizational, legal, financial, and infrastructure support needed to continue to be available to its established user base and millions of developers worldwide."

The Apache Incubator is the entry path into The Apache Software Foundation (ASF) for projects and codebases wishing to become part of the Foundation's efforts. All code donations from external organizations and existing external projects wishing to join Apache enter through the Incubator. Since its creation in 2002, the Apache Incubator has mentored 220 projects: 148 have graduated as Apache Top-level Projects, 35 are currently under incubation, and 37 were retired. Further information on the Apache Incubator is available at http://incubator.apache.org/.

"After seeing (Apache Member and Groovy creator) James Strachan's Groovy talk at OSCON in 2004, I find it wonderful serendipity to see it arrive at the Foundation a decade later," added Stein.

[1] http://s.apache.org/KWE

Monday Mar 09, 2015

The Apache Software Foundation Announces Apache™ Tajo™ v0.10.0

Mature, robust, Open Source relational Big Data warehousing solution provides advanced "SQL-on-Hadoop®" functionality and support.

Forest Hill, MD —9 March 2015— The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the availability of Apache™ Tajo™ v0.10.0, the latest version of the advanced Open Source data warehousing system in Apache Hadoop®.

Apache Tajo is used for low-latency and scalable ad-hoc queries, online aggregation, and ETL (extract-transform-load process) on large data sets stored on HDFS (Hadoop Distributed File System) and other data sources. By supporting SQL standards and leveraging advanced database techniques, Tajo allows direct control of distributed execution and data flow across a variety of query evaluation strategies and optimization opportunities.

"Tajo has evolved over the last couple of years into a mature 'SQL-on-Hadoop' engine," said Hyunsik Choi, Vice President of Apache Tajo. "The improved JDBC driver in this release allows users to easily access Tajo as if users use traditional RDBMSs. We have verified new JDBC driver on many commercial BI solutions and various SQL tools. It was easy and works successfully."

Tajo v0.10.0 reflects dozens of new features and improvements, including:

  • Oracle and PostgreSQL catalog store support
  • Direct JSON file support
  • HBase storage integration (allowing users to directly access HBase tables through Tajo)
  • Improved JDBC driver for easier use of JDBC application
  • Improved Amazon S3 support


A complete overview of all new enhancements can be found in the project release notes at https://dist.apache.org/repos/dist/dev/tajo/tajo-0.10.0-rc1/relnotes.html

Described as "a dark horse in the race for mass adoption" by GigaOM, Tajo is in use at numerous organizations worldwide, including Gruter, Korea University, Melon, NASA JPL Radio Astronomy and Airborne Snow Observatory projects, and SK Telecom for processing Web-scale data sets in real time.

Byeong Hwa Yun, Project Leader at Melon, said "Congratulations on 0.10.0 release! Melon is the biggest music streaming service company in S. Korea. We use Tajo as an ETL tool as well as an analytical processing system. We have experienced that Tajo makes our ETL jobs faster 1.5x-10x than Hive does. Besides, HBase storage integration in this release enables our analytic pipeline simpler. We hope that Tajo has a large role to play in the Apache Hadoop ecosystem." 

"I'm very happy with that Tajo has rapidly developed in recent years," said Jihoon Son, member of the Apache Tajo Project Management Committee. "One of the most impressive parts is the improved support on Amazon S3. Thanks to the EMR bootstrap, users can exploit Tajo's advanced SQL functionalities on AWS with just a few clicks."

Availability and Oversight
Apache Tajo software is released under the Apache License v2.0 and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache Tajo, visit http://tajo.apache.org/ and https://twitter.com/ApacheTajo

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ and follow https://twitter.com/TheASF

# # #

© The Apache Software Foundation. "Apache", "Tajo", "Apache Tajo", "Hadoop", "Apache Hadoop", and the Apache Tajo logo are registered trademarks or trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

Friday Mar 06, 2015

The Apache News Round-up: week ending 6 March 2015

The Apache community has been working hard this week! Here's what's happened:

ApacheCon™ –the official conference series of The Apache Software Foundation.
 - Special Registration Rates for ApacheCon North America available through 21 March http://apachecon.com/

ASF Infrastructure –our distributed team on four continents keeps the ASF's infrastructure running around the clock.
 - Global uptime across all Apache services is a cracking 99.07% http://status.apache.org/

Apache Archiva™ –an application for managing one or more remote repositories, including administration, artifact handling, browsing and searching.
 - Apache Archiva 2.2.0 released http://archiva.apache.org/

Apache Buildr™ –build system for Java-based applications, including support for Scala, Groovy and a growing number of JVM languages and tools.
 - Apache Buildr 1.4.22 released http://buildr.apache.org/

Apache Hama™ –a High-Performance BSP computing engine, which can be used to perform compute-intensive general scientific BSP applications, Google's Pregel-like graph applications, and machine learning algorithms.
 - Apache Hama now supports Hadoop YARN; in use at Samsung Electronics http://s.apache.org/nBl

Apache Lens (Incubating) –a unified analytics interface that aims to cut the data analytics silos by providing a single view of data across multiple tiered data stores and optimal execution environment for the analytical query.
 - Apache Lens 2.0.1-beta-incubating release http://lens.incubator.apache.org/

Apache Lucene™ –a high-performance, full-featured text search engine library written entirely in Java.
 - Apache Lucene 4.10.4 released http://www.apache.org/dyn/closer.cgi/lucene/java/4.10.4

Apache Olingo™ –a Java library that enables developers to implement OData producers and OData consumers.
 - Apache Olingo 2.0.3  released http://olingo.apache.org/doc/odata2/download.html

Apache POI™ –well-known in the Java field as a library for reading and writing Microsoft Office file formats, such as Excel, PowerPoint, Word, Visio, Publisher and Outlook.
 - Apache POI 3.12-beta1 released http://poi.apache.org/download.html

Apache Solr™ –the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project.
 - Apache Solr 4.10.4 released http://www.apache.org/dyn/closer.cgi/lucene/solr/4.10.4


Did You Know?

 - Did you know that the top 5 Apache Project categories are: Library (90 projects), Network-Server (37 projects), XML (29 projects), Web Framework (25 projects), and Big Data (24 projects)?

 - Did you know that The ASF has been accepted as a Google Summer of Code mentoring organization for the 11th consecutive year? (GSoC is overseen by Apache Community Development http://community.apache.org/ )

 - Did you know that Apache Hama now supports Hadoop YARN and is in use at Samsung Electronics?


Apache Community Notices:

 - The next Apache Mesos User Group MeetUp will take place 25 March in London http://www.meetup.com/London-Mesos-User-Group/events/220602962/

 - The Women of ASF Luncheon will be held on 14 April at ApacheCon http://events.linuxfoundation.org/events/apachecon-north-america/attend/women-asf-luncheon

 - CloudStack Days will take place 16 April during ApacheCon (CFP open for all 5 CloudStack Days events: Austin, Budapest, Dublin, Seattle, and Tokyo) http://cloudstack.apache.org/

 - Apache Spark Mini-Summit will take place 16 April in tandem with ApacheCon http://s.apache.org/ojh

 - Apache TrafficServer Spring Summit will take place 16-17 April in Austin https://cwiki.apache.org/confluence/display/TS/Spring+2015+Summit

 - HBaseCon, the fourth annual event for the Apache HBase contributor/developer/admin/user community, will take place 7 May in San Francisco http://hbasecon.com/

 - The next Berlin Buzzwords conference will feature Apache projects such as Lucene, Spark, Solr and Cassandra and will take place 31 May-3 June in Berlin http://berlinbuzzwords.de

 - Are your software solutions Powered by Apache? Download & use our "Powered By" logos http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news by sending mail to announce-subscribe@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community, https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Friday Feb 27, 2015

The Apache News Round-up: week ending 27 February 2015

Here's what our community of 4,000+ Committers have been busily working on this week:

ApacheCon™ –the official conference series of The Apache Software Foundation.
 - The ApacheCon program is now available; Early Bird registration ends 28 February http://apachecon.com/

ASF Infrastructure –our distributed team on four continents keeps the ASF's infrastructure running 24x7x365.
 - Global uptime across all Apache services is a stellar 99.73% http://status.apache.org/
 - Notice of server maintenance policy change https://blogs.apache.org/infra/entry/towards_a_redeployable_future_or

ASF Security –a team comprising ASF Members dedicated to helping and advising Apache projects on security issues, as well as co-ordinating the handling of security vulnerabilities.
 - [IMPORTANT] Security Notice: CVE-2015-0254 XXE and RCE via XSL extension in JSTL XML tags http://mail-archives.apache.org/mod_mbox/www-announce/201502.mbox/%3C82207A16-6348-4DEE-877E-F7B87292576A%40apache.org%3E

Apache Allura™ –Open Source implementation of a software "forge", a web site that manages source code repositories, bug reports, discussions, wiki pages, blogs and more for any number of individual projects.
 - Apache Allura 1.2.1 released http://www.apache.org/dyn/closer.cgi/allura/allura-1.2.1.tar.gz

Apache Commons™ DBCP –Open Source software library implements database connection pooling.
 - Apache Commons DBCP 2.1 released http://commons.apache.org/dbcp/download_dbcp.cgi

Apache HBase™ –Open Source, distributed Big Data store for Apache Hadoop.
 - The Apache Software Foundation Announces Apache™ HBase™ v1.0 http://s.apache.org/zW

Apache Log4j™ –well known framework for logging application behavior.
 - Apache Log4j 2.2 released http://logging.apache.org/log4j/2.x/

Apache Lucene™ –a high-performance, full-featured text search engine library written entirely in Java.
 - Apache Lucene 5.0.0 released http://lucene.apache.org/core/mirrors-core-latest-redir.html

Apache MRQL (Incubating) –query processing and optimization system for large-scale, distributed data analysis, built on top of Apache Hadoop, Hama, Spark, and Flink.
 - Apache MRQL 0.9.4-incubating released http://www.apache.org/dyn/closer.cgi/incubator/mrql

Apache Openmeetings™ –video conferencing, instant messaging, white board, collaborative document editing and other groupware tools using API functions of the Red5 Streaming Server for Remoting and Streaming.
 - Apache Openmeetings 3.0.4 released http://openmeetings.apache.org/downloads.html

Apache Parquet (Incubating) –the common definition of Parquet column-oriented file format for Apache Hadoop.
 - Apache Parquet Format 2.3.0 (Incubating) released (first release of Parquet Format with org.apache.parquet artifacts and package names) https://www.apache.org/dyn/closer.cgi/incubator/parquet/parquet-format-2.3.0-incubating/

Apache Solr™ –popular, blazing fast, Open Source NoSQL search platform from the Apache Lucene project.
 - Apache Solr 5.0.0 and Reference Guide for Solr 5.0 http://lucene.apache.org/solr/mirrors-solr-latest-redir.html

Apache Sqoop™ –a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores, such as relational databases.
 - Apache Sqoop 1.99.5 released http://www.apache.org/dyn/closer.cgi/sqoop/1.99.5

Apache Maven™ –a project development management and comprehension tool.
 - Apache Source Release Assembly Descriptor 1.0.5 Released http://maven.apache.org/apache-resource-bundles/

Apache Tomcat™ –Open Source software implementation of the Java Servlet, JavaServer Pages, Java Unified Expression Language and Java WebSocket technologies.
 - Apache Tomcat 8.0.20 released http://tomcat.apache.org/download-80.cgi
 - Apache Tomcat Apache Standard Taglib 1.2.3 released http://tomcat.apache.org/download-taglibs.cgi


Did You Know?

 - Did you know that the top 5 Apache Project categories are: Library (90 projects), Network-Server (37 projects), XML (29 projects), Web Framework (25 projects), and Big Data (24 projects)?

 - Did you know that in 2002 the Apache Incubator was the fourth project at the ASF, created to mentor projects and codebases wishing to become part of the Foundation's efforts? (Bonus: the first projects to enter the Apache Incubator were Apache Ant, Avalon, DB, Forrest, HC, POI, and TCL.)

 - Did you know that Apache Tajo is part of the Apache Hadoop ecosystem at Gruter, Korea University, NASA JPL's Radio Astronomy and Airborne Snow Observatory projects, and SK Telecom, among others?


Apache Community Notices:

 - Final days for the annual Future of Open Source Survey: have you shared your thoughts? http://s.apache.org/0UW

 - HBaseCon, the fourth annual event for the Apache HBase contributor/developer/admin/user community, will take place 7 May in San Francisco http://hbasecon.com/

 - The next Apache Mesos User Group MeetUp will take place 25 March in London http://www.meetup.com/London-Mesos-User-Group/events/220602962/

 - The Women of ASF Luncheon will be held on 14 April at ApacheCon http://events.linuxfoundation.org/events/apachecon-north-america/attend/women-asf-luncheon

 - Apache TrafficServer Spring Summit will take place 16-17 April in Austin https://cwiki.apache.org/confluence/display/TS/Spring+2015+Summit

 - The Apache Marmotta community are being proactive and beginning to collect ideas for Google Summer of Code 2015 http://markmail.org/message/4nowycutyx2m2rj3

 - The next Berlin Buzzwords conference will feature Apache projects such as Lucene, Spark, Solr and Cassandra and will take place 31 May-3 June in Berlin http://berlinbuzzwords.de

 - Are your software solutions Powered by Apache? Download & use our "Powered By" logos http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news by sending mail to announce-subscribe@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community, https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Tuesday Feb 24, 2015

The Apache Software Foundation Announces Apache™ HBase™ v1.0

Stable version of Open Source, distributed Big Data store for Apache Hadoop features improved performance, ease of use, new availability guarantees, and future release compatibility.

Forest Hill, MD –24 February 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today the availability of Apache™ HBase™ v1.0, the distributed, scalable, database for Apache™ Hadoop™ and HDFS™.

"Apache HBase v1.0 marks a major milestone in the project's development," said Michael Stack, Vice President of Apache HBase. "It is a monumental moment that the army of contributors who have made this possible should all be proud of. The result is a thing of collaborative beauty that also happens to power key, large-scale Internet platforms."

Dubbed the "Hadoop Database", HBase is used on top of Apache Hadoop and HDFS (Hadoop Distributed File System) for random, real-time read/write access for Big Data (billions of rows X millions of columns) across clusters of commodity hardware. HBase is used by Apple, Facebook, FINRA, Flipboard, Flurry, Pinterest, RocketFuel, Salesforce, Xiaomi, and Yahoo!, among many other organizations. 

Apache HBase has also fostered a healthy ecosystem of projects that run on top of it, such as Apache Phoenix, a SQL layer over HBase, and OpenTSDB, a time series database that uses HBase as its backing store.

"Medium- and high- scale services at hundreds of enterprises and some of the largest Internet companies today are backed by Apache HBase," explained Andrew Purtell, member of the Apache HBase Project Management Committee. "Chances are when using your computer or mobile device you interact with a system built with HBase many times daily without ever knowing it. The HBase 1.0 release appropriately acknowledges a maturity already achieved by the Apache HBase community and software both, and is a great occasion to learn more about HBase, how it can help you solve your scale data challenges, and the growing ecosystem of Open Source and commercial software that chooses HBase as foundation."

Apache HBase v1.0 is the result of 7 years of development, and reflects more than 1,500 changes and upgrades over the previous major release, Apache HBase 0.98.0. Notable new features include:

  • Improved performance without sacrificing stability;
  • Introduction of new APIs and reorganization of select client-side APIs;
  • Read availability using timeline consistent region replicas for new availability guarantees;
  • Online configuration change to enable reloading a subset of the server configuration without restarting the region servers; and
  • New look, enhanced usability, and radically revamped documentation.

(Please see the accompanying technical fact sheet at https://blogs.apache.org/hbase/entry/start_of_a_new_era for details on new functionality). 

Lars Hofhansl, Principal Architect at Salesforce.com, and member of the HBase Project Management Committee, said, "Over 13,000 JIRA issues were filed to get HBase where it is now. Going forward we have a clear compatibility story between major and minor versions."

"This is a very exciting moment for Apache HBase, and goes to show how far we have come as a community in stabilizing and maturing Apache HBase", said Francis Liu, Development Lead for Apache HBase at Yahoo. "HBase is an integral part of our technology stack powering numerous critical offstage processing use cases across our business in online advertising, search, communication, content personalization and targeting, and social, mobile and emerging products. Today, we operate some of the largest HBase clusters across a 3,000 server footprint, and look forward to working with the community with a stable release as a base to scale individual HBase clusters to millions of regions soon." 

"Hearty congratulations to the HBase community," said Ishan Chhabra, Lead of all things HBase at Rocketfuel Inc. "Apache HBase already powers our critical online applications and data pipelines over thousands of machines globally, and the community's relentless focus on stability and performance gives us the confidence to continue making it an integral part of our data stack as we scale to 10,000+ machines."

"Apache HBase is a critical data storage system at Pinterest, where we run it across thousands of nodes doing close to 10 million operations every second," said Raghavendra Prabhu, Head of Infrastructure at Pinterest. "HBase is the underlying technology behind Pinterest's Zen graph storage service, which powers key product features like the home feed, messages, notifications, network news and our interest graph. We are eagerly looking forward to the improvements in availability and reliability in HBase 1.0 and will continue to work with the community on improving it for large scale user facing workloads."

"HBase has been the cornerstone of our customer analytics platform Lily since late 2008", says Steven Noels, CTO of NGDATA. "Granted it was an adventurous choice at that time, but since then HBase has evolved and matured, reconfirming that choice time and time again. Seeing 1.0 (finally) shipping is a sign of confirmed adoption throughout all layers of the industry, from internet companies up to (in our case) large financial institutions, telcos and media companies. We are thankful to be part of such a strong, persistent and vibrant community development endeavour." 

"HBase-1.0.0 is the start of a new era," said Enis Söztutar, HBase v1.0 release manager and member of the Apache HBase Project Management Committee. "We have marked it as the next stable version of Apache HBase, and encourage all new users to start using this version."

"There is no rest for the wickedly talented set of contributors who made HBase 1.0," added Stack. "HBase 2.0 is already taking form in our master branch. Users can look forward to new orders of read/write and node count scaling and this time around they won't have to wait seven years on it shipping; HBase 2.0 will be out later this year."

Availability and Oversight
Apache HBase v1.0 is available immediately as a free download from http://www.apache.org/dyn/closer.cgi/hbase/ As with all Apache products, Apache HBase software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache HBase, visit http://hbase.apache.org/ and @HBase on Twitter.

Get Involved!
The HBase community welcomes contributions and participation through various mailing lists as well as attending face-to-face MeetUps, trainings, and events. Catch Apache HBase in action at HBaseCon, taking place 7 May 2015 in San Francisco http://hbasecon.com/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ and follow https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Apache Hadoop", "Hadoop", "Apache HBase", "HBase", "Apache HDFS", "HDFS", "Apache Phoenix", "Phoenix", and "ApacheCon", are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners. 

Tuesday Feb 17, 2015

The Apache Software Foundation Welcomes Cloudera as a Platinum-level Sponsor

Sponsorship boosts operational support for Apache projects and community

Forest Hill, MD –17 February 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Cloudera has renewed its sponsorship of the ASF with an upgrade to the Platinum level.

"We are grateful for Cloudera's continued generous support, which, in turn, helps advance the efforts of the Apache community at-large," said ASF President Ross Gardler. "Sponsoring the ASF helps provide critical infrastructure and support services that keep the Foundation running on a day-to-day basis."

As a United States 501(c)(3) not-for-profit charitable organization, the ASF relies on individual donors and corporate sponsors to help bolster existing projects, incubate new initiatives, and promote meritocratic, community-driven development "The Apache Way".

An ASF Sponsor since 2011, Cloudera is recognized as a champion and active contributor to numerous Apache projects, including Avro, Hadoop, HBase, Hive, Pig, Whirr, and Zookeeper. 

"Open source software has proven a powerful engine for innovation," said Mike Olson, Founder and Chief Strategy Officer of Cloudera. "It harnesses the talent and creativity of the global community of developers. Participation in that community is critical. Cloudera has long been a proud member of the ASF. Developers on the Cloudera payroll have created and contributed to ASF projects, working closely with their peers in the Apache Hadoop ecosystem. Our platinum sponsorship of the ASF complements our active contribution of code. The organization provides essential governance and community support on which our business relies. This donation, and our ongoing commitment, allows the ASF to deepen its investment in community, collaboration and innovation."

Cloudera joins Platinum level Sponsors Citrix, Facebook, Google, Matt Mullenweg, Microsoft, and Yahoo!; Gold Sponsors Comcast, HP, Hortonworks, and IBM; Silver Sponsors Budget Direct, Cerner, InMotion Hosting, iSIGMA, Pivotal, Produban, and WANdisco; and Bronze Sponsors Accor, Basis Technology, Bluehost, Cloudsoft Corporation, Samsung, Talend, and Twitter. For more information on ASF Sponsorship, please visit http://apache.org/foundation/sponsorship.html

About Cloudera
Cloudera is revolutionizing enterprise data management by offering the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop. Cloudera offers enterprises one place to store, access, process, secure, and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data. Cloudera’s open source Big Data platform is the most widely adopted in the world, and Cloudera is the most prolific contributor to the open source Hadoop ecosystem. As the leading educator of Hadoop professionals, Cloudera has trained over 27,000 individuals worldwide. Over 1,400 partners and a seasoned professional services team help deliver greater time to value. Finally, only Cloudera provides proactive and predictive support to run an enterprise data hub with confidence. Leading organizations in every industry plus top public sector organizations globally run Cloudera in production. For more information about Cloudera, visit http://www.cloudera.com/content/cloudera/en/about/company-profile.html

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow https://twitter.com/TheASF

© The Apache Software Foundation. "Apache", "Apache Avro", "Avro", "Apache Hadoop", "Hadoop", "Apache HBase", "HBase", "Apache Hive", "Hive", "Apache Pig", "Pig", "Apache Whirr", "Whirr", "Apache Zookeeper", "Zookeeper", and "ApacheCon", are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

Cloudera, Cloudera's Platform for Big Data, Cloudera Enterprise Data Hub Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Basic Edition and CDH are trademarks or registered trademarks of Cloudera Inc. in the United States, and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

# # #

Friday Feb 13, 2015

The Apache News Round-up: week ending 13 February 2015

In addition to what's happened at the Foundation over the past week, we're curious: "Did You Know?", and are also sharing some community announcements:

ASF Infrastructure –our distributed team on four continents keeps the ASF's infrastructure running 24x7x365.
 - The Infra team's 2 million weekly checks report 94.19% global uptime across all services http://status.apache.org/

Apache Jackrabbit™ –scalable, high-performance hierarchical content repository designed for use as the foundation of modern world-class Web sites and other demanding content applications.
 - Apache Jackrabbit 2.9.1 and Apache Jackrabbit Oak 1.1.6 released http://jackrabbit.apache.org/downloads.html

Apache OODT™ –a software framework, and an architectural style for the rapid construction of scientific data systems.
 - Apache OODT 0.8.1 released http://www.apache.org/dyn/closer.cgi/oodt/

Apache SIS™ –spatial framework that enables better representation of coordinates for searching, data clustering, archiving, or any other relevant spatial needs.
 - Apache SIS 0.5 released http://sis.apache.org/downloads.html

Apache Tomcat™ –Open Source software implementation of the Java Servlet, JavaServer Pages, Java Expression Language and Java WebSocket technologies.
 - Important Security Notice: CVE-2014-0227 Request Smuggling http://mail-archives.apache.org/mod_mbox/www-announce/201502.mbox/%3C54D87A0F.7010400%40apache.org%3E
 - Apache Tomcat 7.0.59 released http://tomcat.apache.org/download-70.cgi

Did You Know?

 - Did you know that select ASF Members are "Apache Quill" guest bloggers for OpenSource.com? http://opensource.com/search/apachesolr_search/quill

 - Did you know that the Big Brother voting infrastructure runs on Apache CloudStack?

 - Did you know DARPA Memex helps fund developers of Apache Tika, Nutch and Solr to deal with humanitarian issues such as Human Trafficking?

Apache Community Notices:

 - The ASF is participating again this year in the annual Future of Open Source Survey. We appreciate your input! https://www.surveymonkey.com/s/FoOS-Apache

 - The Apache Mesos MeetUp & Fireside Chat will take place 18 February at Offerpop in New York http://www.meetup.com/Apache-Mesos-NYC-Meetup/events/220352664/

 - The Apache Usergrid (incubating) + ElasticSea­rch Meetup will take place 18 February at Apigee in San Jose http://www.meetup.com/apache-usergrid-bay-area/events/220486635/?a=ea1_grp&rv=ea1&_af_eid=220486635&_af=event

 - The Apache Hive User Group MeetUp will take place 19 February at LinkedIn in Mountain View http://www.meetup.com/Hive-User-Group-Meeting/events/219794523/

 - The Apache Aurora MeetUp and Mesos User Group will take place 19 February at Twitter in San Francisco https://www.eventbrite.com/e/apache-aurora-meet-up-tickets-15618171371

 - Catch Apache TomEE in action on 21 February at the ceJug/JVM Users Group in Ceará, Brazil https://twitter.com/cejug

 - The first ever Apache Phoenix MeetUp will take place 24 February at Salesforce in San Francisco http://www.meetup.com/San-Francisco-Apache-Phoenix-Meetup/events/220009583/

 - The Women of ASF Luncheon will be held on 14 April at ApacheCon http://events.linuxfoundation.org/events/apachecon-north-america/attend/women-asf-luncheon

 - Apache TrafficServer Spring Summit will take place 16-17 April in Austin https://cwiki.apache.org/confluence/display/TS/Spring+2015+Summit

 - The Apache Marmotta community are being proactive and beginning to collect ideas for Google Summer of Code 2015 http://markmail.org/message/4nowycutyx2m2rj3

 - Are your software solutions Powered by Apache? Download & use our "Powered By" logos http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news at announce@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community, https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Friday Feb 06, 2015

The Apache News Round-up: week ending 6 February 2015

February marks the 20th Anniversary of the ubiquitous Apache HTTP Server, the world's most popular Web server since 1996, and the flagship project behind the creation of the ASF. Here's what happened at the Foundation over the past week:

ApacheCon™ –the official conference series of The Apache Software Foundation. Events in 2015: North America (13-17 April/Austin) and Europe (28 September-1 October/Budapest)
 - FINAL CALL: Travel Assistance applications close on 6 February http://www.apache.org/travel/

ASF Infrastructure –our distributed team on four continents keeps the ASF's infrastructure running 24x7x365.
 - The Infra team's 2 million weekly checks yields 94.48% global uptime across all services http://status.apache.org/

ASF Operations –the day-to-day Foundation-wide activities that support Apache projects and the overall ASF mission.
 - The Apache Software Foundation Operations Summary: October-December 2014 http://s.apache.org/fOb

Apache Flex™ – a highly productive, Open Source application framework for building and maintaining expressive applications that deploy consistently on all major browsers, desktops and devices, including smartphones, tablets, and TVs.
 - Apache Flex SDK 4.14.0 released http://www.apache.org/dist/flex/4.14.0/RELEASE_NOTES

Apache HttpComponents Client™ –a library for client-side HTTP communication built on HttpCore. It provides connection management, cookie management, and authentication.
 - Apache HttpComponents Client 4.4 GA released https://www.apache.org/dist/httpcomponents/httpclient/RELEASE_NOTES-4.4.x.txt

Apache Jackrabbit™ –scalable, high-performance hierarchical content repository designed for use as the foundation of modern world-class Web sites and other demanding content applications.
 - Apache Jackrabbit Oak 1.0.10 and 1.0.11 released http://jackrabbit.apache.org/downloads.html

Apache Kafka™ –high-throughput, publish-subscribe messaging system rethought of as a distributed commit log.
 - Apache Kafka 0.8.2.0 released https://archive.apache.org/dist/kafka/0.8.2.0/RELEASE_NOTES.html

Apache PDFBox™ –an Open Source Java tool for working with Portable Document Format (PDF) documents.
 - Apache PDFBox named an Open Source Partner Organization of the PDF Association http://s.apache.org/Wsf

Apache Streams (Incubating) –a lightweight server for ActivityStreams.
 - Apache Streams 0.1-incubating released http://streams.incubator.apache.org

Apache Syncope™ –an Open Source system for managing digital identities in enterprise environments, implemented in JEE technology.
 - Apache Syncope 1.2.2 released http://syncope.apache.org/downloads.html

Are your software solutions Powered by Apache?
 - Download & use our "Powered By" logos http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news at announce@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community,https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Tuesday Feb 03, 2015

Apache™ PDFBox™ named an Open Source Partner Organization of the PDF Association

Liaison helps enterprise users benefit from enhanced PDF technology and serves as a foundation to other software applications.

Forrest Hill, MD —03 February 2015— Apache PDFBox™, a Top-Level Project of The Apache Software Foundation, today announced the project has been named as a Partner Organization of the PDF Association. 

The Apache PDFBox™ library is an Open Source Java tool for working with Portable Document Format (PDF) documents. It allows for the creation of new PDF documents, manipulation, rendering, signing of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command line utilities.

"We are proud to be recognized by the PDF Association as a driver of ISO-­standardized PDF technology for electronic documents," said Andreas Lehmkühler, Vice President of Apache PDFBox. "Our liaison will help further strengthen the development of Apache PDFBox by providing access to knowledge and resources around PDF technology."

PDF was first released by Adobe Systems in 1993, became an ISO International Standard - ISO 32000-1 in 2008.

Founded in 2006, the PDF Association (http://www.pdfa.org/) is an international organization promoting awareness and adoption of open standards in digital document applications using PDF technology. The association facilitates education, networking, communication, and sharing of expertise and experience with interested parties worldwide. It offers its membership of over 150 enterprises and individual subject-matter experts from more than 20 countries direct contact with PDF technology experts and access to documents from ISO working groups, including release candidates for PDF upcoming standards.

The PDF Association's Partner Organizations are international associations concerned with document management, enterprise content management, long-term archiving and accessibility. Apache PDFBox the PDF Association's first Open Source Partner Organization. The PDF Association delivers vital information about implementing PDF technology to software developers and IT decision makers, and helps document management and ECM implementers understand and leverage ISO-standardized PDF technology. In turn, enterprise systems implementers and end-users benefit from enhanced PDF technology.

"With Apache PDFBox, the first Open Source organization is joining our Association," said Thomas Zellmann, Managing Director of the PDF Association. "This is our contribution to support the growth of freely available PDF solutions and their functionality to further expand the market penetration of the PDF standard."

"Being part of the PDF Association recognizes our commitment to making ISO standardized electronic document technology easily available by leveraging Apache PDFBox as a foundation for other software applications," added Lehmkühler.

About Apache PDFBox
The Apache PDFBox™ library is an Open Source Java tool that allows users to create new PDF documents, manipulate existing documents, extract content, digitally sign, print, and validate files against the PDF/A-1b standard. It also includes several command line utilities, including encrypt, decrypt, overlay, debugger, merger, PDFToImage, and TextToPDF. Apache PDFBox software is released under the Apache License v2.0, and is overseen by a self-selected volunteer team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For downloads, documentation, and ways to become involved with Apache PDFBox, visit http://pdfbox.apache.org/

# # #


Monday Feb 02, 2015

The Apache Software Foundation Operations Summary: October-December 2014

FOUNDATION OPERATIONS SUMMARY
October-December 2014

"The Apache Software Foundation is a cornerstone of the modern open source software ecosystem – supporting some of the most widely used and important software solutions powering today’s Internet economy..." –Mark Driver, Research Vice President, Gartner Group

> The Place for Open Innovation and Collaboration: A recent recount of the the ASF's projects and initiatives revealed that the number exceeds 350, including 160 Top-level Projects, 110 sub-projects (not in the original census), 36 podlings in the Apache Incubator, and 39 technical initiatives in the Apache Labs. New projects that have entered the Apache Incubator are: Corinthia (CMS), Htrace (Big Data), Ignite (Big Data), Kylin (Big Data), Lens (Big Data), NiFi (Cloud Computing), SAMOA (Big Data), Tamaya (Java Tools/Frameworks), Taverna (Workflow Management), and Zeppelin (Big Data Analytics). Podlings that graduated the Incubator as Apache Top-Level Projects are: DeviceMap, Drill, Falcon, and MetaModel. Hadoop Development Tools retired from the Incubator. We distributed formal press releases to support the above milestones, and have conducted regular outreach to members of the media and analyst community, and to the community at-large through the new weekly Apache News Round-Up reports via email to announce@apache.org. To help developers and users identify their affiliation with products and initiatives under the Apache umbrella, we provided new "Powered by Apache" graphical assets for all Apache projects http://www.apache.org/foundation/press/kit/#poweredby

> Apache Community: "Community Over Code" is the cornerstone of the Foundation's core tenets, underscoring open collaboration amongst the ASF's 500+ Members and 4,500+ Committers through respectful, honest, technically-focused interaction. We participated in Google Summer of Code for the 9th consecutive year: of the initial 42 students accepted, 36 passed the midterm evaluations and 35 passed the final evaluations. ApacheCon, the Foundation's official user conference, trainings, and expo was held in 17-21 November 2014 in Budapest, Hungary. Preparations are underway for ApacheCon North America (13-17 April 2015/Austin, Texas), including soliciting presentations and processing travel assistance applications. In addition, the Lucene/Solr and CloudStack communities will be co-locating conference events during ApacheCon.

> Contributions and License Agreements: We received and filed 198 Individual CLAs, 18 Corporate CLAs, and 8 software grants.

> Infrastructure: Over the past quarter, the ASF Infrastructure team has largely been focused on automation and paying down some long standing technical debt. Much of the focus has been on covering the spread of configuration management over our core services. Thanks to several Infrastructure sponsors, we have been able to move move a number of services into the public Cloud. Of course, there are a number of project and user facing changes taking place as well; we've debuted a code signing service to give our projects' end users a better installation experience while making them more secure. We've also dramatically improved our CI platform, adding much additional capacity and capability in the form of more static slaves, the ability to spin up on-demand slaves as well as Docker containers. Apache Infrastructure contractors work tirelessly around the globe and around the clock to ensure our distributed project teams can continue to work without worrying about the services they depend upon. ASF Infrastructure averaged 99.39% overall uptime, as detailed at http://s.apache.org/uptime

> Budget: As ASF sponsorship funds are a critical part of ensuring we can support all Apache projects, Infrastructure accounts for our largest expense at 63% of our current budget. 10% of our budget is allocated to Marketing & Publicity, where our contractor ensures prompt and appropriate responses to all inquiries. An additional 10% is spent on general administration, comprising legal and bank fees, insurances, executive assistant, accounting services, and associated expenses. 5% is dedicated to ASF Brand Management to ensure Apache brands remain independent of any individual commercial interests through trademark registration and related activities.


For more information, subscribe to the announce@apache.org mailing list and visit http://www.apache.org/, the ASF Blog at http://blogs.apache.org/, and the @TheASF feed on Twitter.

# # #

Report prepared by Sally Khudairi, Vice President Marketing & Publicity
(c) The Apache Software Foundation 2015

Friday Jan 30, 2015

The Apache News Round-up: week ending 30 January 2015

It's hard to believe that we're wrapping up the first month of the new year! Here's what happened over the past week:

ASF Infrastructure –our distributed team (10 rotating volunteers and 4 paid staff) on four continents responsible for keeping the ASF's infrastructure running 24x7x365.
 - 94.47% global uptime across all services this week http://status.apache.org/

FINAL CALL for ApacheCon™ –the official conference series of The Apache Software Foundation
 - CFP closes on 1 February http://events.linuxfoundation.org/events/apachecon-north-america/program/cfp
 - Travel Assistance applications close on 6 February http://www.apache.org/travel/

Apache BookKeeper™ – a reliable replicated log service that can be used to turn any standalone service into a highly available replicated service.
 - The Apache Software Foundation Announces Apache BookKeeper as a Top-Level Project http://s.apache.org/3BK

Apache HTTP Server™ – Open Source HTTP server for modern operating systems including UNIX, Microsoft Windows, Mac OS/X and Netware; the most popular Web server since 1996.
 - Apache HTTP Server 2.4.12 Released http://httpd.apache.org/docs/trunk/new_features_2_4.html

Apache MyFaces™ Core –a JavaServer Faces 2.0 implementation as specified by JSR-314. MyFaces Core has passed Sun's JSR-314 TCK and is 100% compliant with the JSR-314 specification.
 - Apache MyFaces Core v2.0.23, v. 2.1.17, and v2.2.7 released http://myfaces.apache.org/download.html

Apache Qpid™ –implements the latest AMQP specification, the first open standard for enterprise messaging, and provides transaction management, queuing, distribution, security, management, clustering, federation and heterogeneous multi-platform support and a lot more.
 - Security Advisory: Apache Qpid qpidd can be crashed by unauthenticated user http://mail-archives.apache.org/mod_mbox/www-announce/201501.mbox/%3C54C60497.5060504%40apache.org%3E
 - Security Advisory: anonymous access to Apache Qpid qpidd cannot be prevented http://mail-archives.apache.org/mod_mbox/www-announce/201501.mbox/%3C54C604A8.5090505%40apache.org%3E

Apache Samza™ –provides a system for processing stream data from publish-subscribe systems such as Apache Kafka.
 - The Apache Software Foundation Announces Apache Samza as a Top-Level Project http://s.apache.org/8pU

Are your software solutions Powered by Apache?
 - Download & use our "Powered By" logos today! http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news at announce@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community, https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Tuesday Jan 27, 2015

The Apache Software Foundation Announces Apache™ Samza™ as a Top-Level Project

Open Source Big Data distributed stream processing framework used in business intelligence, financial services, healthcare, mobile applications, security, and software development, among other industries.

Forest Hill, MD –27 January 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache™ Samza™ has graduated from the Apache Incubator to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

"The incubation process at Apache has been great. It has helped us cultivate a strong community, and provided us with the support and infrastructure to make Samza grow," said Chris Riccomini, Vice President of Apache Samza.

Apache Samza is a distributed stream processing framework, designed to handle fault tolerance, stateful processing, message durability, and scalability. Samza helps users to write light-weight processors that consume streams of data from messaging systems such as Apache Kafka. These processors empower organizations to understand and react to their data in real-time. In addition, Samza uses Apache Hadoop YARN to provide fault tolerance, processor isolation, security, and resource management.

Samza represents a different approach to stream processing. It has been purpose-built first and foremost as a production-grade system with operability and scalability in mind. Samza integrates tightly with Apache Kafka, which makes it a natural fit to those already running Kafka in their data pipeline. The framework also introduces the concept of stateful processing and aggregation as a first-class feature. Stateful processing gives Samza developers a completely new paradigm for aggregating stream data. These features help organizations do high performance stream processing at scale.

Created to process tracking data, service log data, and for data ingestion pipelines for realtime services, Samza originated at LinkedIn, and was submitted to the Apache Incubator in July 2013. 

"LinkedIn is thrilled to see Apache Samza experience such strong adoption and now graduate to a Top-Level Project. Samza was developed to help solve some of LinkedIn's  toughest stream processing challenges and has become a central piece of our infrastructure," said Kevin Scott, Senior Vice President of Engineering and Operations at LinkedIn.

Apache Samza is used in an array of industries, applications, and organizations, including:
  • DoubleDutch, developers of mobile apps for events and conferences, uses Samza to power their analytics platform and stream data live into an event dashboard for real-time insights;
  • Forstcales' Big Data security analytics solutions use Samza to processes security events log as part of the data ingestion pipelines and on-line machine learning models creation process;
  • Happy Pancake, Northern Europe's largest internet dating service, uses Samza for all event handlers and data replication;
  • Advertising technology provider Improve Digital uses Samza as the foundation of a realtime processing capability performing data analytics and as the basis for an alerting system;
  • Jack Henry & Associates uses Samza to process user activity data across its Banno suite of products for financial institutions;
  • MobileAware uses Samza as a foundation for two mobile network products: real time analytics and multi channel notification (push, text message and HTML5);
  • Technology startup Project Florida uses Samza for real-time monitoring of data streams from wearable sensors, for preventative healthcare purposes;
  • Quantiply, providers of Cloud-based micro-applications, uses Samza to bring together user event, system performance, and business operational data for real-time visibility and decision support; and
  • Social media business intelligence solution VinTank uses Samza to power their analysis and natural language processing (NLP) pipeline.


"We've had great experiences with Samza at Improve Digital where it has enabled us to  build out our streaming data platform," said Garry Turkington, CTO of Improve Digital. "It's fantastic to see it graduate to a top-level project."

Jay Kreps, CEO of Confluent, said "Samza is a fantastic piece of infrastructure, and a great complement to Apache Kafka. We at Confluent are really excited to see it added as a top-level Apache project."

"Fortscale has been using Apache Samza successfully to build online machine learning algorithms and detect insider threats," said Dotan Patrich, Software Architect at Fortscale. "It's been a great experience building large scale streaming solution and using Samza's and enjoying it's unique state management architecture. It's fantastic to see it graduate to a Top-Level Project."

"I've been involved in Apache Samza's community since its inception. It's been thrilling to watch the community grow, and I'm very proud and excited to see that the project is graduating. Samza has a bright future, and I'm looking forward to what's to come," added Riccomini.

Availability and Oversight
As with all Apache products, Apache Samza software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Samza, visit http://samza.apache.org/ and @SamzaStream on Twitter

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Cerner, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, iSigma, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow https://twitter.com/TheASF.

© The Apache Software Foundation. "Apache", "Apache Samza", "Samza", "Apache Hadoop", "Hadoop", "Hadoop YARN", "Apache Kafka", "Kafka", "ApacheCon", and the Apache Samza logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

The Apache Software Foundation Announces Apache™ BookKeeper™ as a Top-Level Project

Open Source distributed Big Data logging service and publish/subscribe system used to reliably log streams of records

Forest Hill, MD –27 January 2015– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 350 Open Source projects and initiatives, announced today that Apache™ BookKeeper™ has graduated to become a Top-Level Project (TLP), signifying that the project's community and products have been well-governed under the ASF's meritocratic process and principles.

Apache BookKeeper was established in 2011 as a sub-project of Apache ZooKeeper™ (Open Source API for highly reliable distributed coordination) to reliably log streams of records. It serves as a building block for reliable system consistency and recovery, and can be used to turn any standalone service into a highly available replicated service.

With disk/server failure rates up to 10% annually, replication is a must in today's always-on Cloud and Big Data services. One way to build a replicated service is to ensure that all write operations to the service are copied to all replicas; Apache BookKeeper's replicated logging service is well suited for this purpose. A database may have two replicas to ensure availability: if one crashes, the other can continue to serve traffic. However, ensuring that the data in these two replicas is consistent is not an easy problem to solve. Unlike naive solutions that run into problems like deadlock and inconsistency when one or both of the replicas fail, BookKeeper uses a combination of quorum writes, fencing, and, when necessary, outsourcing of consensus to ZooKeeper to ensure no state will be lost in the case of a replica failure. BookKeeper can similarly be applied to different classes of systems, such as messaging systems, filesystems and transaction processing systems.

Apache BookKeeper is highly available (no single point of failure), and scales horizontally as more storage nodes are added. BookKeeper is used in production in many web scale companies. At Yahoo, it is used as the persistence layer for its Cloud messaging infrastructure, which delivers tens of billions of messages in a day. BookKeeper is used at Twitter as the replicated persistence backend for different messaging use cases, and is also used by Huawei as a shared storage in their solution for HDFS Namenode High Availability. 

"We're very proud to have BookKeeper become a Top-Level Project. It is a testament to the hard work that my fellow committers have put in over the years that the ASF would give us their stamp of approval," said Ivan Kelly, Vice President of Apache BookKeeper. "We hope that the increased exposure will bring even more contributions and use cases to the community."

Availability and Oversight
As with all Apache products, Apache BookKeeper software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache BookKeeper, visit http://bookkeeper.apache.org and https://twitter.com/asfbookkeeper

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than 350 leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 500 individual Members and 4,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

© The Apache Software Foundation. "Apache", "Apache BookKeeper", "BookKeeper", ApacheCon", and the Apache BookKeeper logo are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Friday Jan 23, 2015

The Apache News Round-up: week ending 23 January 2015

This week's highlights from The Apache Software Foundation's 350+ projects and initiatives include:

ASF Legal Affairs Committee –responsible for establishing and managing legal policies based on the advice of legal counsel and the interests of the Foundation.
 - The Apache Software Foundation subpoenaed regarding Patent Claim https://blogs.apache.org/foundation/entry/the_apache_software_foundation_subpoenaed1

FINAL CALL for ApacheCon™ –the official conference series of The Apache Software Foundation
 - CFP closes on 1 February http://events.linuxfoundation.org/events/apachecon-north-america/program/cfp
 - Travel Assistance applications close on 6 February http://www.apache.org/travel/

Apache Bookkeeper™ – distributed logging service called BookKeeper and a distributed publish/subscribe system build on top of BookKeeper called Hedwig.
 - Apache BookKeeper 4.2.4 released http://bookkeeper.apache.org/releases.html

Apache Directory™ LDAP client API –an ongoing effort to provide an enhanced LDAP API, as a replacement for JNDI and the existing LDAP API (jLdap and Mozilla LDAP API) – provides the building blocks for both client side validation and server side data validation
 - Apache Directory LDAP API 1.0.0-M28 released http://directory.apache.org/api/downloads.html

Apache Falcon™ –data processing and management solution for Apache Hadoop™, designed for data motion, coordination of data pipelines, lifecycle management, and data discovery.
 - The Apache Software Foundation Announces Apache Falcon as a Top-Level Project http://s.apache.org/GT2

Apache Flink™ –a system for distributed batch and real-time streaming data analysis that offers familiar collection-based programming APIs in Java and Scala
 - Apache Flink 0.8.0 released http://flink.apache.org/downloads.html

Apache HttpComponents™ Client for Android –can be deployed on Google Android in parallel to the outdated version shipped with platform while remaining partially API compatible with Apache HttpClient 4.3.
 - HttpComponents Client for Android 4.3.5.1 released http://hc.apache.org/downloads.cgi

Apache Tomcat™ –Open Source software implementation of the Java Servlet, JavaServer Pages, Java Unified Expression Language and Java WebSocket technologies.
 - Apache Tomcat 8.0.17 available http://tomcat.apache.org/download-80.cgi

Apache Traffic Server™ –fast, scalable and extensible HTTP/1.1 compliant caching proxy server; can be used as a reverse, forward or even transparent HTTP proxy.
 - Apache Traffic Server 5.2.0 released http://trafficserver.apache.org/downloads

Apache Incbuator™ –the entry path into The Apache Software Foundation (ASF) for projects and codebases wishing to become part of the Foundation's efforts. All code donations from external organisations and existing external projects wishing to join Apache enter through the Incubator.
 - OpenAz and TinkerPop accepted as new podlings this month http://incubator.apache.org/projects/index.html

Are your software solutions Powered by Apache?
 - Download & use our "Powered By" logos today! http://www.apache.org/foundation/press/kit/#poweredby

= = =

For real-time updates, sign up for Apache-related news at announce@apache.org and follow @TheASF on Twitter. For a broader spectrum from the Apache community,https://twitter.com/PlanetApache provides an aggregate of both Project activities and the personal blogs of select ASF Committers.

# # #

Calendar

Search

Hot Blogs (today's hits)

Tag Cloud

Categories

Feeds

Links

Navigation