Entries tagged [cassandra]

Thursday September 11, 2014

The Apache Software Foundation Announces Apache™ Cassandra™ v2.1

Highest performing "NoSQL" distributed Big Data database now faster, with improved analytics and ease-of-use.

Forest Hill, MD –11 September 2014– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 200 Open Source projects and initiatives, today announced at the Cassandra Summit the availability of Apache™ Cassandra™ v2.1, the highly-performant Big Data distributed database.

"Every release reinforces why Cassandra is the database of choice for growing enterprises," said Jonathan Ellis, Vice President of Apache Cassandra and CTO of DataStax. "With 2.1 delivering over 50% better performance over 2.0's already-strong numbers, Cassandra 2.1 lets our users continuously improve their engagement with their customers at the highest speeds to date."

Apache Cassandra is an Open Source, "NoSQL" distributed database management system designed to handle large amounts of data across many commodity servers quickly and reliably, whether running in the Cloud or in a hybrid data store. Apache Cassandra has consistently led the NoSQL market in performance: its fully-distributed architecture provides unparalleled fault tolerance to ensure applications will not go offline, and its linear scalability allows them to reach massive sizes while successfully handling thousands of requests per second with no single point of failure. In addition, v2.1 is the first Cassandra release suitable for production use on Windows.

Apache Cassandra powers hundreds of applications across dozens of industries that demand high performance at scale, and is in use at Adobe, Comcast, eBay, Eventbrite, GE, GoDaddy, HP, IBM, Intuit, Netflix, Pearson, Safeway, Sky, Sony, Spotify, Travelocity, The Weather Channel, and Zoosk, among others.

"Cassandra 2.0 introduced critical features and functions that that let us build out a real-time analytics engine," said Brian O'Neill, CTO at Health Market Science. "The 2.1 release tightens the nuts and bolts and drops nitro into that engine to make those analytics blazing fast."

"Cassandra 2.1 represents an important milestone," said Duyhai Doan, Cassandra Expert at Orange. "For us developers, the biggest game changer in 2.1 is the introduction of CQL3 tuple and user defined type (UDT) as both pave the way for new data model patterns and usage. In addition, the ease of use core value for Cassandra is fulfilled in this release."

Learn More Today at Cassandra Summit 2014
Leading-edge companies including Sony, ING, Target, Google, Credit-Suisse, Microsoft, and Instagram will share how Apache Cassandra has transformed business and accelerated growth at the fifth annual Cassandra Summit hosted by DataStax. Following Cassandra Summit, the inaugural Cassandra Boot Camp workshops will take place on 12-13 September. For more information and to register, visit http://planetcassandra.org/events/san-francisco-cassandra-summit-2014/

Availability and Oversight
As with all Apache products, Apache Cassandra software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Cassandra, visit http://cassandra.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than two hundred leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 450 individual Members and 4,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

"Apache", "Apache Cassandra", "Cassandra", and "ApacheCon" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Wednesday April 09, 2014

The Apache Software Foundation Announces the 5th Anniversary of Apache™ Cassandra™

Highest performing "NoSQL" distributed Big Data database in use at hundreds of organizations including Adobe, CERN, Comcast, Disney, eBay, GE, GitHub, GoDaddy, HP, Hulu, IBM, Instagram, Intuit, Netflix, Plaxo, Polyvore, Sony, and The Weather Channel

Forest Hill, MD –08 April 2014–
The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of more than 170 Open Source projects and initiatives, announced today the 5th Anniversary of Apache™ Cassandra™, the highly-performant Big Data distributed database.

"I am so proud to see what the Apache Cassandra community has been able to achieve in five short years," said Jonathan Ellis, Vice President of Apache Cassandra and DataStax CTO. "We've come such a long way since the early days, and it is a testament to Cassandra's rapid maturation that it has been deployed in over 1,500 global critical production environments."

Apache Cassandra is an Open Source, "NoSQL" distributed database management system designed to handle large amounts of data across many commodity servers quickly and reliably without compromising performance, whether running in the Cloud or in a hybrid data store. Cassandra offers robust support for clusters spanning multiple datacenters, and provides high availability with no single point of failure.

Originally developed at Facebook in 2008 to power their Inbox Search feature, Cassandra entered the Apache Incubator in 2009 and graduated as an Apache Top-level Project in February 2010.

Apache Cassandra has consistently led the NoSQL market in performance: its fully-distributed architecture provides unparalleled fault tolerance to ensure applications will not go offline, and its linear scalability allows them to reach massive sizes while successfully handling thousands of requests per second.

Cassandra Evolution 5 Years On: Under the Hood
During the past five years, Apache Cassandra has resolved over 6,000 JIRA issues and added more than 250 contributors, making today’s Cassandra significantly more performant, resilient, feature-complete, and easier to both operate and develop against. Some of the milestones along the way include:

The Cassandra Query Language, which offers a more intuitive data model and a performant native protocol while retaining backwards compatibility with data created under the old Apache Thrift API;
Lightweight transactions, an industry first that allows users and applications to opt into a linearly consistent world view as necessary;
An innovative virtual node design that allows expanding a cluster in increments as small as a single machine, and across heterogeneous hardware;
A powerful log-structured storage engine featuring advanced compaction, compression, and SSD support;
Thousands of enhancements from running the world's most demanding applications at scale, informing better performance, better drivers, and better management tools.


Improved Performance in Real-world Situations
Apache Cassandra powers hundreds of applications across dozens of industries that demand high performance at scale. By addressing the needs of different workloads, Cassandra has evolved beyond its initial niche in social media into a truly general purpose solution.

Apache Cassandra is used by many highly-visible organizations including: Adobe, Comcast, Disney, eBay, Eventbrite, GE, GoDaddy, HP, IBM, Instagram, Intuit, Netflix, Pearson, Safeway, Sky, Sony, Spotify, Travelocity, The Weather Channel, and Zoosk, among others. Additional organizations using Apache Cassandra can be found at http://planetcassandra.org/companies/

"We knew Apache Cassandra could perform linear scaling of reads and writes with consistent performance."
--David Weinstein, Director of Software Development, Adobe

"Before adopting Cassandra, we could not monitor every malicious site and IP forever – the data volumes were just too great. No other database was ready for what we needed to do."
--Michael Kjellman, Software Engineer, Barracuda Networks

"Apache Cassandra provides us with an easy to use backend and lets us focus on our implementation and features."
--Andreas Wagner, Lead Developer, CumulusRDF at the Karlsruhe Institute of Technology (KIT)

"Apache Cassandra is critical for being able to look up historical behavior data quickly, so that we can do these model updates with low latency."
--Thomas Pinckney, Senior Director of Engineering, eBay

"Apache Cassandra provides us an easy way of providing a highly available store and not have to worry about things like sharding, multi-datacenter support and things like that. Cassandra is an amazing store. Some of the features that you get out of the box are pretty incredible. I think one of the reasons why Cassandra has taken off and is doing so well is because of its awesome community."
--Vipul Sharma, Director of Data Engineering, Eventbrite 

"Apache Cassandra embodies in its core the resilience and availability we need to continue serving our enterprise and internal customers even in the face of transient outages. Most of the time we forget about Cassandra and it keeps on running."
--Michael Rose, Senior Platform Engineer, FullContact

 "The Google Cloud Platform was able to sustain one million Cassandra writes per second at a cost of $0.05 USD per million writes."
--Ivan Santa Maria Filho, Performance Engineering Lead, Google Cloud Performance Team

"Apache Cassandra offers good performance, near linear scalability for our data model, and geo-replication all with minimal maintenance requirements."
--Andres Rangel, Senior Software Engineer, Hulu

"We’re helping the world save water, which is a very laudable target for an innovative and commercial business. We’re using quite a lot of innovative technology to help us do that, of which Cassandra plays a major role."
--Mike Williams, Software Director, i2O Water

"At Instagram we've either replaced or are replacing every use case of Redis with Apache Cassandra. Its operational robustness and ability to exploit solid-state disks are the primary drivers for these efforts. We've also used Cassandra as part of major new feature efforts for its ability to scale up, scale down, and transparently work in a multi-region environment."
--Rick Branson, Infrastructure Software Engineer, Instagram

"The data was stored originally in Oracle in such a way that it does not scale at all … We chose Cassandra because of its ability to scale easily, the operational simplicity, but also because it met our requirements."
--DuyHai Doan, Senior Developer, Libon, a Division of Orange

"We absolutely love the data model and scalability that Cassandra offers us. Coupled with ease of use and the ability to get it up and running quickly in development means that we can experiment quickly and have code flow from development into production in a short amount of time."
--Hisham Mardam Bey, CTO, Mate1

"During Hurricane Sandy, we lost an entire data center. Completely. Lost. It. Our application fail-over resulted in us losing just a few moments of serving requests for a particular region of the country, but our data in Cassandra never went offline."
--Nathan Milford, US Operations Manager, Outbrain

"Portugal Telecom is now expanding to Brazil, which is a market that is roughly 25 times larger than ours, so we believe with the help of Cassandra we’ll store huge loads of information."
--Ivo Jesus, Tech Lead, Portugal Telecom

"We are the number one source for weather on all platforms: TV, web, mobile, and API.  Our digital group provides backend services across all our platforms —billions of requests per day— and is responsible for delivering scheduled and severe weather alerts. We make extensive use of Cassandra under the hood for our high-throughput use cases, and we support nearly every imaginable type of content.  We get about 100M transactions per day on average against our busiest Cassandra-backed service, with a heavy day seeing more like 180-200M transactions. Apache Cassandra has never failed us."
--Robbie Strickland, Software Development Manager, The Weather Channel

Availability and Oversight
As with all Apache products, Apache Cassandra software is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. For documentation and ways to become involved with Apache Cassandra, visit http://cassandra.apache.org/

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than one hundred and seventy leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 400 individual Members and 3,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including Budget Direct, Citrix, Cloudera, Comcast, Facebook, Google, Hortonworks, HP, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, Pivotal, Produban, WANdisco, and Yahoo. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

"Apache", "Apache Cassandra", "Cassandra", and "ApacheCon" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Wednesday September 04, 2013

The Apache Software Foundation Announces Apache™ Cassandra™ v2.0

Highly-performant, Open Source Big Data distributed database in use at Adobe, CERN, Comcast, eBay, GoDaddy, HP, IBM, Instagram, Netflix, Plaxo, and Sony, among others, to create modern, data-driven applications

Forest Hill, MD –4 September 2013– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of nearly 150 Open Source projects and initiatives, today announced Apache Cassandra v2.0, the latest version of the highly-scalable, Big Data distributed database.

Apache Cassandra powers massive data sets quickly and reliably without compromising performance, whether running in the Cloud or partially on-premise in a hybrid data store. Its fully distributed architecture provides unparalleled fault tolerance to ensure applications will not go offline, and its linear scalability allows them to reach massive sizes while successfully handling thousands of requests per second.

"In five years, Apache Cassandra has grown into one of the most widely used NoSQL databases in the world and serves as the backbone for some of today’s most popular applications," said Jonathan Ellis, Vice President of Apache Cassandra.

Under the Hood

New features in Apache Cassandra v2.0 include lightweight transactions, triggers, and CQL (Cassandra Query Language) enhancements that increase productivity in creating modern, data-driven applications.

"Cassandra 2.0 makes it easier than ever for developers to migrate from relational databases and become productive quickly," added Ellis.

New features and improvements include:

  • Lightweight transactions allow ensuring operation linearizability similar to the serializable isolation level offered by relational databases, which prevents conflicts during concurrent requests
  • Triggers, which enable pushing performance-critical code close to the data it deals with, and simplify integration with event-driven frameworks like Storm
  • CQL enhancements such as cursors and improved index support
  • Improved compaction, keeping read performance from deteriorating under heavy write load
  • Eager retries to avoid query timeouts by sending redundant requests to other replicas if too much time elapses on the original request
  • Custom Thrift server implementation based on LMAX Disruptor that achieves lower message processing latencies and better throughput with flexible buffer allocation strategies
Strong Community Engagement and Adoption

The Apache Cassandra developer community includes some of the brightest minds in Big Data. Hundreds of organizations, from startups to large-scale enterprises such as Adobe, Cisco and IBM, rely on Cassandra to power their mission-critical applications online.

"At Ooyala, we're building some of our most ambitious projects to date on top of Apache Cassandra," said Al Tobey, Tech Lead, Compute and Data Services, Ooyala. "The maturation of CQL3, vnodes, and new features such as the PAXOS-backed compare-and-set (CAS) added in Cassandra 2.0 will help us build and deploy those projects confidently."

Apache Cassandra is used by many highly-visible organizations such as Accenture, CERN, Cloudkick, Comcast, Constant Contact, Dell, Digg, Ericsson, Eventbrite, GoDaddy, Houghton Mifflin Harcourt, HP, Instagram, Intuit, Mahalo, Microsoft MetricsHub, Morningstar, NASA, Netflix, Nextag, OpenWave, PBS Kids, Pitney Bowes, Plaxo, Polyvore, Real Networks, Reddit, Sony Network Entertainment, SoundCloud, Spotify, Squidoo, Stormpath, Symantec, Twitter, Wildfire, WSO2, and ZoomInfo. A listing of where Apache Cassandra is used and deployment details can be found at http://planetcassandra.org/Company/ViewCompany?IndustryId=-1

"We are excited about the future Apache Cassandra 2.0 makes possible. Paying down a lot of the technical debt accumulated over 5 years of intense Open Source development, and solidifying the Native Binary Transport for CQL 3, has put the project on a great footing," said Aaron Morton, Apache Cassandra committer and Co-Founder & Principal Consultant of The Last Pickle. "The addition of Lightweight 'Compare-and-Set' Transactions and Cursors brings another set of features that make it easier for developers to harness the performance and scale of Cassandra. And the experimental Trigger support will allow Open Source contributors to provide feedback for this often requested feature."

"It'll be really helpful to have conditional updates built into Cassandra," explained Jon Haddad, Senior Architect at Shift. "Right now there's a few places where we have to use external locking to manage isolation, and having built in support in the database will be amazing."

"We feel this release will delight existing users and tempt those on the sidelines," added Morton.

Availability and Oversight
As with all Apache products, Apache Cassandra v2.0 is released under the Apache License v2.0, and is overseen by a self-selected team of active contributors to the project. A Project Management Committee (PMC) guides the Project's day-to-day operations, including community development and product releases. Apache Cassandra source code, documentation, and related resources are available at http://cassandra.apache.org/.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading Open Source projects, including Apache HTTP Server --the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 400 individual Members and 3,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(c)(3) charitable organization, funded by individual donations and corporate sponsors including AMD, Basis Technology, Budget Direct, Citrix, Cloudera, Comcast, Facebook, Go Daddy, Google, HP, Hortonworks, Huawei, IBM, InMotion Hosting, Matt Mullenweg, Microsoft, PSW Group, Pivotal, WANdisco, and Yahoo!. For more information, visit http://www.apache.org/ or follow @TheASF on Twitter.

"Apache", "Cassandra", "Apache Cassandra", and "ApacheCon" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Contact:
Sally Khudairi
Vice President
The Apache Software Foundation
pressATapacheDOTorg
+1 617 921 8656

</o:p>

Tuesday April 24, 2012

The Apache Software Foundation Announces Apache Cassandra™ v1.1

High-performance, fault-tolerant Open Source "Big Data"  powerhouse scales petabytes of data at Adobe, Cisco, IBM, Morningstar, Netflix, PBS, Rackspace, Twitter, US Government, and more. 

Forest Hill, MD –24 April 2012– The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of nearly 150 Open Source projects and initiatives, today announced Apache Cassandra v1.1. The highly-scalable, distributed database quickly and reliably handles massive data sets across community machines, large server clusters, and data centers without compromising performance –whether running in the Cloud or partially on-premise in a hybrid data store.

"Apache Cassandra is the leading scalable NoSQL database in terms of production installations --the 1.0 release was a huge milestone," said Jonathan Ellis, Vice President of Apache Cassandra. "v1.1 improves on that foundation with many features and enhancements that developers and administrators have been asking for."

Apache Cassandra is rapidly garnering accolades as a best of breed "NoSQL" solution for its ease of use, powerful data model, enterprise-grade reliability, tunable performance, and incremental scalability with no single point of failure. Super-efficient, Cassandra accommodates high query volumes at exceptional speed (sub-millisecond writes) with low latency, and handles petabytes of data across formats and applications in real time.

Apache Cassandra v1.1 features improved caching, revised query language (CQL –Cassandra Query Language– a subset of SQL), storage control, schema/structure, Hadoop integration/output, data directory control, and scalability.

Successfully handling thousands of requests per second, Apache Cassandra is deployed at Adobe, Appscale, Appssavvy, Backupify, Cisco, Clearspring, Cloudtalk, Constant Contact, Digg, Digital River, Formspring, IBM, Mahalo.com, Morningstar, Netflix, Openwave, OpenX, Palantir, PBS, Plaxo, Rackspace, Reddit, RockYou, Shazam, SimpleGeo, Spotify, Twitter, Urban Airship, US Government, Walmart Labs, Yakaz, and more.

The largest Cassandra production cluster to date exceeds 300 terabytes of data over 400 machines.

"The v1.1 release shows how rapidly Apache Cassandra has matured. The focus has clearly shifted to usability which is the sign of a solid system. I look forward to getting it into production right away," said Patrick McFadin, Chief Architect of Hobsons. "With features like Row-level isolation and Composite keys, Apache Cassandra v1.1 is really addressing user driven needs with innovative solutions. Well done to all contributors for making this a great release."

Jeffrey Abbruzzi, Director of eCommerce Engineering and Operations at Williams-Sonoma, added, "In Apache Cassandra, the Williams-Sonoma, Inc. eCommerce team has found a platform that addresses fundamental challenges we faced in modernizing the foundation of our gift registry systems: the need for top-flight performance, fast and reliable replication among geographically distributed nodes, and lack of any single point of failure. We're excited about the release and how it can help us continue to evolve the world-class websites of our Williams-Sonoma, Pottery Barn, PBTeen, Pottery Barn Kids, and West Elm brands."

"We applaud the continued hard work of all of the contributors to the Apache Cassandra project, who continue to make Cassandra the most scalable, easy-to-use, high-performance NoSQL solution available," said Robin Schumacher, VP of Products for DataStax. "We look forward to including Cassandra v1.1, with all the improvements it brings, into the next version of our DataStax Community Edition, as well as upcoming versions of our DataStax Enterprise Edition."

Cassandra entered the Apache Incubator in 2009, and graduated as an Apache Top-Level Project (TLP) in February 2010. Apache Cassandra v1.0 was released in October 2011.

Availability and Oversight
Apache Cassandra software is released under the Apache License v2.0, and is overseen by a Project Management Committee (PMC) that guides the Project's day-to-day operations, community development, and product releases. Apache Cassandra source code, downloads, documentation, mailing lists, and related resources are available at http://cassandra.apache.org/.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading Open Source projects, including Apache HTTP Server — the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 350 individual Members and 3,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(3)(c) not-for-profit charity, funded by individual donations and corporate sponsors including AMD, Basis Technology, Citrix, Cloudera, Facebook, GoDaddy, Google, IBM, HP, Hortonworks, Huawei, Matt Mullenweg, Microsoft, PSW Group, SpringSource, and Yahoo!. For more information, visit http://www.apache.org/.

"Apache", "Apache Cassandra", and "ApacheCon" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

#  #  #

Tuesday October 18, 2011

The Apache Software Foundation Announces Apache Cassandra™ v1.0

Highly-scalable, Open Source "NoSQL" Distributed Database Handles Massive Workloads for Cisco, Constant Contact, DataStax, Digg, IBM, Netflix, Rackspace, Twitter, Walmart Labs, and more.

Forest Hill, MD – 18 October 2011 – The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of nearly 150 Open Source projects and initiatives, today announced Apache Cassandra™ v1.0. The highly-scalable, distributed 
"NoSQL" database plays a key role in Cloud computing by quickly handling massive workloads in real time with minimal disruption to services or systems.

"Dealing with very large amounts of data in realtime is a must for most businesses today," said Jonathan Ellis, Vice President of Apache Cassandra. "Cassandra accommodates high query volumes, provides enterprise-grade reliability, and scales easily to meet future growth requirements – while using fewer resources than traditional solutions."

Apache Cassandra is successfully used by large scale organizations such as Cisco, Cloudkick, Digg, Rackspace, Reddit, Twitter, and Walmart Labs to affordably process massive data sets in real-time across large server clusters. The largest Cassandra production cluster to date exceeds 300 terabytes of data over 400 machines.

"As the most-widely deployed mobile rich media advertising platform, Medialets uses Apache Cassandra™ for handling time series based logging from our production operations infrastructure," said Joe Stein, Chief Architect of Medialets. "We store contiguous counts for data points for each second, minute, hour, day, month so we can review trends over time as well as the current real time set of information for tens of thousands of data points. Cassandra makes it possible for us to manage this intensive data set and the release of 1.0 makes it that much easier."

Deployed across an array of applications, from barcode scanning and geospatial databases to storing user account information and activity logs, Apache Cassandra is easily scalable, efficient, and  performant, typically handling over 5,000 requests per second per core. Innovative uses of Apache Cassandra include:

  • AppScale – back-end for Google App Engine applications
  • Clearspring – tracking URL sharing and serving over 200 million daily view requests
  • Cloudtalk –  creating messaging applications
  • Constant Contact –  powering social media marketing applications
  • Formspring – counting/storing social graph data for 26 million accounts with 10 million daily responses
  • Mahalo.com – recording user Q & A activity logs and topics 
  • Netflix – streaming services back-end database
  • Openwave – distributed storage mechanism for next generation messaging platform
  • OpenX – storing and replicating advertisements and targeting data for ad delivery over 130 nodes
  • Plaxo – analyzing 3 billion contacts against public data sources and identifying 600 million unique contacts
  • RockYou – recording every single click in real time for 50 million online gaming users
  • Urban Airship – mobile service hosting for over 160 million application installs across 80 million unique devices 
  • Yakaz – storing millions of images and social data


Matthew Conway, CTO of Backupify said, "Apache Cassandra™ makes it possible for us to build a business around really high write loads in a scalable fashion without having to build and operate our own sharding layer. The release of Cassandra 1.0 is an exciting milestone for the project and we look forward to exploring the new features and performance enhancements."

"We utilize Apache Cassandra™ to deliver DataStax Enterprise, a distributed data platform that makes it easy for customers to build, deploy, and operate elastically scalable on-premise and cloud-optimized applications," explained Billy Bosworth, CEO of DataStax. "We chose Cassandra to power this platform because of it's real-time scalability, operational simplicity, and above all, its active community of dedicated developers. Version 1.0 is the culmination of their efforts and we look forward to seeing Cassandra 1.0 power our customers applications."

Originally developed at Facebook in 2008, Cassandra entered the Apache Incubator in 2009, and graduated as an Apache Top-Level Project (TLP) in February 2010. Apache Cassandra v1.0 will be featured in the "Data Handling & Analytics" track at ApacheCon, 7-11 November 2011, in Vancouver, Canada. To register, visit http://apachecon.com/.

Availability and Oversight
Apache Cassandra software is released under the Apache License v2.0, and is overseen by a Project Management Committee (PMC) that guides the Project's day-to-day operations, community development, and product releases. Apache Cassandra source code, downloads, documentation, mailing lists, and related resources are available at http://cassandra.apache.org/.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading Open Source projects, including Apache HTTP Server -- the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 350 individual Members and 3,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is a US 501(3)(c) not-for-profit charity, funded by individual donations and corporate sponsors including AMD, Basis Technology, Cloudera, Facebook, Google, HP, Hortonworks, IBM, Matt Mullenweg, Microsoft, PSW Group, SpringSource/VMware, and Yahoo!. For more information, visit http://www.apache.org/.

"Apache", "Apache Cassandra", and "ApacheCon" are trademarks of The Apache Software Foundation. All other brands and trademarks are the property of their respective owners.

# # #

Tuesday January 11, 2011

The Apache Software Foundation Announces Apache Cassandra 0.7


Highly-scalable Open Source Distributed Database for Handling Large Amounts of Data is a Key Component in Cloud Computing

Forest Hill, MD – 11 January 2011 – The Apache Software Foundation (ASF), the all-volunteer developers, stewards, and incubators of nearly 150 Open Source projects and initiatives, today announced Apache Cassandra v0.7, the highly-scalable, second generation Open Source distributed database.

"Apache Cassandra is a key component in cloud computing and other applications that deal with massive amounts of data and high query volumes," said Jonathan Ellis, Vice President of Apache Cassandra. "It is particularly successful in powering large web sites with sharp growth rates."

Apache Cassandra is successfully deployed at organizations with active data sets and large server clusters, including Cisco, Cloudkick, Digg, Facebook, Rackspace, and Twitter. The largest Cassandra cluster to date contains over 400 machines.

"Running any large website is a constant race between scaling your user base and scaling your infrastructure to support it," said David King, Lead Developer at Reddit. "Our traffic more than tripled this year, and the transparent scalability afforded to us by Apache Cassandra is in large part what allowed us to do it on our limited resources. Cassandra v0.7 represents the real-life operations lessons learned from installations like ours and provides further features like column expiration that allow us to scale even more of our infrastructure."

Among the new features in Apache Cassandra v0.7 are:

- Secondary Indexes, an expressive, efficient way to query data through node-local storage on the client side;

- Large Row Support, up to two billion columns per row;

- Online Schema Changes – automated online schema changes from the client API allow adding and modifying object definitions without requiring a cluster restart.

Oversight and Availability
Apache Cassandra is available under the Apache Software License v2.0, and is overseen by a Project Management Committee (PMC), who guide its day-to-day operations, including community development and product releases.

Apache Cassandra v0.7 downloads, documentation, and related resources are available at http://cassandra.apache.org/.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees nearly one hundred fifty leading Open Source projects, including Apache HTTP Server — the world's most popular Web server software. Through the ASF's meritocratic process known as "The Apache Way," more than 300 individual Members and 2,500 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation's official user conference, trainings, and expo. The ASF is funded by individual donations and corporate sponsors including AMD, Basis Technology, Cloudera, Facebook, Google, IBM, HP, Matt Mullenweg, Microsoft, SpringSource, and Yahoo!. For more information, visit http://www.apache.org/.

# # #

Tuesday May 04, 2010

The Apache Software Foundation Announces New Top-Level Projects

Record Number of Projects Launched via Apache Incubator and Current Initiatives

FOREST HILL, MD – 4 May, 2010 – The Apache Software Foundation (ASF) –-the all-volunteer developers, stewards, and incubators of 143 Open Source projects and initiatives-- today announced the creation of six new Top-Level Projects (TLPs), setting an all-time record of the most new TLPs launched in a single month.

A Top-Level Project signifies that a Project's community and products have been well-governed under the ASF's meritocratic, consensus-driven process and principles. Whilst a project is developing within the Apache Incubator or as a sub-project of an existing TLP, it benefits from hands-on mentoring from other Apache contributors, as well as the Foundation’s widely-emulated process, stewardship, outreach, support, and community events.

"Becoming a Top-Level Project is a vote of confidence from the Foundation at-large, demonstrating a project has proven its ability to be properly self-governed," said ASF Chairman Jim Jagielski. "We are proud of our Committers' dedication in building robust communities under the ASF process known as 'The Apache Way'."

All Apache Projects are overseen by a self-selected team of active contributors to the project. Upon a Project's maturity to a TLP, a Project Mangement Committee (PMC) is formed to oversee its day-to-day operations, including community development and product releases.

The six new TLPs include both a graduating project from the Apache Incubator as well as sub-projects of existing TLPs. They are:

Graduating from the Apache Incubator
- Apache Traffic Server is a richly-featured, fast, scalable, and extensible HTTP/1.1 compliant caching proxy server. Formerly a commercial product, Yahoo! submitted Traffic Server to the Apache Incubator in 2009. Traffic Server is widely recognized as an “edge” service in cloud computing; an example of its use is to serve static content such as images and JavaScript, CSS, and HTML files, and route requests for dynamic content to a Web server such as the Apache HTTP Server. Highly performant, Apache Traffic Server has been benchmarked to handle in excess of 75,000 requests per second (RPS), and is used in production in large-scale deployments such as Yahoo!, where it handles 400 terrabytes of traffic per day, and serves more than 30 billion objects daily across its various properties including the Yahoo! homepage, and its Sports, Mail, and Finance sites.

Former Sub-projects of Existing Top-Level Projects
- Apache Mahout provides scalable implementations of machine learning algorithms on top of Apache Hadoop and other technologies. It offers collaborative filtering, clustering, classification, feature reduction, data mining algorithms, and more. Begun as a sub-project of Lucene in 2008, Mahout's team of nearly a dozen contributors is now actively working towards release 0.4.

- Apache Tika is an embeddable, lightweight toolkit for content detection, and analysis. Powering by MIME standards from IANA, advanced language detection features and on the ability to rapidly unify existing parser libraries, Tika provides a one-stop shop for navigating the modern information landscape. Tika entered the Incubator in 2007 and graduated to a Lucene sub-project in 2008. Tika is used in a broad range of Lucene products ranging from Solr, to Nutch and Mahout and is in deployment at NASA, Day Software, the Internet Archive, and at a number of Web startups including Bixo labs.

- Apache Nutch is a highly-modular, Web searching engine based on Lucene Java with added Web-specifics, such as a crawler, a link-graph database, and parsers for HTML and other document formats. Its architecture allows developers to create plugins for media-type parsing, data retrieval, querying, clustering, and more. Following a successful 100 million page demo system, the project graduated the Apache Incubator in 2005 to become a sub-project of Apache Lucene.

- Apache Avro is a fast data serialization system that includes rich and dynamic schemas in all its processing. A sub-project of Apache Hadoop, Avro features rich data structures; a compact, fast, binary data format; a container file to store persistent data; remote procedure call (RPC); and simple integration with dynamic languages. Not only is code generation not required to read or write data files nor to use or implement RPC protocols, it is an optional optimization, only worth implementing for statically typed languages.

- Apache HBase is a distributed database modeled after Google's Bigtable. The project started at Powerset and became a sub-project of Apache Hadoop in 2007. Apache HBase adds random read/write access to the Hadoop stack, extending offline processing capabilities and enabling realtime serving of very large datasets. The project's goal is the hosting of big tables -- billions of rows X millions of columns -- running atop commodity hardware. HBase has been successfully deployed at Adobe, Flurry, Meetup, Mozilla, StumbleUpon, Trend Micro, and Twitter, among others, to perform analytics and as a datastore for live Websites.

Additional New Top-Level Projects Created in 2010
- Apache UIMA (Unstructured Information Management Architecture) is a framework for analyzing unstructured information, such as natural language text. It supports the writing, deployment and reuse of analysis components in a wide variety of settings. Created at IBM and submitted to the Apache Incubator in 2006, UIMA has been adopted as the de-facto enabling platform by a significant part of the natural language processing community. Apache UIMA graduated from the Apache Incubator in March 2010.

- Apache Cassandra is an advanced, second-generation “NoSQL” distributed data store that has a shared-nothing architecture. The Cassandra decentralized model provides massive scalability, and is highly available with no single point of failure even under the worst scenarios. Originally developed at Facebook and submitted to the ASF Incubator in 2009, the Project has added more than a half-dozen new committers, and is deployed by dozens of high-profile users such as Cisco WebEx, Cloudkick, Digg, Facebook, Rackspace, Reddit, and Twitter. Apache Cassandra graduated from the Apache Incubator in March 2010.

- Apache Subversion is a widely-used versioning control system. The project was initated at CollabNet in 2000 and was accepted into the Apache Incubator in 2009; many of the people who founded Subversion also actively contribute to various initiatives at the ASF. All of the ASF's projects use Subversion for source code version control, and Subversion itself relies on many Apache projects such as Apache Portable Runtime (APR) and HTTP Web Server. For nearly a decade, both communities have benefited from open feedback channels, where requirements from the Subversion project have helped drive new features to various Apache projects, and vice versa. Apache Subversion is used in Bounty Source, CodePlex, Django, ExtJS, Free BSD, FreePascal, GCC, Google Code, MediaWiki, Mono, PHP, Ruby, SourceForge, and Tigris.org, as well as numerous corporations. Apache Subversion graduated from the Apache Incubator in February 2010.

- Apache Click is a modern Java EE Web application framework that provides a natural, rich client style programming model. Apache Click's intuitive design makes it very easy to learn and use, with most developers getting up and running within a day. As opposed to traditional component oriented Web frameworks, Click is stateless by design although stateful pages are supported. Click exposes few abstractions to learn and understand; the Java Servlet API is fully exposed to the developer to ease the upgrade path from an action-based framework to a component-based one to alleviate developers from maintaining redundant markup. Apache Click entered the Apache Incubator in 2008 and graduated in February 2010.

- Apache Shindig is an OpenSocial container and helps you to start hosting OpenSocial apps quickly by providing the code to render gadgets, proxy requests, and handle REST and RPC requests. By providing a language-neutral infrastructure for those wishing to host OpenSocial applications on their Websites, Apache Shindig allows new sites to start hosting social apps in under an hour. Originally created as a port of Google's iGoogle gadget container for hosting OpenSocial compatible widgets in any Website, Shindig entered the Apache Incubator in 2007, and graduated in January 2010.

"The ASF has been at the center of innovation over the past 10 years, providing key pieces for much of the software and services we rely on every day," said RedMonk analyst Michael Coté. "These new Top-Level Projects are another example of that in action: projects that aim to help developers and organizations build the next round of useful applications."

Availability
All Apache products are released under the Apache Software License v2.0. Downloads, documentation, and related resources are available at http://www.apache.org/.

About the Apache Incubator and Incubation Process
The Apache Incubator is the entry path for projects and codebases wishing to become part of the efforts at The Apache Software Foundation. All code donations from external organisations and existing external projects wishing to join the ASF enter through the Incubator to: 1) ensure all donations are in accordance with the ASF legal standards; and 2) develop new communities that adhere to our guiding principles. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.

About The Apache Software Foundation (ASF)
Established in 1999, the all-volunteer Foundation oversees more than one hundred leading Open Source projects, including Apache HTTP Server — the world's most popular Web server software. Through The ASF's meritocratic process known as "The Apache Way," more than 300 individual Members and 2,300 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation’s official user conference, trainings, and expo. The ASF is funded by individual donations and corporate sponsors including Facebook, Google, HP, Microsoft, Progress Software, SpringSource/VMware, and Yahoo! For more information, visit http://www.apache.org/.

# # #

Tuesday April 13, 2010

The Apache Software Foundation Announces Apache Cassandra Release 0.6

Newest version of leading Open Source, NoSQL distributed database management system now available.

FOREST HILL, MD – 13 April, 2010 – The Apache Software Foundation (ASF) --developers, stewards, and incubators of 138 Open Source projects-- today announced Apache Cassandra version 0.6, the Project's latest release first since its graduation from the ASF Incubator in February 2010.

Apache Cassandra is an advanced, second-generation “NoSQL” distributed data store that has a shared-nothing architecture. The Cassandra decentralized model provides massive scalability, and is highly available with no single point of failure even under the worst scenarios.

"In response to the growing need for scalable, high-throughput databases, we are pleased to release Cassandra 0.6," said Jonathan Ellis, Apache Cassandra Project Management Committee Chair. "It's fantastic seeing the Project's community at the ASF grow to match the promise of the technology."

Originally developed at Facebook and submitted to the ASF Incubator in 2009, the Project has added more than a half-dozen new committers, and is deployed by dozens of high-profile users such as Cisco WebEx, Cloudkick, Digg, Facebook, Rackspace, Reddit, and Twitter, among others.

"The services we provide to customers are only as good the systems they are built on," said Eric Evans, Apache Cassandra committer and Systems Architect at The Rackspace Cloud. "With Cassandra, we get the fault-tolerance and availability our customers demand, and the scalability we need to make things work."

Cassandra 0.6 features include:

- Support for Apache Hadoop: this allows running analytics queries with the leading map/reduce framework against data in Cassandra.

"Digg is very excited to see Cassandra mature in the last year and graduate to a top-level Apache project. Cassandra is powering our next generation infrastructure, and allowing us to run in an environment that demands data access in datacenters around the world," said Chris Goffinet, Performance and Availability Architect at social news website Digg.

- Integrated row cache: this eliminates the need for a separate caching layer, thereby simplifying architectures.

Powering more than 10 billion pages, Twitter switched to Apache Cassandra because it can run on large server clusters and is capable of taking in very large amounts of data at a time. Storage Team Technical Lead Ryan King explained, "At Twitter, we're deploying Cassandra to tackle scalability, flexibility and operability issues in a way that's more highly available and cost effective than our current systems."

- Increased speed: this builds on Cassandra's highly-launded ability to process thousands of writes per second, allowing solutions of all kinds to cope with increasing write loads.

"Apache Cassandra 0.6 is 30% faster across the board, building on our already-impressive speed," added Ellis. "It achieves scale-out without making the kind of design compromises that result in operations teams getting paged at 2 AM."

Availability

Released under the Apache Software License v2.0, Apache Cassandra 0.6 can be downloaded at http://cassandra.apache.org/.

About The Apache Software Foundation (ASF)

Established in 1999, the all-volunteer Foundation oversees more than seventy leading Open Source projects, including Apache HTTP Server — the world's most popular Web server software. Through The ASF's meritocratic process known as "The Apache Way," nearly 300 individual Members and 2,000 Committers successfully collaborate to develop freely available enterprise-grade software, benefiting millions of users worldwide: thousands of software solutions are distributed under the Apache License; and the community actively participates in ASF mailing lists, mentoring initiatives, and ApacheCon, the Foundation’s official user conference, trainings, and expo. The ASF is funded by individual donations and corporate sponsors including Facebook, Google, HP, Microsoft, Progress Software, SpringSource/VMware, and Yahoo! For more information, visit http://www.apache.org/.

# # #

Calendar

Search

Hot Blogs (today's hits)

Tag Cloud

Categories

Feeds

Links

Navigation