Tag: apache cassandra

Examining the Lifecycle of Tombstones in Apache Cassandra

This post is the first part of a series of blog posts regarding the lifecycle and management of tombstones. Deleting and expiring data in Cassandra is something that you should carefully plan. Especially if you’re about to delete a massive…

Read More >

So you have a broken Cassandra SSTable file?

Every few months I have a customer come to me with the following concern: my compactions for one of my Cassandra tables are stuck or my repairs fail when referencing one of the nodes in my Cassandra cluster. I take…

Read More >

Backup strategies in Cassandra

Cassandra is a distributed, decentralized, fault-tolerant system. Data is replicated throughout multiple nodes (centers) across various data centers. The fact that Cassandra is decentralized means that it can survive single or even multi-node failures without losing any data. With Cassandra,…

Read More >

Which Cassandra version should I use (2018)?

If you’re starting new or in the 3.0.x series: 3.11.2 Apache Cassandra 3.0 is supported until 6 months after 4.0 release (date TBD) If you’re in 2.x, update to the latest in the series (2.1.20, 2.2.12) Apache Cassandra 2.2 is supported…

Read More >

Understanding the Java Virtual Machine (JVM) architecture part 1

What is garbage collection? Garbage collection (GC) is a form of automatic memory management. In essence what the garbage collection does is to attempt to reclaim garbage, ergo memory occupied by objects that are no longer relevant for the active…

Read More >

Pythian at Cassandra Summit 2016

Vanessa Simmons, Director of Business Development at Pythian, provides a sneak peek of Pythian programming at Cassandra Summit 2016.

Read More >

Two easy ways to poll apache Cassandra Metrics using JMX Http Bridge

Yabin Meng, Datastax Enterprise and Apache Cassandra Consultant at Pythian, explores two simple but useful methods to poll Cassandra metrics through a JMX Http Bridge.

Read More >