Batches are one of the most misunderstood features of Apache Cassandra. They rarely improve performance. In fact, while using batches, performance may degrade. To set the stage, let’s take a look at how Cassandra handles individual mutations. Individual mutations…
Read More >Occasionally, clients reach out to us with authentication issues when a node is down. While this scenario shouldn’t happen in a high availability database management system (DBMS), it can if you miss a couple of very relevant lines in the…
Read More >This post is the continuation of the previous post, Cassandra 101: Understanding What Cassandra Is, in which I’ll highlight a series of topics related to Cassandra for beginners. Replication Factor The replication factor in Cassandra can be…
Read More >One of the many things to love about Cassandra is how operationally simple it is to add, remove or even replace nodes in a cluster. Replacing a node in Cassandra is as easy as setting your configuration files…
Read More >What is this about? Well, as the title implies, I will show how to set up Liquibase to manage changes in Snowflake as part of a DataOps practice. Why? Because I am going deep-dive into DataOps around Snowflake and I’ve…
Read More >Got too many tombstones? This blog post will talk about how to deal with tombstones once you already have them. For more information about tombstones, check out this post: Examining the Lifecycle of Tombstones in Apache Cassandra. Verifying the presence…
Read More >User-defined compactions allow us to manually select which files should be compacted. This enables us to reclaim space and limit the size of compaction so it can fit into the remaining space. These compactions are relevant only for SizeTieredCompactionStrategy (STCS)…
Read More >High latency values may indicate a cluster at the edge of its processing capacity, issues with the data model—such as poor choice of partition key or high levels of tombstones—or issues with the underlying infrastructure. Below are some major reasons…
Read More >At Pythian, we pride ourselves in assisting and empowering our clients. We strive to understand the needs of each customer and to teach customers to make the most of their resources. Experience has shown that the best way to learn…
Read More >On September 1, 2020, Apache disclosed a security vulnerability for Apache Cassandra. Summary: It’s possible for a local attacker without access to the Apache Cassandra process or configuration files, to manipulate the RMI registry to perform a man-in-the-middle attack and…
Read More >