Author: John Schulz

How to build your very own Cassandra 4.0 release

Over the last few months, I have been seeing references to Cassandra 4.0 and some of its new features. When that happens with a technology I am interested in, I go looking for the preview releases to download and test….

Read More >

So you have a broken Cassandra SSTable file?

Every few months I have a customer come to me with the following concern: my compactions for one of my Cassandra tables are stuck or my repairs fail when referencing one of the nodes in my Cassandra cluster. I take…

Read More >

Proposal for a new Cassandra cluster key compaction strategy

Cassandra storage is generally described as a log-structured merge tree (LSM). In general, LSM storage provides great speed in performing writes, updates and deletes over reads. As a general rule, a write in Cassandra is an order of magnitude faster…

Read More >

The life of a Pythian road warrior

I’m sitting here on March 16th, the day before Saint Patties day and I am thinking about what I will need for my trip to Salt Lake City next week. I work for Pythian in the Principal Consulting Group. We are…

Read More >

Why column stores?

Introduction I’ve been working with data in many forms for my entire career. During this time, I have occasionally needed to build or query existing databases to get statistical data. Traditional databases are usually designed to query specific data from…

Read More >

Cassandra as a time series database

When I think of Time series databases, my first thoughts go to my own experiences using a relational database to store time series data. The mapping tends to be fairly straight forward. Each row stores data for a single event…

Read More >

Cassandra use cases: when to use and when not to use cassandra

Introduction I have a database server that has these features: High available by design. Can be globally distributed. Allows applications to write to any node anywhere, anytime. Linearly scalable by simply adding more nodes to the cluster. Automatic workload and…

Read More >

The case against auto increment in MySQL

Introduction In my travels to visit many customers over the last few years, I often see my customers creating many or all of their MySQL InnoDB tables using auto-increment primary keys. Many Object Relational Mappers do this by default on…

Read More >

A Cassandra consistency use case

  I recently completed a project where I worked with a company using Cassandra to keep metadata about objects stored in an Object Store. The application keeps track of individual objects as rows within a partition based on user id….

Read More >