GigaOm x Pythian: Realities and Risks of Enterprise AI
In a recent interview on the Business Disruptions in Tech podcast, Howard Holton and ...

Creating Cassandra_installation playbookDeploying a Cassandra database manually on hundreds of machines might be a ...

Some clients have asked us to change the number of num_tokens as their requirement changes. For example lower number of ...

Spark Overview Spark was created in 2009 as a response to difficulties with map-reduce in Hadoop, particularly in ...
Every few months I have a customer come to me with the following concern: my compactions for one of my Cassandra tables ...
In this blog, I’m going to give a detailed guide on how to monitor a Cassandra cluster with Prometheus and Grafana. For ...
Intro First, let me start by saying I do not hate Cassandra. I love Cassandra. In its place, Cassandra is a powerful ...

Introduction While working on a few tasks in the past few weeks, I faced a problem that involved setting a "filter" for ...

Because incremental repairs can significantly reduce the time and IO cost of performing a repair, they can seem like a ...

A client recently asked us to migrate a Cassandra cluster running in Docker containers to Elassandra, with the data ...
Here's a short and sweet post. "In Deploy mode, AutoUpgrade also confirms that the upgrade has succeeded, and moves ...

Cassandra nodetool provides several types of commands to manage your Cassandra cluster. See my previous post about ...
Over the last few months, I have been seeing references to Cassandra 4.0 and some of its new features. When that ...
On September 1, 2020, Apache disclosed a security vulnerability for Apache Cassandra. Summary: It's possible for a ...
Context: Every now and then I find myself looking for a couple of commands I do often. In some other ...
Cassandra support for transactions Unlike commonly used relational database platforms, Cassandra does not support ACID ...


Cassandra nodetool provides several types of commands to manage your Cassandra cluster. See my previous posts for an ...
This post is the first part of a series of blog posts regarding the lifecycle and management of tombstones. Deleting ...
User-defined compactions allow us to manually select which files should be compacted. This enables us to reclaim space ...
High latency values may indicate a cluster at the edge of its processing capacity, issues with the data model—such as ...

Occasionally, clients reach out to us with authentication issues when a node is down. While this scenario shouldn’t ...
Got too many tombstones? This blog post will talk about how to deal with tombstones once you already have them. For ...

This post is the continuation of the previous post, Cassandra 101: Understanding What Cassandra Is, in which I’ll ...

Apache Cassandra promises linear scalability and workload distribution, among many other features—and rightly so. ...

Batches are one of the most misunderstood features of Apache Cassandra. They rarely improve performance. In fact, while ...

One of the many things to love about Cassandra is how operationally simple it is to add, remove or even replace nodes ...

When I started learning Cassandra, I noticed some concepts were not well explained, and it might be helpful for ...

Introduction Cassandra is an open-source NoSQL distributed database used for processing large volumes of data with high ...

Spark is an open-source, distributed processing system used to manage big data workloads. Spark uses in-memory caching ...