The State of MySQL High Availability Going in to 2018

High availability for MySQL has become increasingly relevant given the ever increasing rate of adoption and implementation. It’s no secret to anyone in the community that the popularity of MySQL has become noteworthy. I still remember my start with MySQL…

Read More >

Dipping your toes into building an Analytics Platform on Google Cloud Platform

“We have many disparate data sources and we’re having a hard time getting a global view of all our data across our organization.” “Our data is currently all in <enter data warehouse name here> and we want to migrate it…

Read More >

Microsoft Azure IaaS SQL Server Best Practices

Modern cloud IAAS offerings are sold as a fast and convenient way to setup a server without having to worry about purchasing hardware, networking, and basic installation/configuration work. For the most part, I believe this is true, and they’ve fulfilled…

Read More >

Minimal Twitter to Google Pub/Sub example with Scala

Recently I was looking for a simple Twitter to Pub/Sub streaming pipeline and ended up with own implementation in Scala. I tried to make it as compact as possible. So I chose the dispatch and Google Pub/Sub client libraries for…

Read More >

Replicating in Google Cloud SQL using Tungsten

While investigating alternatives to migrate to Google Cloud SQL, I encountered a lack of support for external masters. However, it’s possible to overcome this limitation by replicating into Google Cloud SQL using Tungsten replicator. Cloud SQL is Google’s database-as-a-service solution,…

Read More >

Apache Beam pipelines with Scala: Part 3 – Dynamic processing

In the third part of the series we will develop a pipeline to transform messages from “data” Pub/Sub using messages from the “control” topic as source code for our data processor. The idea is to utilize Scala toolBox. It’s much…

Read More >

Apache Beam Pipelines With Scala: Part 2 – Side Input

In the second part of this series we will develop a pipeline to transform messages from “data” Pub/Sub topic with the ability to control the process via “control” topic. How to pass effectively non-immutable input into DoFn, is not obvious,…

Read More >

Apache Beam Pipelines With Scala: Part 1 – Template

In this 3-part series I’ll show you how to build and run Apache Beam pipelines using Java API in Scala. In the first part we will develop the simplest streaming pipeline that reads jsons from Google Cloud Pub/Sub, convert them…

Read More >

When to use Amazon Athena

Amazon Athena enables you to access data present in flat files stored in S3 (Simple Storage Service) as if it were in a table in the database. And you don’t have to set up a server or any other software…

Read More >

Cosmos DB Geo-replication – SQL On The Edge Episode 14

As Microsoft Azure’s NoSQL service offering, Cosmos DB has received a lot of investment and development effort. Microsoft considers Cosmos as a “ring zero” service, which means that it is available by default from all regions as soon as they…

Read More >
Page 1 of 1112345...10...Last Page »