Tag: Big Data

Three reasons you need a customer data platform right now

A Twitter user recently turned to the platform to issue an appeal to her bank: “Please don’t send me emails asking if I’m ready to buy a house ten minutes after emailing me an overdraft notice.” That one tweet neatly…

Read More >

Data modeling for cloud DW

In this blog post, I would like to share some options that you can consider to model your cloud DW for better query performance.  With a traditional EDW, we would either come up a STAR, Snowflake or similar schemas. These…

Read More >

Big Data on Microsoft Azure – HDInsight

Introduction   The best definition you going to find for data is that data is the new oil in today’s world. Starting from that, we can define a new horizon and a new way of looking at how we treat…

Read More >

Streaming Oracle to Kafka – Stories from the Message Bus Stop

Fascinated by streaming data pipelines, I have been looking at different ways to get data out of a relational database like Oracle and into Apache Kafka. I have presented about this topic at a number of conferences. There is a…

Read More >

Dipping your toes into building an Analytics Platform on Google Cloud Platform

“We have many disparate data sources and we’re having a hard time getting a global view of all our data across our organization.” “Our data is currently all in <enter data warehouse name here> and we want to migrate it…

Read More >

Building a custom routing NiFi processor with Scala

In this post we will build a toy example NiFi processor which is still quite efficient and has powerful capabilities. Processor logic is straightforward: it will read incoming files line by line, apply given function to transform each line into…

Read More >

Updating Elasticsearch indexes with Spark

With the extensive adoption of Elasticsearch as a search and analytics engine, more often we build data pipelines that interact with Elasticsearch. And apparently, most often the processing framework of choice is Apache Spark. Although reading data from Elasticsearch and…

Read More >

How do machines learn?

Artificial intelligence, machine learning and data science are all terms that get thrown around a lot these days. While it’s easy to get into hair-splitting arguments about the distinctions between them, really they refer to the same thing: teaching machines…

Read More >

Architecting a Modern Data Warehouse – Live Webinar

Join Pythian and DBTA for a live roundtable webinar Architecting a Modern Data Warehouse Live Roundtable Webinar Thursday, November 16, 2017 11:00 am PT / 2:00 pm ET REGISTER TODAY Today, the world of decision-making, along with the data sources…

Read More >

Why You Should Consider the Cloud for a Modern Data Warehouse

  Your data warehouse modernization strategy can make or break your ability to derive value from traditional data sources (for example, your operations and financial data) as well as emerging data sources such as IoT data from devices and sensors,…

Read More >
Page 1 of 1012345...10...Last Page »