Tag: Big Data

Streaming Oracle to Kafka – stories from the message bus stop

Fascinated by streaming data pipelines, I have been looking at different ways to get data out of a relational database like Oracle and into Apache Kafka. I have presented about this topic at a number of conferences. There is a…

Read More >

Building a custom routing NiFi processor with Scala

In this post we will build a toy example NiFi processor which is still quite efficient and has powerful capabilities. Processor logic is straightforward: it will read incoming files line by line, apply given function to transform each line into…

Read More >

Updating Elasticsearch indexes with Spark

With the extensive adoption of Elasticsearch as a search and analytics engine, more often we build data pipelines that interact with Elasticsearch. And apparently, most often the processing framework of choice is Apache Spark. Although reading data from Elasticsearch and…

Read More >

How do machines learn?

Artificial intelligence, machine learning and data science are all terms that get thrown around a lot these days. While it’s easy to get into hair-splitting arguments about the distinctions between them, really they refer to the same thing: teaching machines…

Read More >

Architecting a Modern Data Warehouse – Live Webinar

Join Pythian and DBTA for a live roundtable webinar Architecting a Modern Data Warehouse Live Roundtable Webinar Thursday, November 16, 2017 11:00 am PT / 2:00 pm ET REGISTER TODAY Today, the world of decision-making, along with the data sources…

Read More >

Why should you consider the Cloud for a modern data warehouse?

  Your data warehouse modernization strategy can make or break your ability to derive value from traditional data sources (for example, your operations and financial data) as well as emerging data sources such as IoT data from devices and sensors,…

Read More >

Join Pythian and DBTA on August 24, 2017 for a live roundtable webinar: harnessing the Hadoop ecosystem

Harnessing the Hadoop Ecosystem Live Roundtable Thursday, August 24 at 11:00 am PT / 2:00 PM ET REGISTER With a stake at the center of how organizations are consuming and leveraging big data, Hadoop adoption in the enterprise is growing…

Read More >

Datascape podcast episode 3 – all about data lakes with Danil Zburivsky

I started hearing the term ‘data lake’ a few years ago but didn’t pay a ton of attention to it. Today, the term’s still around and so is the hype. According to this article on Wikipedia the term is poorly…

Read More >

SQL On The Edge #11 – Azure data lake fundamentals

Warner Chaves, Principal Consultant at Pythian and Microsoft MVP explores and explains the basic fundamentals of Azure Data Lake.

Read More >

The value of data In business today

Alejandro Cordero, Lead Database Consultant at Pythian, discusses the rising importance in business data, and how is adds value to an organization.

Read More >
Page 2 of 1012345...10...Last Page »