Tag: Hadoop

How to Deploy Machine Learning on Google Cloud Platform

Editor’s Note: Because our bloggers have lots of useful tips, every now and then we update and bring forward a popular post from the past. Today’s post was originally published on August 15, 2019. In this post, I’ll describe a…

Read More >

Can you run Hadoop in the cloud?

As a solutions architect at Pythian, I often get questions from clients about the many solutions available to them to address their big data needs. Between Hadoop, cloud-based, and hybrid solutions, finding the best option for their unique needs can…

Read More >

Hadoop to the cloud: Is it time to make the move?

When Hadoop was introduced, it promised a faster time to insight because of simpler modeling using cheaper hardware. But over time, organizations have been finding that Hadoop’s complexity and need for specialized skills are adding cost and headaches. Now instead…

Read More >

Big Data on Microsoft Azure – HDInsight

Introduction   The best definition you going to find for data is that data is the new oil in today’s world. Starting from that, we can define a new horizon and a new way of looking at how we treat…

Read More >

Join Pythian and DBTA on August 24, 2017 for a live roundtable webinar: harnessing the Hadoop ecosystem

Harnessing the Hadoop Ecosystem Live Roundtable Thursday, August 24 at 11:00 am PT / 2:00 PM ET REGISTER With a stake at the center of how organizations are consuming and leveraging big data, Hadoop adoption in the enterprise is growing…

Read More >

Datascape podcast episode 10 – getting transactional with Hadoop

In this episode we discuss using Hadoop as the data store for a public facing, web based application. We talk about some of the challenges and how they were overcome.

Read More >

Hadoop as part of your big data strategy

If you’re thinking about moving from a traditional relational database management system (RDBMS), you should consider Apache™ Hadoop®—because your competitors probably are. According to Gartner, Hadoop joined the mainstream in 2016. And Allied Research says the Hadoop market will likely…

Read More >

Datascape podcast episode 3 – all about data lakes with Danil Zburivsky

I started hearing the term ‘data lake’ a few years ago but didn’t pay a ton of attention to it. Today, the term’s still around and so is the hype. According to this article on Wikipedia the term is poorly…

Read More >

SQL On The Edge #11 – Azure data lake fundamentals

Warner Chaves, Principal Consultant at Pythian and Microsoft MVP explores and explains the basic fundamentals of Azure Data Lake.

Read More >

7 tips for managing Infrastructure with Terraform

Kevin Pedersen, Team Manager, Service Reliability Engineering at Pythian, provides some considerations to keep your Terraform environment manageable.

Read More >
Page 1 of 41234