I started hearing the term ‘data lake’ a few years ago but didn’t pay a ton of attention to it. Today, the term’s still around and so is the hype. According to this article on Wikipedia the term is poorly defined and has morphed significantly since it was coined by James Dixon.
Does it mean Hadoop? Does it mean we put everything in one spot? Do we need to document what’s there?
My Google search yielded all sorts of contradictory articles about ‘data lakes’ so I decided to talk to my colleague, Danil Zburivsky, to find out what’s real, what isn’t and what people are using it for.
Join Danil and I for the third episode of the Datascape Podcast as we sift through the murky term ‘data lake’ for some clarity.