Tag: Big Data

GoldenGate 12.2 Big Data Adapters: part 1 – HDFS

Gleb Otochkin, Principal Consultant and Certified Oracle Expert at Pythian, discusses the HDFS adapter for the newest version of GoldenGate.

Read More >

Google Cloud Dataproc in ETL pipeline – Part 1 (Logging)

Pythian’s Big Data Principal Consultant at Pythian, Vladimir Stoyak talks about Google Cloud Dataproc, and provides an in depth look at logging in this technical blog post.

Read More >

Configure High Availability – Load Balancing for Hiveserver2

Manoj Kukreja, Pythian Big Data Consultant, provides you with the right steps to ensure that you have a smooth and available Hive system, performing under increased workloads.

Read More >

How to Deploy a Cluster

Zunaira Jamil, Pythian co-op student in our big data practice, walks you through her lessons learned and solutions when installing a cluster using Cloudera manager for the first time.

Read More >

Big Data Co-op Experience at Pythian

Zunaira Jamil, Pythian co-op student, describes her experiences while fulfilling her experience working with one of our Big Data teams and what she learned.

Read More >

Issues with Triggers in Cloudera Manager

Valentin Nikotin explains why triggers in Cloudera Manager is a very useful feature, as well as how you can set them up to monitor tons of available metrics using tsquery language.

Read More >

Recursion in Hive – Part 1

In Part 1 of this series, Valentin Nikotin, will teach you about migrating from RDBMS to Hive, while maintaining the simplicity and flexibility of a SQL approach.

Read More >

What is Big Data and Do You Really Need it?

Are you interested in transforming your business potential, but stuck asking the same old question, “What is Big Data? Pythian’s CTO and Oracle Ace, Alex Gorbachev, explains everything you need to know.

Read More >

Magic of “\d” in Vertica

A quick neat way to list down important and oft-needed information like names of databases, schemas, users, tables, projections etc. We can also use patterns with the ‘\d’ to narrow down the results. Let’s see it in action:

Read More >

Mongostat – A Nifty Tool for Mongo DBAs

One of the main MongoDB DBA’s task is to monitor the usage of MongoDB system and it’s load distribution. This could be needed for proactive monitoring, troubleshooting during performance degradation, root cause analysis, or capacity planning. Mongostat is a nifty…

Read More >
Page 2 of 912345...Last Page »