Pythian has been working with clients on big data consulting projects, and when it comes to MongoDB and Hadoop, we are often asked two questions.
- What is the differences between Hadoop and MongoDB?
- Can Hadoop and MongoDB be used together?
Alex Gorbachev, Pythian’s CTO, recorded this short video to share his insight and answer these two questions.
Discover more about our expertise in Hadoop
15 Comments. Leave new
here is a question, i am working on a project for DDOS mitigation system.
and i am looking into mitigating other kinds of attacks as well.
n i am planning to do this by monitoring the patterns of attacks.
so here, i wanted to know what will be useful for storing the database of the patterns: mongodb or hadoop ?
awaiting your reply…
Sounds like MongoDB is the best option and then once go beyond few TB , let’s say 3 or 5 TB then You can integrate hadoop to process large data.
Hi Alex,
I’m interested in exploring a bit more Hadoop and I’d like to have some suggestions on where to starting from… any website? book? etc
Thanks in advance for your help.
Regards,
-Felipe
I would suggest go through some videos on youtube and start experimenting with Hadoop
1. Collection of Videos on Hadoop/Eco-System components
https://www.youtube.com/channel/UCLKzuy6OgQwCJ5XaSFn5yRQ
2. https://www.youtube.com/watch?v=ziqx2hJY8Hg
I do sorta disagree with one item that he stated. He said that mongo is used with values like dozens or hundreds of rows. In my experience, it works very well with very large numbers of documents. We are successfully using data in our website that has nearly 3 million rows. Obviously we are paging the data, but retrieving the data is still relatively pretty fast for the amount of data that we have. We have several other collections with 100,000’s of documents also.
Hi,
I am database administrator and would like to know between Hadoop and MongoDB which one serve me better in my career. I have doing some reading and it seems like there is very specific parts in Hadoop that has to do with the data management itself. Can you point me towards the correct one?
How About Both.
Go for Hadoop framwork
Hi,
I am database administrator and would like to know between Hadoop and MongoDB which one serve me better in my career. I have doing some reading and it seems like there is very specific parts in Hadoop that has to do with the data management itself. Can you point me towards the correct one?which one is easy to learn.
they are all document database. it is the concern that we have to have a TB space that needs to be planning way ahead. My last MongoDB project had suffered base on the cost of server disk space.
As we are in March 2016, is still MongoDB the solution to real-time pseudo-BigData (as the dataset may be smaller than in Hadoop) analytics? Are there any other DBs than can be used for real-time BigData?
Big Data related softwares can get very complex and it makes it hard to decide which one is best for you and your company , The video explanation is great to understand the differences between mongodb and hadoop and what should be used based on the amount of data you are going to work with.
Excellent post. I was checking continuously this weblog and I am impressed!
Very helpful information particularly the final sectkon :) I care for such info much.
I was looking for this certain info for a very long time.
Thank you and best of luck.
I was looking out for certain points to distinguish when to use MongoDB versus Hadoop, Above discussion was really helpful . thanks guys!!
Hi.
I would like to ask a question. Now I am working on a system which needs a data querying(including aggregation) and export to file. so which one should I choose? should I use sparkR to get statistical value?and which one provide faster performance?