OFFICIAL PYTHIAN BLOG

Industry insight and analysis covering Big Data, Hadoop, MongoDB, Oracle, SQL Server, MySQL, and more.

ALL POSTS

Are you ready to be a private cloud service provider?

When defining what a cloud service is, we need to know that it is not a technology per se, but its an architectural and operational paradigm. It is a self-service computing environment offering the ability to create, consume, and pay for services. In this architecture, computing resources are elastically supplied from a shared pool and charged based…

Git for Beginners

git, simplified Perhaps you’ve come across a great cache of publicly available SQL scripts that would be very useful in monitoring your databases, and these scripts are hosted on github.  Getting those scripts is as simple as clicking the Download button. What if, however, you wish to contribute to the script library? Or perhaps you…

Thoughts on Google Cloud Dataflow

Google Cloud Dataflow is a data processing tool developed by Google that runs in the cloud. Dataflow is an easy to use, flexible tool that delivers completely automated scaling. It is deeply tied to the Google cloud infrastructure, making it a very powerful for projects running in Google Cloud. Dataflow is an attractive resource management…

Mongostat – A Nifty Tool for Mongo DBAs

One of the main MongoDB DBA’s task is to monitor the usage of MongoDB system and it’s load distribution. This could be needed for proactive monitoring, troubleshooting during performance degradation, root cause analysis, or capacity planning. Mongostat is a nifty tool which comes out of the box with MongoDB which provides wealth of information in…

Partitioning in Hive Tables

Partitioning a large table is general practice for a few reasons: Improving query efficiency by avoiding to transfer and process unnecessary data. Improving data lineage by isolating batches of ingestion, so if a ingestion batch failed for some reason and introduces some corrupted data, it’s safe to re-ingest the data With that being said this practice…

Why Move to Cassandra?

2000px-Cassandra_logo.svg

Nowadays Cassandra is getting a lot of attention, and we’re seeing more and more examples of companies moving to Cassandra. Why is this happening? Why are companies with solid IT structures and internal knowledge shifting, not only to a different paradigm (Read: NoSQL vs SQL), but also to completely different software? Companies don’t simply move…

Page 1 of 28412345...10...Last Page »