Posts Categorized: Pythian

Small Files on MapR-FS

One of the well-known best practices for HDFS is to store data in few large files, rather than a large number of small ones. There are a few problems related to using many small files but the ultimate HDFS killer is that the memory consumption on the name node is proportional to the number of…

Cloudera Challenge 2014

Yesterday, Cloudera released the score reports for their Data Science Challenge 2014 and I was really ecstatic when I received mine with a “PASS” score! This was a real challenge for me and I had to put a LOT of effort into it, but it paid off in the end! Note: I won’t bother you…

Comparing CPU Throughput of Azure and AWS EC2

After observing CPU core sharing with Amazon Web Services EC2, I thought it would be interesting to see if Microsoft Azure platform exhibits the same behavior. Signing up for Azure’s 30-day trial gives $200 in credit to use over the next 30-day period: more than enough for this kind of testing. Creating a new virtual…

Critical MySQL 5.6 bug: GRANTs and replication

Critical MySQL 5.6 bug: any user with GRANT privileges can unwillingly cause all replicas to break The latest major release of MySQL brought us a lot of new and exciting features. As always, new features come with brand new bugs waiting to bite you in the least expected way. I was implementing a monitoring system…

Essential Hadoop Concepts for Systems Administrators

Of course, everyone knows Hadoop as the solution to Big Data. What’s the problem with Big Data? Well, mostly it’s just that Big Data is too big to access and process in a timely fashion on a conventional enterprise system. Even a really large, optimally tuned, enterprise-class database system has conventional limits in terms of…

Pythian Acquires Blackbird.io

Today, we officially announced the fact that Blackbird.io has been acquired by Pythian. I first met its founder, Paul Vallee, in 2007. Paul reached out to me about joining Pythian to found a San Francisco presence. At the time, I was one year into PalominoDB, and was having such a good time being my own…

Page 16 of 230« First...10...1415161718...304050...Last »