Posts by André Araújo

Small Files on MapR-FS

One of the well-known best practices for HDFS is to store data in few large files, rather than a large number of small ones. There are a few problems related to using many small files but the ultimate HDFS killer is that the memory consumption on the name node is proportional to the number of…

Cloudera Challenge 2014

Yesterday, Cloudera released the score reports for their Data Science Challenge 2014 and I was really ecstatic when I received mine with a “PASS” score! This was a real challenge for me and I had to put a LOT of effort into it, but it paid off in the end! Note: I won’t bother you…

How to Download Oracle Software Using WGET or CURL

This is yet another blog post with tips and tricks to help you (and me) download Oracle software without the help of a Web browser. If you Google “how to download Oracle with wget” you’ll find heaps of posts with useful tips. I decided to write this post to compile the methods I’m aware of…

Connection Resets When Importing from Oracle with Sqoop

I’ve been using Sqoop to load data into HDFS from Oracle. I’m using version 1.4.3 of Sqoop, running on a Linux machine and using the Oracle JDBC driver with JDK 1.6. I was getting intermittent connection resets when trying to import data. After much troubleshooting, I eventually found the problem to be related to a…

SQL for Pattern Matching in Oracle 12c

Oracle 12c is out! And with it, a myriad of new features that we’ll be learning and playing with in the months and years to come. Paraphrasing Iggy Fernandez’s blog: “So many Oracle manuals, so little time…” The new features abound, and we need to cherry pick some interesting ones to delve into. There are…

My Pythian Volunteer Day: Blood Donation

BD

Every employee at Pythian can use one workday per year to spend on activities to help the community. We call it a Pythian Volunteer Day. It’s up to the employee to decide what to use if for and when. This year I used my volunteer day to donate blood and this post is a short story about my experience.

Pythian MoBros United!!

Movember is getting to and end… and what a ball we had at Pythian this month! The laughs, pictures and performances will stay in our memories for a long time, and on Facebook, Youtube, Twitter, etc…

Flashback and Forth

One of the readers of my previous post about Oracle’s Flashback Database, posted a comment noting that the Oracle documentation doesn’t make any reference to the ability of the FLASHBACK DATABASE command to roll the database forward. I knew from experience that the database can be rolled back and forth using the FLASHBACK DATABASE command and decided to have a closer look.

My First Experience with Cassandra – Part 1

The traditional NoCOUG SQL challenge has been launched this year with a twist: In the wake of the “BigData” trend/buzz, it’s now been upgraded to a “SQL and/vs. NoSQL” challenge. I took on the challenge, treading through my SQL comfort zone and thinking of ways I could bend relational algebra to solve the wicked puzzle suggested this year.

War Story: Applying Bundle Patch 1 to EM 12 Cloud Control

We have a few clients already using Enterprise Manager 12c Cloud Control. The interface and navigation have improved a lot from the 11g version in my opinion. However, as with any new release of anything, quite a few bugs still need to be fixed. After working with Oracle on some of these bugs last week, we were asked to apply the Bundle Patch 1 (BP1) to one of our clients’ installation. The first thing that I noticed when I started looking for information about BP1 was the amount of warnings from different people I found in MOS and around the internet.

Page 1 of 3123