Tag: Pythian

Which Partitions Does the SQL Statement Access?

This is a short blog post on how one can prove that a particular partition of the table or index is accessed by a specific query.

Read More >

Deploying Cloudera Impala on EC2 with Example Live Demo

Pythian Big Data Impala Implementation

A little while ago I blogged about (and open sourced) an Impala-powered soccer visualization demo, designed to demonstrate just how responsive Impala queries can be. Since not everyone has the time or resources to run the project themselves, we’ve decided to host it ourselves on an EC2 instance.

Read More >

COLLABORATE 13 à la Pythian

Like many previous IOUG/OAUG/Quest shows, Pythian will be in Denver next week!

Read More >

How to Get the Most Out of Collaborate! Talking from Experience

First, the most important advice I can give you is to relax and have FUN!

Read More >

Using Ansible to Secure Cloudera Manager Installation on a Hadoop Cluster

Building a secure Hadoop cluster requires protecting a number of services which comprise Hadoop infrastructure. If you are using CDH distribution, then Cloudera Manager (CM) is one of the components that needs to be secured. There is a good step by step guide in CM documentation, and it’s easy to follow for one server, but what when you have hundreds of them? There are different approaches to the problem of managing server’s configuration at scale, but I’d like to focus on Ansible which is a neat framework for parallel commands execution and complex rollouts.

Read More >

Performance Settings of Concurrent Managers

This is the second article in a series about internals and performance of concurrent managers. In this post, we’ll take a look at three important settings that affect the performance of the concurrent managers: number of processes, “sleep seconds”, and “cache size”. This article might be a bit on the theoretical side, but it should provide a good understanding of how these settings actually affect the behavior and performance of concurrent managers.

Read More >

The Pythian Chalkboard Wall Comes Together

Read More >

Oracle Trace Files in User Directories

How to resolve unusual situation where Oracle was writing trace files into the user directory instead of the usual oracle diagnostic destinations.

Read More >

How to Tune Using v$mystat

When reviewing the performance of some queries, it is sometimes useful to review the sessions statistics for each execution of the query. I had a situation that required to look at these stats so I could see why one query would run fast and sometimes much slower. I wrote a simple wrapper ksh shell script for the query. It saves the session statistics in a table before and after the execution of the query and then prints out the statistics in a pivot report. This turned out to be very handy to me and therefore I chose to share it with the world :)

Read More >

Pythian Days

Read More >
Page 10 of 40« First Page...89101112...20...Last Page »