Category Archives: Uncategorized

Using R to improve your fantasy football team

So I’ve started playing around with R and this week decided to see if I could more intelligently add a player to my team from the ranks of free agency.  The position I needed to fill?  The kicker. The first … Continue reading

Posted in Uncategorized | 1 Comment

“Error occurred while loading translation library” when connecting R to IBM Netezza

When connecting my R-2.15 client to IBM Netezza v7 (NZA 2.5.4) for the first time, I got the error above.  Here was the connect call: >nzConnectDSN(“NZSQL”) Error in odbcDriverConnect(“DSN=VirtualNZ”) :   (converted from warning) [RODBC] ERROR: state HY000, code 45, … Continue reading

Posted in Uncategorized | Leave a comment

Pig workflow optimization: splitting data flows

Pig supports the concept of non-linear data flows, where you have a single input but multiple outputs.  Pig’s optimizer is smart enough to recognize when the same input is referenced multiple times and implicitly splits those data flows.  You can … Continue reading

Posted in Uncategorized | Leave a comment

To copy or move: Implications of loading Hive managed table from HDFS versus local filesystem

When using the load function to populate a Hive table, it’s important to understand what Hive does with the actual data files when the input data resides on your local file system or on the HDFS file system. For example, … Continue reading

Posted in hadoop, hive, scripting, Uncategorized | Tagged , , | Leave a comment

Configuring pig to work with a remote Hadoop cluster

1. First, download a stable release of Pig from here. 2. As root (or some other privileged user), untar the pig tarball to /usr/local; this will create a sub-directory like /usr/local/pig.0.11.1. 3. Create a symbolic link (to make things easier) … Continue reading

Posted in hadoop, pig, scripting, Uncategorized | Tagged , , | 1 Comment

Getting to a blades console from IBM’s IMM (and then getting out!)

Because I don’t do this often, I always forget how to get out of console mode via IBM’s IMM (integrated management module).  I should add that there is a way to do this via the web-based GUI but I’m a … Continue reading

Posted in Uncategorized | Leave a comment

Accessing Facebook Graphs search from Jaql

To use Facebook’s graph search, you need to register first as a developer and then create an app on the Facebook developer site.  Once you have done this, you need to identify the following values for your app: App ID … Continue reading

Posted in Uncategorized | Tagged , , | Leave a comment