Category Archives: analytics

Correlation Analysis with IBM’s PureData for Analytics (Netezza)

IBM PureData systems come with a lengthy list of in-database analytic functions that you can run without having to export the data.  One of the easiest functions to understand and implement is the correlation analysis. Correlation is an association between … Continue reading

Posted in analytics, Netezza, PureData for Analytics | Leave a comment

Enabling Netezza Analytics for use in a database

Many of Netezza’s analytic functions require the presence of some metadata tables/views in order to work.  These tables/views are created when you initialize the analytic libraries in a particular database. SYSTEM(ADMIN)=> call nza..kmeans(‘intable=t1,id=int1,k=10,model=kmeans,outtable=out,maxiter=5’); ERROR:  The metadata tables are not initialized. … Continue reading

Posted in administration, analytics, General | Tagged , , , , , | Leave a comment

Dropping a Netezza Analytics model

The result of most of Netezza’s analytic functions are a series of tables.  The tables produced vary from function to function.  We schedule jobs to run the analytic functions and want to reuse the same model name so that our … Continue reading

Posted in administration, analytics, General | Tagged , , , | Leave a comment

Netezza in-database modeling with SPSS Modeler 14.2: K-Means 35x faster

Leveraging Netezza’s in-database analytic capabilities can significantly reduce the amount of time required to execute SPSS streams.  By pushing the analytics to the data, we eliminate the need to pull the data out of the table and onto our SPSS … Continue reading

Posted in analytics, General, Uncategorized | Tagged , , , | Leave a comment

Error 126 when connecting SPSS Modeler to Netezza using 64bit ODBC driver

I installed IBM SPSS Modeler 14.1 today and the first thing I wanted to do was to connect it to Netezza and compare performance: in-database analytics and without. It didn’t take long for me to run into my first issue … Continue reading

Posted in analytics, Uncategorized | Tagged , , , | Leave a comment

IBM Netezza analytics to analyze query history table usage

Using Netezza’s in-database analytics package FPGROWTH, database administrators can identify the most commonly used combination of tables and the performance of the queries that reference those sets of tables. First, let’s see the most commonly used combination of tables.  Today, … Continue reading

Posted in administration, analytics, Performance | 2 Comments

Linear Regression with IBM Netezza’s Analytics package

In the IBM Netezza Analytics suite exists a Linear Regression function that you can use (very easily) to identify intercept and slope between a dependent and any number of independent variables.  I’m no math nut, but the presentation of this … Continue reading

Posted in analytics, General | Tagged , | Leave a comment