Guy Harrison

Friday

Jan142011

Working with Cassandra 0.7

Friday, January 14, 2011 at 4:21PM

In this post, I experimented with inserting data from Oracle into Cassandra column families using Hector. Unfortunately, that code isn’t compatible with the latest Cassandra 0.7 release, so I had to rework it. The new version uses the addInsertion method of the Mutator object and while not totally intuitive didn’t take long to get working. Here are the key changes:

private static void insertSales(Connection oracleConn, Keyspace keyspace,

        String cfName) throws SQLException {

    int rows = 0;

    ColumnPath cf = new ColumnPath(cfName);

    Statement query = oracleConn.createStatement();

  

    String sqlText = "SELECT cust_id, cust_first_name,  cust_last_name, prod_name, "

            + "           SUM (amount_sold) sum_amount_sold,sum(quantity_sold) sum_quantity_sold "

            + "          FROM sh.sales    "

            + "          JOIN sh.customers USING (cust_id) "

            + "          JOIN sh.products  USING (prod_id)  "

            + "         GROUP BY cust_id, cust_first_name,  cust_last_name,  prod_name "

            + "         ORDER BY cust_id, prod_name ";

    ResultSet results = query.executeQuery(sqlText);

    int rowCount = 0;

    int lastCustId = -1;

    while (results.next()) { // For each customer

        Integer custId = results.getInt("CUST_ID");

        String keyValue = custId.toString();

  

        if (rowCount++ == 0 || custId != lastCustId) { // New Customer

            String custFirstName = results.getString("CUST_FIRST_NAME");

            String custLastName = results.getString("CUST_LAST_NAME");

            System.out.printf("%s %s\n", custFirstName, custLastName);

            // Create a supercolumn for customer details (first, lastname)

            Mutator<String> mutator = HFactory.createMutator(keyspace,

                    stringSerializer);

            mutator.addInsertion(keyValue, cfName, HFactory

                    .createSuperColumn("CustomerDetails", Arrays

                            .asList(HFactory.createStringColumn(

                                    "customerFirstName", custFirstName)),

                            StringSerializer.get(), StringSerializer.get(),

                            StringSerializer.get()));

            mutator.addInsertion(keyValue, cfName, HFactory

                    .createSuperColumn("CustomerDetails", Arrays

                            .asList(HFactory.createStringColumn(

                                    "customerLastName", custLastName)),

                            StringSerializer.get(), StringSerializer.get(),

                            StringSerializer.get()));

  

            mutator.execute();

        }

        // Insert product sales total for that customer

        String prodName = results.getString("PROD_NAME");

        Float SumAmountSold = results.getFloat("SUM_AMOUNT_SOLD");

        Float SumQuantitySold = results.getFloat("SUM_QUANTITY_SOLD");

        // Supercolumn name is the product name

        Mutator<String> mutator = HFactory.createMutator(keyspace,

                stringSerializer);

        mutator.addInsertion(keyValue, cfName, HFactory.createSuperColumn(

                prodName, Arrays.asList(HFactory.createStringColumn(

                        "AmountSold", SumAmountSold.toString())),

                StringSerializer.get(), StringSerializer.get(),

                StringSerializer.get()));

        mutator.addInsertion(keyValue, cfName, HFactory.createSuperColumn(

                prodName, Arrays.asList(HFactory.createStringColumn(

                        "QuantitySold", SumQuantitySold.toString())),

                StringSerializer.get(), StringSerializer.get(),

                StringSerializer.get()));

        mutator.execute(); 

        lastCustId = custId;

        rows++;

    }

    System.out.println(rows + " rows loaded into " + cf.getColumn_family());

}

The reason why I wanted to do this was to play with Cassandra using our (relatively) new Toad for Cloud Databases Eclipse client. Toad for Cloud Databases lets you work with non-relational datasources such as Cassandra, HBase, SimpleDB, etc, using SQL.

Here’s how it works. We select the column family we want to map from the Cassandra server:

That column family contains data loaded from both the Oracle CUSTOMER and SALES tables. Toad recognizes that the data in that single column family is best represented by two normalized tables, and gives us the opportunity to specify the names for the primary and foreign keys. We can also rename the “tables” (more like views really) that Toad will create:

The resulting tables look similar to the tables that we originally loaded from Oracle, and we can issue SQL queries against them just as we could have with Oracle. The queries get translated from SQL to thrift calls against the underlying Cassandra Server:

I definitely find it easier to issue SQL than write a 200 line Java program to do the same thing! Of course, I'm not much of a Java programmer, but at a minimum having Toad to query the Cassandra data is invaluable when checking to see that your program did was it was intended to do

Guy Harrison |

2 Comments |

tagged

Oracle,

cassandra in

TCD blog post

Reader Comments (2)

Hello Guy, I see that this application is using the Eclipse platform. Will Toad for Oracle be using Eclipse Rich Client Platform in a near future?

Thanks,
Seb

January 15, 2011 |

Seb

Hi Seb.

There is a Toad for Oracle Eclipse extension... you can get it here: http://toadextensions.com/index.jspa?product=eclipse. The extension does not replace the traditional Toad client, but will provide the richest Oracle experience for those working in the Eclipse IDE.

Regards,

Guy

January 17, 2011 |

Guy Harrison

Post a New Comment

Enter your information below to add a new comment.

My response is on my own website »

Author:

Author Email (optional):

Author URL (optional):

Post:

↓ | ↑

Some HTML allowed: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <code> <em> <i> <strike> <strong>

Working with Cassandra 0.7

Reader Comments (2)

Post a New Comment

Link an External Response