Available options to connect to Hive and Impala from .NET application

Recently a customer has requested information regarding which one to use to connect to HS2 or Impala from .NET application, ODBC or JDBC? I will briefly summarise my findings below: Both ODBC and .NET are Microsoft products, so it is natural that they will work nicely together. And according to …

Column Stats Shows Incorrect Stats Information in Impala

Another bug identified today in Impala while helping customers solving a weird Impala issue. The problem is that “SHOW COLUMN STATS” command in Impala shows incorrect stats information, either shows “-1” for distinct values or the number is not matching with real distinct values: query: show column stats test +——–+————+——————+——–+———-+———-+ …

Timestamp stored in Parquet file format in Impala Showing GMT Value

This article explains why Impala and Hive return different timestamp values on the same table that was created and value inserted from Hive. It also outlines the steps to force Impala to apply local time zone conversion when reading timestamp field stored in Parquet file format. When Hive stores a timestamp …