June 8, 2018 – Hadoop Troubleshooting Guide

Impala Query Failed with ERROR “AnalysisException: ORDER BY expression not produced by aggregation output”

Eric Lin June 8, 2018 June 8, 2018

Recently, I discovered a bug in Impala that when you are using Expression in the ORDER BY clause, the query will fail with below error message: Customer used a very complicated query, and I managed to simplify it to look something like below: This can be re-produced from CDH5.13.x onward. …

Cloudera

How to Control Impala Daemon’s Memory Limit

Eric Lin June 8, 2018 June 8, 2018

This article explains Impala daemon’s processes and how to control the maximum memory each process can use. Impala Daemon has two different processes running, one is written in C++, used by backend, mainly for query processing. The other one is written in Java, used by frontend, for query compilations, storing …

Cloudera

Oozie Spark Actions Fail with Error “Spark config without ‘=’: –conf”

Eric Lin June 8, 2018 June 8, 2018

Currently Oozie provides easy interface for Spark1 jobs via Spark1 action, so that user does not have to embed spark-submit into shell action. However, recently I have discovered an issue in Oozie that it has a bug to parse Spark configurations and incorrectly generated a spark-submit command to submit Spark …

Cloudera

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30