Impala JDBC Driver

May 15, 2019 17:05
Edited

Hello everyone,

We tried to install the Impala JDBC Driver at Database Driver tab, but the problem comes when i have to select the Database Driver Template

Which Database Driver Template should be used?

We've tried some of them, but we got errors when we created a new Import Job using the connection.

Caused by: java.sql.SQLException: [Cloudera][ImpalaJDBCDriver](500605) Error occurred while opening a session with the server. No additional detail from the server regarding this error is available. Please ensure that the driver configuration is compatible with the server configuration. This type of error can also occur when the server is too busy to handle the request. Please try again later.

Are there any reasons to select between Hive or Impala when a Data Link is created?

We have to use Impala instead of Hive, because its a recommendation from our Cluster provider for no batch processes

Thanks in advance.

Arturo.

Comments

3 comments

Official comment
Brian Junio

May 15, 2019 22:04
Arturo,

Good afternoon!

First, we'll need to ensure that we've selected the correct JDBC driver from Cloudera's site:

https://www.cloudera.com/downloads/connectors/impala/jdbc/2-6-9.html

Next, we can review Cloudera's JDBC configuration doc:

https://www.cloudera.com/documentation/enterprise/5-9-x/topics/impala_jdbc.html

Now that we've got a solid base, lets address your questions.

>Which Database Driver Template should be used?

The templates are based around general expectations for connection strings throughout several offerings. As you may be aware, you are allowed to enter a custom string rather than relying on the existing templates.

My advice would be to review the "impala_jdbc" configuration doc above and model the connection string after their suggestion. Additional details can be found under the header "Using the Cloudera JDBC Connector".

>Are there any reasons to select between Hive or Impala when a Data Link is created?

This is a scenario where one format may offer advantages over the other in terms of cluster performance. There are no notable differences from the Datameer side when interacting with either Impala or Hive.

Do note, a JDBC Connection will suffer greatly in performance when compared to our native Hive Connector. I would suggest you create both Connection types and work with a uniform data set between the two services. A few quick use case tests should reveal to you which path is best for your configuration.

Cheers,

Brian
Comment actions Permalink
Arturo Huacre

June 10, 2019 20:36
Thanks for your response, Brian.

Now i have a Impala Connection created, but when i try to import a table, i'm getting below error in the Data Details tab:
- Can not parse input: [Cloudera][ImpalaJDBCDriver](500051) ERROR processing query/statement. Error Code: 0, SQL state: TStatus(statusCode:ERROR_STATUS, sqlState:HY000, errorMessage:AnalysisException: Syntax error in line 1: SELECT * FROM "bcp_edv_ods"."de_importeper... ^ Encountered: STRING LITERAL Expected: DEFAULT, IDENTIFIER CAUSED BY: Exception: Syntax error ), Query: SELECT * FROM "bcp_edv_ods"."de_importepercentildolpcte" WHERE ROWNUM <= 10
I am using a kerberos authentication.

Thanks in advance.

Arturo.
0

Comment actions Permalink
Alan

June 11, 2019 18:25
Hi Arturo,

We received your case in Zendesk #16304.

We will follow up with you within this ticket.
0

Comment actions Permalink

Please sign in to leave a comment.

Comments

Didn't find what you were looking for?