Flink JDBC Driver #
Flink JDBC Driver is a Java library for connecting and submitting SQL statements to SQL Gateway as the JDBC server.
Usage #
Before using Flink JDBC driver, you need to start a SQL Gateway as the JDBC server and binds it with your Flink cluster. We now assume that you have a gateway started and connected to a running Flink cluster.
Dependency #
All dependencies for JDBC driver have been packaged in flink-sql-jdbc-driver-bundle
, you can download and add the jar file in your project.
Group Id | Artifact Id | JAR |
---|---|---|
org.apache.flink |
flink-sql-jdbc-driver-bundle |
Download |
You can also add dependency of Flink JDBC driver in your maven or gradle project.
Maven Dependency
<dependency>
<groupId>org.apache.flink</groupId>
<artifactId>flink-sql-jdbc-driver-bundle</artifactId>
<version>{VERSION}</version>
</dependency>
Use with a JDBC Tool #
Use with Beeline #
Beeline is the command line tool for accessing Apache Hive, but it also supports general JDBC drivers. To install Hive and beeline, see Hive documentation.
- Download flink-jdbc-driver-bundle-{VERSION}.jar from download page and add it to
$HIVE_HOME/lib
. - Run beeline and connect to a Flink SQL gateway. As Flink SQL gateway currently ignores user names and passwords, just leave them empty.
beeline> !connect jdbc:flink://localhost:8083
- Execute any statement you want.
Sample Commands
Beeline version 3.1.3 by Apache Hive
beeline> !connect jdbc:flink://localhost:8083
Connecting to jdbc:flink://localhost:8083
Enter username for jdbc:flink://localhost:8083:
Enter password for jdbc:flink://localhost:8083:
Connected to: Flink JDBC Driver (version 1.18-SNAPSHOT)
Driver: org.apache.flink.table.jdbc.FlinkDriver (version 1.18-SNAPSHOT)
0: jdbc:flink://localhost:8083> CREATE TABLE T(
. . . . . . . . . . . . . . . > a INT,
. . . . . . . . . . . . . . . > b VARCHAR(10)
. . . . . . . . . . . . . . . > ) WITH (
. . . . . . . . . . . . . . . > 'connector' = 'filesystem',
. . . . . . . . . . . . . . . > 'path' = 'file:///tmp/T.csv',
. . . . . . . . . . . . . . . > 'format' = 'csv'
. . . . . . . . . . . . . . . > );
No rows affected (0.108 seconds)
0: jdbc:flink://localhost:8083> INSERT INTO T VALUES (1, 'Hi'), (2, 'Hello');
+-----------------------------------+
| job id |
+-----------------------------------+
| da22010cf1c962b377493fc4fc509527 |
+-----------------------------------+
1 row selected (0.952 seconds)
0: jdbc:flink://localhost:8083> SELECT * FROM T;
+----+--------+
| a | b |
+----+--------+
| 1 | Hi |
| 2 | Hello |
+----+--------+
2 rows selected (1.142 seconds)
0: jdbc:flink://localhost:8083>
Use with SqlLine #
SqlLine is a lightweight JDBC command line tool, it supports general JDBC drivers. You need to clone the codes from github and compile the project with mvn first.
- Download flink-jdbc-driver-bundle-{VERSION}.jar from download page and add it to
target
directory of SqlLine project. Notice that you need to copy slf4j-api-{slf4j.version}.jar totarget
which will be used by flink JDBC driver. - Run SqlLine with command
bin/sqlline
and connect to a Flink SQL gateway. As Flink SQL gateway currently ignores user names and passwords, just leave them empty.sqlline> !connect jdbc:flink://localhost:8083
- Execute any statement you want.
Sample Commands
sqlline version 1.12.0
sqlline> !connect jdbc:flink://localhost:8083
Enter username for jdbc:flink://localhost:8083:
Enter password for jdbc:flink://localhost:8083:
0: jdbc:flink://localhost:8083> CREATE TABLE T(
. . . . . . . . . . . . . . .)> a INT,
. . . . . . . . . . . . . . .)> b VARCHAR(10)
. . . . . . . . . . . . . . .)> ) WITH (
. . . . . . . . . . . . . . .)> 'connector' = 'filesystem',
. . . . . . . . . . . . . . .)> 'path' = 'file:///tmp/T.csv',
. . . . . . . . . . . . . . .)> 'format' = 'csv'
. . . . . . . . . . . . . . .)> );
No rows affected (0.122 seconds)
0: jdbc:flink://localhost:8083> INSERT INTO T VALUES (1, 'Hi'), (2, 'Hello');
+----------------------------------+
| job id |
+----------------------------------+
| fbade1ab4450fc57ebd5269fdf60dcfd |
+----------------------------------+
1 row selected (1.282 seconds)
0: jdbc:flink://localhost:8083> SELECT * FROM T;
+---+-------+
| a | b |
+---+-------+
| 1 | Hi |
| 2 | Hello |
+---+-------+
2 rows selected (1.955 seconds)
0: jdbc:flink://localhost:8083>
Use with Tableau #
Tableau is an interactive data visualization software. It supports Other Database (JDBC) connection from version 2018.3. You’ll need Tableau with version >= 2018.3 to use Flink JDBC driver. For general usage of Other Database (JDBC) in Tableau, see Tableau documentation.
- Download flink-jdbc-driver-(VERSION).jar from the download page and add it to Tableau driver path.
- Windows:
C:\Program Files\Tableau\Drivers
- Mac:
~/Library/Tableau/Drivers
- Linux:
/opt/tableau/tableau_driver/jdbc
- Windows:
- Select Other Database (JDBC) under Connect and fill in the url of Flink SQL gateway. Select SQL92 dialect and leave user name and password empty.
- Hit Login button and use Tableau as usual.
Use with other JDBC Tools #
Any tool supporting JDBC API can be used with Flink JDBC driver and Flink SQL gateway. See the documentation of your desired tool on how to use a JDBC driver.
Use with Application #
Use with Java #
Flink JDBC driver is a library for accessing Flink clusters through the JDBC API. For the general usage of JDBC in Java, see JDBC tutorial.
- Add the following dependency in pom.xml of project or download flink-jdbc-driver-bundle-{VERSION}.jar and add it to your classpath.
- Connect to a Flink SQL gateway in your Java code with specific url.
- Execute any statement you want.
Sample.java
public class Sample {
public static void main(String[] args) throws Exception {
try (Connection connection = DriverManager.getConnection("jdbc:flink://localhost:8083")) {
try (Statement statement = connection.createStatement()) {
statement.execute("CREATE TABLE T(\n" +
" a INT,\n" +
" b VARCHAR(10)\n" +
") WITH (\n" +
" 'connector' = 'filesystem',\n" +
" 'path' = 'file:///tmp/T.csv',\n" +
" 'format' = 'csv'\n" +
")");
statement.execute("INSERT INTO T VALUES (1, 'Hi'), (2, 'Hello')");
try (ResultSet rs = statement.executeQuery("SELECT * FROM T")) {
while (rs.next()) {
System.out.println(rs.getInt(1) + ", " + rs.getString(2));
}
}
}
}
}
}
Output
1, Hi
2, Hello
Besides DriverManager
, Flink JDBC driver supports DataSource
and you can also create connection from it.
DataSource.java
public class Sample {
public static void main(String[] args) throws Exception {
DataSource dataSource = new FlinkDataSource("jdbc:flink://localhost:8083", new Properties());
try (Connection connection = dataSource.getConnection()) {
try (Statement statement = connection.createStatement()) {
statement.execute("CREATE TABLE T(\n" +
" a INT,\n" +
" b VARCHAR(10)\n" +
") WITH (\n" +
" 'connector' = 'filesystem',\n" +
" 'path' = 'file:///tmp/T.csv',\n" +
" 'format' = 'csv'\n" +
")");
statement.execute("INSERT INTO T VALUES (1, 'Hi'), (2, 'Hello')");
try (ResultSet rs = statement.executeQuery("SELECT * FROM T")) {
while (rs.next()) {
System.out.println(rs.getInt(1) + ", " + rs.getString(2));
}
}
}
}
}
}
Use with Others #
In addition to java, Flink JDBC driver can be used by any JVM language such as scala, kotlin and ect, you can add the dependency of Flink JDBC driver in your project and use it directly.
Most applications may use data access frameworks to access data, for example, JOOQ, MyBatis and Spring Data. You can config Flink JDBC driver in them to perform Flink queries on an exist Flink cluster, just like a regular database.