This documentation is for an out-of-date version of Apache Flink. We recommend you use the latest stable version.
Table API & SQL #
Apache Flink features two relational APIs - the Table API and SQL - for unified stream and batch processing. The Table API is a language-integrated query API for Java, Scala, and Python that allows the composition of queries from relational operators such as selection, filter, and join in a very intuitive way. Flink’s SQL support is based on Apache Calcite which implements the SQL standard. Queries specified in either interface have the same semantics and specify the same result regardless of whether the input is continuous (streaming) or bounded (batch).
The Table API and SQL interfaces integrate seamlessly with each other and Flink’s DataStream API.
You can easily switch between all APIs and libraries which build upon them.
For instance, you can detect patterns from a table using MATCH_RECOGNIZE
clause
and later use the DataStream API to build alerting based on the matched patterns.
Table Program Dependencies #
You will need to add the Table API as a dependency to a project in order to use Table API & SQL for defining data pipelines.
For more information on how to configure these dependencies for Java and Scala, please refer to the project configuration section.
If you are using Python, please refer to the documentation on the Python API
Where to go next? #
- Concepts & Common API: Shared concepts and APIs of the Table API and SQL.
- Data Types: Lists pre-defined data types and their properties.
- Streaming Concepts: Streaming-specific documentation for the Table API or SQL such as configuration of time attributes and handling of updating results.
- Connect to External Systems: Available connectors and formats for reading and writing data to external systems.
- Table API: Supported operations and API for the Table API.
- SQL: Supported operations and syntax for SQL.
- Built-in Functions: Supported functions in Table API and SQL.
- SQL Client: Play around with Flink SQL and submit a table program to a cluster without programming knowledge.