public class HBaseTableSource extends Object implements BatchTableSource<Row>, ProjectableTableSource<Row>
HBaseTableSource
construction.
Use addColumn(String, String, Class)
to specify the family, qualifier, and type of columns to scan.
The TableSource returns Row
with nested Rows for each column family.
The HBaseTableSource is used as shown in the example below.
HBaseTableSource hSrc = new HBaseTableSource(conf, "hTable");
hSrc.addColumn("fam1", "col1", byte[].class);
hSrc.addColumn("fam1", "col2", Integer.class);
hSrc.addColumn("fam2", "col1", String.class);
tableEnv.registerTableSource("hTable", hSrc);
Table res = tableEnv.sql("SELECT t.fam2.col1, SUM(t.fam1.col2) FROM hTable AS t GROUP BY t.fam2.col1");
Constructor and Description |
---|
HBaseTableSource(org.apache.hadoop.conf.Configuration conf,
String tableName)
The HBase configuration and the name of the table to read.
|
Modifier and Type | Method and Description |
---|---|
void |
addColumn(String family,
String qualifier,
Class<?> clazz)
Adds a column defined by family, qualifier, and type to the table schema.
|
String |
explainSource()
Describes the table source
|
DataSet<Row> |
getDataSet(ExecutionEnvironment execEnv)
Returns the data of the table as a
DataSet . |
TypeInformation<Row> |
getReturnType()
Returns the
TypeInformation for the return type of the TableSource . |
HBaseTableSource |
projectFields(int[] fields)
Creates a copy of the
TableSource that projects its output on the specified fields. |
void |
setCharset(String charset)
Specifies the charset to parse Strings to HBase byte[] keys and String values.
|
public HBaseTableSource(org.apache.hadoop.conf.Configuration conf, String tableName)
conf
- hbase configurationtableName
- the tableNamepublic void addColumn(String family, String qualifier, Class<?> clazz)
family
- the family namequalifier
- the qualifier nameclazz
- the data type of the qualifierpublic void setCharset(String charset)
charset
- Name of the charset to use.public TypeInformation<Row> getReturnType()
TableSource
TypeInformation
for the return type of the TableSource
.getReturnType
in interface TableSource<Row>
public DataSet<Row> getDataSet(ExecutionEnvironment execEnv)
BatchTableSource
DataSet
.
NOTE: This method is for internal use only for defining a TableSource
.
Do not use it in Table API programs.
getDataSet
in interface BatchTableSource<Row>
execEnv
- (undocumented)public HBaseTableSource projectFields(int[] fields)
ProjectableTableSource
TableSource
that projects its output on the specified fields.
projectFields
in interface ProjectableTableSource<Row>
fields
- The indexes of the fields to return.TableSource
that projects its output.public String explainSource()
TableSource
explainSource
in interface TableSource<Row>
Copyright © 2014–2018 The Apache Software Foundation. All rights reserved.