Interface | Description |
---|---|
RegexTokenizerParams<T> |
Params for
RegexTokenizer . |
Class | Description |
---|---|
RegexTokenizer |
A Transformer which converts the input string to lowercase and then splits it by white spaces
based on regex.
|
RegexTokenizer.RegexTokenizerUdf |
The main logic of $
RegexTokenizer , which converts the input string to an array of
tokens. |
Copyright © 2019–2023 The Apache Software Foundation. All rights reserved.