Text driver does depends on additional libraries and is generally faster than CSV or Velocity driver.
Note: The driver doesn't use SQL syntax
| Driver class: | scriptella.driver.text.Driver |
| URL: | Text file URL. URIs are resolved relative to a script file directory.
If url has no value the output is read from/printed to the console (System.out). |
| Runtime dependencies: | None |
| Name | Description | Required |
|---|---|---|
| encoding | Specifies charset encoding of Text files. | No, the system default encoding is used. |
| eol | End-Of-Line suffix. Only valid for <script> elements. |
No, the default value is \n. |
| trim | Value of true specifies that the leading and trailing
whitespaces in text file lines should be omitted.
| No, the default value is true. |
| flush | Value of true specifies that the outputted content should flushed immediately when
the <script> element completes.
| No, the default value is false. |
| skip_lines | The number of lines to skip before start reading. | No, the default value is 0 (no lines are skipped). |
| null_string | Specifies string token to represent Java null literal.
When querying a text file, regex group equal to null_string is returned as Java null. Specify an empty string ( |
No, by default strings are preserved, i.e. empty strings are not converted to nulls and null variables references are not expanded in the output, i.e. ${nullvalue}. |
If a line or a part of it matches the pattern this match produces a virtual row in a result set.
The column names in a virtual result set correspond to matched regex group names.
For example query foo(.*) matches foobar line and the produced
result set row contains two columns(groups): 0-foobar, 1-bar. These columns
can be referenced in child script or query elements by a numeric name or by a string name columnN.
It also possible to specify more than one regular expressions to match file content. Specify each regular expression on a separate line to match them using OR condition.
The Text driver uses java.util.regex implementation for pattern matching. See java.util.Pattern
for supported syntax Javadoc.
Additional notes:
0(zero) column name in the produced result set contains the matched line.
<query>
^ERROR: (.*)
WARNING: (.*Failed.*)
([\d]+) errors?
</query>
This query consists of 3 regular expressions:
ERROR: prefixWARNING lines having Failed substring
Log file started...
INFO: INIT
WARNING: CPU is slow
WARNING: Failed to increase heap size
ERROR: Process interrupted
Operation completed with 1 error.
As the result of query execution the following set of rows is produced:
| 0 | 1 |
|---|---|
| WARNING: Failed to increase heap size | Failed to increase heap size |
| ERROR: Process interrupted | Process interrupted |
| 1 error | 1 |
Additional notes:
eol connection property.
<script>
Inserted a record with ID=$id. Table=${table}
</script>
For id=1 and table=system this script produces the following output:
Inserted a record with ID=1. Table=system
NOTE:
By default NULL variables and expressions are preserved, usenull_string connection property to specify
a string token for nulls.
For example setting null_string to empty string in the connection properties section will enable parsing
empty strings as nulls:
<connection driver="csv" url="report.csv">
null_string=
</connection>
Scriptella properties substitution engine cannot distinguish null value from unused variable or some random usage of
$var syntax,
therefore we've chosen to preserve these blocks until user explicitly specify the value of null_string.
<connection id="in" driver="text" url="data.csv">
</connection>
<connection id="out" driver="text" url="report.csv">
</connection>
<script connection-id="out">
ID;Priority;Summary;Status
</script>
<query connection-id="in">
<script connection-id="out">
$rownum;$column0;$column1;$column2
</script>
</query>
Copies rows from data.csv file to report.csv, additionally the ID column is added.
The result file is semicolon separated.