You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: pipeline/inputs/tail.md
+15-1Lines changed: 15 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -37,9 +37,23 @@ The plugin supports the following configuration parameters:
37
37
|`Static_Batch_Size`| Set the maximum number of bytes to process per iteration for the monitored static files (files that already exist upon Fluent Bit start). |`50M`|
38
38
|`File_Cache_Advise`| Set the `posix_fadvise` in `POSIX_FADV_DONTNEED` mode. This reduces the usage of the kernel file cache. This option is ignored if not running on Linux. |`On`|
39
39
|`Threaded`| Indicates whether to run this input in its own [thread](../../administration/multithreading.md#inputs). |`false`|
40
+
|`Unicode.Encoding`| Set the encoding which the origin of character encoding. Currently, UTF-16LE, UTF-16BE, and auto is supported. |_none_|
41
+
42
+
{% hint style="info" %} If the database parameter `DB` isn't
43
+
specified, by default the plugin reads each target file from the
44
+
beginning. This might cause unwanted behavior. For example, when a
45
+
line is bigger than `Buffer_Chunk_Size` and `Skip_Long_Lines` isn't
46
+
turned on, the file will be read from the beginning of each
47
+
`Refresh_Interval` until the file is rotated. {% endhint %}
40
48
41
49
{% hint style="info" %}
42
-
If the database parameter `DB` isn't specified, by default the plugin reads each target file from the beginning. This might cause unwanted behavior. For example, when a line is bigger than `Buffer_Chunk_Size` and `Skip_Long_Lines` isn't turned on, the file will be read from the beginning of each `Refresh_Interval` until the file is rotated.
50
+
Note that `Unicode.Encoding` depends on simdutf library which is written in C++11 or above.
51
+
So, the older platforms are not supported for this feature.
52
+
In addition, `Unicode.Encoding auto` is not covered for the all of the usages.
53
+
This is because sometimes this auto-detecting for character encodings makes a mistake to guess the correct encoding.
54
+
We recommend to use `UTF-16LE` or `UTF-16BE` if the target file encoding is pre-determined or known beforehand.
55
+
In details, this parameter requests to use 2-bytes aligned chunk and buffer sizes.
56
+
If they are not aligned for 2 bytes, Fluent Bit will use 2-bytes alignments automatically to avoid character breakages on consuming boundaries.
0 commit comments