The Regular expression parser has the following options.
flags()
Type: | assume-utf8 , check-hostname , check-program , dont-store-legacy-msghdr , empty-lines , expect-hostname , kernel , no-hostname , no-multi-line , no-parse , sanitize-utf8 , store-legacy-msghdr , store-raw-message , syslog-protocol , validate-utf8 |
Default: | empty set |
Description: Specifies the log parsing options of the source. For example:
source { network(flags(check-hostname, check-program)); };
assume-utf8
The assume-utf8
flag assumes that the incoming messages are UTF-8 encoded, but does not verify the encoding. If you explicitly want to validate the UTF-8 encoding of the incoming message, use the validate-utf8
flag.
check-hostname
When receiving messages, AxoSyslog can check whether the hostname contains valid characters.
Valid characters are:
- alphanumeric characters (A-Z, a-z, 0-9)
- the dash (
-
) and underscore (_
) characters - the dot (
.
) and the colon (:
) characters - the
@
and slash (/
)
If the hostname contains invalid characters, AxoSyslog sets the syslog.invalid_hostname
tag for the message, and doesn’t parse the ${HOST}
field from the message.
check-program
Available in version 4.10 and later.
If the check-program
flag is enabled, AxoSyslog validates the ${PROGRAM}
field for RFC3164-formatted messages. Valid program names meet the following criteria:
- Contain only these characters:
[a-zA-Z0-9-_/().]
- Include at least one alphabetical character.
If the program name fails validation, it is considered to be part of the log message.
dont-store-legacy-msghdr
By default, AxoSyslog stores the original incoming header of the log message. This is useful if the original format of a non-syslog-compliant message must be retained (AxoSyslog automatically corrects minor header errors, for example, adds a whitespace before msg
in the following message: Jan 22 10:06:11 host program:msg
). If you do not want to store the original header of the message, enable the dont-store-legacy-msghdr
flag.
empty-lines
Use the empty-lines
flag to keep the empty lines of the messages. By default, AxoSyslog removes empty lines automatically.
exit-on-eof
If this flag is set on a source, AxoSyslog stops when an EOF (end of file) is received. Available in version 4.9 and later.
expect-hostname
If the expect-hostname
flag is enabled, AxoSyslog will assume that the log message contains a hostname and parse the message accordingly. This is the default behavior for TCP sources. Note that pipe sources use the no-hostname
flag by default.
guess-timezone
Attempt to guess the timezone of the message if this information is not available in the message. Works when the incoming message stream is close to real time, and the timezone information is missing from the timestamp.
kernel
The kernel
flag makes the source default to the LOG_KERN | LOG_NOTICE
priority if not specified otherwise.
no-header
The no-header
flag triggers AxoSyslog to parse only the PRI
field of incoming messages, and put the rest of the message contents into $MSG
.
Its functionality is similar to that of the no-parse
flag, except the no-header
flag does not skip the PRI
field.
no-header
flag signals AxoSyslog that the syslog
header is not present (or does not adhere to the conventions / RFCs), so the entire message (except from the PRI
field) is put into $MSG
.
The following example illustrates using the no-header
flag with the syslog-parser()
parser:
parser p_syslog {
syslog-parser(
flags(no-header)
);
};
no-hostname
Enable the no-hostname
flag if the log message does not include the hostname of the sender host. That way AxoSyslog assumes that the first part of the message header is ${PROGRAM} instead of ${HOST}. For example:
source s_dell {
network(
port(2000)
flags(no-hostname)
);
};
no-multi-line
The no-multi-line
flag disables line-breaking in the messages: the entire message is converted to a single line. Note that this happens only if the underlying transport method actually supports multi-line messages. Currently the file()
and pipe()
drivers support multi-line messages.
no-parse
By default, AxoSyslog parses incoming messages as syslog messages. The no-parse
flag completely disables syslog message parsing and processes the complete line as the message part of a syslog message. The AxoSyslog application will generate a new syslog header (timestamp, host, and so on) automatically and put the entire incoming message into the MESSAGE part of the syslog message (available using the ${MESSAGE}
macro). This flag is useful for parsing messages not complying to the syslog format.
If you are using the flags(no-parse)
option, then syslog message parsing is completely disabled, and the entire incoming message is treated as the ${MESSAGE} part of a syslog message. In this case, AxoSyslog generates a new syslog header (timestamp, host, and so on) automatically. Note that even though flags(no-parse)
disables message parsing, some flags can still be used, for example, the no-multi-line
flag.
sanitize-utf8
When using the sanitize-utf8
flag, AxoSyslog converts non-UTF-8 input to an escaped form, which is valid UTF-8.
Prior to version 4.6, this flag worked only when parsing RFC3164 messages. Starting with version 4.6, it works also for RFC5424 and raw messages.
store-legacy-msghdr
By default, AxoSyslog stores the original incoming header of the log message, so this flag is active. To disable it, use the dont-store-legacy-msghdr
flag.
store-raw-message
Save the original message as received from the client in the ${RAWMSG}
macro. You can forward this raw message in its original form to another AxoSyslog node using the syslog-ng()
destination, or to a SIEM system, ensuring that the SIEM can process it. Available only in 3.16 and later.
syslog-protocol
The syslog-protocol
flag specifies that incoming messages are expected to be formatted according to the new IETF syslog protocol standard (RFC5424), but without the frame header. Note that this flag is not needed for the syslog
driver, which handles only messages that have a frame header.
validate-utf8
The validate-utf8
flag enables encoding-verification for messages.
Prior to version 4.6, this flag worked only when parsing RFC3164 messages. Starting with version 4.6, it works also for RFC5424 and raw messages.
For RFC5424-formatted messages, if the BOM character is missing, but the message is otherwise UTF-8 compliant, AxoSyslog automatically adds the BOM character to the message.
The byte order mark (BOM) is a Unicode character used to signal the byte-order of the message text.
patterns()
Synopsis: | patterns(“pattern1” “pattern2”) |
Mandatory: | yes |
Description: The regular expression patterns that you want to find a match. regexp-parser()
supports multiple patterns, and stops the processing at the first successful match.
prefix()
Synopsis: | prefix() |
Description: Insert a prefix before the name part of the parsed name-value pairs to help further processing. For example:
-
To insert the
my-parsed-data.
prefix, use theprefix(my-parsed-data.)
option. -
To refer to a particular data that has a prefix, use the prefix in the name of the macro, for example,
${my-parsed-data.name}
. -
If you forward the parsed messages using the IETF-syslog protocol, you can insert all the parsed data into the SDATA part of the message using the
prefix(.SDATA.my-parsed-data.)
option.
Names starting with a dot (for example, .example
) are reserved for use by AxoSyslog. If you use such a macro name as the name of a parsed value, it will attempt to replace the original value of the macro (note that only soft macros can be overwritten, see Hard versus soft macros for details). To avoid such problems, use a prefix when naming the parsed values, for example, prefix(my-parsed-data.)
This parser does not have a default prefix. To configure a custom prefix, use the following format:
parser p_regexp{
regexp-parser(
patterns( ... )
prefix("myprefix.")
);
};
template()
Synopsis: | template("${<macroname>}") |
Description: The macro that contains the part of the message that the parser will process. It can also be a macro created by a previous parser of the log path. By default, the parser processes the entire message (${MESSAGE}
).