References to Email policy content are only available when an Email Gateway is included in the peer group. |
You use the New Expression dialog to create lexical expressions.
You can create your own lexical expressions directly, or you can use text entities to build them. A text entity is a pre-defined or user-defined component or building block that you can include in a lexical expression.
There are three types of text entity:
Predefined Entities are pre-configured, standard lexical patterns which are frequently used. For example predefined entities can match against Credit Card Numbers or identification numbers from different regions (identity card, driving license, passport and Japanese My Number). Predefined entities are fixed patterns and cannot be edited.
You can configure your own reusable text entities. These user-defined entities are displayed in a list and are available for use in lexical expressions.
Lexical expression qualifiers are specific values you might want to detect, rather than general lexical patterns. For example, you might have a particular list of identification numbers you want to redact or block. You can import the list as a set of qualifiers and then use them in a lexical expression. See Lexical Expression Qualifiers for more information.
When creating plain-text lexical expressions, you should be aware that the
Punctuation is ignored in the text unless it is specified in the expression. Partial words are not matched.
Expression | Text | Match? |
---|---|---|
here and there | here and there | yes |
here and there | here, and there | yes |
here and there | here-and-there | yes |
don't | don't | yes |
dont | don't | no |
don't | dont | no |
don t | don't | yes |
don't | don t | no |
butt | butter | no |
swear | menswear | no |
pig | spigot | no |
Weighting score for lexical expressions. This value determines how the expression impacts your security policy.
A Detect Lexical Expression content rule named Cakes and Pastries has been designed to detect the names of confectionery products.
An expression list (below) has been configured with a total weight Threshold of 10.
Custom expressions for each of your products are added as follows. Each expression is given a Weight.
If a communication contains the expressions 'Doughnuts' and 'Iced buns', the combined weight (6+5=11) outweighs the threshold (10) and the content rule will trigger. However, if 'iced buns' and 'cookies' are detected, the combined weight (5+3=8) is not sufficient to trigger the content rule.
Expressions with Instant weighting (such as 'Cake' in this example) will always trigger the content rule, regardless of the threshold.
Expression Lists can also be configured to count the weight of expressions only once in each part of the message. For example, a message with the subject: 'Cookies, Cookies, Cookies!' would add a combined weight of +3 if this option is activated. |
You can use logical operators to construct lexical Expressions. Operators must be placed between keywords, phrases or text entities.
Operator | Description |
---|---|
AND | Both keywords or phrases must be present. |
OR | One or both keywords or phrases must be present. |
XOR | One of the keywords or phrases must be present, but not both. |
BEFORE | Both keywords or phrases must be present and the keyword or phrase that precedes the operator must occur before the keyword or phrase that follows the operator. |
AFTER | Both keywords or phrases must be present and the keyword or phrase that precedes the operator must occur after the keyword or phrase that follows the operator. |
FOLLOWEDBY=x | Both keywords or phrases must be present, and the keyword or phrase that follows the operator must be within x words of the one that precedes the operator. |
NEAR | Both keywords or phrases must be present, and they must be within ten words of one another. The expressions may occur in either order. |
ANDNOT | The keyword or phrase that precedes the operator must be present, and the keyword or phrase that follows the operator must not be present. |
If you want to use several operators, use brackets to ensure the correct logical behavior. The |
The Case sensitive check box applies case-sensitive matching to your lexical expression.
Lexical Expressions can be set to Redact enabling them for possible redaction.
Use a weight of Instant for redactable lexical expressions where possible. |
The Summary box can be used to display any additional notes associated with your lexical expression. You can enter up to 180 characters.
In the Expressions section, click New.
The New Expression dialog is displayed.
Enter the expression. You can build your own lexical expression by typing it into the Expression box, or you can build an expression using text entities:
Predefined
Select the If matched weighting score for your new expression.
I want to detect any credit numbers that may be accidentally included in messages.
Create (or Edit) a Lexical Expression List which includes a lexical expression with the Predefined Text Entity: Credit Card.
Select Credit Card from the list of available options.
Click Add. The expression is added to your expression list.
You can configure a Detect Lexical Expression or a Redact Text content rule which uses the expression list you have created. |
I want to create a text entity which detects Employee ID Numbers in the format (AB1234), where AB represents a two-letter department code and 1234 represents any four digit number up to 9999.
Create a user-defined entity which is a regular expression. This can be configured using POSIX/PERL regular expressions to match your required format.
Configure the Overview section to name your expression and add any relevant Notes.
A User Defined Entity will match any string using the specified format, provided it is not adjacent to another alphanumeric character. For example, GSD7892 would not be detected but G SD7892 would produce a match. |
Use Lexical Expression Qualifiers to import a specific list of values (for example, your full list of Employee ID Numbers). See Lexical Expression Qualifiers for more information. |
Click Save. The expression is added to your list of User Defined Entities for future use.
I need an expression that detects US Social Security Numbers and Employee ID Numbers.
Create a Lexical Expression that uses an AND operator to detect US Social Security Numbers (predefined text entity) and Employee ID Numbers (user defined entity).
Click the Predefined tab to view the list of predefined text entities. Select Social Security Number (US).
Click the AND operator.
All entities in a custom expression must be separated by operators. |
Click Add to include the expression in your expression list.
You can configure a Detect Lexical Expression Content Rule which uses the expression list you have created. |
I need an expression that detects the name of my product, "Email Analyzer". I want this term to be redacted when sent from inside my organization.
Create a Lexical Expression and enter the text in the Expression box. Set the expression to case sensitive and enable redaction.
Set If matched (weighting score) to a suitable weighting.
Give redactable terms a high or Instant weight, where possible. |
Enter "Email Analyzer" into the Expression box.
You might want to also detect a variant spelling, such as "Email Analyser". You can use a regular expression: .PERL.Email Analy(z|s)er to achieve this, or your could create a user defined entity for each variant and apply them to the expression using an .OR. operator. |
Select Redact.
Selecting Redact enables redaction of the selected expression. To apply redaction policy, the expression list must be used by a Redact Text content rule. | |
Expression Syntax To match the parentheses characters '(', ')' or period/full stop '.', the character must be preceded by '\'. For example: to match the text (CAT), including the brackets, the expression should be defined as: \(CAT\). |
I want to detect Business Identifier Codes but avoid detecting any false-positives.
Create a Lexical Expression using the BIC predefined text entity. Apply a Lexical Expression Qualifier to narrow the search to specific values. This avoids the possibility of detecting false-positives.
Select BIC (Business Identifier Code) from the Predefined tab.
Insert the cursor to the left of the closing period character.
.PATTERN=BIC|.
Select a Qualifier from the Qualifiers tab.
Qualifiers specify unique values, stored in an external data source. For more information, see Lexical Expression Qualifiers.
.PATTERN=BIC?qualifier-name.
Set the If matched (weighting score).
You cannot change the case sensitivity of a predefined text entity. |
Click Add. The expression is added to your expression list.
You can configure a Detect Lexical Expression or a Redact Text content rule which uses the expression list you have created. |
© 1995–2018 Clearswift Ltd.