Rule Types and Modifiers
In CHARTextract, each label will have a set of primary rules and corresponding scores.
Primary Rules
Primary rules consist of a regular-expression-based rule and a score. Primary rules may have secondary rules for further refinement. Primary rules are checked for each label. If there is no match in the primary rule, then the subsequent secondary rules are ignored.
If there is a match in the primary rule, the corresponding label will be assigned the primary rule’s score.
Secondary Rules
Secondary rules consist of a regular-expression-based rule, a score, and a rule type. Secondary rules are always attached to a primary rule. If the secondary’s rule primary rule does not match, then the secondary rule will never get triggered.
If there is a match in the primary rule, each secondary rule is checked in order. Secondary rules can either replace the primary rule’s score or ignore the primary rule.
Rule Types
Rule Type: Replace
The Replace
rule type replaces the score of the primary rule with the secondary rule’s score, if the secondary rule matches.
Rule Type: Ignore
The Ignore
rule type ignores the score of the primary rule, if the secondary rule matches.
Rule Modifiers
Rule modifiers can be used with the secondary rules and allow for further rule refinement.
Rule Modifier: Before
The Before
modifier can be used with the Replace
and Ignore
rule types. The Before
modifier will only match a secondary rule in the sentence segment before the primary rule match.
Rule Modifier: After
The After
modifier can be used with the Replace
and Ignore
rule types. The After
modifier will only match a secondary rule in the sentence segment after the primary rule match.