Swear-word token enricher.
This enricher adds
swear boolean metadata property to the token
instance if word it represents is in a swear word dictionary, i.e. the swear dictionary contains this word's
stem. The value
true of the metadata property indicates that this word's stem is found in the dictionary,
false value indicates otherwise.
- Value parameters:
Relative path, absolute path, classpath resource or URL to the dictionary. The dictionary should have a simple plain text format with one lemma per line, empty lines are skipped, duplicates ignored, lines starting with # symbol will be treated as comments and ignored. Note that the search in the dictionary is implemented using words' stem and case is ignored.
Stemmer implementation for the language used in the supplied swear-word dictionary.
- See also: