Merative Annotator for Clinical Data Container Edition

Spell Check

This annotator identifies misspelled words and phrases in a document and suggests corrections. You can use the spell check annotator as a standalone preprocessing step for your data or it can be used as part of a larger annotator flow.

Spell check can also be configured to recognize and correct to surface forms in custom dictionaries if the dictionaries are enabled for spell check in the cartridge configuration.

Configurations

ConfigurationValuesDescription
debugtrue/falseWhen true, the spell check annotator will provide an additional field with a human-readable rendering of the corrections that were applied to the source document.
spell_check_profiledefault/ocrA spell check profile defines the basics about the behavior of the spell check service. The default profile which is suitable for common human typos. The ocr profile is a more aggressive profile that tries to correct errors that are introduced by optical character recognition systems.
apply_spell_correctionstrue/falseWhen true, spell check applies high confidence corrections to the container text.

Annotation Types

  • spellCorrectedText
  • spellingCorrections
  • suggestions

spellCorrectedText

FeatureDescription
correctedTextThe document text with spelling corrections applied.
debugTextA debug version of the spell corrected document text that shows a where the corrections were applied in the original text.

spellingCorrections

FeatureDescription
beginThe start position of the misspelled word as a character offset into the text. The smallest possible start position is 0.
end<The end position of the misspelled word as a character offset into the text. The end position points to the first character after the spelling correction, such that end-begin equals the length of the coveredText.
coveredTextThe text of the misspelled word.

suggestions

FeatureDescription
appliedWhen true, this indicates that this correction was applied in the correctedText version of the document.
textThe text of the spelling suggestion.

Sample Response

Sample response from the spell check annotator for the text: The patient had an ovariactomy.

{
"unstructured": [
{
"text": "The patient had an ovariactomy",
"data": {
"spellCorrectedText": [
{
"correctedText": "The patient had an ovariectomy"
}