Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

A categorization profile contains categories and subcategories for the Conversation Analyzer feature. Conversation Analyzer uses the profile to categorize transcripts of call recordings. The profile also contains any substitution and redaction rules you provide. Using the substitution and redaction rules, Conversation Analyzer refines the transcribed text.

The categorization profile applies to the associated account. For information about where you can view the categorized recordings and refined transcripts, see Listening to and commenting on a call recording.

...

  • Substitution rules replace commonly mis-transcribed words and improve the spelling of words. You will most likely require these rules for proper nouns, such as place, company or product names. For example, Conversation Analyzer may transcribe 'Basingstoke' as 'Beijing spoke', or 'NewVoiceMediaVonage Contact Center' as 'new voice media'. Create rules that replace the incorrect word or words.
  • Redaction rules replace sensitive information such as credit card details. Redaction rules are specific type of substitution rules in that instead of using them to refine and clarify phrases in the transcript output, you use them to obscure the content. Use a redaction rule to replace specified text with text such as '(redacted)', '(removed)', or 'xxxxxxxxxxxxxx'.

...

Expand


Code Block
languagejava
{
    "name": "Subs_example",
	"language": "en-us",
    "categories": [ ]
    "substitution": [
        {
            "party": "agent",
            "find": "new voice media",
            "replace": "NewVoiceMediaVonage Contact Center"
        },
        {
            "party": "either",
            "find": "Beijing spoke",
            "replace": "Basingstoke"
        },
        {
            "party": "customer",
            "find": "my card number is *",
            "replace": "xxxx xxxx xxxx xxxx"
        }
    ]
}


...

Valid expression and find values contain only alphanumeric, apostrophe and space characters; that is, values can contain spaces (U+0020), apostrophes (U+0027), and characters from the following Unicode categories:

Unicode Category Name
Description
Ll

Letter, Lowercase.

For example, a-z, ᵯ, ḅ, ṥ, ở, ﬓ

Lu

Letter, Uppercase.

For example, A-Z, Ý, Ŧ, Ǣ, Щ, 𝕐

LtLetter, Titlecase.

For example, Dž, ᾎ, ᾟ, ᾭ

Lo

Letter, Other (e.g. ª, ܗ, 爨)

The Mongolian Letter "Manchu Ali Gali Lha" (U+18AA,) is not allowed within expression and find values. This character is used internally within the categorisation engine. If the character appears within spoken text, conversation analyzer treats the character as an apostrophe.

LmLetter, Modifier.

For example, ʰ, ᵓ, 〲, ꟹ

MnMark, Nonspacing.

For example, ុ, ᜴

NdNumber, Decimal Digit.

For example, 0-9, ۳, ૮, ๗

Pc

Punctuation, Connector.

For example, _, ‿, ⁀, ⁔, ︳, ︴, ﹍, ﹎, ﹏, _

This category includes ten characters; the most commonly used is the LOW LINE character (_), u+005F.

...