Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

A categorization profile contains categories and subcategories for the Conversation Analyzer feature. Conversation Analyzer uses the profile to categorize transcripts of call recordings. The profile also contains any substitution and redaction rules you provide. Using the substitution and redaction rules, Conversation Analyzer refines the transcribed text.

The categorization profile applies to the associated account. For information about where you can view the categorized recordings and refined transcripts, see Listening to and commenting on a call an audio recording.

Categorization profiles are written in JavaScript Object Notation (JSON). For information about JSON, see https://www.json.org/.

...

  • name (a name/value pair)
  • language (a name/value pair)
  • skipCallsUnder (an integer)

  • categories (an array of category objects).
    Each category consists of the following:
    • name (a name/value pair)
    • rules (an array of one or more categorization rule objects).

      Info

      Not used.


    • subcategories (an array of one or more subcategory objects).
      Each subcategory consists of the following:
      • name (a name/value pair)
      • rules (an array of one or more categorization rule objects).
        Each categorization rule object consists of the following:
        • party (a name/value pair)
        • expression (a name/value pair)
      • subcategories (an array of one or more nested subcategory objects).

        Info

        Not used.


    For more information about categorization rules, see see Categorization rules.
  • substitution (an array of substitution rule objects).
    Each substitution rule consists of the following:
    • party (a name/value pair)
    • find (a name/value pair)
    • replace (a name/value pair)
    For more information about substitution rules, see Substitution and redaction rules.
Panel
borderColor#eeeeee
bgColorwhite
titleColorwhite
borderWidth1
titleBGColor#232323
borderStylesolid
titleIn this page

Table of Contents
depth2

...

Note
titleAnalyzing transcripts

Conversation Analyzer analyzes transcripts in several steps:

  1. Conversation Analyzer identifies characters in the transcripts. Characters are either word or non-word characters. Characters from the Unicode categories (see expression and find value validation), plus apostrophes, are word characters. Other characters are non-word characters and act as word separators. Non-word characters include !, £, $, %, ^, &, *, (, ), and -.
  2. Conversation Analyzer uses findings from step 1 to identify the individual words in the transcripts.
  3. Conversation Analyzer looks for words in the transcripts that match the rules in the categorization profile:
    1. Conversation Analyzer applies substitution rules first, replacing text if found.
    2. Conversation Analyzer tags the processed transcripts with the corresponding categories if found.

...

The categorization expression language describes the format of an expression. The language supports simple expressions where the presence of the exact word or phrase would result in a match. For information about the categorization expression language, see Categorization expression language.

Substitution and redaction rules

...

  • Substitution rules replace commonly mis-transcribed words and improve the spelling of words. You will most likely require these rules for proper nouns, such as place, company or product names. For example, Conversation Analyzer may transcribe 'Basingstoke' as 'Beijing spoke', or 'Vonage Contact Center' as 'new voice media'. Create rules that replace the incorrect word or words.
  • Redaction rules replace sensitive information such as credit card details. Redaction rules are specific type of substitution rules in that instead of using them to refine and clarify phrases in the transcript output, you use them to obscure the content. Use a redaction rule to replace specified text with text such as '(redacted)', '(removed)', or 'xxxxxxxxxxxxxx'.

...

The categorization expression language describes the format of the value in the find name/value pair. The language supports simple values where the presence of the exact word or phrase would result in a match. For information about the categorization expression language, see see Categorization expression language.

replace

The replace name/value pair in a rule defines the text that will replace the found text.

Applying substitution and redaction rules result in Conversation Analyzer modifying transcript text. Because of this, you must take extra care when writing your rules. For more information about substitution rules, see Substitution and redaction rules continued.

Categorization expression language

...

Valid expression and find values contain only alphanumeric, apostrophe and space characters; that is, values can contain spaces (U+0020), apostrophes (U+0027), and characters from the following Unicode categories:

Unicode Category Name
Description
Ll

Letter, Lowercase.

For example, a-z, ᵯ, ḅ, ṥ, ở, ﬓ

Lu

Letter, Uppercase.

For example, A-Z, Ý, Ŧ, Ǣ, Щ, 𝕐

LtLetter, Titlecase.

For example, Dž, ᾎ, ᾟ, ᾭ

Lo

Letter, Other (e.g. ª, ܗ, 爨)

The Mongolian Letter "Manchu Ali Gali Lha" (U+18AA,) is not allowed within expression and find values. This character is used internally within the categorisation engine. If the character appears within spoken text, conversation analyzer treats the character as an apostrophe.

LmLetter, Modifier.

For example, ʰ, ᵓ, 〲, ꟹ

MnMark, Nonspacing.

For example, ុ, ᜴

NdNumber, Decimal Digit.

For example, 0-9, ۳, ૮, ๗

Pc

Punctuation, Connector.

For example, _, ‿, ⁀, ⁔, ︳, ︴, ﹍, ﹎, ﹏, _

This category includes ten characters; the most commonly used is the LOW LINE character (_), u+005F.

...

Wildcard
Description
Example expressions
Details
?Wildcard representing one character
Each ? represents one character.
wh?

The following words will match the example expression: "who" and "why". For an example of an expression using the ? wildcard, see see Example 2. Expression using the ? character wildcard.

wh??The following words will match the example expression: "what", "when", "whom". For an example of an expression using the ?? wildcard, see see Example 5. Expression using the ?? wildcard.
*
Anchor
Characters
Characters
Wildcard representing zero to many characters
sit*

The following words will match the example expression: "sit", "sits", "sitting". For an example of an expression using the * wildcard, see Example 3. Expression using the * character wildcard.

Note

To use * to represent a character or characters, ensure that the * is contiguous with the characters in the containing word.

You can also use * to represent a word or words. For information, see Wildcard representing zero to many words.


#Wildcard representing one numeric character ###

Only digits will match the example expression, not text.

Text containing "123" will match the example expression but text containing "one two three" will not.

For an example of an expression using the # wildcard, see Example 4. Expression using the # character wildcard.

*
Anchor
Words
Words
Wildcard representing zero to many words
cat * mat

The following phrases will match the example expression: "cat mat", "cat sits on the mat", and "cat always sits happily on the mat".

For an example of an expression using the * wildcard, see Example see Example 6. Expression using the * word wildcard.

Note

To use * to represent a word or words, type a space between the * and any other characters in the expression.

You can also use * to represent a character or characters. For information, see Wildcard representing zero to many characters.


~NRepresents the number of words that can appear between the specified words in a phrasecat mat ~4

A phrase that contains N or fewer words between the specified words will match the example expression.

The following phrases will match the example expression: "cat mat", "cat sits on the mat", and "cat always sits on the mat".

For an example of an expression using the ~N wildcard, see Example 7. Expression using the ~N wildcard.

Note

If used, the expression must appear at the end of the phrase.

If the expression contains more than two words, ~N applies to the number of words between any of the specified words.

For an example of an expression using using the ~N wildcard with more than two words, see Example 8. Expression using the ~N wildcard.


expression examples 

Example 1. Simple expression

...