All Collections
7taps Help Center
How to manage AI videos
How to manage AI videos

Occasionally AI system mispronounces words or makes strange pauses. Below are some quick tips to get AI videos sound just right.

Updated over a week ago

Quick Note

Accents and voice expressions are available only on 7taps Enterprise.

Quick Tips

๐Ÿšซ Don't mix languages: eg. don't use English words in a Spanish script.

๐Ÿ“ Spell words correctly: make sure you have used correct spelling in your script.

๐Ÿ’ฌ Insert breaks if needed: you can add additional breaks into the script by inserting break tags - for example <break time="2s" />

โœ๏ธ Use punctuation marks: a script without proper use of commas and periods would sound too fast and hard to listen to. Use periods, commas, hyphens, question marks to help our AI system sound as you would want it to sound.

๐Ÿ—ฃ๏ธ Fix pronunciation of words, acronyms and numbers if needed: for example, it's useful to sometimes split words with a hyphen sign to help our AI system pronounce them correctly. So an example would to write "con-tent" instead of normal "content".

Most importantly - use breaks and hyphens: Improving voice results is a matter of creatively using periods/commas and sometimes rearranging sentence structure. We highly recommend getting used to sometimes inserting break tags and influencing specific word pronunciation by splitting words with hyphens.

Add additional breaks to the video

Our voices support a so-called SSML markup language. This markup has quite a few different tags but, for now, the most important one is the ability to instruct the voice to create breaks.

Wherever you want an additional break in your text, simply input (2s is an example here, you can specify time in seconds or milliseconds):

<break time="2s" />

For example, I have the following text:

Hey John! How are you doing today?

Let's say I'm not happy with the default break after "John!". Breaks are especially useful to better separate sentences.
I can now simply input the following markup to add a break:

Hey John!<break time="50ms"/>How are you doing today?

Adding gestures

The default AI avatar (Anna) is now able to perform gestures, which add emphasis and emotion to your videos. The Avatar can perform four key gestures:

  • Nod

  • Head yes

  • Head no

  • Eyebrows up

To add any of these gestures to your script, use the corresponding tag:

  • <gesture label="nod"/>

  • <gesture label="head yes"/>

  • <gesture label="head no"/>

  • <gesture label="eyebrows up"/>

Correcting pronunciation

Pronouncing company names, acronyms, business terms or slang can sometimes be difficult for the AI because they are ambiguous. Getting the pronunciation right is a matter of inserting hyphens or spelling the word phonetically.


Try inserting hyphens to make the word sound like you want. Example:

[ Content โ†’ con-tent ]

Alternatively, you can also help the system by using phonetic spelling of the words. You can read more on this below.


if you want them to be pronounced like a word, try spelling it like it would sound. Examples:

[ AI โ†’ a-eye ]โ€

[ AWS โ†’ a-"double you"-s ]โ€

If you want the acronym to be pronounced word by word make space between the letters: [ NYC โ†’ N Y C ]


Change how you spell them depending on how you you want them to sound. Examples:

[ Ten eighty-nine -> 10 89 ]

[ Two five eight six -> 2 5 8 6 ]

[ One hundred and forty eight: -> 148 ]

Using punctuation marks

If you are having issues with the rhythm of the sentence, try adding commas/periods, quotes or re-arranging the sentence:

Commas will add shorter breaks than a period

Periods will add a longer break and downwards inflection

"Quotes" will add emphasis to that part of the sentence

For example these two examples will result in different rhythm and pauses:

[ Hereโ€™s a demonstration of how a sentence without any breaks or commas at all compare to a sentence that has as you can see the video without can be difficult to follow because there are no breaks or pauses in it. ]


[ Hereโ€™s a demonstration of how a sentence, without any breaks or commas at all, compare to a sentence that has. As you can see, the video without can be difficult to follow, because there are no breaks or pauses in it. ]

Advanced: fix pronunciation by using phonetic spelling

You can sometimes fix word pronunciation by using their phonetic spelling. Below we've included a handy table to help you replace letters with phonetic alternatives. Example:

[ Desert โ†’ de-zert ]

5f2aa75714ff3e0847b6ca23_Screenshot 2020-08-04 at 20.07.48

Supported languages

Arabic - Natural

Bengali - Original

Bulgarian - Natural

Catalan - Natural

Chinese (CN) - Natural

Chinese (HK) - Natural

Croatian - Natural

Czech - Natural

Danish - Natural

Dutch - Natural

English (AU) - Natural

English (CA)

English (GB) - Narration

English (IE) - Natural

English (IN) - Natural

English (US) - Professional

Filipino - Default

Finnish - Natural

French (CA) - Natural

French (CH) - Natural

French (FR) - Natural

German (AT) - Natural

German (CH) - Natural

German (DE) - Natural

Greek - Natural

Hebrew - Natura

Hindi - Natural

Hungarian - Natura

Indonesian - Default

Italian - Natural

Japanese - Natural

Korean - Natura

Latvian - Natural

Malay - Natural

Norwegian - Natural

Polish - Natural

Portuguese (BR) - Natural

Portuguese (PT) - Natural

Romanian - Natural

Russian - Natural

Slovak - Natural

Slovenian - Natural

Spanish (ES) - Natura

Spanish (LA) - Original

Spanish (MX) - Natural

Swedish - Natural

Tamil (IN) - Natural

Telugu (IN) - Natural

Thai - Natural

Turkish - Natural

Ukrainian - Default

Vietnamese - Natural

Did this answer your question?