How to reduce IDML file segment tag numbers

Discover ways to streamline IDML file segment tags for faster document processing.

 

If you upload an IDML file to Smartcat for translation, you might see segments with too many tags. If this happens, you can change tag settings to reduce their number. 

Here is an example of the settings screen that you will see, using a simulation file upload.

Understanding the “Simplify text formatting” options

There are three options to choose from.

Option 1: Off 
This is the default option. Choose this mode if you do not want to simplify formatting. It does not merge any CharacterStyleRange tags.

Option 2: Moderate 
This option applies the following use thresholds.

  • Kerning: [-50..50]
  • Tracking: [-50..50]
  • Baseline Shift: [-2..2]

Option 3: Aggressive
Use at 2x option two's moderate thresholds.

At CharacterStyleRange processing time, if any of the kerning/tracking/baseline shift parameters are within a selected threshold, they are ignored – considered to be equal to zero. Then the standard logic applies: if there are two or more consecutive CharacterStyleRange elements with the same style, they are merged together.

Understanding the “Ignore empty applied languages” checkbox

This checkbox is switched off by default. It's used to ignore specific attributes around applied languages. 

If turned on, two consecutive CharacterStyleRange elements that differ with this attribute will merge together. The first CharacterStyleRange value in the sequence wins.

 

Did this article help you find the answer you were looking for? If not or if you have further questions, please contact our support team.

Was this article helpful?

Do you need a human-assisted guidance? 🙌

Request a demo