This functionality can be used to protect various character/number sequences such as dates, combinations of numbers and units of measurement, times, or article numbers following a specific character pattern.
What is to be protected is defined using a regular expression. The function is implemented in such a way that the occurrences of the protectable sequences in the source and target segments recognized via regular expressions are protected as a tag. It is also possible to configure rules that automatically transfer certain protected source language sequences to the target language according to a predefined pattern. translate5 offers an extensive selection of such regular expressions, which can be used out of the box or customized to your individual needs. And of course you can also set up more rules. The format is recorded for each source and each target language, i.e. not for language combinations.
The rules are created in the preferences.
The settings for the content protection functionality can be found in the preferences under “Content protection”.
There are three tabs here in which different settings can be made:
Active Input Rules:
Here, you can specify which rules should apply to which source language by activating the rules from the “Rules for content” tab for specific source languages.
An input rule only applies if it is combined with an output rule. Exception: “keep content” rules. |
If content protection rules have been created and activated for certain source and target languages, the corresponding number/character sequences in the projects created using these rules are automatically protected by tags and – if defined accordingly – converted into a different format for the target language.
For example:
Jira Issue ID is protected:
translate5 as brand name is protected:
ISBN-10 number is protected:
translate5 automatically recognizes when a translation memory no longer matches the Content protection settings and signals this in the language resources overview with a symbol.
The affected translation memories can be converted by clicking on this button so that they once again correspond to the Content protection settings.
The translation memory will not be accessible during the conversion. It is possible that converting a translation memory takes a lot of time. | |
The conversion of the translation memory is applied immediately. This means that the matches can also be influenced by the conversion in ongoing tasks the concerned TM is linked to. |
The following procedure is recommended for very large and/or important projects that were started before the conversion but are not yet completed at the time of the conversion:
During the import of the .tmx file into the main TM, all active rules of the Content Protection settings are applied to the imported segments. This procedure for the new matches is therefore equivalent to converting the entire main TM again, but depending on the size of the TM (and thus the amount of time required for the conversion) it is the faster way. | |
In the language resources view, you can use a filter to display only those translation memories that have not (yet) been converted: |