Version und Versionierung

Includes functions of the application up to version	6.2.2
Current translate5 version	7.12.0

Version	Published	Changed By	Comment
CURRENT (v. 2)	May 20, 2024 13:29	Marion Gubler	updated to version 7.4.1
v. 5	May 20, 2024 12:40	Marion Gubler	updated to version 7.3.2 - change wording for false-positive
v. 4	Apr 11, 2024 12:17	Marion Gubler	updated to version 7.2.4

Go to Page History

In the editor, you can mark text or complete segments for quality management purposes. For example, you can mark segments with stylistic or grammatical errors and then carry out statistical analyses using this data. You can assign the error categories in the “Quality assurance” section of the right-hand editor panel.

Types of quality assurance in translate5

translate5 distinguishes betweenautomated quality assuranceandmanual quality assurance:

During manual quality assurance, a user manually assigns an error category to a complete segment or part of it.
- Manual QA complete segment: Here, segments can be marked with various individually configured error categories.
- Manual QA in the segment: Here, parts of a segment can be commented on and assigned an error category in accordance with theMultidimensional Quality Metrics (MQM) frameworkor with individual error categories.
Automatic quality assurance assigns an error category and/or comment to a complete segment or part of it.
- Automated QA complete segment covers cases such as consistency (equal source segments translated differently) or empty segments, length check or (missing) processing of 100% or fuzzy matches.
- Automated QA in the segment covers cases such as numbers differing between source and target segment, incorrect thousands separators and errors found by the spell checker.

Overview

Four editor areas are available for quality assurance:

The left-hand editor panel “qa_stats“, where you can filter by specific manually or automatically assigned QS categories.
TheTerminologie-, Stil- und Rechtschreibprüfung” section of the right-hand editor panel: Terminology recognized by the TermCollection is displayed here.
The “Quality assurance” section of the right-hand editor panel, where you can set and control the QA categories at segment and content level that can be filtered in the left-hand editor panel. This section is subdivided into:
“false_“: You can mark or unmark errors found by the quality assurance as false positives.
“manual_qa_complete_segment“: Here, quality problems at segment level can be pointed out using, various predefined categories.
“manual_qa_in_segment“: Here, quality problems within segments can be marked manually by means of comments and severity, or using the predefined QA categories.
“status“: Here, a status can be set per segment, e.g. stating that a segment needs to be checked again.

QA statistics

Here you can see the list of error categories that are checked during quality assurance and either assigned automatically or manually at segment level. You can filter and display the segments based on these categories by ticking the categories in the filter field. If you select one of the main categories, all subcategories of this category will also be selected. The same applies to deselecting a main category. The very first line “All categories” can be used to select or deselect all main categories and their subcategories. Error categories that do not currently apply to any of the segments cannot be selected. Using the drop-down above the error categories, you can choose whether you want to

Show all
Only errors
OnlyQuality Assurance & Auto QA

. You can combine the drop-down selection with the selected categories for filtering.

	False positives are bits of text that were flagged as false by the automated quality assurance, but are in fact not errors. This happens, for example, when there is a number in the source segment that is translated using the corresponding word. This segment would be marked with a numbers error, because the system cannot find the number in the target segment, and would therefore be a false positive.
	Error categories whose counter is a “0” as there are no segments to which this category applies cannot be selected.
	The categories that refer to errors that block the export of the task, such as structural tag errors, are marked with red font colour.

False positives

This section of the right-hand editor panel shows all error categories that have been assigned to the currently open segment. The individual error categories can be set to “False positive” here, which means that this error category was applied incorrectly.

Marking reported errors as false positives individually

Directly in the segment	In the right-hand editor panel
Right-click on the relevant part of the segment. A small pop-up appears where you can place a tick in the “Yes” column. The reported error is now marked as a false positive.	In the “Quality assurance” section of the right-hand editor panel, the error categories assigned to the currently open segment are displayed. Place a tick in the “Yes” column of the list for categories that you want to mark as false positive.

	Please note that setting the “false positive” tick will be saved immediately, even if editing of the segment is cancelled without saving.

Marking entire error categories as false positive

You can set complete error categories reported by LanguageTool and terminology recognition to false positive. To do this, first place a tick in the “Yes” column of the relevant category in the “Quality assurance” section of the right-hand editor panel. Then click on the blue number that appears in the right-hand column, which indicates the number of errors in this error category. The number will no longer appear with blue highlighting. This can be undone by clicking on the number again.

	Please note that setting the “false positive” tick will be saved immediately, even if editing of the segment is cancelled without saving.

Manual QA (complete segment)

Here you can assign error categories on segment level.

	Please note that the assignment of error categories is saved immediately, even if editing of the segment is cancelled without saving.
	These error categories can be defined and adjusted individually for each task in the system configuration.

Manual QA (inside segment)

During manual quality assurance in the segment, a user can assign an error category for one or several words within a segment. To do so, proceed as follows:

Mark the relevant bit of text in the segment.
Select the severity of the error from the drop-down (optional).
Add a comment (optional).
Select an error category from the “Add manual QA” drop-down.

The “Add Manual QA” drop-down is multi-dimensional. If you want to display the submenu of one of the categories, just move the cursor over it. However, you can also directly assign one of the error categories appearing on the first level.

The selected segment passage is then surrounded by a pair of tags with an orange background.

The system remembers most recently used error categories and offers them for direct selection among the categories on the first drop-down level.

	The categories displayed here can be defined and adjusted individually for each task in the “QM-Subsegment_Issues.xml” file in the import zip.
	If no text is marked before selecting the error category, no category will be assigned.
	The error category assignments inside the segment are retained only when the segment is saved.
	For more efficiency, the error categories can also be assigned via a keyboard shortcuts, see Tastenkombinationen im Editor#mqm_shortcuts.

Status

In this area, a manual quality assurance status can be assigned to segments. A status can, e.g. indicate to the reviser or second proofreader which segments should be given special attention to.

	The statuses displayed here can be defined in the configuration of your translate5 system.

Error categories

The following section gives an overview of the error categories.

Consistency

Inconsistent source	The target segment contains the same translation that was used for a deviating source segment.
Inconsistent target	The target segment has been translated differently from another target segment whose source segment content matches the current one.

Internal tags

Internal tags have an incorrect structure	The order of the tags in the target segment is incorrect in one of the following ways e.g. a closing tag has been inserted before the opening tag: or the tags were inserted incorrectly nested or are overlapping:
Internal tags have been added	The target segment contains tags that are not present in the corresponding source segment.
Internal tags missing	Not all tags from the source segment are present in the target segment.
Whitespace has been added	The target segment contains at least one additional space, tab or line break before/after a tag compared to the source segment. It makes no difference whether the space is missing before or after the tag.
Whitespace has been removed	The target segment contains at least one space, tab or line break less before/after a tag compared to the source segment. It makes no difference whether the space is missing before or after the tag.

LanguageTool

Grammar	The text contains a grammatical error (including errors in syntax and morphology).
Spelling	The text contains a spelling mistake.
Typographical	The text has typographical errors, such as missing/incorrect punctuation, incorrect capitalization, etc.
General > Characters	The target segment contains symbols that are wrong, in the wrong order, or that are not used in the target language.
General > Duplication	The target segment contains at least one word twice.
General > Inconsistency	The text is inconsistent in that it uses different translations for the same term in different places (not coupled with terminology recognition).
General > Legal	The text is legally problematic (e.g. it refers to the wrong legal system, or uses expressions and phrases that are not common for the target language’s legal area).
General > Uncategorized	The identified error is either uncategorized or cannot be categorized (due to incomprehensible strings).
Style > Register	The translation is written in the wrong language register or uses slang or other language variants that are not appropriate for the target text.
Style > Locale-specific content	The translation contains passages that do not apply to the target language culture or country.
Style > Locale-violation	The text violates the standards for the intended locale.
Style > General style	The text contains stylistic errors.
Style > Pattern problem	The text does not match a pattern that defines acceptable content (or matches a pattern that defines non-acceptable content).
Style > Whitespace	The kind or number of white spaces in the source and target segments do not match, or the target segment violates certain rules for the use of white space.
Style > Terminology	A wrong term, or a term from the wrong area was used, or terminology was used inconsistently.
Style > Internationalization	There is a problem related to the internationalization of content.

Empty segments

Empty segments	There are segments that have not been translated yet.

White space at the beginning/end

Line break at the beginning	The segment contains an unnecessary line break right at the beginning.
Line break at the end	The segment contains an unnecessary line break at the end.
Space after line break	The segment contains an unnecessary space after the line break.
Non-breaking space at the beginning	The segment contains a non-breaking space right at the beginning.
Non-breaking space at the end	The segment contains an unnecessary non-breaking space at the end.
Space before line break	The segment contains an unnecessary space before the line break.
Segment ends with a space followed by a tag	The segment contains a tag at the end that is preceded by an unnecessary space.
Tab at the beginning	The segment contains an unnecessary tab stop right at the beginning.
Tab at the end	The segment contains an unnecessary tab stop at the end.
Segment begins with a tag followed by a space	The segment contains a tag at the beginning that is followed by an unnecessary space.

Length check

Segment relevantly shorter than allowed (more than 20% shorter or at least 100 pixels shorter or at least 20 characters shorter)	Indicates that the segment's character count is significantly below layout-dependent length specification limits.
Segment longer than allowed	The segment content exceeds the permissible length specification.
Segment has too many lines	The segment contains more line breaks than permitted (e.g. relevant for subtitle files).
Segment shorter than allowed.	The segment content falls short of the permissible length specification.

Manual QA (complete segment)

The categories displayed here can be configured individually. The following categories are available by default:

Demo QM error 1
Mistranslation
Terminology problem
Fluency issue
Inconsistency

Manual QA (in the segment)

These default quality categories follow theMultidimensional Quality Metric (MQM)model that is based on the LISA QA model, but structured modally, as developed and defined by QTLaunchpad. QTLaunchpad was an EU supported project by theGerman Research Center for Artificial Intelligence (DFKI),Dublin City University,University of Sheffieldand theAthena Institute for Language and Speech Processing. The categories are as follows:

Accuracy
- Mistranslation
  - Terminology
- Omission
- Addition
- Untranslated
Fluency
- Register
- Style
- Inconsistency
- Spelling
- Typography
- Grammar
- Locale violation
- Unintelligible
Verity
- Completeness
- Legal requirements
- Locale applicability

	The categories displayed here can be defined and adjusted individually for each task in the “QM-Subsegment_Issues.xml” file in the import zip.

Usage of TM matches

Edited 100% match	Indicates that a 100% match from the translation memory has been edited.
Unedited fuzzy match	Indicates that a fuzzy match from the translation memory has not been edited.

Terminology

Forbidden in source	Indicates that a term has been used that is flagged as forbidden for the source language.
Forbidden in target	Indicates that a term has been used that is flagged as forbidden for the target language.
Not defined in target terminology	Indicates that the segment contains a term that is only defined in the source language, but not yet in the target language of the corresponding TermCollection.
Not found in target	Indicates that there is a match with the TermCollection in the source segment, but not in the target segment.

Numbers

Numbers source ≠ target	Indicates that the numbers contained in the target segment do not match those in the source segment.
Format alteration (ordinal numbers, leading zero etc.)	Indicates that the notation or format of the numbering/ordinals in the target segment differs from the one used in the source segment.
Different character/formatting for number interval	Indicates that the notation or format of a number interval in the target segment differs from the one used in the source segment.
1000 separator not allowed	Indicates that a thousands separator that is not permitted for the target language has been used in the target segment.
Dubious number from source unchanged in target	Indicates that a possibly incorrect number was detected in the source segment that was transferred to the target segment and not edited.
Alphanumeric character sequences: Differences	Indicates differences between the source and target segments regarding alphanumeric strings, e.g. article names that have not been transferred correctly.
Format alteration (date information etc.)	Indicates that the notation or format of dates in the target segment is not the same as in the source segment.
Separator(s) not localised	Indicates that the thousands separator was transferred from the source segment to the target segment and not edited to match the target language’s requirements.
Formatting of 1000s number altered	Indicates that the notation or format of the thousands numbers in the target segment differs from the one in the source segment.
Numeral from source found as number in target	Indicates that the source segment contains a numeral (one, two, three, etc.), for which a number (1, 2, 3, etc.) was used in the target segment.
Number from source found as numeral in target.	Indicates that the source segment contains a number (1, 2, 3, etc.), for which a numeral (one, two, three, etc.) was used in the target segment.

Page tree

Quality assurance

Version und Versionierung

Types of quality assurance in translate5

Overview

QA statistics

False positives

Marking reported errors as false positives individually

Marking entire error categories as false positive

Manual QA (complete segment)

Manual QA (inside segment)

Status

Error categories

Consistency

Internal tags

LanguageTool

Empty segments

White space at the beginning/end

Length check

Manual QA (complete segment)

Manual QA (in the segment)

Usage of TM matches

Terminology

Numbers