翻译记忆

A Translation Memory (TM) is a database that stores translation units, which are segments of source text paired with their corresponding translations in different languages. It improves and speeds up the translation process by providing suggestions for identical or similar strings in your projects. 每个 Crowdin 企业版项目都会自动创建一个项目翻译记忆库,并且项目内进行的每次翻译都会自动添加到此翻译记忆库。

注意:相同语言之间的译文不会添加到翻译记忆库(譬如英语源语言到英语目标语言的译文)。

创建翻译记忆

In addition to the project TMs that are automatically created with each project, you can create separate TMs and populate them with content by uploading your existing TMs in TMX, XLSX, or CSV formats. These TMs can then be assigned to the relevant projects as needed.

创建翻译记忆库的步骤如下:

  1. 打开您组织的工作区,然后在左侧边栏中选择翻译记忆
  2. 在右下角,点击添加创建
    创建翻译记忆
  3. In the appeared dialog, name your TM and select a default language that will be displayed first in the table of translation units.
  4. (可选)单击选择文件上传您现有的翻译记忆库。 您可以跳过此步骤,稍后上传翻译记忆库。
  5. 单击 创建
    Create TM Dialog

对于需要根据已翻译好的 Crowdin 企业版项目创建翻译记忆库的情况,我们建议使用翻译记忆生成器应用。

Managing Translation Units and Segments

You can create translation units from scratch, edit and delete existing translation units or segments of a particular TM or all available TMs.

Creating Translation Units

To create a translation unit, follow these steps:

  1. 打开您组织的工作区,然后在左侧边栏中选择翻译记忆。 或者,打开您的项目并前往设置 > 翻译记忆
  2. 单击所需的翻译记忆库。
  3. Click Add Translation Unit.
    Add Translation Unit
  4. In the appeared dialog, select the language from the drop-down menu and type the translation of the segment.
  5. Click add Add segment to add more translations for the translation unit.
  6. 单击 保存
    Add Translation Unit Dialog

Viewing and Filtering Translation Units and Segments

Once you open a translation memory, you can view and filter its translation units and segments using either the Translation Units or Segments pages.

在“翻译单元”页面上,您可以查看分组为翻译单元的翻译记忆内容(每行一个翻译单元,每个句段显示在单独的语言列中)。
Translation Units View

On the Segments page, you can view TM content as individual segments (one segment per row) with the following details:

  • Segment – contains either source or target language text.
  • Language – specifies the language of the segment.
  • Usage count – specifies the number of times a segment has been used.
  • Created – specifies the date of the segment creation.
  • Last modified – specifies the date of the last modification of the segment.
  • Author – specifies the user who created the segment.

Segments View

By default, all translation units and segments are displayed in the Translation Units and Segments pages. To filter the translation units or segments displayed, click and use the available filter option:

  • Modified: All, Custom Range.
  • Author: All, particular user.
  • Languages (Specific to the Segments page): All, particular language.

To sort translation units or segments, click Sort by and select one of the available options:

  • Text
  • Last modified
  • Created

Editing Translation Units

You can edit both the source and translation parts of the existing translation unit.

To edit a translation unit, follow these steps:

  1. 打开您组织的工作区,然后在左侧边栏中选择翻译记忆。 或者,打开您的项目并前往设置 > 翻译记忆
  2. 单击所需的翻译记忆库。 Alternatively, click All Records to view translation units of all available TMs in one list.
  3. 将鼠标悬停在翻译单元上,然后单击 将其打开。
  4. In the appeared dialog, edit or delete the segments of the needed languages.
  5. 单击 保存
    Edit Translation Unit Dialog

除了通过 翻译记忆 页面编辑翻译单元外,还可以阅读有关 在编辑器中编辑翻译记忆建议 的更多信息。

Replacing in Segments

You can easily find and replace translations in segments within a selected TM using the Find & Replace feature.

To replace current translations with the new ones, follow these steps:

打开您组织的工作区,然后在左侧边栏中选择翻译记忆。 或者,打开您的项目并前往设置 > 翻译记忆

  1. 单击所需的翻译记忆库。
  2. Click .
  3. In the appeared dialog, select the language in which you want to search. (Optional) Use filters if necessary.
  4. Enter the word, phrase, or sentence you want to substitute and the text to replace it with. (Optional) Use the Match case and Exact match options to refine the search results.
  5. 单击 查找 预览将被替换的句段。
    Preview
  6. Select the segments you want to replace and click Replace Selected to finish.

Deleting Translation Units and Segments

You can delete one, multiple, or all the translation units or segments at once.

To delete all the translation units from TM, follow these steps:

  1. On the Translation Units page, select the top checkbox above the translation unit list.
  2. Confirm the selection of all translation units.
  3. 点击 Deleting Translation Units

To delete all the segments from TM for only one particular language, follow these steps:

  1. On the Segments page, click and select the needed language.
  2. Select the top checkbox above the segment list.
  3. Confirm the selection of all segments.
  4. 点击 Deleting Segments

When dealing with the removal of translation units and translations, there could be three possible outcomes:

  • 从翻译记忆中删除翻译单元时,Crowdin 企业版项目中的字符串的相关译文不会被删除。
  • When you cancel the translation activity for a string via the Activity tab, the translation for a string will be deleted, but the related translation unit will be preserved in TM.
  • When deleting a translation for a string in the Editor, both the translation and the related translation unit will be deleted.

For cases when you need to clean your TM from duplicates and outdated translation units, we recommend using the TM Cleaner app.

下载和上传翻译记忆

若要下载或上传翻译记忆,请遵循以下步骤:

  1. 打开您组织的工作区,然后在左侧边栏中选择翻译记忆。 或者,打开您的项目并前往设置 > 翻译记忆
  2. 单击所需的翻译记忆库。
  3. 点击

所有者、管理员(Admin)和管理员(Manager)可以以下列文件格式下载和上传翻译记忆库:TMX、XLSX 或 CSV。

下载和上传按钮

如果使用 CSV 或 XLS/XLSX 文件格式上传翻译记忆库,请在配置对话框中将对应语言的列匹配起来。Configuring Columns for CSV/XLSX TMs

翻译记忆的 CSV 和 XLSX 文件格式的自动列识别

一旦您以 CSV 或 XLSX 格式上传翻译记忆库文件,系统会根据第一行指定的列名称自动检测文件架构。 The identification is performed in a case-insensitive manner. Columns that weren’t detected automatically will be left as Not chosen for manual configuration. 当您上传包含多种语言的翻译记忆库电子表格时,自动列识别会特别有用。

为了充分利用自动列识别功能,我们建议您使用以下值命名 CSV 或 XLSX 翻译记忆库文件中的语言列:

  • 语言名称(例如:乌克兰语)
  • Crowdin 语言代码(例如:uk)
  • Locale (e.g., uk-UA)
  • 带下划线的区域设置(例如:uk_UA)
  • Language code ISO 639-1 (e.g., uk)
  • Language code ISO 639-2/T (e.g., ukr)

要重新检测翻译记忆库文件架构,请单击检测配置

注意:如果自动架构检测识别上传的翻译记忆库文件中的所有列,则会自动选中不导入首行(头部)选项。

下载翻译记忆进行离线管理

从 Crowdin 企业版下载 TMX 格式的翻译记忆时,您可以得到一些额外的元数据,这些元数据可能对离线工具的不同使用场景有用。

以 TMX 格式下载的翻译记忆提供的其他翻译记忆属性:

x-crowdin-metadata – 字符串标识符哈希值。
creationid – Crowdin 企业版中的译文作者的全名和用户名。
creationdate – 译文创建日期。
changeid – 更新译文的人员的全名和用户名。
changedate – 译文更新日期。
usagecount – Crowdin 企业版中的翻译建议的使用次数。
lastusagedate – Crowdin 企业版中使用翻译建议的最后日期。

通常,在 Crowdin 企业版工作的翻译供应商会从项目中导出翻译记忆,以便在各种桌面应用程序中为他们的客户进行管理(例如,清除不相关译文的翻译记忆并进一步重新导入回 Crowdin 企业版)。 上面列出的翻译记忆属性允许根据不同的标准更好地导航和筛选翻译记忆句段。 Also, you might use cleaned and refreshed TMs to train MT engines only on product-specific data to ensure a higher quality of translations as a result.

分配翻译记忆

To assign a TM to your project, follow these steps:

  1. Open your project and go to Settings > Translation memories.
  2. In the Assigned translation memories section, select the needed TMs from the list.

分配翻译记忆

Prioritizing TM

When you assign a few TMs to the project, you can set the needed priority for each of them. As a result, TM suggestions from the TM with the higher priority will be displayed in the first place.

The default TM priority value is set to 1. A higher number has a higher priority (for example, 5 has a higher priority than 1). For example, if you assigned four TMs to your project, you can set the priority of 4 to the most important TM, the one that should be used in the first place. And respectively set lower priorities to other TMs.

To set the priority for TMs, follow these steps:

  1. Open your project and go to Settings > Translation memories.
  2. In the Priority column, set the preferred priority for assigned TMs.

Prioritizing TM

Changing Default TM

To change your project’s default TM, follow these steps:

  1. Open your project and go to Settings > Translation memories.
  2. In the Assigned translation memories section, click the star icon next to the needed TM from the list.

Changing Default TM

Sharing TMs

Using the shared TMs, you can pre-translate any of the projects in your organization. Also, TM suggestions from all TMs will appear in the Editor.

Note: If you want to share a TM between projects, the source language of both projects must match.

To share TMs between all of the projects in your organization, follow these steps:

  1. 打开您组织的工作区,然后在左侧边栏中选择翻译记忆
  2. Select Share Translation Memories.
    Sharing TMs

通过预翻译应用翻译记忆

Pre-translation via TM allows you to leverage a configurable (40% to 100% match ratio) and Perfect matches.

Read more about how TM matches are calculated.

Read more about Pre-translation.

Prioritizing TM Suggestions during the Pre-translation via TM

During the pre-translation via TM, the system considers multiple parameters to select the most relevant TM suggestion. If the system finds only one suitable TM suggestion for a string, it will be applied during the pre-translation via TM. If the system finds two or more TM suggestions for one string, they will be sorted based on multiple parameters and applies the most suitable one.

The following parameters are listed in the order the system uses them to decide which TM suggestion works better. If the decision can’t be made using the first parameter (i.e., two TM suggestions with 100% match), the system will use the next parameter until the decision is made.

  1. Relevance – also known as TM match. Read more about TM Match Calculation.
  2. Auto-Substitution usage – verifying whether the TM suggestion was improved by the auto-substitution. Read more about Auto-substitution.
  3. Assigned TM Priority – the priority of the TM a TM suggestion is stored in. Read more about Prioritizing TM.
  4. Primary or dialect language – the primary or dialect language usage in TM suggestion’s source text (e.g., a TM suggestion from English will have a higher priority than English, Canada).
  5. TM suggestion creation date – the date a TM suggestion was created (a TM suggestion with a more recent creation date will have a higher priority).

To better understand how TM suggestions are prioritized during the pre-translation via TM, let’s go through a few hypothetical scenarios. Let’s imagine you have an untranslated string in your project with the following source text Welcome!. Once you run the pre-translation via TM, the system starts to search for TM suggestions in your TMs.

  • The system finds two TM suggestions with the source text Welcome and Welcome!. The translation from the Welcome! TM suggestion will be used since it has a higher TM match.
  • The system finds two TM suggestions: Welcome! and Welcome!. Both have the same source text, so the system checks whether the auto-substitution was used to improve these TM suggestions and picks the one that wasn’t improved by the auto-substitution.
  • The system finds two TM suggestions: Welcome! and Welcome!. Both have the same source text, and both weren’t improved by the auto-substitution. Then the system checks the priority of the TMs these TM suggestions are stored in and picks the one stored in the TM with higher priority.
  • The system finds two TM suggestions: Welcome! and Welcome!. Both have the same source text, both weren’t improved by the auto-substitution, and both are stored in the TMs with the same priority. Then the system checks the source languages of the TM suggestions and picks the one that uses the primary language.
  • The system finds two TM suggestions: Welcome! and Welcome!. Both have the same source text, both weren’t improved by the auto-substitution, both are stored in the TMs with the same priority, and both use primary source languages. Then the system checks the TM suggestion creation date and picks the one with the latest date.

In rarer cases, there could be a situation when two or more TM suggestions are identical based on all the parameters listed above. In this case, the system picks the first one among identical.

TM Match Calculation

Crowdin Enterprise calculates the TM match by comparing the source string to be translated and TM’s existing segments.

There are three main types of TM matches:

  • Perfect Match - TM segment’s text and context completely match the source string
  • 100% Match - TM segment’s text matches the source string, but the context is different
  • Fuzzy Match (99% and less) - TM segment’s text is different to a certain extent compared to the source string

If the calculations for Perfect and 100% TM match is relatively straightforward, the fuzzy matches’ calculation may not be so obvious.

There are multiple different factors that affect the calculation of fuzzy matches, for example:

  • 词序
  • 标点
  • 格式化标签
  • 匹配长度比源字符串长

TM Suggestions Fuzzy Match

本文是否有帮助?