Word Counter

Below are the principles due to which Crowdin counts words:

  • A word is a combination of letters, punctuation marks and/or special characters (e.g.:@ # $ % ^ & * – _ ` ‘ “) followed by space.
  • The sequence of punctuation marks or special characters is not considered as a word.
  • HTML tags are considered as separate words for most of the formats, except the following ones: HTML, Front Matter HTML, HAML, MD, Front Matter MD, XML, WEBXML, IDML, XLIFF, ADOC, DOCX, MIF, DITA.
  • URLs (e.g.: https://crowdin.com) and emails (e.g.: support@crowdin.com) are considered as one word.
  • Hieroglyphs in Chinese, Japanese, and other hieroglyphic languages are counted as one word/hieroglyph. For example, “ライフ・イン・トウキョウ。” is counted as ten words.

Other examples of how the words are counted:

String Words
Number is -123.45 3
<a href=”{0}” target=”_parent”>here</a> 1 / 7 (if non-HTML-based format is used)
{0} – {1} at {2} 4
two-in-one 1
2-in-one 1
two-in-1 1
%file_type% 1
hello?world 1
hello ? world 2
☂ ☃ ☀⚤ 0
© %company% 1
01/01/1980 3
Monday, August 8, 2011 4
https://ka-graphie.example.com/6d8b.png 1
Let’s look 2
Let's look 3 (another type of apostrophe is used)
Word(s) 2

Translatable HTML Attributes

When working with the HTML-based file formats, depending on the file structure, some HTML attribute values may be considered translatable while others not. You can see the list of attributes and situations when their values are considered translatable in the table below.

Attribute Details Example
title translatable if contained in any HTML tag <p title="value">Text</p>
src translatable if contained in img, iframe, embed, video, audio, source*, track* tags
* – if nested in the parent tags video, audio
<audio src="sound.ogg" controls>Text</audio>
href translatable if contained in a tag <a href="https://www.site.com">Site name</a>
data translatable if contained in object tag <object data="image.jpg"></object>
value translatable if contained in input, button tags <input value="Text">
placeholder translatable if contained in input, textarea tags <textarea placeholder="Enter some text here..."></textarea>
alt translatable if contained in img tag <img src="image.jpg" alt="Image">
label translatable if contained in optgroup, track* tags
* – if nested in the parent tags video, audio
<optgroup label="Text">
content translatable if contained in meta tag <meta name="description" content="Text">

