World Library  
Flag as Inappropriate
Email this Article

Left-to-right mark

Article Id: WHEBN0008605218
Reproduction Date:

Title: Left-to-right mark  
Author: World Heritage Encyclopedia
Language: English
Subject: Right-to-left mark, Byte order mark, Unicode anomaly, Signatures, Old Turkic (Unicode block)
Collection:
Publisher: World Heritage Encyclopedia
Publication
Date:
 

Left-to-right mark

The left-to-right mark (LRM) is a control character or invisible formatting character, used in the computerized typesetting of text that contains mixed left-to-right scripts (such as English and Russian) and right-to-left scripts (such as Arabic, Persian and Hebrew). It is used to set the way adjacent characters are grouped with respect to text direction.

Unicode

In Unicode, LRM is encoded U+200E left-to-right mark (HTML: ). UTF-8 is E2 80 8E. Usage is prescribed in the Unicode Bidi (bidirectional) algorithm.

Example of use in HTML

Suppose the writer wishes to inject a run of English text (i.e. left-to-right) text into an Arabic or Hebrew paragraph, with non-alphabetic characters at the end of the English text (on the right). "The language C++ is a programming language used..." in Arabic, but with the "C++" in English renders as follows:

‫ لغة C++ هي لغة برمجة تستخدم...

With an LRM entered in the HTML after the ++, it renders as follows:

‫ لغة C++‎ هي لغة برمجة تستخدم...

Standards-compliant browsers will render the ++ on the left in the first example, and on the right in the second. This happens because the browser recognizes that the paragraph is in a RTL script (Arabic), and applies punctuation, which is neutral as to its direction, in coordination with the more prominent (paragraph level) adjacent text. The LRM causes the punctuation to be adjacent to only LTR text – the "C" and the LRM – and hence position as if it were in left-to-right text, i.e., to the right of the preceding text. or may be required by some software rather than the invisible Unicode character itself; the actual invisible character would also make copy editing difficult.

See also

External links

  • Unicode standard annex #9: The bidirectional algorithm
  • Unicode character (U+200E)


This article was sourced from Creative Commons Attribution-ShareAlike License; additional terms may apply. World Heritage Encyclopedia content is assembled from numerous content providers, Open Access Publishing, and in compliance with The Fair Access to Science and Technology Research Act (FASTR), Wikimedia Foundation, Inc., Public Library of Science, The Encyclopedia of Life, Open Book Publishers (OBP), PubMed, U.S. National Library of Medicine, National Center for Biotechnology Information, U.S. National Library of Medicine, National Institutes of Health (NIH), U.S. Department of Health & Human Services, and USA.gov, which sources content from all federal, state, local, tribal, and territorial government publication portals (.gov, .mil, .edu). Funding for USA.gov and content contributors is made possible from the U.S. Congress, E-Government Act of 2002.
 
Crowd sourced content that is contributed to World Heritage Encyclopedia is peer reviewed and edited by our editorial staff to ensure quality scholarly research articles.
 
By using this site, you agree to the Terms of Use and Privacy Policy. World Heritage Encyclopedia™ is a registered trademark of the World Public Library Association, a non-profit organization.
 


Copyright © World Library Foundation. All rights reserved. eBooks from Project Gutenberg are sponsored by the World Library Foundation,
a 501c(4) Member's Support Non-Profit Organization, and is NOT affiliated with any governmental agency or department.