INTERNET-DRAFT                               Charles H. Lindsey
Usenet Format Working Group                  University of Manchester
                                             July 2001

4.4.2. Character Sets within Article Bodies

Previous Up Next
4.4.2.  Character Sets within Article Bodies
   Within article bodies, characters are represented as octets according
   to the encoding scheme implied by any Content-Transfer-Encoding and
   Content-Type headers [RFC 2045].  In the absence of such headers,
   reading agents cannot be relied upon to display correctly more than
   the US-ASCII characters.

        NOTE: Observe that reading agents are not forbidden to "guess",
        or to interpret as UTF-8 regardless, which would be the simplest
        course for them to take.

        NOTE: It is not expected that reading agents will necessarily be
        able to present characters in all possible character sets,
        although they MUST be able to present all US-ASCII characters.
        For example, a reading agent might be able to present only the
        ISO-8859-1 (Latin 1) characters [ISO 8859], in which case it
        Ought to present undisplayable characters using some distinctive
        glyph, or by exhibiting a suitable warning. Older reading agents
        that do not understand MIME headers or UTF-8 should be able to
        display bodies in US-ASCII (with some loss of human
        comprehensibility) except possibly when the Content-Transfer-
        Encoding is "8bit".

   Followup agents MUST be careful to apply appropriate encodings to the
   outbound followup. A followup to an article containing non-ASCII
   material is very likely to contain non-ASCII material itself.

Previous Up Next
Previous draft (04): 4.4.2. Character Sets within Article Bodies

Diffs to previous draft

--- {draft-04}	Wed Jul 11 21:55:16 2001
+++ {draft-05}	Wed Jul 11 21:55:16 2001
@@ -1,6 +1,4 @@
 
4.4.2.  Character Sets within Article Bodies
-
-
    Within article bodies, characters are represented as octets according
    to the encoding scheme implied by any Content-Transfer-Encoding and
    Content-Type headers [RFC 2045].  In the absence of such headers,
@@ -18,7 +16,7 @@
         ISO-8859-1 (Latin 1) characters [ISO 8859], in which case it
         Ought to present undisplayable characters using some distinctive
         glyph, or by exhibiting a suitable warning. Older reading agents
-        that do not understand Mime headers or UTF-8 should be able to
+        that do not understand MIME headers or UTF-8 should be able to
         display bodies in US-ASCII (with some loss of human
         comprehensibility) except possibly when the Content-Transfer-
         Encoding is "8bit".