INTERNET-DRAFT                               Charles H. Lindsey
Usenet Format Working Group                  University of Manchester
                                             July 2001

4.1. Syntax of News Articles

Previous Up Next
4.1.  Syntax of News Articles
   The overall syntax of a news article is:

      article           = 1*header separator body
      header            = header-name ":" 1*SP header-content CRLF
      header-name       = 1*name-character *( "-" 1*name-character )
      name-character    = ALPHA / DIGIT
      header-content    = USENET-header-content
                               *( [CFWS] ";" header-parameter ) /
                          other-header-content
      USENET-header-content
                        = 
      other-header-content
                        = 
      header-parameter  = USENET-header-parameter /
                          other-header-parameter
      USENET-header-parameter
                        = 
      other-header-parameter
                        = attribute "=" value
      attribute         = USENET-token / iana-token / x-token
      value             = token / quoted-string
      USENET-token      = 
      iana-token        = 
      x-token           = [CFWS] "x-" token-core [CFWS]
      token             = [CFWS] token-core [CFWS]
      token-core        = 1*
      tspecials         = "(" / ")" / "<" / ">" / "@" /
                          "," / ";" / ":" / "\" / DQUOTE /
                          "/" / "[" / "]" / "?" / "="
      separator         = CRLF
      body              = *( *998text CRLF )

   An article consists of some headers followed by a body. An empty line
   separates the two. The headers contain structured information about
   the article and its transmission. A header begins with a header-name
   identifying it, and can be continued onto subsequent lines as
   described in section 4.2.3.  The body is largely unstructured text
   significant only to the poster and the readers.

        NOTE: Terminology here follows the current custom in the news
        community, rather than the [RFC 2822] convention of referring to
        what is here called a "header" as a "header-field" or "field".


   Note that the separator line must be truly empty, not just a line
   containing white space. Further empty lines following it are part of
   the body, as are empty lines at the end of the article.

        NOTE: The syntax above defines the canonical form of a news
        article as a sequence of lines each terminated by CRLF. This
        does not prevent serving agents or transport agents from storing
        or handling the article in other formats (e.g. using a single LF
        in place of CRLF) so long as the overall effects achieved are as
        defined by this standard when operating on the canonical form.

Previous Up Next
Previous draft (04): 4.1. Syntax of News Articles

Diffs to previous draft

--- {draft-04}	Wed Jul 11 21:55:04 2001
+++ {draft-05}	Wed Jul 11 21:55:05 2001
@@ -49,8 +49,9 @@
    significant only to the poster and the readers.
 
         NOTE: Terminology here follows the current custom in the news
-        community, rather than the [MESSFOR] convention of referring to
+        community, rather than the [RFC 2822] convention of referring to
         what is here called a "header" as a "header-field" or "field".
+
 
    Note that the separator line must be truly empty, not just a line
    containing white space. Further empty lines following it are part of