Internet DRAFT - draft-chung-imax
draft-chung-imax
Edmon Chung & Jim Lam
Internet Draft Neteka Inc.
<draft-chung-imax-01.txt>
February 2003
Internationalized Mail Address eXtensions (IMAX)
STATUS OF THIS MEMO
This document is an Internet-Draft and is in full conformance with
all provisions of Section 10 of RFC2026.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as Internet-
Drafts. Internet-Drafts are draft documents valid for a maximum of
six months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use Internet-Drafts as
reference material or to cite them other than as "work in progress."
The reader is cautioned not to depend on the values that appear in
examples to be current or complete, since their purpose is primarily
educational. Distribution of this memo is unlimited.
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Abstract
This document describes a set of extension mechanisms for mail
servers and clients to handle the transportation and negotiation of
multilingual email addresses, utilizing the standard SMTP and POP
extension mechanisms.
In other words, the mechanism discussed in this document promotes the
use of multilingual email addresses that is immediately deployable by
interested parties without affecting or breaking any other existing
systems.
Table of Contents
1. Introduction....................................................2
1.1 Terminology....................................................2
2. SMTP Extensions for Multilingual Addresses......................2
2.1 Framework for Multilingual Extension...........................2
2.2 IMAX Compliant Sessions........................................3
2.3 CHARSET Parameter..............................................4
2.4 Client Side Fallback Strategy..................................4
3. Considerations for Inscription of Message Headers.Error! Bookmark
not defined.
Chung & Lam [Page 1]
IMAX February 2003
3.1 Denotation for Multilingual Header Fields....Error! Bookmark not
defined.
3.2 Encoding Schemes....................Error! Bookmark not defined.
3.3 Implementation......................Error! Bookmark not defined.
3.4 Compatibility Issues................Error! Bookmark not defined.
4. POP Capabilities Extension for Multilingual User Names..........5
4.1 IMAX Capability................................................5
4.2 The IMAX process...............................................6
4.3 Fallback Strategy..............................................7
5. Stringprep, Nameprep and Charprep Considerations................7
6. IANA & Security Considerations..................................7
6.1 SMTP Service Extension Considerations..........................7
6.2 Message Header Considerations.......Error! Bookmark not defined.
6.3 POP Capability Considerations..................................7
6.4 Security Considerations........................................8
1. Introduction
This document outlines a set of extension mechanisms that would
enable the use of multilingual email addresses without disrupting the
services of the existing servers. In brief, two main areas have been
enhanced to deal with multilingual addresses: SMTP and POP.
All enhancements follow the guidelines of standard extension
mechanisms, thus should be deployable without affecting the
interoperability of mail services.
1.1 Terminology
The key words "MUST", "SHALL", "REQUIRED", "SHOULD", "RECOMMENDED",
and "MAY" in this document are to be interpreted as described in RFC
2119 [RFC2119].
Throughout this document, multilingual characters intended to be 8-
bit will be presented in the format (charset=hexvalue=hexvalue),
while 16-bit UCS-2 characters will be in the form (U+hex).
2. SMTP Extensions for Multilingual Addresses
Following the SMTP Service Extension specifications defined in RFC
1869, a new EHLO keyword is introduced to facilitate the use of
multilingual email addresses for the transportation of messages.
Optional parameters are included to define the charset(s) supported
by the server and that of the multilingual address. To promote and
maintain the simplicity however, by default, a compliant server MUST
support UTF-8 as well as the IETF ASCII Compatible Encoding (ACE)
scheme as common for multilingual domain names (Section 5).
2.1 Framework for Multilingual Extension
The IMA extensions are defined as follows:
Chung & Lam [Page 2]
IMAX February 2003
(1) The textual name of the service extension defined is
ôInternationalized Mail Addressesö;
(2) The EHLO Keyword value associated with the extension is ôIMAXö;
(3) The IMA EHLO Keyword supports as an optional parameter a space
separated list of charsets registered with IANA for use with
MIME, plus any ACE formats supported by the DNS;
(4) No new SMTP verb is introduced;
(5) A parameter using the keyword "CHARSET" is added to the MAIL
FROM and RCPT TO commands.
(6) The clients and servers session would not be lengthened yet
multilingual characters could be used as mailbox identifiers.
2.2 IMAX Compliant Sessions
Besides the new IMAX function, another service extension that would
be used to determine the handling of multilingual addresses is the
8bit-MIMEtransport extension [RFC1952]. If the server supports both,
the client SHOULD send a multilingual address in UTF-8. If however
the server only supports IMAX or only supports 8BITMIME, then an ACE
formatted name MUST be used instead.
For example, a session between fully compliant clients and servers
would look like (C: = Client, S: = Server)(Please also be reminded
that characters intended to be 8-bit will be presented in the format:
charset=hexvalue=hexvalue):
S: <wait for connection on TCP port 25>
C: <open connection to server>
S: 220 mail.neteka.com -- Server SMTP (NeBOX v3.0)
C: EHLO mail.toronto.edu
S: 250-mail.neteka.com
S: 250-8BITMIME
S: 250 IMAX
C: MAIL FROM:<(UTF-8=E4=E8=AD=E6=96=87)@toronto.edu> CHARSET=UTF-8
S: 250 Address Ok.
C: RCPT TO:<edmon@neteka.com>
S: 250 edmon@neteka.com OK
C: DATA
...
If the server does not support 8BITMIME or if it does not support
IMAX, then a 7-bit format must be used:
C: EHLO mail.toronto.edu
S: 250-mail.neteka.com
S: 250 IMAX
C: MAIL FROM:<xn--fiq228c@toronto.edu> CHARSET=ACE
S: 250 Address Ok.
Chung & Lam [Page 3]
IMAX February 2003
...
Should there be any negotiations on the supported CHARSET, any
unsupported CHARSET would result in a 504 error response, indicating
that the command parameter is not implemented.
C: EHLO mail.toronto.edu
S: 250-mail.neteka.com
S: 250-8BITMIME
S: 250 IMAX UTF-8 GB2312
C: MAIL FROM:<(Big5=A4=A4=A4=E5)@neteka.com> CHARSET=Big5
S: 504 command parameter not implemented
C: MAIL FROM:<(UTF-8=E4=E8=AD=E6=96=87)@neteka.com> CHARSET=UTF-8
S: 250 Address Ok.
...
It is recommended in this document that the default encoding schemes
supported SHOULD be UTF-8 and an ACE format (that used by the DNS),
and both MUST be implemented for any server returning the IMAX EHLO
Keyword.
2.3 CHARSET Parameter
While the CHARSET parameter for the IMAX EHLO Keyword is optional,
any IMAX compliant client MUST specify the charset used by including
the CHARSET parameter after the MAIL FROM or RCPT TO commands. This
avoids the confusion caused by multiple conflicting character
encoding schemes. It also further signifies that the client is
attempting to transmit an IMA using the IMAX extensions.
If no charset is specified, the server SHOULD assume that the client
is not IMAX compliant.
2.4 Client Side Fallback Strategy
There are different scenarios and level of compliancy for servers.
The basic one would be encountering a server that doesnÆt support
EHLO keywords or any SMTP service extensions. In this case, the
client should attempt to send any multilingual names in an ACE
format.
S: <wait for connection on TCP port 25>
C: <open connection to server>
S: 220 mail.example.com -- Server SMTP
C: EHLO mail.neteka.com
S: 500 Command not recognized: EHLO
C: HELO mail.neteka.com
S: 250 mail.example.com hello
C: MAIL FROM:<xn--fiq228c@neteka.com>
S: 250 Address Ok.
...
Chung & Lam [Page 4]
IMAX February 2003
If a server supports EHLO but does not indicate that it also supports
IMAX, essentially the client should handle multilingual user names
the same way as encountering a non-EHLO aware server.
S: 220 mail.example.com -- Server SMTP
C: EHLO mail.neteka.com
S: 250-mail.neteka.com
S: 250 HELP
C: MAIL FROM:<xn--fiq228c@neteka.com>
S: 250 Address Ok.
...
The use of ACE names is intended to be for transitional purposes to
ensure a smooth and transparent migration towards multilingual
enabled mail address handling. Eventually, mail clients and servers
should attempt to use the IMAX scheme with UTF8 for increased
efficiency and reduced confusion.
3. POP Capabilities Extension for Multilingual User Names
While SMTP takes care of the transportation of messages POP
essentially handles the retrieval of mail objects from the server by
a client.
In order to use multilingual user names for the retrieval of messages
from a mail server using the POP protocol, a new capability is
introduced following the POP3 extension mechanism [RFC 2449].
3.1 IMAX Capability
(1) CAPA tag:
IMAX
(2) Arguments:
CHARSET û a space separated list of charsets registered with
IANA for use with MIME
(3) Standard commands affected:
USER and APOP
(4) Announced states / possible differences:
AUTHENTICATION / none
(5) Commands valid in states:
AUTHENTICATION
(6) Specification Reference:
This document
(7) Discussion:
The IMAX capability indicates that the POP server is
Chung & Lam [Page 5]
IMAX February 2003
multilingual aware and is able to handle multilingual user
addresses. Further discussion in Section 4.2.
3.2 The IMAX process
The IMAX capability enables the POP server to successfully handle
multilingual user names, which form a crucial part of a multilingual
email address. The IMAX capability supports optional arguments
specifying the charsets supported by the POP server.
By default, similar to that specified in Section 2.2, any POP server
advertising that it supports IMAX MUST at least support UTF-8 and the
ACE format used by the DNS. Other charsets may be included and could
be advertised using a space-separated list as an argument for the
IMAX CAPA keyword.
For example, a POP session between IMAX compliant servers and clients
would follow:
S: <wait for connection on TCP port 110>
C: <open connection to server>
S: +OK POP3 server ready
C: CAPA
S: +OK Capability list follows
S: TOP
S: USER
S: IMAX
S: .
Again, there will be different levels of compliance for POP servers.
If the POP3 extension mechanism is not at all supported, then the
CAPA command would yield a -ERR response, which indicates that this
is not an IMAX aware server. Since it should be necessary for POP
servers to be IMAX aware to host multilingual addresses however, a
client could assume that the POP server is non-compliant if a CAPA
command results in an ûERR response.
An IMAX compliant client MUST include the additional CHARSET
parameter with the USER command if the name in question contains
extended characters.
When confronted with a charset that is not implemented, the server
will generate a -ERR response:
S: +OK POP3 server ready
C: CAPA
S: +OK Capability list follows
S: TOP
S: USER
S: IMAX UTF-8 GB2312
S: .
C: USER (Big5=A4=A4=A4=E5)@neteka.com CHARSET=Big5
S: -ERR CHARSET=big5 not implemented
C: USER (UTF-8=E4=E8=AD=E6=96=87)@neteka.com CHARSET=UTF-8
Chung & Lam [Page 6]
IMAX February 2003
S: +OK welcome
...
The client must then fallback to using UTF-8 or those advertised by
the POP server as supported charsets.
3.3 Fallback Strategy
Although it is a safe assumption to maintain that for a mail server
to be willing to handle multilingual addresses, it should be prepared
to update its POP servers to be IMAX aware, and thus there really
should be little concern for any fallback strategy.
Nevertheless, it is possible to create provisions for this situation.
Should an IMAX aware client encounter a non-compliant POP server, it
could use an ACE formatted user name for login.
In essence, it means that it is possible to have a pure client based
solution without changing the mail servers at all. Please find
further discussion in the IMAA (Internationalized Mail Addresses in
Applications) paper.
4. Stringprep, Nameprep and Charprep Considerations
Stringprep, Nameprep or other string preparation considerations for
matching multilingual names are not discussed in this document.
Character equivalence preparations (Charprep) are also not discussed.
Please refer to the relevant documents for further information.
While this document does not explicitly discuss the different
matching preparation considerations, it recommends that the client
SHOULD perform the relevant preparations before transportation, and
the server MUST perform the relevant preparations.
5. IANA & Security Considerations
Within this document, there are a number of new commands and
parameters that will have to be included into their respective IANA
registry.
5.1 SMTP Service Extension Considerations
For the SMTP service extensions, a new EHLO keyword ôIMAXö is
introduced with optional parameters being the supported charsets. An
additional parameter ôCHARSETö is also added to the SMTP commands
ôMAIL FROMö and ôRCPT TOö to indicate the charset used during the
transportation.
5.2 POP Capability Considerations
A new Capability is added as a POP extension. The CAPA tag ôIMAXö is
introduced to indicate that it is multilingual compliant, with
optional parameters being, again the supported charsets. Similar to
Chung & Lam [Page 7]
IMAX February 2003
the SMTP arrangements, an additional parameter ôCHARSETö is also
appended to the POP commands ôUSERö and ôAPOPö.
5.3 Security Considerations
This document does not discuss any security issues and is not
believed to raise any extra security problems not already existing
within the email systems. In fact the promotion of the adoption of
the service extension mechanisms for both SMTP and POP could in turn
enhance the overall security of email messaging.
References
[RFC821] J. Postel, "Simple Mail Transfer Protocol", RFC 821,
USC/Information Sciences Institute, August 1982.
[RFC1035] Mockapetris, P., "Domain Names - Implementation and
Specification," STD 13, RFC 1035, USC/ISI, November 1987
[RFC1652] J. Klensin et. al., "SMTP Service Extension for 8bit-
MIMEtransport", RFC 1952, July 1994
[RFC1869] J. Klensin et. al., "SMTP Service Extensions", RFC 1896,
November 1995
[RFC2026] S. Bradner, "The Internet Standards Process -- Revision
3", Harvard University, RFC 2026, October 1996
[RFC2047] K. Moore, "MIME Part Three: Message Header Extensions for
Non-ASCII Text", University of Tennessee, RFC 2449,
November 1996
[RFC2119] S. Bradner, "Key words for use in RFCs to Indicate
Requirement Levels", RFC 2119, March 1997
[RFC2449] R. Gellens, et. al., "POP3 Extension Mechanism", RFC 2449,
November 1998
Authors:
Edmon Chung
Neteka Inc.
Suite 100, 243 College St.
Toronto, Ontario, Canada M5T 1R5
edmon@neteka.com
Jim Lam
Neteka Inc.
Suite 100, 243 College St.
Toronto, Ontario, Canada M5T 1R5
jimmy@neteka.com
Chung & Lam [Page 8]