Internet DRAFT - draft-arrouye-idn-ie5-resolution

draft-arrouye-idn-ie5-resolution



INTERNET-DRAFT                                                Y. Arrouye
June 27, 2001                                            RealNames Corp.
Expires December 27, 2001

       IDN Resolution in Windows Internet Explorer 5.0 and Above
                draft-arrouye-idn-ie5-resolution-00.txt

Status of this Memo

   This document is an Internet-Draft and is in full conformance with
   all provisions of Section 10 of RFC2026.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups. Note that other
   groups may also distribute working documents as Internet-Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time. It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/1id-abstracts.html

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html

Abstract

   This document describes how internationalized domain names (IDNs) are
   being resolved in the Windows Internet Explorer (IE) Web browser
   version 5.0 and above. The resolution that is described in this
   document is currently available on a limited basis for an existing
   test bed. This document focuses on the different steps that are taken
   after a user enters an IDN in the address bar of IE, up to when the
   relevant Web page is displayed in the user's browser.

1.  Introduction

   Registration of IDNs in the .com, .org, and .net top-level domains
   has been available as a test bed since November 10, 2000. This test
   bed now contains nearly one million registered IDNs. The test bed
   consists of multiple phases. While the first phases only consisted in
   registration and hosting of an informational Web page by VeriSign, it
   is today possible for IDNs owners to assign them to one of their
   hosts, and to control the content that is delivered from these hosts,
   making IDNs useful for the first time.




Arrouye                                                         [Page 1]

draft-arrouye-idn-ie5-resolution-00.txt                     27 June 2001


   The greatest hurdle in the deployment of this IDN test bed has been
   the lack of support from applications. Users had to download a plug-
   in for their Web browser or their Windows operating system in order
   to be able to use IDNs. This requirement meant that availability of
   IDNs was severely limited, as people do not usually download such
   plug-ins, or are simply not aware of their existence.

   RealNames (Keyword: RealNames, http://www.realnames.com/) recently
   added IDN resolution to its resolution services. Thanks to the use of
   these services by Internet Explorer, IE users of version 5.0 and
   above on Windows have access to IDNs without the need to use a plug-
   in or do any special configuration. This document explains how this
   IDN resolution works.

2.  Overview of the VeriSign Test Bed

   Information about the VeriSign IDN test bed can be found at
   http://www.verisign-grs.com/idn/ so this section will simply give
   some basic information about it.

   IDNs can be subscribed in the .com, .org, and .net top-level domains
   through accredited registrars. The registrars do perform the Nameprep
   [NAMEPREP] step, as well as a RACE-encoding [RACE] step, on the
   names, and submit the resulting encoded string to the VeriSign GRS
   through RRP [RFC2832]. The subscribed names then go into a test bed,
   by placing them in a secondary level domain name. This is achieved by
   replacing the top-level domain TLD by .mltdb.TLD. The RACE-encoded
   subscription bq--gdvkf26n7tqlu.com is actually accessible only as
   bq--gdvkf26n7tqlu.mltbd.com. This prevents ACE-encoded strings to be
   visible in the top-level domain zone files. A later phase of the test
   bed may remove this restriction.

   The names in the .mltbd.com, .mltbd.org, and .mltbd.net test bed
   zones can still be managed by their buyers through delegation of
   their resolution from VeriSign. This is what is happening currently,
   in phase 3.2 of the test bed.

3.  Name Resolution in Internet Explorer

   The process of name resolution, or more correctly user input
   resolution, in IE, is as follows:

   - The user types some address in the IE address bar.

   - IE checks whether the address looks like a valid URI [RFC2396] with
   a scheme. If that is the case, it tries to resolve that URI.

   - If the address does not look like a valid URI, but like a domain



Arrouye                                                         [Page 2]

draft-arrouye-idn-ie5-resolution-00.txt                     27 June 2001


   name, IE first does a DNS lookup of the domain name after encoding it
   in UTF-8. If the lookup is successful, IE assumes that the user wants
   to do an HTTP [RFC2616] request for the document / on this host, and
   tries to fetch it.

   - If the DNS lookup failed, or if the address does not look like a
   valid URI, IE sends the full user input, in UTF-8, to
   auto.search.msn.com, a server that implements IE's Autosearch
   functionality.

   - Autosearch is faced with two kinds of inputs.  Genuine Autosearch
   queries are handled by MSN, which is the integration point for both
   search and the RealNames Keywords navigation system. Failed DNS
   queries are sent directly to the RealNames servers for further
   handling, as explained in the section "RealNames IDN Resolution
   Service."

4.  RealNames IDN Resolution Service

   The RealNames IDN resolution service is embedded in the RealNames
   Keywords routers [RNS], systems dedicated to the task of name
   resolution for the Internet. One of the available interfaces is HTTP-
   based, and works by returning HTTP redirects (302) as answers to an
   HTTP GET queries for name resolution. As a result, the user's HTTP
   client will get the desired named resource.

   The RealNames routers map a name to a destination. They were
   initially built to simply handle Keywords, and have been recently
   extended to handle different kinds of names. These names are called
   multilingual identifiers (MLIs) in this document. MLIs are segregated
   into different categories, or types, and may be resolved differently
   depending on their type. The initial type of MLI was a Keyword. In
   this document, we are interested into another type, the IDN.

   When an MLI is passed to a RealNames router, it first needs to be
   categorized. IDNs are being recognized by doing a syntax check as
   well as testing whether they can be encoded successfully for
   consumption by the existing DNS. IDNs are also being checked against
   a table of valid TLDs for IDNs, since they are not available
   everywhere as of this writing. Names that do not belong to the right
   TLDs are handled as Keywords (the default fall-back identifier
   category in the RealNames system).

   Once an MLI has been categorized as an IDN,  is encoded by following
   the steps described in the IDNA [IDNA] Internet-Draft: the Nameprep
   algorithm is applied to the name, and then it is encoded using an
   ACE.




Arrouye                                                         [Page 3]

draft-arrouye-idn-ie5-resolution-00.txt                     27 June 2001


   In order to be able to support different IDN deployment environments,
   the ACE used for the encoding of the name after Nameprep does depend
   on the TLD of the IDN. In our example of the VeriSign test bed, it is
   RACE. For the same reason, any post-processing of the name is also
   keyed off the TLD. In our example, the post-processing is the
   insertion of the .mltbd secondary-level domain in the now DNS-
   friendly name.

   Once a DNS-friendly name has been produced, the RealNames routers
   simply redirect to it.

   The following is a step-by-step illustration of this process. The
   notation \uNNNN is used to represent the Unicode code point U+NNNN.

   - A RealNames router is asked to resolve the name
   \u30EA\u30A2\u30EB\u30FC\u30E0\u30BA.COM This request comes as an
   HTTP GET request from auto.search.msn.com, as discussed before.

   - The name is categorized as an IDN.

   - The Nameprep algorithm is applied to this name, and its output is
   \u30EA\u30A2\u30EB\u30FC\u30E0\u30BA.com (uppercase letters have been
   mapped to lowercase ones).

   - The name is then encoded by applying RACE-encoding to each of its
   internationalized parts. The result of this encoding is
   bq--gdvkf26n7tqlu.com which is a DNS-friendly name.

   - The post-processing for .com names is now applied to this DNS-
   friendly name. As we have said before, this means that .mltbd is
   inserted as a secondary-level domain. The name is now
   bq--gdvkf26n7tqlu.mltbd.com.

   - Finally, the RealNames router sends an HTTP redirect to
   http://bq--gdvkf26n7tqlu.mltbd.com as its answer to the name
   resolution query.

   At that point, the name resolution is complete, and the user of the
   Web client that made the original request will get the desired
   content.

   Note that in order to enhance user experience, the RealNames routers
   accept partial URIs (URIs without a scheme) as input and correctly
   categorize them based on the host in the URI. So for example, the
   input \u30EA\u30A2\u30EB\u30FC\u30E0\u30BA.COM/foo.html will bring
   http://bq--gdvkf26n7tqlu.mltbd.com/foo.html as a user would expect.
   (To see a real page instead of an error, replace foo.html with
   Virtual.asp?page=JP_Corporate_PressListing in the above example.)



Arrouye                                                         [Page 4]

draft-arrouye-idn-ie5-resolution-00.txt                     27 June 2001


5.  Other Considerations

   The technology that is described here is limited to the resolution of
   IDNs in a Web browsing context. Moreover, it is limited to IDNs being
   entered by the user in the browser's address line, not as part of a
   valid URI, whether it be in the browser's address line or in a
   document manipulated by the browser. The goal of this setup is to
   enhance the user experience of people that type domain names, and now
   internationalized ones, as Web addresses, by giving them access to a
   new world of names they have been requesting.

   RealNames technology can be used through various protocols, such as
   HTTP and CNRP [CNRP]. With Microsoft IE 5.0 or above for Windows, its
   HTTP interface can be leveraged out of the box to offer name
   resolution services oriented towards user's navigation needs. The
   current setup, which adds IDN resolution to RealNames Keywords
   technology, is a good example of how flexible our platform is, and
   how IDN resolution can be offered today, even if different TLDs
   require different setups.

   Note that RealNames IDN resolution technology is not enabled for
   every single TLD. One of the reason for this is that different TLDs
   may use different resolution mechanims, or test beds. The other
   reason is that this is currently run as a commercial service, and
   activation requires a business agreement between the registry of a
   given TLD and RealNames Corporation.

6.  Conclusion

   Since internationalized domain names have been available for sale in
   the .com, .org, and .net top-level domains, the biggest hurdle to
   their use has been the lack of an application that supports them out
   of the box. RealNames, through its partnership with Microsoft, offers
   a solution for the user of Windows Internet Explorer 5.0 or above,
   that works without any necessary modification to one's environment,
   removing the biggest barrier to the use of internationalized domain
   names by these users.

7.  References

   [CNRP] N. Popp, M. Mealling, and M. Moseley, Common Name Resolution
   Protocol (CNRP), draft-ietf-cnrp-10.txt, June 2001.

   [IDNA] P. Falstrom and P. Hoffman, Internationalizing Host Names in
   Applications (IDNA), draft-ietf-idn-idna-02.txt, June 2001.

   [NAMEPREP] P. Hoffman and M. Blanchet, Preparation of
   Internationalized Host Names, draft-ietf-idn-nameprep-03.txt, January



Arrouye                                                         [Page 5]

draft-arrouye-idn-ie5-resolution-00.txt                     27 June 2001


   2001.

   [RACE] P. Hoffman, RACE: Row-based ASCII Compatible Encoding for IDN,
   draft-ietf-idn-race-03.txt, November 2000.

   [RFC2396] T. Berners-Lee, R. Fielding, L. Masinter, Uniform Resource
   Identifiers (URI): Generic Syntax, RFC 2396, August 1998.

   [RFC2616] R. Fielding, J. Gettys, J. Mogul, H. Frystyk, L. Masinter,
   P. Leach, and T. Berners-Lee, Hypertext Transfer Protocol - HTTP/1.1,
   June 1999.

   [RFC2832] S. Hollenbeck and M. Srivastava, NSI Registry Registrar
   Protocol (RRP), Version 1.1.0, RFC 2832, May 2000.

   [RNS] Y. Arrouye, The RealNames System - An International Human-
   Friendly Web Navigation System,
   http://www.internetkeywords.org/iuc/realnames-iuc16-paper.htm,
   Proceedings of the 16th International Unicode Conference, Amsterdam,
   The Netherlands, March 2000.

   [UNICODE] The Unicode Consortium, The Unicode Standard - Version 3.0,
   ISBN 0-201-61633-5, January 2000. (Version 3.1 of the standard is
   available at http://www.unicode.org/unicode/reports/tr27/ and was
   published in May 2001.)

Author's Address

   Yves Arrouye
   RealNames Corporation
   150 Shoreline Drive
   Redwood City, CA 94065

   Phone: (650) 486-5503

   E-mail: yves@realnames.com

   Keyword: RealNames
   Web: http://www.realnames.com/












Arrouye                                                         [Page 6]