TOC 
Network Working GroupD. Ewell, Ed.
Internet-DraftConsultant
Intended status: InformationalSeptember 27, 2008
Expires: March 31, 2009 


Update to the Language Subtag Registry
draft-ietf-ltru-4645bis-06

Status of this Memo

By submitting this Internet-Draft, each author represents that any applicable patent or other IPR claims of which he or she is aware have been or will be disclosed, and any of which he or she becomes aware will be disclosed, in accordance with Section 6 of BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as “work in progress.”

The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.

The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.

This Internet-Draft will expire on March 31, 2009.

Abstract

This memo defines the procedure used to update the IANA Language Subtag Registry in conjunction with the publication of RFC 4646bis [RFC EDITOR NOTE: replace with actual RFC number], for use in forming tags for identifying languages. As an Internet-Draft, it also contained a complete replacement of the contents of the Registry to be used by IANA in updating it. To prevent confusion, this material was removed before publication.



Table of Contents

1.  Introduction
2.  Updating the Registry
    2.1.  Starting Point
    2.2.  New Language Subtags
    2.3.  Modified Language Subtags
    2.4.  New Region Subtags
    2.5.  Grandfathered and Redundant Tags
    2.6.  Preferred-Value Changes
    2.7.  Additional Changes
3.  Updated Registry Contents
4.  Security Considerations
5.  IANA Considerations
6.  Changes
7.  References
    7.1.  Normative References
    7.2.  Informative References
Appendix A.  Acknowledgements
§  Author's Address
§  Intellectual Property and Copyright Statements




 TOC 

1.  Introduction

[RFC4646] (Phillips, A. and M. Davis, “Tags for Identifying Languages,” September 2006.) provided for a Language Subtag Registry and described its format. The initial contents of the Registry and rules for determining them were specified in [RFC4645] (Ewell, D., “Initial Language Subtag Registry,” September 2006.).

[draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.) expands on [RFC4646] (Phillips, A. and M. Davis, “Tags for Identifying Languages,” September 2006.) by adding support for approximately 7,500 new primary and extended language subtags based on [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) and [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) alpha-3 code elements, and seven new region subtags based on [ISO3166‑1] (International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.) exceptionally reserved code elements. This memo describes the process of updating the Registry to include these additional subtags, and to make secondary changes to the Registry that result from adding the new subtags and from other decisions made by the Language Tag Registry Update (LTRU) Working Group.

In its initial phase as an Internet-Draft, this memo also contained a complete replacement of the contents of the Language Subtag Registry to be used by the Internet Assigned Numbers Authority (IANA) in updating it. This content was deleted from this memo prior to publication as an RFC.

The format of the Language Subtag Registry, and the definition and intended purpose of each of the fields, are described in [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.).

The Registry is expected to change over time, as new subtags are registered and existing subtags are modified or deprecated. The process of updating the Registry is described in [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.), Section 3. In its Internet-Draft phase, this memo did not define the permanent contents of the Registry and should not be represented as having done so.

Many of the subtags defined in the Language Subtag Registry are based on code elements defined in [ISO639‑1] (International Organization for Standardization, “ISO 639-1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code,” 2002.), [ISO639‑2] (International Organization for Standardization, “ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition,” 1998.), [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.), [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.), [ISO3166‑1] (International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.), [ISO15924] (International Organization for Standardization, “ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts,” January 2004.), and [UN_M.49] (Statistics Division, United Nations, “Standard Country or Area Codes for Statistical Use,” June 1999.). The Registry is not a mirror of the code lists defined by these standards and should not be used as one.

The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in [RFC2119] (Bradner, S., “Key words for use in RFCs to Indicate Requirement Levels,” March 1997.).



 TOC 

2.  Updating the Registry

This section describes the process for determining the updated contents of the Language Subtag Registry.



 TOC 

2.1.  Starting Point

The version of the Language Subtag Registry that was current at the time of IESG approval of this memo served as the starting point for this update. This version was created according to the process described in [RFC4645] (Ewell, D., “Initial Language Subtag Registry,” September 2006.) and maintained according to the process described in [RFC4646] (Phillips, A. and M. Davis, “Tags for Identifying Languages,” September 2006.).

The source data for [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) used for this update consisted of three files, available from the official site of the ISO 639-3 Registration Authority. [RFC EDITOR NOTE: these files may be updated before approval of this memo.]

The source data for [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) used for this update was Table 3, “Language families and groups ordered by identifier,” from the PDF version of the standard.

Language code elements that were already retired in [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) prior to IESG approval of this memo were not listed in these files, and consequently were not considered in this update.

The values of the File-Date field, the Added date for each new subtag record, and the Deprecated date for each existing grandfathered or redundant tag deprecated by this update were set to a date as near as practical to the date of IESG approval of this memo. [RFC EDITOR NOTE: these dates are initially set to 2029-09-09 for easy recognition, and MUST be updated during AUTH48.]



 TOC 

2.2.  New Language Subtags

For each language in [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) that was not already represented by a language subtag in the Language Subtag Registry, a new language subtag was added to the Registry, using the [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) code element as the value for the Subtag field and each of the non-inverted [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) names as a separate Description field. The [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) reference name was represented by the first Description field.

If the language was encompassed by one of the [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) macrolanguages ‘ar’ (Arabic), ‘kok’ (Konkani), ‘ms’ (Malay), ‘sw’ (Swahili), ‘uz’ (Uzbek), or ‘zh’ (Chinese), as determined by [iso‑639‑3‑macrolanguages_20080218] (International Organization for Standardization, “ISO 639-3 Macrolanguage Mappings,” January 2008.), an extended language subtag was also added, with the primary language subtag of the macrolanguage as the value for the Prefix field. These macrolanguage subtags were already present in the Language Subtag Registry and were determined by the LTRU Working Group to have been used to represent a single dominant language as well as the macrolanguage as a whole, making the extended language mechanism suitable for languages encompassed by the macrolanguage.

If the name of the language included the words “Sign Language”, an extended language subtag was added, with the string “sgn” as the value for the Prefix field. This is a special case that treats the existing primary language subtag for “Sign languages” as if it were a macrolanguage encompassing all sign languages.

All extended language subtags were added with a Preferred-Value equal to the corresponding primary language subtag.

If the language was encompassed by a macrolanguage, as determined by [iso‑639‑3‑macrolanguages_20080218] (International Organization for Standardization, “ISO 639-3 Macrolanguage Mappings,” January 2008.), a Macrolanguage field was added for the encompassed language, with a value equal to the subtag of the macrolanguage. (Note that ‘sgn’ is defined as a “collection code” by [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) and hence is not included in that standard; therefore, no Macrolanguage field was added for sign-language subtags.)

If the language was assigned a “Scope” value of ‘M’ (Macrolanguage) in [iso‑639‑3_20080902] (International Organization for Standardization, “ISO 639-3 Code Set,” September 2008.), a Scope value of “macrolanguage” was added for the language. Otherwise, if the language was assigned a “Scope” value of ‘S’ (Special), a Scope value of “special” was added. Most languages in [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) have scope ‘I’ (Individual) and thus were not assigned a Scope value in the Registry.

For each language in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) that was not already represented by a language subtag in the Language Subtag Registry, a new language subtag was added to the Registry, using the [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) code element as the value for the Subtag field and the “English name” field as the Description field. Each of these languages was assigned a Scope value of “collection” in the Registry.

All subtags were added to the Registry maintaining alphabetical order within each type of subtag: all 2-letter “language” subtags first, then all 3-letter “language” subtags, and finally all “extlang” subtags. Some existing records were moved to ensure this order.



 TOC 

2.3.  Modified Language Subtags

For each language in [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) that was already represented by a language subtag in the Language Subtag Registry, Description fields were added as necessary to reflect all non-inverted names listed for that language in [iso‑639‑3_Name_Index_20080902] (International Organization for Standardization, “ISO 639-3 Language Names Index,” September 2008.). Any existing Description fields which reflected inverted names, or which represented a strict subset of the information provided by the [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) name, were deleted. An example of the latter was the name “Ainu” for the subtag ‘ain’, which provided less information than the [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) name “Ainu (Japan)”.

The order of Description fields was adjusted to ensure that the reference name from [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) was listed first, followed by other names from [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) in the order presented by that standard, followed by any other names already existing in the Registry. In some cases this resulted in a reordering of Description fields for existing entries, even when no new values were added.

For each language that was encompassed by a macrolanguage in [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.), a Macrolanguage field was added, with a value equal to the subtag of the macrolanguage.

For each language in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) that was already represented in the Language Subtag Registry, the Description field was adjusted as necessary to match the “English name” field in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.). Each of these languages was assigned a Scope value of “collection”. Language subtags corresponding to code elements that were identified in [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) as representing collection codes, but not listed in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.), were also assigned a Scope value of “collection”.

Note in particular that the change from [ISO639‑2] (International Organization for Standardization, “ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition,” 1998.) names such as “Afro-Asiatic (Other)” to [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) names such as “Afro-Asiatic languages” implies a broadening of scope for some of these subtags, designated “remainder groups” in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.).

As a result of discussions with officials from the [ISO639‑2] (International Organization for Standardization, “ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition,” 1998.) and [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) registration agencies, it was determined that the existing language subtag ‘car’ would retain its existing Description, “Galibi Carib”, and would continue to be identified as an individual language (i.e. no Scope value), even though the code element was also present in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.) as a collection code with the name “Carib languages”.

A Scope value of “private-use” was added for the unique record with Subtag value ‘qaa..qtz’. This record has a Description of “Private use” (changed from “PRIVATE USE”) and corresponds to a range of code elements that is reserved for private use in [ISO639‑2] (International Organization for Standardization, “ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition,” 1998.). The Description fields for script and region private-use subtags were also capitalized as “Private use”.



 TOC 

2.4.  New Region Subtags

[draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.) expands the scope of region subtags by adding subtags based on code elements defined as “exceptionally reserved” in [ISO3166‑1] (International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.). These code elements are reserved by the ISO 3166 Maintenance Agency “at the request of national ISO member bodies, governments and international organizations.” At the time of IESG approval of this memo, ISO 3166/MA had defined nine exceptionally reserved code elements, all of which were added to the Language Subtag Registry except for the following:



 TOC 

2.5.  Grandfathered and Redundant Tags

As stated in [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.), “grandfathered” and “redundant” tags are complete tags in the Language Subtag Registry that were registered under [RFC1766] (Alvestrand, H., “Tags for the Identification of Languages,” March 1995.) or [RFC3066] (Alvestrand, H., “Tags for the Identification of Languages,” January 2001.) and remain valid. Grandfathered tags cannot be generated from a valid combination of subtags, while redundant tags can be.

Under certain conditions, registration of a subtag under [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.) may cause a grandfathered tag to be reclassified as redundant. It may also enable the creation of a generative tag with the same meaning as a grandfathered or redundant tag; in that case, the grandfathered or redundant tag is marked as Deprecated, and the generative tag (including the new subtag) becomes its Preferred-Value.

As a result of adding the new subtags in this update, each of the following grandfathered tags became composable and was reclassified as redundant, and was deprecated with the indicated generative tag serving as the Preferred-Value:

zh-cmn (Preferred-Value: cmn)

zh-cmn-Hans (Preferred-Value: cmn-Hans)

zh-cmn-Hant (Preferred-Value: cmn-Hant)

zh-gan (Preferred-Value: gan)

zh-wuu (Preferred-Value: wuu)

zh-yue (Preferred-Value: yue)

The following grandfathered tags were deprecated, with the indicated generative tag serving as the Preferred-Value:

i-ami (Preferred-Value: ami)

i-bnn (Preferred-Value: bnn)

i-pwn (Preferred-Value: pwn)

i-tao (Preferred-Value: tao)

i-tay (Preferred-Value: tay)

i-tsu (Preferred-Value: tsu)

zh-hakka (Preferred-Value: hak)

zh-min (no Preferred-Value; see below)

zh-min-nan (Preferred-Value: nan)

zh-xiang (Preferred-Value: hns)

The tag “zh-min”, originally registered under [RFC1766] (Alvestrand, H., “Tags for the Identification of Languages,” March 1995.), is a special case: it represents a small class of Chinese languages, but is not a true macrolanguage. The string “min” could not ever be used to tag these languages since the [ISO639‑3] (International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.) code element ‘min’ is assigned to an individual language (Minangkabau) that is not related to Chinese (‘zh’). Because it is not believed to represent a useful linguistic entity for tagging purposes, it was deprecated without a Preferred-Value.

The following grandfathered and redundant sign-language tags were deprecated, with the indicated generative tag serving as the Preferred-Value:

sgn-BE-FR (Preferred-Value: sfb)

sgn-BE-NL (Preferred-Value: vgt)

sgn-BR (Preferred-Value: bzs)

sgn-CH-DE (Preferred-Value: sgg)

sgn-CO (Preferred-Value: csn)

sgn-DE (Preferred-Value: gsg)

sgn-DK (Preferred-Value: dsl)

sgn-ES (Preferred-Value: ssp)

sgn-FR (Preferred-Value: fsl)

sgn-GB (Preferred-Value: bfi)

sgn-GR (Preferred-Value: gss)

sgn-IE (Preferred-Value: isg)

sgn-IT (Preferred-Value: ise)

sgn-JP (Preferred-Value: jsl)

sgn-MX (Preferred-Value: mfs)

sgn-NI (Preferred-Value: ncs)

sgn-NL (Preferred-Value: dse)

sgn-NO (Preferred-Value: nsl)

sgn-PT (Preferred-Value: psr)

sgn-SE (Preferred-Value: swl)

sgn-US (Preferred-Value: ase)

sgn-ZA (Preferred-Value: sfs)

No change was made to the Description field(s) for any of the grandfathered or redundant tags. For example, the redundant tag “sgn-US” continues to carry the Description “American Sign Language”. The sign language tags registered prior to [RFC4646] (Phillips, A. and M. Davis, “Tags for Identifying Languages,” September 2006.) remain an exception to the general principle that the meaning of a non-grandfathered tag can be derived from its component subtags.

In previous versions of the Registry, grandfathered tags that had been deprecated as a result of adding an ISO 639-based language subtag included a Comments field, with a value of the form “replaced by ISO code xxx”, where ‘xxx’ represented the new language subtag. These comments duplicated the information contained within the Preferred-Value field, and were deleted as part of this update. No changes were made to other Comments fields.



 TOC 

2.6.  Preferred-Value Changes

[draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.), Section 3.1.7 provides for the value of Preferred-Value fields to be updated as necessary to reflect changes in one of the source standards. Accordingly, the Preferred-Value fields for the following deprecated tags were changed:

i-hak (changed from zh-hakka to hak)

zh-guoyu (changed from zh-cmn to cmn)

This makes it unnecessary for consumers of the Language Subtag Registry to follow a “chain” of Preferred-Values in order to arrive at a non-deprecated subtag.



 TOC 

2.7.  Additional Changes

For consistency with the handling of alternative names in language subtags, Description fields for script subtags taken from [ISO15924] (International Organization for Standardization, “ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts,” January 2004.) that represent alternative names were converted to multiple Description fields. For example, the Description “Han (Hanzi, Kanji, Hanja)” was converted to four separate Description fields. Some Description fields for script subtags contained parenthetical material that was explanatory, rather than identifying alternative names; these fields were not altered.

This situation does not apply to region subtags taken from [ISO3166‑1] (International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.) and [UN_M.49] (Statistics Division, United Nations, “Standard Country or Area Codes for Statistical Use,” June 1999.) because those standards do not provide freely available alternative names for code elements.

The capitalization of the Subtag fields for certain grandfathered and redundant tags (sgn-BE-fr, sgn-BE-nl, sgn-CH-de, yi-latn) was modified to conform with the capitalization conventions described in [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.), Section 2.1.1. This has no effect on the validity or meaning of these tags.

The Description field for subtag ‘sgn’ was capitalized as “Sign languages”, to match the capitalization used for other languages in [ISO639‑5] (International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.), even though this capitalization does not exactly match that used for code element ‘sgn’ in any of the ISO 639 parts.

The Deprecated field for the region subtag TP was modified from 2002-11-15 to 2002-05-20, to correct a clerical error. The corrected date reflects the actual date the code element TP was withdrawn in [ISO3166‑1] (International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.).

The order of fields within records in the Registry was adjusted as necessary to match the order in which these fields are described in [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.), Section 3.1.2. This ordering is not required by [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.) and may not necessarily be reflected in future additions or modifications to the Registry.



 TOC 

3.  Updated Registry Contents

The remainder of this section specified the updated set of records for the Language Subtag Registry. This material was deleted before publication of this memo, to avoid any potential confusion with the Registry itself. The IANA Language Subtag Registry can be found at http://www.iana.org/numbers.html under “Language Tags”.

[RFC EDITOR NOTE: the remainder of this section is to be deleted upon publication.]

The updated contents of the Language Subtag Registry follow. This data is intended as a complete replacement for the current contents of the Registry. The Registry begins with the line that starts with the string “File-Date” and continues to the end of this section. Headers, footers, line breaks, and other vertical whitespace introduced by the RFC process are not significant. Leading horizontal whitespace relative to the “File-Date” line indicates a continued line in the record-jar format, and must not be deleted.

The Language Subtag Registry uses the UTF-8 character encoding. Non-ASCII characters represented in this document by XML hex entities (such as ´) MUST be converted by IANA to Unicode characters in the UTF-8 encoding when the updated Registry is created.



 TOC 

4.  Security Considerations

For security considerations relevant to the Language Subtag Registry and the use of language tags, see [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.).



 TOC 

5.  IANA Considerations

In its initial phase as an Internet-Draft, this memo contained a complete replacement of the contents of the Language Subtag Registry to be used by IANA in updating it. As an RFC, it contains a pointer to the Registry, which is maintained by IANA. The Language Subtag Registry can be found at http://www.iana.org/numbers.html under “Language Tags”. For details on the procedures for the format and ongoing maintenance of this Registry, see [draft‑ietf‑ltru‑4646bis‑17] (Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.).



 TOC 

6.  Changes

[RFC EDITOR NOTE: This section is provided for the convenience of reviewers and will be removed from the final document.]

This memo is a new work, not an incremental update of [RFC4645] (Ewell, D., “Initial Language Subtag Registry,” September 2006.). The procedure for populating the original Language Subtag Registry, specified by the earlier [RFC4646] (Phillips, A. and M. Davis, “Tags for Identifying Languages,” September 2006.), is included by reference to [RFC4645] (Ewell, D., “Initial Language Subtag Registry,” September 2006.). Therefore, no changes from [RFC4645] (Ewell, D., “Initial Language Subtag Registry,” September 2006.) are listed in this section.

Changes between draft-ietf-ltru-4645bis-05 and this version are:

Changes between draft-ietf-ltru-4645bis-04 and draft-ietf-ltru-4645bis-05 were:

Changes between draft-ietf-ltru-4645bis-03 and draft-ietf-ltru-4645bis-04 were:

Changes between draft-ietf-ltru-4645bis-02 and draft-ietf-ltru-4645bis-03 were:

Changes between draft-ietf-ltru-4645bis-01 and draft-ietf-ltru-4645bis-02 were:

Changes between draft-ietf-ltru-4645bis-00 and draft-ietf-ltru-4645bis-01 were:



 TOC 

7.  References



 TOC 

7.1. Normative References

[ISO639-3] International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages - Part 3: Alpha-3 code for comprehensive coverage of languages, first edition,” 2007.
[ISO639-5] International Organization for Standardization, “ISO 639-5:2008. Codes for the representation of names of languages -- Part 5: Alpha-3 code for language families and groups,” May 2008.
[RFC2119] Bradner, S., “Key words for use in RFCs to Indicate Requirement Levels,” BCP 14, RFC 2119, March 1997 (TXT, HTML, XML).
[draft-ietf-ltru-4646bis-17] Phillips, A., Ed. and M. Davis, Ed., “Tags for Identifying Languages,” September 2008.
[iso-639-3-macrolanguages_20080218] International Organization for Standardization, “ISO 639-3 Macrolanguage Mappings,” January 2008.
[iso-639-3_20080902] International Organization for Standardization, “ISO 639-3 Code Set,” September 2008.
[iso-639-3_Name_Index_20080902] International Organization for Standardization, “ISO 639-3 Language Names Index,” September 2008.


 TOC 

7.2. Informative References

[ISO15924] International Organization for Standardization, “ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts,” January 2004.
[ISO3166-1] International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.
[ISO639-1] International Organization for Standardization, “ISO 639-1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code,” 2002.
[ISO639-2] International Organization for Standardization, “ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition,” 1998.
[RFC1766] Alvestrand, H., “Tags for the Identification of Languages,” RFC 1766, March 1995 (TXT).
[RFC2629] Rose, M., “Writing I-Ds and RFCs using XML,” RFC 2629, June 1999 (TXT, HTML, XML).
[RFC3066] Alvestrand, H., “Tags for the Identification of Languages,” RFC 3066, January 2001 (TXT).
[RFC4645] Ewell, D., “Initial Language Subtag Registry,” RFC 4645, September 2006 (TXT).
[RFC4646] Phillips, A. and M. Davis, “Tags for Identifying Languages,” BCP 47, RFC 4646, September 2006 (TXT).
[UN_M.49] Statistics Division, United Nations, “Standard Country or Area Codes for Statistical Use,” UN Standard Country or Area Codes for Statistical Use, Revision 4 (United Nations publication, Sales No. 98.XVII.9), June 1999.


 TOC 

Appendix A.  Acknowledgements

This memo is a collaborative work of the Language Tag Registry Update (LTRU) Working Group. All of its members have made significant contributions to this memo and to its predecessor, [RFC4645] (Ewell, D., “Initial Language Subtag Registry,” September 2006.).

Specific contributions to this memo were made by Stéphane Bortzmeyer, John Cowan, Mark Davis, Martin Duerst, Frank Ellermann, Debbie Garside, Kent Karlsson, Addison Phillips, and Randy Presuhn.

This document was written with the xml2rfc tool described in [RFC2629] (Rose, M., “Writing I-Ds and RFCs using XML,” June 1999.).



 TOC 

Author's Address

  Doug Ewell (editor)
  Consultant
Email:  doug@ewellic.org
URI:  http://www.ewellic.org


 TOC 

Full Copyright Statement

Intellectual Property