Network Working Group	A. Phillips, Ed.
Internet-Draft	Lab126
Obsoletes: 4646 (if approved)	M. Davis, Ed.
Intended status: BCP	Google
Expires: November 18, 2008	May 17, 2008

Tags for Identifying Languages
draft-ietf-ltru-4646bis-14

Status of this Memo

By submitting this Internet-Draft, each author represents that any applicable patent or other IPR claims of which he or she is aware have been or will be disclosed, and any of which he or she becomes aware will be disclosed, in accordance with Section 6 of BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as “work in progress.”

The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt.

The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html.

This Internet-Draft will expire on November 18, 2008.

Abstract

This document describes the structure, content, construction, and semantics of language tags for use in cases where it is desirable to indicate the language used in an information object. It also describes how to register values for use in language tags and the creation of user-defined extensions for private interchange.

1. Introduction
2. The Language Tag
    2.1. Syntax
    2.2. Language Subtag Sources and Interpretation
        2.2.1. Primary Language Subtag
        2.2.2. Extended Language Subtags
        2.2.3. Script Subtag
        2.2.4. Region Subtag
        2.2.5. Variant Subtags
        2.2.6. Extension Subtags
        2.2.7. Private Use Subtags
        2.2.8. Grandfathered Registrations
        2.2.9. Classes of Conformance
3. Registry Format and Maintenance
    3.1. Format of the IANA Language Subtag Registry
        3.1.1. File Format
        3.1.2. Record Definitions
        3.1.3. Subtag and Tag Fields
        3.1.4. Description Field
        3.1.5. Deprecated Field
        3.1.6. Preferred-Value Field
        3.1.7. Prefix Field
        3.1.8. Suppress-Script Field
        3.1.9. Macrolanguage Field
        3.1.10. Comments Field
    3.2. Language Subtag Reviewer
    3.3. Maintenance of the Registry
    3.4. Stability of IANA Registry Entries
    3.5. Registration Procedure for Subtags
    3.6. Possibilities for Registration
    3.7. Extensions and the Extensions Registry
    3.8. Update of the Language Subtag Registry
4. Formation and Processing of Language Tags
    4.1. Choice of Language Tag
        4.1.1. Tagging Encompassed Languages
    4.2. Meaning of the Language Tag
    4.3. Lists of Languages
    4.4. Length Considerations
        4.4.1. Working with Limited Buffer Sizes
        4.4.2. Truncation of Language Tags
    4.5. Canonicalization of Language Tags
    4.6. Considerations for Private Use Subtags
5. IANA Considerations
    5.1. Language Subtag Registry
    5.2. Extensions Registry
6. Security Considerations
7. Character Set Considerations
8. Changes from RFC 4646
9. References
    9.1. Normative References
    9.2. Informative References
Appendix A. Acknowledgements
Appendix B. Examples of Language Tags (Informative)
Appendix C. Examples of Registration Forms
§ Authors' Addresses
§ Intellectual Property and Copyright Statements

[ISO15924]	International Organization for Standardization, “ISO 15924:2004. Information and documentation -- Codes for the representation of names of scripts,” January 2004.
[ISO3166-1]	International Organization for Standardization, “ISO 3166-1:2006. Codes for the representation of names of countries and their subdivisions -- Part 1: Country codes,” November 2006.
[ISO639-1]	International Organization for Standardization, “ISO 639-1:2002. Codes for the representation of names of languages -- Part 1: Alpha-2 code,” 2002.
[ISO639-2]	International Organization for Standardization, “ISO 639-2:1998. Codes for the representation of names of languages -- Part 2: Alpha-3 code, first edition,” 1998.
[ISO639-3]	International Organization for Standardization, “ISO 639-3:2007. Codes for the representation of names of languages -- Part 3: Alpha-3 code for comprehensive coverage of languages,” 2007.
[ISO646]	International Organization for Standardization, “ISO/IEC 646:1991, Information technology -- ISO 7-bit coded character set for information interchange.,” 1991.
[RFC2026]	Bradner, S., “The Internet Standards Process -- Revision 3,” BCP 9, RFC 2026, October 1996 (TXT).
[RFC2028]	Hovey, R. and S. Bradner, “The Organizations Involved in the IETF Standards Process,” BCP 11, RFC 2028, October 1996 (TXT, HTML, XML).
[RFC2119]	Bradner, S., “Key words for use in RFCs to Indicate Requirement Levels,” BCP 14, RFC 2119, March 1997 (TXT, HTML, XML).
[RFC2277]	Alvestrand, H., “IETF Policy on Character Sets and Languages,” BCP 18, RFC 2277, January 1998 (TXT, HTML, XML).
[RFC2434]	Narten, T. and H. Alvestrand, “Guidelines for Writing an IANA Considerations Section in RFCs,” BCP 26, RFC 2434, October 1998 (TXT, HTML, XML).
[RFC2860]	Carpenter, B., Baker, F., and M. Roberts, “Memorandum of Understanding Concerning the Technical Work of the Internet Assigned Numbers Authority,” RFC 2860, June 2000 (TXT).
[RFC3339]	Klyne, G., Ed. and C. Newman, “Date and Time on the Internet: Timestamps,” RFC 3339, July 2002 (TXT, HTML, XML).
[RFC4645]	Ewell, D., “Initial Language Subtag Registry,” RFC 4645, September 2006 (TXT).
[RFC4647]	Phillips, A. and M. Davis, “Matching of Language Tags,” BCP 47, RFC 4647, September 2006 (TXT).
[RFC5234]	Crocker, D. and P. Overell, “Augmented BNF for Syntax Specifications: ABNF,” STD 68, RFC 5234, January 2008 (TXT).
[UAX14]	Freitag, A., “Unicode Standard Annex #14: Line Breaking Properties,” August 2006.
[UN_M.49]	Statistics Division, United Nations, “Standard Country or Area Codes for Statistical Use,” UN Standard Country or Area Codes for Statistical Use, Revision 4 (United Nations publication, Sales No. 98.XVII.9, June 1999.

[RFC1766]	Alvestrand, H., “Tags for the Identification of Languages,” RFC 1766, March 1995 (TXT).
[RFC2047]	Moore, K., “MIME (Multipurpose Internet Mail Extensions) Part Three: Message Header Extensions for Non-ASCII Text,” RFC 2047, November 1996 (TXT, HTML, XML).
[RFC2231]	Freed, N. and K. Moore, “MIME Parameter Value and Encoded Word Extensions: Character Sets, Languages, and Continuations,” RFC 2231, November 1997 (TXT, HTML, XML).
[RFC2781]	Hoffman, P. and F. Yergeau, “UTF-16, an encoding of ISO 10646,” RFC 2781, February 2000 (TXT).
[RFC3066]	Alvestrand, H., “Tags for the Identification of Languages,” RFC 3066, January 2001 (TXT).
[RFC3552]	Rescorla, E. and B. Korver, “Guidelines for Writing RFC Text on Security Considerations,” BCP 72, RFC 3552, July 2003 (TXT).
[RFC3629]	Yergeau, F., “UTF-8, a transformation format of ISO 10646,” STD 63, RFC 3629, November 2003 (TXT).
[RFC4646]	Phillips, A. and M. Davis, “Tags for Identifying Languages,” RFC 4646, September 2006 (TXT).
[UTS35]	Davis, M., “Unicode Technical Standard #35: Locale Data Markup Language (LDML),” December 2007.
[Unicode]	Unicode Consortium, “The Unicode Consortium. The Unicode Standard, Version 5.0, (Boston, MA, Addison-Wesley, 2003. ISBN 0-321-49081-0),” January 2007.
[iso639.prin]	ISO 639 Joint Advisory Committee, “ISO 639 Joint Advisory Committee: Working principles for ISO 639 maintenance,” March 2000.
[record-jar]	Raymond, E., “The Art of Unix Programming,” 2003.
[registry-update]	Ewell, D., Ed., “Update to the Language Subtag Registry,” September 2006.

	Addison Phillips (editor)
	Lab126
Email:	addison@inter-locale.com
URI:	http://www.inter-locale.com

	Mark Davis (editor)
	Google
Email:	mark.davis@google.com

Tags for Identifying Languagesdraft-ietf-ltru-4646bis-14

Status of this Memo

Abstract

Table of Contents

1. Introduction

2. The Language Tag

2.1. Syntax

2.2. Language Subtag Sources and Interpretation

2.2.1. Primary Language Subtag

2.2.2. Extended Language Subtags

2.2.3. Script Subtag

2.2.4. Region Subtag

2.2.5. Variant Subtags

2.2.6. Extension Subtags

2.2.7. Private Use Subtags

2.2.8. Grandfathered Registrations

2.2.9. Classes of Conformance

3. Registry Format and Maintenance

3.1. Format of the IANA Language Subtag Registry

3.1.1. File Format

3.1.2. Record Definitions

3.1.3. Subtag and Tag Fields

3.1.4. Description Field

3.1.5. Deprecated Field

3.1.6. Preferred-Value Field

3.1.7. Prefix Field

3.1.8. Suppress-Script Field

3.1.9. Macrolanguage Field

3.1.10. Comments Field

3.2. Language Subtag Reviewer

3.3. Maintenance of the Registry

3.4. Stability of IANA Registry Entries

3.5. Registration Procedure for Subtags

3.6. Possibilities for Registration

3.7. Extensions and the Extensions Registry

3.8. Update of the Language Subtag Registry

4. Formation and Processing of Language Tags

4.1. Choice of Language Tag

4.1.1. Tagging Encompassed Languages

4.2. Meaning of the Language Tag

4.3. Lists of Languages

4.4. Length Considerations

4.4.1. Working with Limited Buffer Sizes

4.4.2. Truncation of Language Tags

4.5. Canonicalization of Language Tags

4.6. Considerations for Private Use Subtags

5. IANA Considerations

5.1. Language Subtag Registry

5.2. Extensions Registry

6. Security Considerations

7. Character Set Considerations

8. Changes from RFC 4646

9. References

9.1. Normative References

9.2. Informative References

Appendix A. Acknowledgements

Appendix B. Examples of Language Tags (Informative)

Appendix C. Examples of Registration Forms

Authors' Addresses

Full Copyright Statement

Intellectual Property

Tags for Identifying Languages
draft-ietf-ltru-4646bis-14