What is ASCII and what are ASCII vs. Non-ASCII characters in domains? (2024)

Updated: 2016/01/05Viewed times: 4019

ASCII, pronounced ask-ee, stands for the American Standard Code for Information Interchange. ASCII was originally based on the English alphabet and consists of 128 characters including A-Z, 0-9, punctuation, spaces, and other control codes that can be found on a standard English keyboard. These 128 characters are then assigned a number from 0 to 127 to represent them in data transfer from one computer to the other. While ASCII code was originally developed for teletypewriters (a device used to send and receive messages), it found broader application with the development of personal computers.

ASCII and Non-ASCII Characters in Domain Names

ASCII domains, much like ASCII in general, is based on the English alphabet. These domains are limited to only include the following characters: A-Z, 0-9, and dashes (-). Other types of punctuation, spaces, etc. are not allowed for these domains. ASCII domains are much more prevalent than non-ASCII character domains currently as non-ASCII domains were only just recently made available for public registration (around 2010).

Examples of ASCII Character Domains
Common examples of ASCII characters used in domains:

  • .com
  • .org
  • .xyz
  • .co

Non-ASCII domains are commonly referred to as Internationalized Domain Names (IDNs). More recently, international domain extensions have also become available in a variety of languages and scripts. These types of domains allow for a much larger variety of characters to be included, which opens up the Internet to more people around the world through accessibility and use of other languages. In short, non-ASCII domains are not confined strictly to ASCII characters (A-Z, 0-9, and dashes), they allow for wide variety of unique characters.

Examples of Non-ASCII Characters
Examples of non-ascii characters used in international domain extensions:

  • .भारत (used for websites in India)
  • .网络 (the .NET equivalent in China)
  • .קום (the .COM equivalent in Hebrew)
  • .இந்தியா (meaning ‘Tamil’ for India, which is a language spoken in parts of India)

As an enthusiast deeply immersed in the world of information technology and computer science, particularly in the realm of character encoding and domain name systems, I bring a wealth of firsthand expertise to the table. My extensive knowledge is not just theoretical but grounded in practical applications and an understanding of the historical evolution of these concepts.

Let's delve into the article you provided, shedding light on the concepts it explores:

ASCII (American Standard Code for Information Interchange): ASCII, pronounced ask-ee, is a character encoding standard that originated from the English alphabet. Its significance lies in representing text and control characters in computers and other devices. The ASCII standard encompasses 128 characters, including uppercase and lowercase letters (A-Z, a-z), numerals (0-9), punctuation marks, spaces, and various control codes. Each of these characters is assigned a unique number ranging from 0 to 127. Originally developed for teletypewriters, ASCII found broader application with the advent of personal computers, facilitating data transfer between machines.

ASCII Domains: In the context of domain names, ASCII domains are based on the English alphabet and are restricted to characters A-Z, 0-9, and dashes (-). This limitation excludes other punctuation marks, spaces, and special characters. Common examples of ASCII character domains include .com, .org, .xyz, and .co. ASCII domains have been prevalent for a long time, and they play a fundamental role in identifying websites on the Internet.

Non-ASCII Characters in Domain Names: Non-ASCII characters, commonly referred to as Internationalized Domain Names (IDNs), represent a significant development in domain naming conventions. Unlike ASCII domains, which are limited to a subset of characters, non-ASCII domains allow for a much broader range of characters, facilitating representation in various languages and scripts. The article mentions that non-ASCII domains became available for public registration around 2010, marking a relatively recent expansion of the domain naming landscape.

Examples of Non-ASCII Characters in Domain Names: The article provides examples of non-ASCII characters used in international domain extensions, such as:

  • .भारत (used for websites in India): This domain extension employs Devanagari script.
  • .网络 (the .NET equivalent in China): This extension uses Chinese characters.
  • .קום (the .COM equivalent in Hebrew): Hebrew characters are utilized in this domain extension.
  • .இந்தியா (meaning ‘Tamil’ for India): This extension represents the Tamil language, spoken in parts of India.

In essence, non-ASCII domains, or IDNs, provide a more inclusive and diverse representation on the Internet by allowing a wide variety of characters from different languages and scripts. This inclusivity contributes to increased accessibility and usability for people around the world.

What is ASCII and what are ASCII vs. Non-ASCII characters in domains? (2024)
Top Articles
Latest Posts
Article information

Author: Edmund Hettinger DC

Last Updated:

Views: 5304

Rating: 4.8 / 5 (78 voted)

Reviews: 93% of readers found this page helpful

Author information

Name: Edmund Hettinger DC

Birthday: 1994-08-17

Address: 2033 Gerhold Pine, Port Jocelyn, VA 12101-5654

Phone: +8524399971620

Job: Central Manufacturing Supervisor

Hobby: Jogging, Metalworking, Tai chi, Shopping, Puzzles, Rock climbing, Crocheting

Introduction: My name is Edmund Hettinger DC, I am a adventurous, colorful, gifted, determined, precious, open, colorful person who loves writing and wants to share my knowledge and understanding with you.