Opening UTF-8 Text Files (2024)

Unicode Tutorials - Herong's Tutorial Examples

Using Notepad as a Unicode Text Editor

Opening UTF-8 Text Files

This section provides a tutorial example on how to open a UTF-8 text file with Nodepad correctly by selecting the UTF-8 encoding option on the open file dialog box.

According to the Notepad help information, Notepad support 3 Unicode encodings: Unicode, UTF-8, and big-endian Unicode.Let's try to use Notepad to open the UTF-8 text file, hello.utf-8, created from the previous chapter first.

1. Run Notepad and click menu File > Open. The open file dialog box comes up.

2. Select the hello.utf-8 text file and select the UTF-8 option in the Encoding field. See the picture below:

Opening UTF-8 Text Files (1)

3. Click the Open button. The UTF-8 file opens in the editor correctly.See the picture below:

Opening UTF-8 Text Files (2)

Very nice. This proves that Notepad can open UTF-8 text file correctly if the UTF-8 encoding option is selected.

If you select a different encoding, like Unicode, the UTF-8 text file will be opened incorrectly. Try it out yourself.

Table of Contents

About This Book

Character Sets and Encodings

ASCII Character Set and Encoding

GB2312 Character Set and Encoding

GB18030 Character Set and Encoding

JIS X0208 Character Set and Encodings

Unicode Character Set

UTF-8 (Unicode Transformation Format - 8-Bit)

UTF-16, UTF-16BE and UTF-16LE Encodings

UTF-32, UTF-32BE and UTF-32LE Encodings

Java Language and Unicode Characters

Character Encoding in Java

Character Set Encoding Maps

Encoding Conversion Programs for Encoded Text Files

Using Notepad as a Unicode Text Editor

What Is Notepad

Opening UTF-8 Text Files

Opening UTF-16BE Text Files

Opening UTF-16LE Text Files

Saving Files in UTF-8 Option

Byte Order Mark (BOM) - FEFF - EFBBBF

Saving Files in "Unicode Big Endian" Option

Saving Files in "Unicode" Option

Supported Save and Open File Formats

Using Microsoft Word as a Unicode Text Editor

Using Microsoft Excel as a Unicode Text Editor

Unicode Fonts

Unicode Code Point Blocks: 0000 - 0FFF

Unicode Code Point Blocks: 1000 - FFFF

Unicode Code Point Blocks: 10000 - 11FFF

Unicode Code Point Blocks: 12000 - 10FFFF

Archived Tutorials

References

Full Version in PDF/EPUB

Opening UTF-8 Text Files (2024)
Top Articles
Latest Posts
Article information

Author: Roderick King

Last Updated:

Views: 5988

Rating: 4 / 5 (51 voted)

Reviews: 82% of readers found this page helpful

Author information

Name: Roderick King

Birthday: 1997-10-09

Address: 3782 Madge Knoll, East Dudley, MA 63913

Phone: +2521695290067

Job: Customer Sales Coordinator

Hobby: Gunsmithing, Embroidery, Parkour, Kitesurfing, Rock climbing, Sand art, Beekeeping

Introduction: My name is Roderick King, I am a cute, splendid, excited, perfect, gentle, funny, vivacious person who loves writing and wants to share my knowledge and understanding with you.