Normalize Unicode Text - Online Unicode Tools (2024)

World's simplest unicode tool

This browser-based utility converts fancy Unicode text back to regular text. All Unicode glyphs that you paste or enter in the text area as the input automatically get converted to simple ASCII characters in the output. It normalizes Unicode letters, numbers, punctuation marks, ligatures, as well as Emojis that contain letters. You can configure this tool to output some of the characters as-is and also control the case of the output text. Created by encoding gurus from team Browserling.

we're on twitter! tweet tweet

Love what we do? You can follow us on Twitter for updates!

A link to this tool, including input, options and all chained tools.

Import from file

Save as...

Copy to clipboard

Download

Export to Pastebin

Can't convert.

An error has occured.

Chain with...

Remove chain

Save as...

Copy to clipboard

Download

Export to Pastebin

Remove no tools?

This tool cannot be chained.

Skip Text Symbols

Enter Unicode letters, numbers,and marks that you don't wantto normalize.

Normalized Text Case

Preserve the input case fromthe input Unicode glyphs.

Reformat the output to usea proper sentence case.

Convert all letters in theoutput to capital letters.

Convert all letters in theoutput to lowercase letters.

What is a unicode text normalizer?

learn more about this tool

This online web application normalizes Unicode text. It converts all typographic Unicode glyphs to readable English characters from the ASCII charset. It supports over twenty different alphabets, a dozen of Unicode fonts, and it's also capable of recognizing letters by their shape. The tool transforms all letter-like symbols into letters of the English alphabet. It can normalize ligatures, for example, "ใŽž" as "km" and "ใ‘" as "ln", and word glyphs, for example, "๐Ÿ†•" as "NEW" and "๐Ÿ†—" as "OK". It also works with many numeric and math fonts and converts Unicode digits into regular Latin numerals 0 to 9 from the code position range U+0030 to U+0039. It can also normalize multi-digit numbers, for example, "โ‘ซ" to "12" and "ใ‰" to "60", as well as fractions, for example, "ยฝ" to "1/2" and "โ…—" to "3/5". It replaces fake punctuation marks with ASCII punctuation marks, for example, an emoji question mark "โ“" (U+2753) gets converted to a regular question mark "?" (U+003F), and a sine wave "โˆฟ" (U+223F) gets normalized to an ordinary tilde "~" (U+007E). It also normalizes all Unicode spaces and removes spaces of zero width. This app can also clear the combining characters in any text, as well as remove the Zalgo effect from the text and return clean symbols. If you want to preserve certain Unicode characters, you can enter them in the option skip symbols option field. The listed glyphs will then remain unchanged. You can also control the case of text letters in the output by letting the program automatically determine the case from the input, capitalizing only the first letter of each sentence, or turning all characters into upper or lower case.

Unicode text normalizer examples

Click to use

The Penguins of Madagascar

In this example, we normalize a dialogue from the cartoon The Penguins of Madagascar. Skipper's lines are written in a monospace font and Kowalski's replies use a bold-italic sans-serif font. In the input dialogue, there are also many weird commas, dots, dashes, and question marks. The program splits the text into individual graphemes and then for each grapheme finds the corresponding character in the ASCII table. All output characters are in the range from U+0000 to U+007F and have the same case as the input Unicode glyphs.

โ€” ๐™บ๐š˜๐š ๐šŠ๐š•๐šœ๐š”๐š’ูซ ๐š˜๐š™๐š๐š’๐š˜๐š—๐šœ๊“ธโ€” ๐˜ผ ๐™จ๐™ฉ๐™ง๐™–๐™ฉ๐™š๐™œ๐™ž๐™˜ ๐™ง๐™š๐™ฉ๐™ง๐™š๐™–๐™ฉอต ๐™Ž๐™ ๐™ž๐™ฅ๐™ฅ๐™š๐™งโ”โ€” ๐™ด๐šก๐š™๐š•๐šŠ๐š’๐š—๊“ธโ€” ๐™„๐™ฉสผ๐™จ ๐™ก๐™ž๐™ ๐™š ๐™ง๐™ช๐™ฃ๐™ฃ๐™ž๐™ฃ๐™œ ๐™–๐™ฌ๐™–๐™ฎ ๐™—๐™ช๐™ฉ ๐™ข๐™–๐™ฃ๐™ก๐™ž๐™š๐™งโ€ค

- Kowalski, options.- A strategic retreat, Skipper?- Explain.- It's like running away but manlier.

Required options

These options will be used automatically if you select this example.

Enter Unicode letters, numbers,and marks that you don't wantto normalize.

Preserve the input case fromthe input Unicode glyphs.

A Recipe For Happiness

In this example, we introduce a simple and useful recipe for everyday happiness. We use many bright and extraordinary Unicode characters here. Many letters contain combining marks, emoticons, as well as typographical ligatures. Unicode numbers use various shapes, fonts, and even fractional glyphs. Punctuation marks have a variety of styles and colors. We turn each Unicode symbol to plain text. We use the "Sentence Case" mode to properly capitalize only the first letter of each sentence and convert the rest of the text to lowercase.

เผ ๐˜ข lฬคษ๏ฝ’๐š๐—ฒ hฬคฬˆ๐’†๐•๐Ÿ…ฟ๐—ถ๐—‡๏ฝ‡ ๐จ๐™› แน•osฬคฬˆ๐ข๐Ÿ‡นโ“˜๐˜ท๐˜ช๐™ฉโ“จโโˆ— โถ ๐™ก๐šŠrฬคฬˆฯฑ๐Ÿ„ด ฦจmฬˆ๐•š๐•๐—ฒโเผ โ‘ก ๐Ÿ‡จ๐•ฆpฬคโ’ฎ ๐—ˆ๐—ณ ฦจ๐”€๏ฝ…๏ฝ…๐“ฝ๐•Ÿวsฬค๐—Œโธตโ‹† ยพ ใŽ ๐Ÿ…พ๐Ÿ‡ซ ๐–Œ๐™คoฬคฬˆdฬ ๐Ÿ…ข๐˜ฆโ’ฉโ“ข๐Ÿ„ด ๐—ˆ๐Ÿ…ต โ‚•แต˜โ‚˜๐˜ฐ๐Ÿ†อพเผ ยฝ แถœuฬค๏ฝ ๐Ÿ„พfฬค ฦจวlโ“•-๏ฝ…ล›แบ—โ‚‘๏ฝ…โ“œโธตโ‹† ๐Ÿ“โ“ช๐Ÿข ใŽค ๐Ÿ…ž๐“ฏ ๐Ÿ‡นสณ๐Ÿ†„๐“ฎ ๐–‹๐–บ๐’Š๐š๐—โโˆ— ๐Ÿ™ ๐—Œpฬˆ๐˜ฐoโฟfฬคแต˜๐Ÿ„ป แต’๐Ÿ„ต ๐ โ“ž๐™ค๐“ญแบ…๐˜ช๐š•โ’งอพเผ แ˜” ๐—‰๐ข๐˜ฏ๊œ€๐กว๐Ÿ†‚ ๏ฝ๐•— ๐Ÿ„ด๐–†๐—Œโ“จ ๐”คโ“ž๐š’๐Ÿ†–โโ‹† โ“โ’ฉdฬคฬˆ ๐Ÿ…ฐ ๐Ÿ„ท๐–พaฬคฬˆโ’ญ๐ญ โ“•uฬคฬˆ๐”ฉ๐Ÿ…› ๐™คfฬค ๏ฝŒ๐—ˆvฬ๐˜ฆ๊“ธ๐˜ฎรญโ‚“ ๐—โ’ชโ“–๐•–๐’•โ‚•รซษน ๐Ÿ…ฐ๐’๐—ฑ ๐˜ดสœษ’โ“ก๐”ขwฬค๐“ฒ๐”ฑ๐™ ๐”ฃ๐Ÿ‡ฆmฬค๐—‚๐š•โ“จ โ’œ๏ฝŽ๐ fฬrฬครฏ๐˜ฆโ’ฉ๐šsฬคโ—โ•

* A large helping of positivity;* 1 large smile;* 2 cups of sweetness;* 3/4 kg of good sense of humor;* 1/2 cup of self-esteem;* 500 cm^3 of true faith;* 1 spoonful of goodwill;* 2 pinches of easy going;* and a heart full of love.Mix together and sharewith family and friends!!

Required options

These options will be used automatically if you select this example.

Enter Unicode letters, numbers,and marks that you don't wantto normalize.

Reformat the output to usea proper sentence case.

Dinosaur Language

This example translates words from the Dinosaur language into English. The Dinosaur language contains many decorating Unicode symbols and even some Zalgo. The utility outputs symbols in clean text format so that it is easy to read the message. It preserves three Unicode characters: "โ‹—", "โ‹–", and "๐Ÿ…พ", by using the "Skip Text Symbols" option. Thus, we get an easy-to-read phrase in the output, with three decorating Unicode preserved.

แณแณแณแณแณโ‹—โ€œโ“‡โ“„โ’ถโ“‡โ€โ‹–แธแธแธแธแธแธแณแณแณแณแณโ‹—๐”ช๐”ข๐”ž๐”ซ๐”ฐโ‹–แธแธแธแธแธแธแธแณโ‹—โ€œ๐Ÿ…ธ ๐Ÿ…ป๐Ÿ…พ๐Ÿ†…๐Ÿ…ด ๐Ÿ†ˆ๐Ÿ…พ๐Ÿ†„โ€โ‹–แธแณแณแณแณแณแณแณโ‹—ใŒโ‹–แธแธแธแธแธแธแธแธแณแณแณแณแณโ‹—DอฌฬŒฬบฬ—ฬฎiอ‚ฬฬญอ…อ–nแท†อ’ฬžอšฬปoแท‡แท‡อšฬคฬบsอฆอฬœอœฬ˜aอ„อŒฬฆฬฃอ”uฬ“อ’ฬปแทŠฬจrอจอ‹ฬขฬงฬปโ‹–แธแธแธแธ

>>>>>โ‹—"ROAR"โ‹–<<<<<<>>>>>โ‹—meansโ‹–<<<<<<<>โ‹—"I L๐Ÿ…พVE Y๐Ÿ…พU"โ‹–<>>>>>>>โ‹—inโ‹–<<<<<<<<>>>>>โ‹—Dinosaurโ‹–<<<<

Required options

These options will be used automatically if you select this example.

Enter Unicode letters, numbers,and marks that you don't wantto normalize.

Preserve the input case fromthe input Unicode glyphs.

Pro tips Master online unicode tools

You can pass input to this tool via

?input

query argument and it will automatically compute output. Here's how to type it in your browser's address bar. Click to try!

https://onlineunicodetools.com/normalize-unicode-text

?input

=%E2%80%94%20%F0%9D%99%BA%F0%9D%9A%98%F0%9D%9A%A0%F0%9D%9A%8A%F0%9D%9A%95%F0%9D%9A%9C%F0%9D%9A%94%F0%9D%9A%92%D9%AB%20%F0%9D%9A%98%F0%9D%9A%99%F0%9D%9A%9D%F0%9D%9A%92%F0%9D%9A%98%F0%9D%9A%97%F0%9D%9A%9C%EA%93%B8%0A%E2%80%94%20%F0%9D%98%BC%20%F0%9D%99%A8%F0%9D%99%A9%F0%9D%99%A7%F0%9D%99%96%F0%9D%99%A9%F0%9D%99%9A%F0%9D%99%9C%F0%9D%99%9E%F0%9D%99%98%20%F0%9D%99%A7%F0%9D%99%9A%F0%9D%99%A9%F0%9D%99%A7%F0%9D%99%9A%F0%9D%99%96%F0%9D%99%A9%CD%B5%20%F0%9D%99%8E%F0%9D%99%A0%F0%9D%99%9E%F0%9D%99%A5%F0%9D%99%A5%F0%9D%99%9A%F0%9D%99%A7%E2%9D%94%0A%E2%80%94%20%F0%9D%99%B4%F0%9D%9A%A1%F0%9D%9A%99%F0%9D%9A%95%F0%9D%9A%8A%F0%9D%9A%92%F0%9D%9A%97%EA%93%B8%0A%E2%80%94%20%F0%9D%99%84%F0%9D%99%A9%CA%BC%F0%9D%99%A8%20%F0%9D%99%A1%F0%9D%99%9E%F0%9D%99%A0%F0%9D%99%9A%20%F0%9D%99%A7%F0%9D%99%AA%F0%9D%99%A3%F0%9D%99%A3%F0%9D%99%9E%F0%9D%99%A3%F0%9D%99%9C%20%F0%9D%99%96%F0%9D%99%AC%F0%9D%99%96%F0%9D%99%AE%20%F0%9D%99%97%F0%9D%99%AA%F0%9D%99%A9%20%F0%9D%99%A2%F0%9D%99%96%F0%9D%99%A3%F0%9D%99%A1%F0%9D%99%9E%F0%9D%99%9A%F0%9D%99%A7%E2%80%A4&ignore-letters=&autocase=true

All unicode tools

Didn't find the tool you were looking for? Let us know what tool we are missing and we'll build it!

Quickly find code positions of all Unicode values.

Quickly decode code positions to Unicode values.

Quickly encode Unicode values to UTF-8 encoding.

Quickly encode Unicode values to UTF-16 encoding.

Quickly encode Unicode values to UTF-32 encoding.

Quickly create a picture from Unicode symbols.

Quickly generate random Unicode text in a given range.

Quickly generate all Unicode values from the given code point interval.

Quickly filter Unicode symbols that are within the given code point interval.

Quickly split Unicode data into graphemes.

Quickly sort Unicode glyphs in increasing or decreasing order.

Quickly find the length of Unicode text.

Quickly increase Unicode code point values.

Quickly decrease Unicode code point values.

Quickly reverse the order of symbols in Unicode text.

Quickly rotate Unicode characters to the left and right.

Quickly create multiple copies of Unicode text.

Quickly extract all characters from Unicode text.

Quickly split Unicode data into pieces.

Quickly split Unicode text into chunks of constant length.

Quickly merge Unicode snippets together.

Quickly shorten Unicode text to the given length.

Quickly left-pad Unicode text with any character.

Quickly right-pad Unicode text with any character.

Quickly align Unicode data to the center.

Quickly align Unicode data to the right.

Quickly convert ordinary numbers to fancy Unicode numbers.

Quickly convert Unicode numbers back to regular numbers.

Quickly convert ordinary letters to Unicode letters in various fonts.

Quickly convert Unicode letters back to regular Latin letters.

Quickly convert ordinary text to fancy Unicode text.

Quickly convert fancy Unicode text back to regular text.

Quickly combine input Unicode with diacritical marks.

Quickly delete diacritical marks from the Unicode text.

Quickly spoof regular text using Unicode hom*oglyphs.

Quickly circularly rearrange Unicode symbols.

Quickly convert Unicode data to escape sequences.

Quickly encode Unicode data to HTML entities.

Quickly URL-escape Unicode symbols.

Quickly encode Unicode values to base64.

Quickly encode Unicode values to a data URI.

Quickly convert Unicode characters to raw bytes.

Quickly convert Unicode data to base-2 (binary).

Quickly convert Unicode data to base-8 (octal).

Quickly convert Unicode data to base-10 (decimal).

Quickly convert Unicode data to base-16 (hexadecimal).

Quickly convert Unicode symbols to raw ASCII bytes.

Quickly convert ASCII bytes to Unicode symbols.

Quickly convert Unicode text to a string literal.

Quickly create a picture from Unicode emojis.

Quickly release Zalgo on your Unicode text.

Quickly make Zalgo text readable again.

Coming soon These unicode tools are on the way

Edit Unicode

View and edit Unicode in a browser-based editor.

Name Unicode Symbols

Spell out the names of Unicode characters in the input text.

URL-decode Unicode

URL-unescape Unicode text.

Convert Binary to Unicode

Convert base-2 data to Unicode encoding.

Convert Octal to Unicode

Convert base-8 data to Unicode encoding.

Convert Decimal to Unicode

Convert base-10 data to Unicode encoding.

Convert Hex to Unicode

Convert base-16 data to Unicode encoding.

Convert Unicode to Any Base

Convert Unicode text to any radix.

Convert Any Base to Unicode

Convert any radix data to Unicode.

Convert Unicode to Latin1

Convert Unicode text to ISO-8859-1 encoding.

Convert Latin1 to Unicode

Convert ISO-859-1 encoded data to Unicode.

Convert Unicode to Latin2

Convert Unicode text to ISO-8859-2 encoding.

Convert Latin2 to Unicode

Convert ISO-8859-2 encoded data to Unicode.

Convert Unicode to Ecoji

Convert Unicode text to Ecoji encoding.

Convert Ecoji to Unicode

Convert Ecoji encoded data to Unicode.

Convert Bytes to Unicode

Convert raw bytes to Unicode.

Check Unicode Version

Check the Unicode version of the given Unicode characters.

Validate Unicode

Check if the given Unicode has valid encoding.

Convert Unicode to Punycode

Encode Unicode text to Punycode encoding.

Convert Punycode to Unicode

Decode Punycode encoding to Unicode.

Decode Base64 to Unicode

Convert base64 data to Unicode text.

Encode Unicode to Data URI

Convert Unicode to a valid data URL.

Decode Data URI to Unicode

Convert a valid data URL to Unicode text.

Convert HTML to Unicode

Decode HTML entities to Unicode data.

Convert UTF8 to Unicode

Decode UTF8 encoding to Unicode.

Convert UTF16 to Unicode

Decode UTF16 encoding to Unicode.

Convert UTF32 to Unicode

Decode UTF32 encoding to Unicode.

Convert Unicode to Uppercase

Convert all Unicode characters to uppercase.

Convert Unicode to Lowercase

Convert all Unicode characters to lowercase.

Print All Unicode Flags

Generate a list of all country flag icons.

Print All Unicode Arrows

Generate a list of all Unicode arrows.

Print All Unicode Animals

Generate a list of all Unicode animals.

Print All Unicode Flowers and Plants

Generate a list of all Unicode flowers and plants.

Print All Unicode Blocks

Generate a list of all Unicode block elements.

Print All Unicode Hieroglyphs

Generate a list of all Egyptian hieroglyphs.

Print All Unicode Currencies

Generate a list of all currency symbols.

Generate a Unicode Rainbow

Use Unicode colors to generate a rainbow.

Generate a Lenny Face

Create a smiley face from Unicode symbols.

Generate Random Emoji

Generate a list of random emojis.

Convert Unicode to Randomcase

Randomize case of all Unicode characters.

Convert Unicode to Lowercase

Convert all Unicode characters to lowercase.

JSON Stringify Unicode

Encode Unicode to JSON.

JSON Parse Unicode

Decode JSON to Unicode.

Shuffle Unicode Symbols

Randomly rearrange the order of input graphemes.

Convert Unicode to Alt Code

Generate Alt codes for Unicode characters.

Convert Alt Code to Unicode

Generate Unicode glyphs from Alt codes.

Analyze Unicode

Print statistics about Unicode data and code points.

Slice Unicode

Extract a part from Unicode data.

Draw Unicode Waves

Generate waves with Unicode symbols.

Draw Unicode Sparklines

Generate graphs using Unicode symbols.

Normalize Unicode Text - Online Unicode Tools (2024)
Top Articles
Latest Posts
Article information

Author: Zonia Mosciski DO

Last Updated:

Views: 5828

Rating: 4 / 5 (51 voted)

Reviews: 82% of readers found this page helpful

Author information

Name: Zonia Mosciski DO

Birthday: 1996-05-16

Address: Suite 228 919 Deana Ford, Lake Meridithberg, NE 60017-4257

Phone: +2613987384138

Job: Chief Retail Officer

Hobby: Tai chi, Dowsing, Poi, Letterboxing, Watching movies, Video gaming, Singing

Introduction: My name is Zonia Mosciski DO, I am a enchanting, joyous, lovely, successful, hilarious, tender, outstanding person who loves writing and wants to share my knowledge and understanding with you.