DevKit Labs

String Inspector (Unicode & Bytes)

Inspect a string's length, encoding and every character's code point — 100% in your browser.

String
0
Characters · code points
0
Graphemes · visible units
0
UTF-16 units · string.length
0
UTF-8 bytes · encoded size
0
Words
0
Lines

About String Inspector (Unicode & Bytes)

Paste a string to see exactly what it's made of: the number of Unicode code points, visible graphemes (which differ for emoji and combined characters), UTF-16 code units (JavaScript's string.length) and UTF-8 byte size. A per-character table shows each character's glyph, code point (U+XXXX), decimal value, UTF-8 byte count and category.

It's handy for debugging encoding issues, invisible characters and emoji that take more than one code unit. Everything runs locally in your browser; nothing is uploaded.

Why length looks wrong

Input
👨‍👩‍👧
Output
1 grapheme · 5 code points · 8 UTF-16 units

Emoji built from several code points explain surprising length values.

Frequently asked questions

Why is the character count different from string.length?

string.length counts UTF-16 code units; characters above U+FFFF (like many emoji) use two units. This tool also shows code points and graphemes so you can see the difference.

What is a grapheme?

A grapheme is a single visible character as a person perceives it. An emoji family or an accented letter can be one grapheme made of several code points.

Can it show UTF-8 byte size?

Yes. It reports the total UTF-8 byte length and the byte count of each individual character.

Is my text uploaded anywhere?

No. All analysis happens locally in your browser.

Related tools