How To Count Strings With Emojis In JavaScript

white yellow and green round plastic toy

I love emojis. Who doesnโ€™t?

I was polishing off a highly intellectual X post a few days ago when I realized something.

X post showing the problem with counting emojis.

Emojis arenโ€™t counted the same as regular characters

When typing out emojis in the new post section of X, you can see how regular characters count less than emojis.

After a quick search, I found out it has something to do with how they are encoded in the Unicode system.

Essentially, emojis are made of multiple code points, and length only counts code points, not characters.

Regardless of why it happens, I thought about all the text counters Iโ€™ve created and how many exist in SaaS land.

Emojis are not getting their fair shake ๐Ÿ˜ข.

Simply taking the length of the string isnโ€™t an accurate count. Take, for example, something like this:

JavaScript
import { useState } from "react";

export default function App() {
  const [text, setText] = useState("");

  function countString() {
    return text.length;
  }

  function handleChange(e) {
    setText(e.target.value);
  }

  return (
    <div className="App">
      <h1>Make the emojis count ๐Ÿ‘</h1>
      <textarea value={text} onChange={handleChange} />
      <small>Characters: {countString()}</small>
    </div>
  );
}

This is a simple React component that tracks the characters typed into a text field. It is the most common implementation of this feature.

But the output gives us the same problem as my X post:

Sample app showing how using length to count strings in JavaScript doesn't work.

Modern web development makes it easy to count characters accurately

You can use a built-in object called Intl.Segmenter.

There is a much broader use case for the object, but it essentially breaks down strings into more meaningful items like words and sentences based on a locale you provide. It offers more granularity than simply using code points.

To fix our example above, all we have to do is update our countString function like this:

JavaScript
import { useState } from "react";

export default function App() {
  const [text, setText] = useState("");

  function countString() {
    return Array.from(new Intl.Segmenter().segment(text)).length;
  }

  function handleChange(e) {
    setText(e.target.value);
  }

  return (
    <div className="App">
      <h1>Make the emojis count ๐Ÿ‘</h1>
      <textarea value={text} onChange={handleChange} />
      <small>Characters: {countString()}</small>
    </div>
  );
}

We create a new instance of the Intl.Segmenter object and pass our text to it. We put that output into an array and then finally take the length, which will be far more accurate than simply taking the length of the original string.

Here is the result:

Sample app showing how using Intl.Segmenter to count strings in JavaScript is the correct way to count characters.

So why doesnโ€™t X count an emoji correctly?

Short answer: I have no idea.

Iโ€™ve been programming far too long to delude myself into thinking there is a simple answer.

But Intl.Segmenter has good browser support, and any performance or memory constraints would be negligible.

My best guess is that the codebase is so large and so old that it isnโ€™t worth the side effects of a refactor.

Iโ€™d be happy to learn more if anyone has better insight into this.

I hope this helps ๐Ÿ˜„.

Happy coding ๐Ÿค™.

Picture of Jeff Jakinovich

Jeff Jakinovich

๐Ÿ‘จโ€๐Ÿ’ป Full-stack developer & PM โ€ข ๐Ÿš€ My mission is to simplify work and spread curiosity in motion.