Count the number of occurrences of a character in a string in Javascript

C

Corey

I have updated this answer. I like the idea of using a match better, but it is slower:

console.log(("str1,str2,str3,str4".match(/,/g) || []).length); //logs 3 console.log(("str1,str2,str3,str4".match(new RegExp("str", "g")) || []).length); //logs 4

Use a regular expression literal if you know what you are searching for beforehand, if not you can use the RegExp constructor, and pass in the g flag as an argument.

match returns null with no results thus the || []

The original answer I made in 2009 is below. It creates an array unnecessarily, but using a split is faster (as of September 2014). I'm ambivalent, if I really needed the speed there would be no question that I would use a split, but I would prefer to use match.

Old answer (from 2009):

If you're looking for the commas:

(mainStr.split(",").length - 1) //3

If you're looking for the str

(mainStr.split("str").length - 1) //4

Both in @Lo's answer and in my own silly performance test split comes ahead in speed, at least in Chrome, but again creating the extra array just doesn't seem sane.

test shows Firefox is a lot faster than any other browser when splitting. jsperf.com/count-the-number-of-occurances-in-string

Uh, I just tested vsync's jsperf and the regex was slower in Chrome, Firefox and IE. 68%, 100%, and 14% respectively. I have an i7 2600.

I really dislike the idea of using a regex because "you like it better". Regexes have their purpose, but generally when there's a simple non-regex solution it's a better choice. Also note that both methods create an array, so that's not a reason to use a regex either.

I like it better in this case for a reason. Splitting a string into an array to get a number of occurrences is a round about way to get that information. Splitting an array is only faster because of implementation details, something that can change, whereas getting the number of matches is an improvement in readability, the intent is obvious and doesn't create and fill an unused data structure.

split() is a basic tool in javascript, conceptually simple, and counting the splits gives clear intention and is totally readable.

u

user229044

There are at least five ways. The best option, which should also be the fastest (owing to the native RegEx engine) is placed at the top.

Method 1

("this is foo bar".match(/o/g)||[]).length;
// returns 2

Method 2

"this is foo bar".split("o").length - 1;
// returns 2

Split not recommended as it is resource hungry. It allocates new instances of 'Array' for each match. Don't try it for a >100MB file via FileReader. You can observe the exact resource usage using Chrome's profiler option.

Method 3

    var stringsearch = "o"
       ,str = "this is foo bar";
    for(var count=-1,index=-2; index != -1; count++,index=str.indexOf(stringsearch,index+1) );
// returns 2

Method 4

Searching for a single character

    var stringsearch = "o"
       ,str = "this is foo bar";
    for(var i=count=0; i<str.length; count+=+(stringsearch===str[i++]));
     // returns 2

Method 5

Element mapping and filtering. This is not recommended due to its overall resource preallocation rather than using Pythonian 'generators':

    var str = "this is foo bar"
    str.split('').map( function(e,i){ if(e === 'o') return i;} )
                 .filter(Boolean)
    //>[9, 10]
    [9, 10].length
    // returns 2

Share: I made this gist, with currently 8 methods of character-counting, so we can directly pool and share our ideas - just for fun, and perhaps some interesting benchmarks :)

It took me a little while to realize what ||[] was doing but this answer is great! For anyone else scratching their heads, match() returns null if no matches are found and ||[] will return a 0 length array if the match() returns null, meaning length() will return 0 instead of producing a type error.

Nathan, to my defense, I did elaborate on that prior to writing the code above: gist.github.com/2757164 . I want to avoid blog-posts of small code-pieces, which however would have allowed you instant access through google-search. Gist as a snippet repository is very sparsely indexed and less than ideal. PS: I too hate unclear syntactic idiosyncrasies.

Lo Sauer, no need to defend yourself, the code is solid and I learned something on my own by figuring out how it worked :) I prefer this method over what is actually marked as the answer. There should be no need to split a string if we're not going to use the results.

Your third method (also, unfortunately, the fastest), will miss any match at index 0 in the haystack. You could fix it by using a do...while loop instead: var strsearch = "o", str = "othis is foo bar", index = -1, count = -1; do { index = str.indexOf(strsearch, index+1); count++; } while (index != -1); count

It suffices to set the start index = -2, but thanks a lot @Augustus

P

Philippe Boissonneault

Add this function to sting prototype :

String.prototype.count=function(c) { 
  var result = 0, i = 0;
  for(i;i<this.length;i++)if(this[i]==c)result++;
  return result;
};

usage:

console.log("strings".count("s")); //2

what about "stringsstringstrings".count("str") ?

@Toskan Please look at OPs question, which is about "a character". I guess that is why the parameter is called "c" and not "s" - for "[c]haracter", not "[s]tring". Most of the other answers are HORRIBLE, people throw around arrays and extra objects for this extremely simple task as if memory allocation is free.

a

angabriel

Simply, use the split to find out the number of occurrences of a character in a string.

mainStr.split(',').length // gives 4 which is the number of strings after splitting using delimiter comma

mainStr.split(',').length - 1 // gives 3 which is the count of comma

u

user253751

A quick Google search got this (from http://www.codecodex.com/wiki/index.php?title=Count_the_number_of_occurrences_of_a_specific_character_in_a_string#JavaScript)

String.prototype.count=function(s1) { 
    return (this.length - this.replace(new RegExp(s1,"g"), '').length) / s1.length;
}

Use it like this:

test = 'one,two,three,four'
commas = test.count(',') // returns 3

error on * char (SyntaxError: nothing to repeat)

the argument has to be a regular expression. So if you want to count 's, you need to send in '[*]'

Y

Yosvel Quintero Arguelles

You can also rest your string and work with it like an array of elements using

Array.prototype.filter()

const mainStr = 'str1,str2,str3,str4'; const commas = [...mainStr].filter(l => l === ',').length; console.log(commas);

Or

Array.prototype.reduce()

const mainStr = 'str1,str2,str3,str4'; const commas = [...mainStr].reduce((a, c) => c === ',' ? ++a : a, 0); console.log(commas);

Is this an attempt to solve a simple problem with as much (actual) effort as possible?

I mean, it's the more sensible option. I don't know about performance, but it's a lot easier to understand "go filter this string's letters and count how many are left" than to understand "split this string every time you see this letter, then count how many there are, and subtract one."

V

Vencovsky

Here is a similar solution, but it uses Array.prototype.reduce

function countCharacters(char, string) {
  return string.split('').reduce((acc, ch) => ch === char ? acc + 1: acc, 0)
}

As was mentioned, String.prototype.split works much faster than String.prototype.replace.

h

halfbit

ok, an other one with regexp - probably not fast, but short and better readable then others, in my case just '_' to count

key.replace(/[^_]/g,'').length

just remove everything that does not look like your char but it does not look nice with a string as input

G

Geng Jiawen

If you are using lodash, the _.countBy method will do this:

_.countBy("abcda")['a'] //2

This method also work with array:

_.countBy(['ab', 'cd', 'ab'])['ab'] //2

V

Valera Rozuvan

I have found that the best approach to search for a character in a very large string (that is 1 000 000 characters long, for example) is to use the replace() method.

window.count_replace = function (str, schar) {
    return str.length - str.replace(RegExp(schar), '').length;
};

You can see yet another JSPerf suite to test this method along with other methods of finding a character in a string.

It is obvious that that if your code somehow iterates over a million characters 500000 times a second, my CPU is running at at least 100GHz (assuming no SIMD; even then it would be at least 40GHz). Thus I do not believe that this benchmark is correct.

C

Clive Paterson

Performance of Split vs RegExp

var i = 0; var split_start = new Date().getTime(); while (i < 30000) { "1234,453,123,324".split(",").length -1; i++; } var split_end = new Date().getTime(); var split_time = split_end - split_start; i= 0; var reg_start = new Date().getTime(); while (i < 30000) { ("1234,453,123,324".match(/,/g) || []).length; i++; } var reg_end = new Date().getTime(); var reg_time = reg_end - reg_start; alert ('Split Execution time: ' + split_time + "\n" + 'RegExp Execution time: ' + reg_time + "\n");

D

Dendromaniac

I made a slight improvement on the accepted answer, it allows to check with case-sensitive/case-insensitive matching, and is a method attached to the string object:

String.prototype.count = function(lit, cis) {
    var m = this.toString().match(new RegExp(lit, ((cis) ? "gi" : "g")));
    return (m != null) ? m.length : 0;
}

lit is the string to search for ( such as 'ex' ), and cis is case-insensitivity, defaulted to false, it will allow for choice of case insensitive matches.

'I love StackOverflow.com'

'o'

var amount_of_os = 'I love StackOverflow.com'.count('o');

amount_of_os would be equal to 2.

var amount_of_os = 'I love StackOverflow.com'.count('o', true);

This time, amount_of_os would be equal to 3, since the capital O from the string gets included in the search.

A

Ankur Choraywal

Easiest way i found out...

Example-

str = 'mississippi';

function find_occurences(str, char_to_count){
    return str.split(char_to_count).length - 1;
}

find_occurences(str, 'i') //outputs 4

concise! Thanks!

M

Md. Jamal Uddin

Here is my solution. Lots of solution already posted before me. But I love to share my view here.

const mainStr = 'str1,str2,str3,str4';

const commaAndStringCounter = (str) => {
  const commas = [...str].filter(letter => letter === ',').length;
  const numOfStr = str.split(',').length;

  return `Commas: ${commas}, String: ${numOfStr}`;
}

// Run the code
console.log(commaAndStringCounter(mainStr)); // Output: Commas: 3, String: 4

Here you find my REPL

N

NuSkooler

I just did a very quick and dirty test on repl.it using Node v7.4. For a single character, the standard for loop is quickest:

Some code:

// winner!
function charCount1(s, c) {
    let count = 0;
    c = c.charAt(0); // we save some time here
    for(let i = 0; i < s.length; ++i) {
        if(c === s.charAt(i)) {
            ++count;
        }
    }
    return count;
}

function charCount2(s, c) {
    return (s.match(new RegExp(c[0], 'g')) || []).length;
}

function charCount3(s, c) {
    let count = 0;
    for(ch of s) {
        if(c === ch) {
            ++count;
        }
    }
    return count;
}

function perfIt() {
    const s = 'Hello, World!';
    const c = 'o';

    console.time('charCount1');
    for(let i = 0; i < 10000; i++) {
        charCount1(s, c);
    }
    console.timeEnd('charCount1');
    
    console.time('charCount2');
    for(let i = 0; i < 10000; i++) {
        charCount2(s, c);
    }
    console.timeEnd('charCount2');
    
    console.time('charCount3');
    for(let i = 0; i < 10000; i++) {
        charCount2(s, c);
    }
    console.timeEnd('charCount3');
}

Results from a few runs:

perfIt()
charCount1: 3.301ms
charCount2: 11.652ms
charCount3: 174.043ms
undefined

perfIt()
charCount1: 2.110ms
charCount2: 11.931ms
charCount3: 177.743ms
undefined

perfIt()
charCount1: 2.074ms
charCount2: 11.738ms
charCount3: 152.611ms
undefined

perfIt()
charCount1: 2.076ms
charCount2: 11.685ms
charCount3: 154.757ms
undefined

Update 2021-Feb-10: Fixed typo in repl.it demo

Update 2020-Oct-24: Still the case with Node.js 12 (play with it yourself here)

Nice work @NuSkooler! A few caviats: Try it with longer strings, (like looking for '\n'-s a pop song), and try and run all the counts on different s. Say charCountN(s+i, c). Also, you have a typo: charCount3 is mistyped as charCount2, therefore the similar results. If you try it with my modifications, you'll find that split is the winner with regex match coming in as second.

Thanks @BarneySzabolcs! Updated the repl demo and re-ran the results. I did leave the strings the same -- as you say, YMMV. I'm sure some other tricks that work in lower level languages could also work here especially for longer strings.

w

wlf

s = 'dir/dir/dir/dir/'
for(i=l=0;i<s.length;i++)
if(s[i] == '/')
l++

J

Jakub Wawszczyk

I was working on a small project that required a sub-string counter. Searching for the wrong phrases provided me with no results, however after writing my own implementation I have stumbled upon this question. Anyway, here is my way, it is probably slower than most here but might be helpful to someone:

function count_letters() {
var counter = 0;

for (var i = 0; i < input.length; i++) {
    var index_of_sub = input.indexOf(input_letter, i);

    if (index_of_sub > -1) {
        counter++;
        i = index_of_sub;
    }
}

http://jsfiddle.net/5ZzHt/1/

Please let me know if you find this implementation to fail or do not follow some standards! :)

UPDATE You may want to substitute:

    for (var i = 0; i < input.length; i++) {

With:

for (var i = 0, input_length = input.length; i < input_length; i++) {

Interesting read discussing the above: http://www.erichynds.com/blog/javascript-length-property-is-a-stored-value

Yes, and it would work for substring, not only subchars. However, you need to add the parameters to the function :)

J

Joe Lapp

It's amazing that in 13 years, this answer hasn't shown up. Intuitively, it seems like it should be fastest:

const s = "The quick brown fox jumps over the lazy dog.";
const oCount = s.length - s.replaceAll('o', '').length;

If there are only two kinds of character in the string, then this is faster still:


const s = "001101001";
const oneCount = s.replaceAll('0', '').length;

u

user2296195

What about string.split(desiredCharecter).length-1

Example:

var str = "hellow how is life"; var len = str.split("h").length-1; will give count 2 for character "h" in the above string;

z

zoran404

The fastest method seems to be via the index operator:

Or as a prototype function:

String.prototype.charOccurances = function (char) { for (var c = 0, i = 0, len = this.length; i < len; ++i) { if (this[i] == char) { ++c; } } return c; } console.log( 'example/path/script.js'.charOccurances('/') ); // 2

D

DerpyCoder

function len(text,char){

return text.innerText.split(string).length
}

console.log(len("str1,str2,str3,str4",","))

This is a very short function.

U

User_coder

I know I am late to the party here but I was rather baffled no one answered this with the most basic of approaches. A large portion of the answers provided by the community for this question are iteration based but all are moving over strings on a per-character basis which is not really efficient.

When dealing with a large string that contains thousands of characters walking over each character to get the occurance count can become rather extraneous not to mention a code-smell. The below solutions take advantage of slice, indexOf and the trusted traditional while loop. These approaches prevent us having to walk over each character and will greatly speed up the time it takes to count occurances. These follow similar logic to that you'd find in parsers and lexical analyzers that require string walks.

Using with Slice

In this approach we are leveraging slice and with every indexOf match we will move our way through the string and eliminate the previous searched potions. Each time we call indexOf the size of the string it searches will be smaller.

function countChar (char: string, search: string): number {
  
  let num: number = 0;
  let str: string = search;
  let pos: number = str.indexOf(char);
  
  while(pos > -1) {
    str = str.slice(pos + 1);
    pos = str.indexOf(char);
    num++;
  }

  return num;

}

// Call the function
countChar('x', 'foo x bar x baz x') // 3

Using with IndexOf from position

Similar to the first approach using slice but instead of augmenting the string we are searching it will leverage the from parameter in indexOf method.

function countChar (char: string, str: string): number {
  
  let num: number = 0;
  let pos: number = str.indexOf(char);
  
  while(pos > -1) {
    pos = str.indexOf(char, pos + 1);
    num++;
  }

  return num;

}

// Call the function
countChar('x', 'foo x bar x baz x') // 3

Personally, I go for the second approach over the first, but both are fine and performant when dealing with large strings but also smaller sized ones too.

J

Jonathan Fingland

The following uses a regular expression to test the length. testex ensures you don't have 16 or greater consecutive non-comma characters. If it passes the test, then it proceeds to split the string. counting the commas is as simple as counting the tokens minus one.

var mainStr = "str1,str2,str3,str4";
var testregex = /([^,]{16,})/g;
if (testregex.test(mainStr)) {
  alert("values must be separated by commas and each may not exceed 15 characters");
} else {
  var strs = mainStr.split(',');
  alert("mainStr contains " + strs.length + " substrings separated by commas.");
  alert("mainStr contains " + (strs.length-1) + " commas.");
}

M

Marc K.

I'm using Node.js v.6.0.0 and the fastest is the one with index (the 3rd method in Lo Sauer's answer).

The second is:

function count(s, c) { var n = 0; for (let x of s) { if (x == c) n++; } return n; }

D

Damion Dooley

And there is:

function character_count(string, char, ptr = 0, count = 0) {
    while (ptr = string.indexOf(char, ptr) + 1) {count ++}
    return count
}

Works with integers too!

A

Andrew

Here's one just as fast as the split() and the replace methods, which are a tiny bit faster than the regex method (in Chrome and Firefox both).

let num = 0;
let str = "str1,str2,str3,str4";
//Note: Pre-calculating `.length` is an optimization;
//otherwise, it recalculates it every loop iteration.
let len = str.length;
//Note: Don't use a `for (... of ...)` loop, it's slow!
for (let charIndex = 0; charIndex < len; ++charIndex) {
  if (str[charIndex] === ',') {
    ++num;
  }
}

Wow! This turns out to be like 10X slower than the split().length - 1 approach for me in FF! This makes me mad!

Haha. I hate needing a performance optimization - but 10x means you need it! The FF JIT must be horrible. Too bad its my preferred browser... Hang in there - you never know what gets optimized and what doesn't.

Well it's probably just that split() is implemented at a lower level whereas storing and accessing characters of a string is at high level JS. I wonder what might prevent JITing your code... Hmm...

Yay! I figured it out! It doesn't like your for loop. I'll update your answer; with the regular for loop, it's just as fast as split(). I believe the reasons why are the same as listed here: stackoverflow.com/a/43821929/1599699

Andrew - you aren't supposed to change the answer, just comment on it. My original post provided an alternative that was optimized (at the time) in chrome, but not firefox. It was a different form that perhaps someone might have a use for. Your version matches other answers here making my answer hidden and useless. I'm glad you got your algorithm set. However, I'm not invested in Javascript enough to change my answer. This note is a historical comment for others. Check the edits for the original version.

T

Tyler Forsythe

var mainStr = "str1,str2,str3,str4"; var splitStr = mainStr.split(",").length - 1; // subtracting 1 is important! alert(splitStr);

Splitting into an array gives us a number of elements, which will always be 1 more than the number of instances of the character. This may not be the most memory efficient, but if your input is always going to be small, this is a straight-forward and easy to understand way to do it.

If you need to parse very large strings (greater than a few hundred characters), or if this is in a core loop that processes large volumes of data, I would recommend a different strategy.

While this code may provide a solution to the question, it's better to add context as to why/how it works. This can help future users learn and eventually apply that knowledge to their own code. You are also likely to have positive-feedback/upvotes from users, when the code is explained.

G

Gere

My solution:

function countOcurrences(str, value){
   var regExp = new RegExp(value, "gi");
   return str.match(regExp) ? str.match(regExp).length : 0;  
}

This will not work as String.prototype.match returns null without matches. That means no reference to an object with a length attribute. In other words: String.prototype.match.call('willnotwork', /yesitwill/) === null

V

Viscount Wathika

I know this might be an old question but I have a simple solution for low-level beginners in JavaScript.

As a beginner, I could only understand some of the solutions to this question so I used two nested FOR loops to check each character against every other character in the string, incrementing a count variable for each character found that equals that character.

I created a new blank object where each property key is a character and the value is how many times each character appeared in the string(count).

Example function:-

function countAllCharacters(str) {
  var obj = {};
  if(str.length!==0){
    for(i=0;i<str.length;i++){
      var count = 0;
      for(j=0;j<str.length;j++){
        if(str[i] === str[j]){
          count++;
        }
      }
      if(!obj.hasOwnProperty(str[i])){
        obj[str[i]] = count;
      }
    }
  }
  return obj;
}

J

Jack G

I believe you will find the below solution to be very short, very fast, able to work with very long strings, able to support multiple character searches, error proof, and able to handle empty string searches.

function substring_count(source_str, search_str, index) {
    source_str += "", search_str += "";
    var count = -1, index_inc = Math.max(search_str.length, 1);
    index = (+index || 0) - index_inc;
    do {
        ++count;
        index = source_str.indexOf(search_str, index + index_inc);
    } while (~index);
    return count;
}

Example usage:

console.log(substring_count("Lorem ipsum dolar un sit amet.", "m ")) function substring_count(source_str, search_str, index) { source_str += "", search_str += ""; var count = -1, index_inc = Math.max(search_str.length, 1); index = (+index || 0) - index_inc; do { ++count; index = source_str.indexOf(search_str, index + index_inc); } while (~index); return count; }

The above code fixes the major performance bug in Jakub Wawszczyk's that the code keeps on looks for a match even after indexOf says there is none and his version itself is not working because he forgot to give the function input parameters.

Count the number of occurrences of a character in a string in Javascript

Follow WeChat

Want to stay one step ahead of the latest teleworks?

相似问题

Platform

Support

Contact US