strlen: how does it work?

How does strlen() work internally?

Answers


strlen usually works by counting the characters in a string until a \0 character is found. A canonical implementation would be:

size_t strlen (char *str) {
    size_t len = 0;
    while (*str != '\0') {
        str++;
        len++;
    }
    return len;
}

There is no inherent bug in the function, it works exactly as documented.

That's not to say it doesn't have problems, to wit:

  • if you pass it a "string" that doesn't have a \0 at the end, you may run into problems but technically, that's not a C string (a) and it's your own fault.
  • you can't put \0 characters in your string but, again, it wouldn't be a C string in that case.
  • it's not the most efficient way - you could store a length up front so you could get the length much quicker.

But none of those are bugs, they're just consequences of a design decision.

See also this excellent article by Joel Spolsky where he discusses various string formats and their characteristics, including normal C strings, Pascal strings and the combination of the two, null terminated Pascal strings, though he has a more, shall we say, "colorful" term for them :-)


(a) A C string is defined as a series of non-terminator characters (ie, any other than \0) followed by that terminator. Hence this definition disallows both embedded terminators within the sequence and sequences without such a terminator.

Or, putting it more succinctly (as per the ISO standard):

A string is a contiguous sequence of characters terminated by and including the first null character.


Need Your Help