Free tech support / small logo


Shell script to convert lowercase to title case?

As part of a project I'm working on, I find myself deep in a Linux shell script, needing to have a subroutine that converts a sentence of all lowercase to title case. You know, from "this is a test case" to "This is a Test Case". Not every word, just the right ones. Doable?


Dave's Answer:

That's an interesting project to chew on, actually, because there are a couple of very different ways to address the problem. Actually, that's pretty typical of Linux shell script programming because there are so many different commands in Linux.

The first thought I had was to use "sed" in a loop, basically doing something like "find [ ][a-z]" and replace it with "[ ][A-Z]" but that both strikes me as inefficient and a solution that'll make it just about impossible to skip words that shouldn't be capitalized.

Instead, the more logical solution seems to be breaking the sentence down into individual words, testing the words against the "skip" words, then fixing each of the remaining words before the fixed sentence is reassembled.

The trickiest part is to fix the first letter of the word, right? Or is it...

In fact, here's an easy way to break a word down into the first letter and the remaining letters:

firstletter=$(echo $word | cut -c1 | tr '[[:lower:]]' '[[:upper:]]')
otherletters=$(echo $word | cut -c2-)

It's one of my favorite commands, "cut", and we're using its ability to chop up what it's given character-by-character. Then "tr" transliterates lowercase to uppercase.

The main loop is pretty straightforward:

for word in $*
do
  per-word code goes here
done

The most interesting part is perhaps how to skip words that shouldn't be capitalized. After thinking about a couple of possibilities, here's what I came up with:

case $word in
  the|and|an|or|a|of) /bin/echo -n "$word "; continue; ;;
esac

The problem? If the first word is one of these stop words, it still needs to be capitalized, so it's a bit more nuanced: these words should only be skipped if they aren't the first word that appears in the sentence. Which means we're going to need to keep track of how many words we've scanned...

Easily done, though. Super easy. Just add a conditional around the "case" statement:

if [ $wordcount -gt 0 ] ; then ...

The fastest way to keep incrementing the counter variable is to use the shell's built-in mathematical capabilities:

wordcount=$(( $wordcount + 1 ))

That's 95% of things, so I'll let you put all the pieces together properly to get it to work...


Also check out:








Subscribe!
Never miss another Q&A article! Click to subscribe: Add to Google Reader Add to My Yahoo! Subscribe in NewsGator RDF XML
Comments · Add Comment
Starbucks coffee cup I do have a lot to say, and questions of my own for that matter, but first I'd like to say thank you for all your efforts on this Web site by buying you a cup of coffee!
Rather amazingly, there are no comments on this article yet.

I do have a comment, now that you mention it!











Remember personal info?


Please note that I will never send you any unsolicited email. Ever.

While I'm at it, please note that by submitting a question or comment you're agreeing to my terms of service, which are: you relinquish any subsequent rights of ownership to your material by submitting it on this site.









Recent Entries


Search
I Need Help!
Need Help? Ask Dave Taylor!


© 2002 - 2012 by Dave Taylor. All Rights Reserved.

Note: This web site is for the purpose of disseminating information for educational purposes, free of charge, for the benefit of all visitors. We take great care to provide quality information. However, we do not guarantee, and accept no legal liability whatsoever arising from or connected to, the accuracy, reliability, currency or completeness of any material contained on this web site or on any linked site.

[whiteboard marker tray]
"Ask Dave Taylor®" is a registered trademark of Intuitive Systems, LLC.