Orangwutang Word Cloud

I thought it might be interesting to create a word cloud out of my whole blog to see what words I’d used the most, etc. It was a little more involved than I thought it would be – and frankly not that interesting.

First, I had to export my blog as an XML file, which is easy to do using WordPress. Then I converted that XML file into a PDF of my blog using www.blogbooker.com. Created a 1,869-page PDF of my whole blog. (Pretty handy actually for backing up my blog in case WordPress ever goes down or whatever. I am emailing myself a copy of that PDF file and so will always have a copy of it on Gmail. I will try to remember to back it up every three months or so.) Then I had to convert that PDF into TXT – I downloaded a small freeware program to do that. Then cut and pasted the text into www.wordle.net to create the word cloud. If you click on the following thumbnail it will show you a larger version of the cloud.

The only thing I don’t like about is when I converted the blog into PDF and then into text, it inserted a lot of formatting type words like “embedwebsite,” “page,” and “authkey.” I wish there was a way to tell Wordle not to include certain words – otherwise I would have to go through the text file and delete them all manually, I guess. Not worth it.

Anyway, for what it’s worth, here you go:

Wordle: Orangwutang