I’m helping score the Analytical Writing Placement Exams for incoming UC freshmen this week. For those of you not familiar with the exam, 17 and 18 year olds who have been admitted to a UC wake up at some ungodly early hour of a Saturday morning, sit down, read a passage, and write an essay. It’s my job to look for things like sentence variety, organization and structure, arguments, analysis, and a general understanding of the prompt. Of course, multiple grammatical errors, poor variety in vocabulary, and numerous misspellings can hurt, but I can be forgiving for one or two misspelled words. After all, they don’t have access to spellcheck or wikipedia, and no one writes perfectly well without a chance to edit, especially on a Saturday morning when you’re 17.

The prompt this year has to do with socializing with strangers. One “error” I’ve seen in many essays (of a variety of skill levels, including those scoring “clearly competent”) is the use of conversate instead of have a conversation or converse. But is this really an error?

More »

My friend recently asked me how I make word clouds for presentations. Wordle is definitely a good choice. WordPress automatically makes word clouds out of my tags in the sidebar. But sometimes you can’t or don’t want to upload your data to places like WordPress or Wordle and you just want to use R (because you use R for everything else, so why not? Or is that just me?).

In a typical word cloud, word frequency is what determines the size of the word. As of this writing, the word cloud in my side bar (over there ) has “linguistics” and “programming” as clearly the largest words. Tags like “video games,” “language,” and “education” are also pretty big. There are also really small words like “Navajo” and “handwriting.” This reflects the frequency of each tag. Bigger tags are more frequent, so I write about linguistics a lot but not so much about Navajo in particular.

More »