Tuesday, January 04, 2011

Ad Hoc Data Analysis From The Unix Command Line - Wikibooks, open books for an open world

Ad Hoc Data Analysis From The Unix Command Line - Wikibooks, open books for an open world: "Once upon a time, I was working with a colleague who needed to do some quick data analysis to get a handle on the scope of a problem. He was considering importing the data into a database or writing a program to parse and summarize that data. Either of these options would have taken hours at least, and possibly days. I wrote this on his whiteboard:
Your friends: cat, find, grep, wc, cut, sort, uniq
These simple commands can be combined to quickly answer the kinds of questions for which most people would turn to a database, if only the data were already in a database. You can quickly (often in seconds) form and test hypotheses about virtually any record oriented data source."

2 comments:

Anonymous said...

Certainly, I adore it, interesting and well-founded thoughts. You need to write more interesting articles in your.

Mobile home movers said...

Nice effort, very informative, this will help me to complete my task.