Favstar gets even better if you sign in.
Data Scientist (n.): Person who is better at statistics than any software engineer and better at software engineering than any statistician.
Riots breaking out in Noe Valley: two strollers overturned and on fire at 24th and Sanchez.
Great post by the eng team at @cerner on building composable MapReduce pipelines on #hadoop with #crunch: https://engineering.cerner.com/2013/02/composable-mapreduce-with-hadoop-and-crunch/ …
Just got an email from LinkedIn that my profile was among the least viewed 1% in 2012!!!
Some command line tools I wrote with Crunch that make Mahout easier to use for new data scientists: http://blog.cloudera.com/blog/2013/03/cloudera_ml_data_science_tools/ …
This is one of my favorite techniques for building random forests and other models on #hadoop: http://blog.cloudera.com/blog/2013/02/how-to-resample-from-a-large-data-set-in-parallel-with-r-on-hadoop/ … via @laserson
I have never been prouder to be a part of the @cloudera team. Introducing our first data science course: http://blog.cloudera.com/blog/2012/10/data-science-training/ …
Excel files are the dark matter of data science.
Data Scientist @Cloudera, VP of Apache Crunch. I mostly tweet about #hadoop and postmodern lit. Yeah, I know.