Keeping a Sharp Analytical Mind

Summary: To stay on top of your personal development, try learning new things like a programming language, an instrument, or exposure to a new field (e.g. biology or accounting). Exposure to new ideas helps you avoid confirmation bias and increase you willingness to explore your analysis further.

Optimal Toilet Paper Placement

Test accuracy from using rpart in parallel foreach

Overview of Parallel Processing in R

Summary: The foreach package provides parallel operations for many packages (including randomForest). Packages like gbm and caret have parallelization built into their functions. Other tools like bigmemory and ff solve handling large datasets with memory management.


DMA Analytics Challenge 2016 Recap

My friend, Josh Jacquet, and I competed in the DMA’s 2016 Analytics Challenge (powered by EY) and placed 4th out of the 50 entrants. Given that the majority of the other contestants were agencies vying for a little exposure, I think we did well.

Different Model Combinations for the DMA Analytics Challenge

Clean Code and The Pragmatic Programmer Book Covers

Book Review(s): Writing Quality Code

Summary: Clean Code is a clear guide to writing better software which can be applied to data mining and SQL queries. The Pragmatic Programmer is a more detailed but slightly more antiquated book. Code Complete is a massive tome full of specific guidance.


Get US Census Data with R

Summary: The US Census provides an API that lets you query any of their datasets. Includes population by race, gender, age, and more by zip code, state, congressional district, and a few other geographies.

Select Census Geographies include State, Zip Code, MSPA, Congressional District and More.