Summary: Keep analyses organized with a directory that supports exploratory and production results (scripts, visualizations, and models) and keep data immutable. Use a tool like CookieCutter Data Science to automatically build a directory structure.
Summary: The best training sessions gather information on the audience before and after by using a survey. There are no surprises in the class because handouts detail everything that is taught. Lastly, analogies are used and students participate in order to cement the knowledge.
Summary: Kaggle competitors spend their time exploring the data, building training set samples to build their models on representative data, explore data leaks, and use tools like Python, R, XGBoost, and Multi-Level Models.
Summary: R offers a handful of packages to automate building models. rpart, randomForest, MASS, and forecast packages help you search through a hypothesis space. The caret package helps crawl through the hyper parameter space.
Summary: The passcode riddle asks for three three whole positive numbers with each one being equal to or larger than the next. Turns out there are only a handful of numbers this could possibly work for. Browsing YouTube one morning, I came across the video from TED-Ed and I was intrigued! I’ll be honest, I […]