I walk through how to do matrix multiplication from scratch, starting with Python, and working up to einsum. I also test the computational speed differences between CPU and GPU on a 50k dataset.
This article walks you through how to use a bash script to scrape glassdoor and combine several company's reviews into one dataframe. In future articles I will walk through analyzing both the text and data.
In this article I walk you through how to recreate common SPSS outputs like descriptive statistics, crosstabs, and a correlation matrix. I also introduce you to a powerful package for quick EDA (Exploratory Data Analysis) called Pandas Profiling.
We recently collected data for the 2019 SIOP Machine Learning Competition and I had to write a quick script to ensure we were only including "high quality data". In this article I walk through the approach I took to cleaning out the low quality responses. As you will see I was able to do it in 15 lines of code.
In this article I briefly discuss why I think it's important for fellow social scientists to jump on the open-source programming train. I give 8 reasons why I think you should seriously consider learning an open source language (preferably Python!).
© N. Koenig 2016
Powered by Pelican