Matrix Multiplication from scratch in Python

 — 

I walk through how to do matrix multiplication from scratch, starting with Python, and working up to einsum. I also test the computational speed differences between CPU and GPU on a 50k dataset.

Category: data science Tags:

Scraping Glassdoor For Company Reviews

 — 

This article walks you through how to use a bash script to scrape glassdoor and combine several company's reviews into one dataframe. In future articles I will walk through analyzing both the text and data.

Category: programming Tags:

Examining Data in Python via Pandas

 — 

In this article I walk you through how to recreate common SPSS outputs like descriptive statistics, crosstabs, and a correlation matrix. I also introduce you to a powerful package for quick EDA (Exploratory Data Analysis) called Pandas Profiling.

Category: data science Tags:

Data Wrangling Survey Data in Python

 — 

We recently collected data for the 2019 SIOP Machine Learning Competition and I had to write a quick script to ensure we were only including "high quality data". In this article I walk through the approach I took to cleaning out the low quality responses. As you will see I was able to do it in 15 lines of code.

Category: data science Tags:

Why should you care about learning Python or R?

 — 

In this article I briefly discuss why I think it's important for fellow social scientists to jump on the open-source programming train. I give 8 reasons why I think you should seriously consider learning an open source language (preferably Python!).

Category: programming Tags:

© N. Koenig 2016

Powered by Pelican

Fork me on GitHub