Technical Blog from our Data Science Team

Read the latest technical blogs from our data science team.

Showing posts written by Caterina Constantinescu

Back to all technical blogs»

Data guidelines: A set of recommendations for clean and usable data

The extent to which a dataset follows a set of commonly expected guidelines will often determine how much time you have left to spend thinking about your analysis. Ideally, you might intend to spend 20% of your time cleaning the data for a project, and 80% planning and carrying out your actual analysis. But often,

Read More »

LA maps of crime: Using R to map criminal activity in LA since 2010

I’ve recently come across data.gov — a huge resource for open data. At the time of writing, there are close to 17,000 freely available datasets stored there, including this one offered by the LAPD. Interestingly, this dataset includes almost 1.6M records of criminal activity occurring in LA since 2010 — all of them described according to a variety of measures (you can

Read More »