Data Exploration with Python, Part 3
District Data Labs
by
5y ago
This is the third post in our Data Exploration with Python series. Before reading this post, make sure to check out Part 1 and Part 2! Preparing yourself and your data like we have done thus far in this series is essential to analyzing your data well. However, the most exciting part of Exploratory Data Analysis (EDA) is actually getting in there, exploring the data, and discovering insights. That's exactly what we are going to start doing in this post. We will begin with the cleaned and prepped vehicle fuel economy data set that we ended up with at the end of the last post. This version of ..read more
Visit website
Basics of Entity Resolution
District Data Labs
by
5y ago
Entity resolution (ER) is the task of disambiguating records that correspond to real world entities across and within datasets. The applications of entity resolution are tremendous, particularly for public sector and federal datasets related to health, transportation, finance, law enforcement, and antiterrorism. Unfortunately, the problems associated with entity resolution are equally big — as the volume and velocity of data grow, inference across networks and semantic relationships between entities becomes increasingly difficult. Data quality issues, schema variations, and idiosyncratic da ..read more
Visit website
Data Exploration with Python, Part 2
District Data Labs
by
5y ago
This is the second post in our Data Exploration with Python series. Before reading this post, make sure to check out Data Exploration with Python, Part 1! Mise en place (noun): In a professional kitchen, the disciplined organization and preparation of equipment and food before service begins. When performing exploratory data analysis (EDA), it is important to not only prepare yourself (the analyst) but to prepare your data as well. As we discussed in the previous post, a small amount of preparation will often save you a significant amount of time later on. So let's review where we should b ..read more
Visit website

Follow District Data Labs on FeedSpot

Continue with Google
Continue with Apple
OR