Clerk of Courts Deed Records Webscraping

This project was undertaken as a way to more quickly access sale records of real estate transactions. Typically, these records become available on public-facing GIS apps with a minimum of a 30 day lag time from recording to accessibility within public data tools. In order to access all of these records more quickly, a webscraping script has been written using Selenium to manually cycle through the deed records, filter the data, and then load them to a data management system.

Project information

  • Category: Webscraping, Data Sourcing, ETL
  • Objective: To scrape the county Clerk of Courts website and load all of the recorded deeds for each date.
  • Project URL: #Available upon request

Toolbox

Python Selenium PostgreSQL PHP HTML CSS JavaScript Leaflet