Analyzing Fed Court Nominees via Pro Bono Work

A scraper built to pull information from the Senate Judiciary Website.

This tool moves from the homepage of the Judiciary Committee to:

the meeting page
each past "Nominations" meeting
click and download each questionnaire form for each nominnee presented in each nomination meeting
read and parse each PDF, checking for inconsistencies
using regex, pull key information from each PDF including, name, school, nominated position, pro bono work experience and more
push this information into a Pandas dataframe
download that information into a CSV

Using textual analysis, I've created a categorization system for the type of (required) pro bono work completed by those appointed to the federal courts. Examples of categories include things like:

Criminal Justice
Child Protection
Discrimination and Human Rights

The final output of this scraping project is a GeoJSON choropleth map that displays pro bono work categories by district.

Notebooks:

Downloading nominee questionnaires: Using a combination of Beautiful Soup and Selenium, this notebook downloads over 50 PDFs from the Senate Judiciary Committee website. I use pdfminer to parse the PDFs and regex to pull information from each into a dataframe and CSV.
Creating pro bono categories and mapping: Using textual analysis, I categorize types of pro bono work, ultimately mapping the categories using GeoJSON.

Other uses:

If you have no interest at all in pro bono work, this code can still be used to:

Scrape the Senate Judiciary Committee's website
Download specific files
Pool information about nominees

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Creating pro bono categories and mapping.ipynb		Creating pro bono categories and mapping.ipynb
Downloading nominee questionnaires.ipynb		Downloading nominee questionnaires.ipynb
README.md		README.md
geo-data.js		geo-data.js
map.html		map.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyzing Fed Court Nominees via Pro Bono Work

Notebooks:

Other uses:

About

Releases

Packages

Languages

jessieblaeser/Federal-Court-Nominees-Scrape-and-Analysis

Folders and files

Latest commit

History

Repository files navigation

Analyzing Fed Court Nominees via Pro Bono Work

Notebooks:

Other uses:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages