Skip to content

Latest commit

 

History

History
44 lines (28 loc) · 1.05 KB

README.md

File metadata and controls

44 lines (28 loc) · 1.05 KB

YC Company Scraper

YC Company Scraper is a Python script that fetches information about Y Combinator companies from specified batches, extracts GitHub repository links from their websites, and prints the data.

Features

  • Retrieve company information from various Y Combinator batches
  • Scrape GitHub repository links from company websites
  • Print the extracted data

Requirements

  • Python 3.6 or higher
  • requests library

Installation

  1. Clone this repository:
git clone git@github.com:dirkjbreeuwer/yc-scraper.git
cd yc-company-scraper
  1. Install the requests library:
pip install requests

Usage

  1. Open yc_company_scraper.py and modify the batches list to include the desired Y Combinator batch codes.

  2. Run the script:

python yc_company_scraper.py
  1. The script will print the company name, website, YC URL, and any GitHub repository links found on their website.

Disclaimer

This script is for educational purposes only. Always respect the websites' terms of service and robots.txt when scraping data.