website-inspector

Synopsis

A project consists of 2 main Python scripts which are used to monitor websites availability. The monitor.py periodically checks the state of configured websites (using their URLs, but also has support for a regexp pattern should to be found in the page) and sends the testing results to specified Kafka broker. The db_writer.py receives the results from Kafka and writes them to a PostgreSQL database, as the name suggests.

Standalone or in Containers?

The scripts can be run directly on a host's system or through Docker containers. The project contains a script to build and start necessary containers.

What does Website Inspector actually monitors?

It checks the following:

a response time,
an error code returned (HTTP specific result code or 700 value when it could not connect),
whether the regexp pattern is found on the page (if not, the error code 701 will indicate it).

What goes to a database?

The monitoring results are saved to a PostgreSQL database. 2 tables are used for that purpose (in one-to-many relationship):

SITES (a lookup table containing information about websites checked by the monitor along with a DATEADDED field indicating when was the record added for the first time),
CHECKS (stored check results, including metric described earlier and the check-up time).

Configuration

You must configure the parameters of Kafka broker and a PostgreSQL server, obviously. For that, just update the contents of configs/general.ini configuration file. When needed, please make sure to update additional security files needed by Kafka. I have included the configuration to my servers run in the cloud, configured and launched in seconds thanks to Aiven solutions. Second step is to configure monitored websites through configs/monitored-sites.conf file. The provided file will cause the monitor to check on "olesno.pl" and "pichen.com" websites (the latter also checks for a regexp pattern).

Important note: Due its changeable nature, the configs directory is not added directly to Docker image, instead it is shared with the host file system upon containers start.

Initial setup

Make sure to have Docker installed and running on your Linux and you have access to it. It is super easy to set-up the whole thing using Docker containers.

Go to directory where you cloned the GIT repository.
Run this script first (make sure it has the executable flag set): $ ./create_docker_image_and_test.sh It will build the Docker image and immediately test it using a few test cases.
Create required DB structure in the database (WARNING: it will wipe out existing SITES and CHECKS tables along with their records): $ ./init_db.sh

Running

Whenever you want to start both the monitor and DB writer just use the following script, which will start 2 separate containers:

$ ./start_containers.sh

Use docker ps to see if the containers are running (there should be 2, ie.: wi-monitor and wi-dbwriter).

Use docker logs to see the message reported by scripts.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
libs		libs
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
create_docker_image_and_test.sh		create_docker_image_and_test.sh
db_writer.py		db_writer.py
init_db.py		init_db.py
init_db.sh		init_db.sh
monitor.py		monitor.py
start_containers.sh		start_containers.sh
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

website-inspector

Synopsis

Standalone or in Containers?

What does Website Inspector actually monitors?

What goes to a database?

Configuration

Initial setup

Running

About

Releases

Packages

Languages

License

dawidpichen/website-inspector

Folders and files

Latest commit

History

Repository files navigation

website-inspector

Synopsis

Standalone or in Containers?

What does Website Inspector actually monitors?

What goes to a database?

Configuration

Initial setup

Running

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages