Skip to content

Latest commit

 

History

History
50 lines (39 loc) · 1.34 KB

README.md

File metadata and controls

50 lines (39 loc) · 1.34 KB

Java JitPack License

Robots.txt

Java library for reading and querying robots.txt files.

Using the library in Java

  1. Parse robots.txt:
RobotsTxt robotsTxt = RobotsTxtReader.read(inputStream);
  1. Query robotsTxt:
Grant grant = robotsTxt.query("GoogleBot", "/path");
boolean canAccess = grant.getAllowed();
if (grant instanceof MatchedGrant) {
  Duration crawlDelay = ((MatchedGrant) grant).getMatchedRuleGroup().getCrawlDelay();
}

Importing into your project

Maven

Add the JitPack repository into your pom.xml.

<repositories>
  <repository>
    <id>jitpack.io</id>
    <url>https://jitpack.io</url>
  </repository>
</repositories>

Add the following under your <dependencies>:

<dependencies>
  <dependency>
    <groupId>com.github.alturkovic</groupId>
    <artifactId>robots-txt</artifactId>
    <version>[insert latest version here]</version>
  </dependency>
</dependencies>