samclarke

samclarke/robots-parser

NodeJS robots.txt parser with support for wildcard (*) matching.

JavaScript
156
19
MIT License
Total donated
Undistributed
Share with your subscribers:

Recipients

How the donated funds are distributed

Support the dependencies of samclarke/robots-parser

Account's avatar
BDD/TDD assertion library for node.js and the browser. Test framework agnostic.
Account's avatar
simple, flexible, fun test framework
Account's avatar
the Istanbul command line interface

Support the repos that depend on samclarke/robots-parser

Account's avatar
Very straightforward, event driven web crawler. Features a flexible queue interface and a basic cache mechanism with extensible backend.
Account's avatar
A set of shared utilities that can be used by crawlers
Account's avatar
Distributed web crawler powered by Headless Chrome
Account's avatar
Node.js shared library for Franklin bulk operations
Account's avatar
Distributed web crawler powered by Headless Chrome
Account's avatar
Distributed web crawler powered by Headless Chrome
Account's avatar
Check the favicon of a website
Account's avatar
A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Account's avatar
robots.txt agent with cache.
Account's avatar
Distributed web crawler powered by Headless Chrome
Account's avatar
A web crawler written in TypeScript.
Account's avatar
Site report generator
Account's avatar
Fast asynchronous NodeJS module for crawling/scraping a web through worker_threads.
Account's avatar
Automated auditing, performance metrics, and best practices for the web. / Lighthouse 7.3.0
Account's avatar
Distributed web crawler powered by Headless Chrome
Account's avatar
Automated auditing, performance metrics, and best practices for the web.
Account's avatar
Structural analysis tools for complex web sites
Account's avatar
Simple sitemap generator for remote resources
Account's avatar
A node.js library to search, parse and fetch covers from https://www.thecoverproject.net/
Account's avatar
A simple web crawler
Account's avatar
fork from headless-chrome-crawler and update puppeteer to the latest version
Account's avatar
MCP servers exposed as a CLI
Account's avatar
Comprehensive website validation
Account's avatar
A Model Context Protocol server that provides web content fetching capabilities
Account's avatar
Rules related to checking for any SEO issues on the page given
Account's avatar
Very straightforward, event driven web crawler. Features a flexible queue interface and a basic cache mechanism with extensible backend.
Account's avatar
Automated auditing, performance metrics, and best practices for the web.

Top contributors

samclarke's profile
samclarke
77 contributions
dependabot[bot]'s profile
dependabot[bot]
5 contributions
brendankenny's profile
brendankenny
1 contributions
brendonboshell's profile
brendonboshell
1 contributions
danhab99's profile
danhab99
1 contributions
kdzwinel's profile
kdzwinel
1 contributions
SimonC-Audigent's profile
SimonC-Audigent
1 contributions

Recent events

Kivach works on the Obyte network, and therefore you can track all donations.

No events yet