darvinstudio / darvin-crawler-bundle
此包提供控制台命令,用于检测您网站上的断链。
6.0.4
2021-04-19 05:59 UTC
Requires
README
此包提供控制台命令,用于检测您网站上的断链。
示例配置
# config/packages/dev/darvin_crawler.yaml darvin_crawler: default_uri: https://example.com # Default value of command's "uri" argument blacklists: parse: # Content from URIs matching these regexes will not be parsed - '/\/filtered\//' visit: # URIs matching these regexes will not be visited - '/\/filtered\//'
用法
爬取默认URI
$ bin/console darvin:crawler:crawl
爬取指定URI
$ bin/console darvin:crawler:crawl https://example.com
显示所有访问过的链接
$ bin/console darvin:crawler:crawl https://example.com -v