darvinstudio/darvin-crawler-bundle

此包提供控制台命令,用于检测您网站上的断链。

6.0.4 2021-04-19 05:59 UTC

This package is auto-updated.

Last update: 2024-09-19 13:02:44 UTC


README

此包提供控制台命令,用于检测您网站上的断链。

示例配置

# config/packages/dev/darvin_crawler.yaml
darvin_crawler:
    default_uri: https://example.com # Default value of command's "uri" argument
    blacklists:
        parse: # Content from URIs matching these regexes will not be parsed
            - '/\/filtered\//'
        visit: # URIs matching these regexes will not be visited
            - '/\/filtered\//'

用法

爬取默认URI

$ bin/console darvin:crawler:crawl

爬取指定URI

$ bin/console darvin:crawler:crawl https://example.com

显示所有访问过的链接

$ bin/console darvin:crawler:crawl https://example.com -v