vladlen/html-parser

Serpstat 测试任务。递归获取域名中的图片链接。保存内部和外部链接。

dev-master 2020-04-24 17:59 UTC

This package is auto-updated.

Last update: 2024-09-25 04:16:36 UTC


README

Serpstat 测试任务。递归获取域名中的图片链接。保存内部和外部链接。

安装

添加到 composer.json
{
    "require": {
        "vladlen/html-parser": "dev-master"
    }
}
或者执行 composer 命令

composer require vladlen/html-parser

用法

namespace Test\Parser;
require_once (__DIR__."/vendor/autoload.php");

require ('htmlParser.php');
require ('filters.php');

$url = "https://google.com/";
$hrefFilter =  new HrefFilter($url);
$htmlCrawler = new HtmlCrawler();

$aHrefOption = new Option('a', null,    // null for no tag filter
    [new Option('href', $hrefFilter)]);

$parser = new HtmlParser($htmlCrawler, [$aHrefOption]);

$logAction = new UserCallActionParam("Test\\Parser\\_log");
$parser->onLog->add($logAction);

$parser->parse($url);

function _log($val)
{
    ...
}