opportus / web-scraper

一个Web爬虫,以URI列表和每个文档上要执行的XPath查询列表作为参数。返回一个DataInterface实例。

v2.0.0 2018-12-29 11:08 UTC

This package is auto-updated.

Last update: 2024-09-29 05:01:10 UTC


README

一个Web爬虫,以URI列表和每个文档上要执行的XPath查询列表作为参数。返回一个DataInterface实例。

安装

$ composer require opportus/web-scraper

使用

Use Opportus\WebScraper\WebScraper;

$uris = [
    'https://en.wikipedia.org/wiki/Web_scraping',
    'https://en.wikipedia.org/wiki/XPath',
];

$queries = [
    '//p[1][node()]',
    '//div[@id="mw-normal-catlinks"]/ul//li[node()]',
];

$scraper = new WebScraper();

$data = $scraper->scrap($uris, $queries); // @see https://github.com/opportus/web-scraper/blob/master/src/DataInterface.php