hokode/search-engine-crawler

搜索引擎爬虫的Composer包

dev-master 2022-07-04 05:33 UTC

This package is not auto-updated.

Last update: 2024-09-23 16:26:48 UTC


README

此PHP包允许您抓取和解析Google搜索结果。该包不使用或需要GoogleAPIs集成。

安装

您可以使用Composer安装search-engine-crawler。

$ composer require hokode/search-engine-crawler dev-master

或者您可以将包添加到您的composer.json文件中,然后执行composer install

  "require": {
        //...
        "hokode/search-engine-crawler": "dev-master"
    }

Packagist.com上查看包

简单示例

对于标准查询“男士鞋子”,返回Google SERP(搜索引擎结果页面)数据作为数组的简单示例。

<?php

// composer autoload
require_once __DIR__ . '/../vendor/autoload.php';


use hokode\SearchEngineCrawler\SearchEngine;

$client = new SearchEngine();

//set search engine or defaults to google.com
$client->setEngine('google.com');
//set search depth or defaults to 5
$client->setDepth(10);

$results = $client->search(['sports','news']);

print_r($results);

?>

示例响应

以下显示了返回的数组响应的快照(为了简洁而缩短)。

Array
(
    [0] => Array
        (
            [title] => Sports News - The Indian Express
            [url] => https://indianexpress.com/section/sports/
            [rank] => 0
            [ad] => 0
            [keywords] => sports news
        )

    [1] => Array
        (
            [title] => NDTV Sports: Latest Sports News, Live Scores, Results Today's ...
            [url] => https://sports.ndtv.com/
            [rank] => 0
            [ad] => 0
            [keywords] => sports news
        )

    [2] => Array
        (
            [title] => Latest Cricket News, Live Match Scores & Sports ... - Times of India
            [url] => https://timesofindia.indiatimes.com/sports
            [rank] => 0
            [ad] => 0
            [keywords] => sports news
        )

    [3] => Array
        (
            [title] => Sky Sports - Sports News, Transfers, Scores | Watch Live Sport
            [url] => https://www.skysports.com/
            [rank] => 0
            [ad] => 0
            [keywords] => sports news
        )

    [4] => Array
        (
            [title] => Sports News, Latest Sports News, Sports News Headlines Today
            [url] => https://www.hindustantimes.com/sports
            [rank] => 0
            [ad] => 0
            [keywords] => sports news
        )

    [5] => Array
        (
            [title] => Sports News, Latest Sports Updates, Cricket World ... - India Today
            [url] => https://www.indiatoday.in/sports
            [rank] => 0
            [ad] => 0
            [keywords] => sports news
        )



)

支持的搜索引擎

google.*(其中*可以是ae、ke、com等)。