githzl/xpath

基于xpath的页面提取

v1.0.0 2018-08-30 08:56 UTC

This package is not auto-updated.

Last update: 2024-09-29 06:14:16 UTC


README

1.快速入门

{
    "require": {
        "githzl/xpath": "^1.0"
    }
}

2.使用演示

<?php

require 'vendor/autoload.php';

use GithzlXPath\Lib\XPath;

$html = file_get_contents('https://www.zhipin.com'); // 获取到html  
$xpath = new XPath($html);

echo $title = $xpath->getTitle();  // BOSS直聘-互联网招聘神器!

echo $link = $xpath->query('//*[@id="wrap"]/div[4]/dl/dd/a[1]')->getNodeValue(); // 慧聪网

echo $attr = $xpath->query('//*[@id="wrap"]/div[4]/dl/dd/a[1]')->getAttrValue('href'); // https://www.hc360.com

3.快速获取xpath查询的方法

1.谷歌浏览器安装扩展:XPath Helper

2.页面中打开控制台选择dom模块

3.选择要提取的数据右键copy->copy xpath

###如果对您有帮助或您学到了新知识,请点 Star