funstaff/tika

tika包装器的库

dev-master / 1.x-dev 2016-03-09 12:06 UTC

This package is auto-updated.

Last update: 2024-08-29 03:52:44 UTC


README

主分支: 构建状态

Apache Tika二进制文件的PHP包装器

安装

要使用Composer安装tika,只需运行

$ php composer.phar require funstaff/tika dev-master

用法

<?php

use Funstaff\Tika\Configuration;
use Funstaff\Tika\Document;
use Funstaff\Tika\Wrapper;

$config = new Configuration('/path/to/tika.jar');
$config
    ->setOutputFormat('html')
    ->setOutputEncoding('UTF-8');

$wrapper = new Wrapper($config);
$wrapper
    ->addDocument(new Document('doc.pdf','/path/to/document.pdf'))
    ->addDocument(new Document('doc2.pdf','/path/to/document2.pdf'))
    ->execute();

/* Get All documents */
$documents = $wrapper->getDocument();

/* or only one document */
$document = $wrapper->getDocument('doc.pdf');

/* Get Document content */
$content = $document->getContent();

/* or raw content (output of Tika) */
$content = $document->getRawContent();

/* Get All Metadata for document (only on xml and html output format) */
$metadata = $document->getMetadata();

/* Get Value for metadata */
$author = $metadata->get('author');

致谢

感谢所有提供反馈和提交代码的用户 https://github.com/Funstaff/Tika

© Bertrand Zuchuat - Funstaff