funstaff / tika
tika包装器的库
dev-master / 1.x-dev
2016-03-09 12:06 UTC
Requires
- php: >=5.3.2
- psr/log: >=1.0
- symfony/process: >=2.3
Requires (Dev)
- phpunit/phpunit: ^4.0
This package is auto-updated.
Last update: 2024-08-29 03:52:44 UTC
README
Apache Tika二进制文件的PHP包装器
安装
要使用Composer安装tika,只需运行
$ php composer.phar require funstaff/tika dev-master
用法
<?php use Funstaff\Tika\Configuration; use Funstaff\Tika\Document; use Funstaff\Tika\Wrapper; $config = new Configuration('/path/to/tika.jar'); $config ->setOutputFormat('html') ->setOutputEncoding('UTF-8'); $wrapper = new Wrapper($config); $wrapper ->addDocument(new Document('doc.pdf','/path/to/document.pdf')) ->addDocument(new Document('doc2.pdf','/path/to/document2.pdf')) ->execute(); /* Get All documents */ $documents = $wrapper->getDocument(); /* or only one document */ $document = $wrapper->getDocument('doc.pdf'); /* Get Document content */ $content = $document->getContent(); /* or raw content (output of Tika) */ $content = $document->getRawContent(); /* Get All Metadata for document (only on xml and html output format) */ $metadata = $document->getMetadata(); /* Get Value for metadata */ $author = $metadata->get('author');
致谢
感谢所有提供反馈和提交代码的用户 https://github.com/Funstaff/Tika。
© Bertrand Zuchuat - Funstaff