wrseward / pdf-parser
PHP库,用于解析PDF文件中的文本
0.1.0
2015-09-01 00:54 UTC
Requires
- php: >=5.3.3
- symfony/process: ~2.7
Requires (Dev)
- mikey179/vfsstream: ~1.5
- phpspec/phpspec: ~2.2
This package is not auto-updated.
Last update: 2024-09-28 17:43:57 UTC
README
PHP库,用于将PDF文件解析为文本。pdftotext的包装器。
安装
通过Composer
composer require wrseward/pdf-parser
pdftotext
二进制文件
Debian / Ubuntu
apt-get install poppler-utils
RedHat / CentOS
yum install poppler-utils
OS X
brew install xpdf
验证安装 / 获取二进制文件的路径
which pdftotext
用法
$parser = new \Wrseward\PdfParser\Pdf\PdfToTextParser('/usr/bin/pdftotext'); $parser->parse('/path/to/file.pdf'); echo $parser->text();
运行测试
./vendor/bin/phpspec run