skyeng / php-lemmatizer
此包已被废弃,不再维护。未建议替代包。
PHP Lemmatizer 是一个词形还原库,用于从英语派生词中获取词干形式。
v1.0.3
2017-06-27 06:51 UTC
Requires
- php: >=5.6
Requires (Dev)
- phpunit/phpunit: ^5.7
This package is not auto-updated.
Last update: 2021-06-06 21:55:42 UTC
README
简述
处理英语词的词形还原。
负责人
- 代码所有者: Дмитрий Южаков
- 产品所有者: Сергей Сафонов
- 团队: Platform Core
PHP Lemmatizer 是一个 PHP 词形还原库,用于从英语的派生词中获取词干形式。
受 JavaScript Lemmatizer 启发,但返回值与其不同。
安装
使用 Composer
$ composer require skyeng/php-lemmatizer
{ "require": { "skyeng/php-lemmatizer": "^1.0" } }
用法
<?php use Skyeng\Lemmatizer; use Skyeng\Lemma; // Require Composer's autoloader require_once __DIR__ . "/vendor/autoload.php"; $lemmatizer = new Lemmatizer(); // retrieve a lemma with a part of speech. // you can assign Lemma::POS_VERB or Lemma::POS_NOUN or Lemma::POS_ADJECTIVE or // POS_ADVERB as a part of speech. $lemmas = $lemmatizer->getLemmas('desks', Lemma::POS_NOUN); // => [ new Lemma('desk', Lemma::POS_NOUN) ] // of course, available for irregular inflected form words. $lemmas = $lemmatizer->getLemmas('went', Lemma::POS_VERB); // => [ new Lemma('go', Lemma::POS_VERB) ] $lemmas = $lemmatizer->getLemmas('better', Lemma::POS_ADJECTIVE); // => [ new Lemma('better', Lemma::POS_ADJECTIVE), new Lemma('good', Lemma::POS_ADJECTIVE) ] // when multiple base forms are found, return all of them. $lemmas = $lemmatizer->getLemmas('leaves', Lemma::POS_NOUN); // => [ new Lemma('leave', Lemma::POS_NOUN), new Lemma('leaf', Lemma::POS_NOUN) ] // retrieve a lemma without a part of speech. $lemmas = $lemmatizer->getLemmas('sitting'); // => [ new Lemma('sit', Lemma::POS_VERB), new Lemma('sitting', Lemma::POS_ADJECTIVE) ] // retrieve only lemmas not including part of speeches in the returned value. $lemmas = $lemmatizer->getOnlyLemmas('desks', Lemma::POS_NOUN); // => [ 'desk' ] $lemmas = $lemmatizer->getOnlyLemmas('coded', Lemma::POS_VERB); // => [ 'code' ] $lemmas = $lemmatizer->getOnlyLemmas('leaves'); // => [ 'leave', 'leaf' ]
限制
// Lemmatizer leaves alone a word not included in it's dictionary index. $lemmas = $lemmatizer->getLemmas('MacBooks'); // => [ new Lemma('MacBooks', Lemma::POS_NOUN) ]
贡献
- 分支(https://github.com/skyeng/php-lemmatizer)
- 创建您的功能分支(git checkout -b my-new-feature)
- 提交您的更改(git commit -am 'Add some feature')
- 推送到分支(git push origin my-new-feature)
- 创建新的拉取请求