codiliateur / text-similarity
基于三元组算法的短语相似度识别函数
v1.0.0
2023-07-17 08:32 UTC
Requires
- php: ^7.1|^8.0
- ext-mbstring: *
This package is not auto-updated.
Last update: 2024-09-24 12:16:41 UTC
README
此包提供基于三元组算法的短语相似度识别函数。
类 Codiliateur\TextSimilarity\Trigram
经典三元组算法的实现。
use Codiliateur\TextSimilarity\Trigram;
Trigram::similarity('one two tree', 'tree two one') // 1.0
Trigram::similarity('one two', 'one two two one') // 1.0
Trigram::similarity('6 cat eat 6 mouse', 'cat eat 6 mouse') // 1.0
Trigram::similarity('one two', 'tree two one') // 0.615385
类 Codiliateur\TextSimilarity\TrigramPlus
带有重复匹配三元组计数的自定义三元组算法实现。
use Codiliateur\TextSimilarity\TrigramPlus;
TrigramPlus::similarity('one two tree', 'tree two one') // 1.0
TrigramPlus::similarity('one two', 'one two two one') // 0.285714
Trigram::similarity('6 cat eat 6 mouse', 'cat eat 6 mouse') // 0.888889
TrigramPlus::similarity('one two', 'tree two one') // 0.666667