Skip to content

gwuhaolin/strsim

Folders and files

NameName
Last commit message
Last commit date

Latest commit

�?

History

5 Commits
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?
�?

Repository files navigation

字符串相似性计算

TFIDF

基于分词+统计概率

package main

import "github.com/gwuhaolin/strsim"

func main() {
	sim := strsim.NewTfidfCompare(`KastKing路亚竿套装全套远投枪柄水滴轮超硬碳�?杆抛竿海竿钓鱼竿`)
	textB := `绑好台钓成品方便线组套装全套鲫鱼钓鱼线钩子线夹八字环渔具用品`
	textC := `迪卡侬海竿路亚竿矶钓竿超轻超硬钓鱼竿手海两用竿长节CAP`
	s1, s2 := sim(textB), sim(textC)
	println(s1, s2)
}

分词基于gse,自定义分词词典:

Jaro

基于编辑距离

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages