R : Fuzzy name match for variable size -
i have been working on matching source set master set of customer names while can achieved using -adist
in r have been using 2 million of source set 500k of master set, here cant use adist
not support long vectors, have chunked data small set have 70 k of source set , 20k of master set while here data sets size varies , hence can not use adist
doesn't support variable size of sets , have tried various other ways achieve same amatch
, pmatch
, agrep
not help, have referred these sites found couldn't find solution.
- super fuzzy name checking?
faster r code fuzzy name matching using agrep() multiple patterns...?
- r: string fuzzy matching using jarowinkler
- fuzzy string matching in r
i have tried levenshteindist
, levenshteinsim
, jarowinkler
have problem implementing huge dataframe , can find solution data frame similar this solution using jarowinkler different size of sets
Comments
Post a Comment