Package: fuzzylink 0.4.2

fuzzylink: Probabilistic Record Linkage Using Pretrained Text Embeddings
Links datasets through fuzzy string matching using pretrained text embeddings. Produces more accurate record linkage when lexical string distance metrics are a poor guide to match quality (e.g., "Patricia" is more lexically similar to "Patrick" than it is to "Trish"). Capable of performing multilingual record linkage. Methods are described in Ornstein (2025) <doi:10.1017/pan.2025.10016>.
Authors:
fuzzylink_0.4.2.tar.gz
fuzzylink_0.4.2.zip(r-4.7)fuzzylink_0.4.2.zip(r-4.6)fuzzylink_0.4.2.zip(r-4.5)
fuzzylink_0.4.2.tgz(r-4.6-any)fuzzylink_0.4.2.tgz(r-4.5-any)
fuzzylink_0.4.2.tar.gz(r-4.7-any)fuzzylink_0.4.2.tar.gz(r-4.6-any)
fuzzylink_0.4.2.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
fuzzylink/json (API)
NEWS
| # Install 'fuzzylink' in R: |
| install.packages('fuzzylink', repos = c('https://joeornstein.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/joeornstein/fuzzylink/issues
Pkgdown/docs site:https://joeornstein.github.io
Last updated from:1cec92e40b. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 144 | ||
| source / vignettes | OK | 185 | ||
| linux-release-x86_64 | OK | 116 | ||
| macos-release-arm64 | OK | 115 | ||
| macos-oldrel-arm64 | OK | 134 | ||
| windows-devel | OK | 81 | ||
| windows-release | OK | 74 | ||
| windows-oldrel | OK | 77 | ||
| wasm-release | OK | 105 |
Exports:anthropic_api_keycheck_matchdotfuzzylinkget_embeddingsget_similarity_matrixmistral_api_keyopenai_api_key
Dependencies:askpassclicorocurldplyrellmerfastmapgenericsgluehttrhttr2jsonlitelaterlatticelifecyclemagrittrMatrixmimeopensslotelpillarpkgconfigplyrpromisesR6rangerrappdirsRcppRcppArmadilloRcppEigenRcppParallelreshape2RfastrlangS7stringdiststringistringrsystibbletidyselectutf8vctrswithrzigg
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Install an ANTHROPIC API KEY in Your '.Renviron' File for Repeated Use | anthropic_api_key |
| Test whether two strings match with an LLM prompt. | check_match |
| Compute the dot product between two vectors | dot |
| Probabilistic Record Linkage Using Pretrained Text Embeddings | fuzzylink |
| Get pretrained text embeddings | get_embeddings |
| Create matrix of embedding similarities | get_similarity_matrix |
| Install a MISTRAL API KEY in Your '.Renviron' File for Repeated Use | mistral_api_key |
| Install an OPENAI API KEY in Your '.Renviron' File for Repeated Use | openai_api_key |
