You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 3, 2023. It is now read-only.
The reconciler will return narrower values as possible matches (e.g. "Orientalizing" will return "Early Orientalizing"), but it will not return broader values (e.g. "Early Uruk" will not return "Uruk" as a possible match). This seems to be because the algorithm will only consider possible matches that contain at least all words in the original term, including parenthetical statements, spatial adjectives, etc. Thus "Jordanian Chalcolithic" will not return "Chalcolithic", and "Vikingatid (600-1000 AD)" will not return "Vikingatid". I realize that this is related to efforts to create results sets that are narrow enough to be useful, but it will either require that the user carry out a major amount of data cleaning in advance, or that we accept a lot of false negatives. I would favor broader recall that would return "Uruk" among the possible matches for "Early Uruk" -- or at least testing this out to see how confusing it makes things.
The text was updated successfully, but these errors were encountered:
atomrab
changed the title
Broader/narrower matches (continues convo in #1)
Broader/narrower matches (continues convo in #2)
Dec 18, 2017
The reconciler will return narrower values as possible matches (e.g. "Orientalizing" will return "Early Orientalizing"), but it will not return broader values (e.g. "Early Uruk" will not return "Uruk" as a possible match). This seems to be because the algorithm will only consider possible matches that contain at least all words in the original term, including parenthetical statements, spatial adjectives, etc. Thus "Jordanian Chalcolithic" will not return "Chalcolithic", and "Vikingatid (600-1000 AD)" will not return "Vikingatid". I realize that this is related to efforts to create results sets that are narrow enough to be useful, but it will either require that the user carry out a major amount of data cleaning in advance, or that we accept a lot of false negatives. I would favor broader recall that would return "Uruk" among the possible matches for "Early Uruk" -- or at least testing this out to see how confusing it makes things.
The text was updated successfully, but these errors were encountered: