Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dictionary comprehensiveness #5

Open
CallumBeaney opened this issue Oct 23, 2023 · 2 comments
Open

dictionary comprehensiveness #5

CallumBeaney opened this issue Oct 23, 2023 · 2 comments

Comments

@CallumBeaney
Copy link
Member

CallumBeaney commented Oct 23, 2023

If for now we assume Michigan's MED isn't going to provide access any time soon, we currently suffer from an issue of comprehensiveness (and of the usefulness of entry info). Reading Gawain as a difficult example, many words are missing from our resource. Its usefulness is fundamentally limited for as long as this problem is present.

Here are other reputable & comprehensive dictionaries:

Now for that top one, if the OCR/HOCR available can't do it, I am willing to try my hand at manually plugging entries of it into a JSON (copyright permitting). It's only 700 pages long with around 40 definitions per page so it'd maybe take me 30 or 40 years if I put a movie or two on in the background no biggie

unfortunately the OCR is total ass

@CallumBeaney
Copy link
Member Author

RE: that first PDF, I think this is the best next step

A Concise Dictionary of Middle English, 1888 as HTML

The same, but in a way we want it

+A-doun+, _adv._ down, S, S2, C2, C3, G; +adun+, S; +adune+, S.--AS. _of
dúne_, off the hill. (+A-+ 3.)

@CallumBeaney
Copy link
Member Author

@alexobviously RE: 2nd dictionary see the 2nd comment of mine

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant