Skip to content

thorunna/DIMErrorCorpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

DIMErrorCorpus

The DIM error corpus is an Icelandic error corpus made from data from the Database of Icelandic Morphology (DIM). Search queries which return no results in the database are used for making the corpus, intended for both general use and for correcting future search queries in the database.

Automatic correction is done using ReynirCorrect and Skrambi and the 1000 most frequent search queries are manually corrected. The resulting corpus combines the manual correction, ReynirCorrect's correction and Skrambi's correction.

The error corpus is licensed under a Creative Commons Attribution 4.0 International License.

Format

The format of the corpus is based on columns. The first column displays the original search query and the second column displays its correction, if it is not currently found in DIM. The third column displays the error type.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published