Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Table Recognition Training - Cell Text Tokens in annotations #11185

Open
johntoma opened this issue Nov 3, 2023 · 1 comment
Open

Table Recognition Training - Cell Text Tokens in annotations #11185

johntoma opened this issue Nov 3, 2023 · 1 comment
Assignees
Labels

Comments

@johntoma
Copy link

johntoma commented Nov 3, 2023

Do these cell text tokens (highlighted in the image below) in the training data annotation have any affect on table recognition training?

image

Is it necessary to include these text values, or can I just leave it blank?

I am confused because the OCR for table recognition is done by a separate model, so I'm not sure if these tokens are important or not. I assume they are not actually necessary for training at all and have no affect on training results and are only there for human readability?

Many of the tables in my training data either don't include these tokens (the "tokens" array is simply left empty in the annotation) or have the incorrect tokens for that cell.

Do I need to review my dataset again and ensure these cell text tokens have the correct values, or am I correct in thinking that these token values are not important?

Thanks for your time.

Copy link
Contributor

This issue is stale because it has been open for 90 days with no activity.

@github-actions github-actions bot added the stale label Dec 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants