We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
现在RapidTable的表格识别能力已经足够优秀,但是对于合并单元格的表格,识别效果依旧有所欠缺,对于这些合并单元格的表格,TableStructureRec更具有优势,目前我会使用MinerU项目提取PDF,但我会关闭表格识别,因为即时使用RapidTable或其他两个模型,对于单元格合并的问题依旧有出现错误,然后我会单独对表格重新使用TableStructureRec项目中的wired_table_rec进行提取。
The text was updated successfully, but these errors were encountered:
想了解一下,有没有应对旋转表格的经验,如何识别并旋转归位后再进行表格提取呢?
Sorry, something went wrong.
补充:GOT OCR2.0效果也不错 但是只能将表格转为latex格式 另外 怎么关闭表格识别呢?
补充:GOT OCR2.0效果也不错 但是只能将表格转为 latex格式 另外 怎么关闭表格识别呢? minerU有个magic-pdf.json "table-config": { "model": "rapid_table", "enable": true, "max_time": 1400 }, 把enable改为False.
No branches or pull requests
现在RapidTable的表格识别能力已经足够优秀,但是对于合并单元格的表格,识别效果依旧有所欠缺,对于这些合并单元格的表格,TableStructureRec更具有优势,目前我会使用MinerU项目提取PDF,但我会关闭表格识别,因为即时使用RapidTable或其他两个模型,对于单元格合并的问题依旧有出现错误,然后我会单独对表格重新使用TableStructureRec项目中的wired_table_rec进行提取。
The text was updated successfully, but these errors were encountered: