We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
最近在做ocr还原扫描件(使用飞浆的面版识别+reportlib生成还原pdf),目前pdf排版比较方便,所以打算先转pdf在用pdf2docx(花时间写一套根据ocr实现排版感觉可以直接扩展这个项目,但是暂时还没有时间去扩展) 看了下pdf解析的时候可能存在多行一个段落的情况,但是多行的情况下行高应该要均分给每一行才对 会出现问题的具体情况: test_7.pdf
使用这个逻辑转换: 均分行高: 另外可否中间插入空格行去做到排版尽量跟原来相似呢?
The text was updated successfully, but these errors were encountered:
No branches or pull requests
最近在做ocr还原扫描件(使用飞浆的面版识别+reportlib生成还原pdf),目前pdf排版比较方便,所以打算先转pdf在用pdf2docx(花时间写一套根据ocr实现排版感觉可以直接扩展这个项目,但是暂时还没有时间去扩展)
看了下pdf解析的时候可能存在多行一个段落的情况,但是多行的情况下行高应该要均分给每一行才对
会出现问题的具体情况:
test_7.pdf
使用这个逻辑转换:
均分行高:
另外可否中间插入空格行去做到排版尽量跟原来相似呢?
The text was updated successfully, but these errors were encountered: