Releases: PaddlePaddle/PaddleOCR
Releases · PaddlePaddle/PaddleOCR
v2.9.1
v2.9.0
What's Changed
- fix: table recognition content is not escaped properly by @GreatV in #13277
- fix bug when layout_predictor is None by @GreatV in #13279
- add url in pyproject, and update version number by @GreatV in #13274
- unifying data types in the SLAHead by @GreatV in #13276
- add PaddleX info to README by @TingquanGao in #13308
- Update expired link in quickstart.md by @ZeddYu in #13253
- optimize func: get_infer_gpuid by @GreatV in #13275
- fix slice op parameters not being passed correctly by @GreatV in #13319
- Solve ModuleNotFoundError: No module named 'tools.infer' by @myhloli in #13348
- Add hardware docs by @nepeplwu in #13329
- add paddlex link by @TingquanGao in #13316
- Fix the dictionary bug in tablerec inference by @Topdu in #13362
- add bn_dict.txt by @taeefnajib in #13373
- add missing docstring in paddleocr.py using copilot by @jzhang533 in #13344
- line 445 program.py by @ManikSinghSarmaal in #13389
- fix layout recovery import error by @GreatV in #13434
- Latexocr paddle by @liuhongen1234567 in #13401
- [doc]add amp train notes for detection train by @andyjiang1116 in #13481
- remove some of the less common dependencies by @GreatV in #13461
- docs: Add a new document site by @SWHL in #13375
- Update mkdocs.yml by @GreatV in #13487
- chore: Update issue template by @SWHL in #13505
- chore: Update bug report template by @SWHL in #13508
- Fix cpp_infer "--enable_mkldnn=false" not effective by @hiroi-sora in #13539
- Enable Main Branch Support for PaddleX by @zhangyubo0722 in #13523
- docs: Update README by @SWHL in #13543
- docs: Update README_en by @SWHL in #13545
- 修改错别字 by @MonkeyBrothers in #13544
- docs: Remove old applications docs by @SWHL in #13551
- fix: 'numpy' has no attribute 'astype' by @laolitou in #13554
- add latexocr docs and fix some typos by @GreatV in #13532
- chore(Issue_template): Add validation of Environment and MPE code by @SWHL in #13559
- skip text files when running test ci by @GreatV in #13561
- fix bug for paddlepaddle3.0 by @changdazhou in #13568
- docs: Update the pdf file path in the operation demonstration by @Gmgge in #13575
- support benchmark for paddlepaddle3.0 by @changdazhou in #13574
- improve the reading experience of some documents by @GreatV in #13562
- update dive into OCR book link by @GreatV in #13581
- docs: Shorten the image path and remove dupliate images by @SWHL in #13585
- docs: Fix docs errors by @SWHL in #13588
- skip text files when running test ci on push by @GreatV in #13582
- docs: Add android_demo docs by @SWHL in #13601
- fix download bug when use multi gpus by @changdazhou in #13610
- disable automatic checks for new version albumentations by @GreatV in #13583
- 修复LaTeXOCR 在paddleX中的一些问题 by @liuhongen1234567 in #13646
- update docs and remove out-of-date event by @GreatV in #13660
- setuptools 72.2.0 result in that MANIFEST.in is invalid by @TingquanGao in #13670
- update docs and remove old docs by @GreatV in #13662
- update docs and fix markdown render error by @GreatV in #13678
- chore: Update issue template by @SWHL in #13679
- cache Python dependencies and PaddleOCR files by @GreatV in #13682
- Add files via upload by @lingskr in #13685
- Update ch_PP-OCRv4_rec_distillation.yml by @jiqirenfeile in #13692
- Remove channel links from docs by @zhangyubo0722 in #13674
- Code Style Unification by @zhangyubo0722 in #13697
- docs: Remove doc/datasets directory and fix docs/datasets documents by @SWHL in #13700
- Provides Vietnamese dictionary and corpus by @lingskr in #13698
- Modify the data processing part of LaTeXOCR and replace the absolute path by a relative path by @liuhongen1234567 in #13702
- use setuptools-scm extracts PaddleOCR versions by @GreatV in #13716
- Repair the bug in the inference script for LaTeX OCR by @liuhongen1234567 in #13750
- fixed: mkldnn -> onednn by @achieve-dream1221 in #13757
- remove unused enumerate by @Kayzwer in #13760
- update applications/overview.md by @GreatV in #13763
- Fix setting of make border epoch by @Sunting78 in #13783
- Fix doc link in docs by @Topdu in #13792
- Add support for Hebrew Language and Alphabet by @johnlockejrr in #13797
- Add Syriac script support by @johnlockejrr in #13800
- update KIE docs by @GreatV in #13799
- fix the CI running errors in tests. by @GreatV in #13846
- Fix pir dy2st train by @0x45f in #13853
- fix SRN algorithm infer error by @GreatV in #13851
- update pretrain for benchmark by @changdazhou in #13820
- fix bugs for SLANet infer by @liu-jiaxuan in #13861
- fix version by @TingquanGao in #13895
- set --image_dir to be required by @GreatV in #13896
- support export after save model by @zhangyubo0722 in #13844
- fix hubserving run error by @GreatV in #13918
- fix lateocr bug by @zhangyubo0722 in #13920
- 1.在ppstructure管道中添加latex_ocr公式识别功能;2.添加pdf转markdown文件功能 by @ztyf-lq in #13868
- updata 2.9, adding new models and supporting all-in-one full developm… by @dyning in #13932
- updata 2.9, adding new models and supporting all-in-one full developm… by @dyning in #13933
- adding new models and supporting all-in-one full development tools by @dyning in #13934
- Update quick_start.md with html, not md by @dyning in #13935
- Update quick_start.md for paddlex by @dyning in #13936
- pdf to markdown document by @ztyf-lq in #13942
- Update algorithm_rec_vitstr_en.md by @GreatV in #13947
- update a live promotion by @Zhiiixin in #13954
- ci: Support multi version docs by @SWHL in #13957
- docs: Add tip of old documents by @SWHL in #13960
- ci: Fix mike error by @SWHL in #13962
- Update README.md, fixed broken quick start link by @Kozmosa in #13965
- fix broken link by @GreatV in #13970
- [NPU] cherrypick13983 by @Wang...
v2.8.1
v2.8.0
What's Changed
- [Cherry-pick] #10515 by @ToddBear in #10537
- [BugFix]compat_pillow by @shiyutang in #10596
- [bug fix] fix none res in recovery by @andyjiang1116 in #10603
- Fix seed passing issue of build_dataloader by @RuohengMa in #10614
- [bug fix]rm invalid params by @andyjiang1116 in #10605
- [Cherry-pick] #10441 #10512 by @moehuster in #10593
- 修改数据增强导致的DSR报错 by @xu-peng-7 in #10662
- onnxruntime support gpu by @WenmuZhou in #10668
- Update VQA to use the updated LayoutLM syntax from PaddleNLP by @sijunhe in #9791
- 实现功能:当--savefile为true时,在--output下以当前图片名称后接“.txt”为文件名保存ocr推理结果,解决了issues: by @WilliamQf-AI in #10628
- Cherrypicking GH-10217 and GH-10216 to PaddlePaddle:dygraph by @UserUnknownFactor in #10654
- fix numpy speed by @wanghuancoder in #10773
- Cherrypicking GH-10251 & GH-10181 to PaddleOCR:dygraph by @itasli in #10710
- rec_r45_abinet.yml add max_length and image_size by @xlg-go in #10744
- ch_PP-OCRv4_rec_distill.yml, fix KeyError: 'NRTRLabelDecode' by @xlg-go in #10761
- 根据推理对三通道的图像需求,以及opencv中imread参数说明IMREAD_COLOR(If set, always convert … by @Gmgge in #10777
- Update algorithm_kie_vi_layoutxlm_en.md by @sagarjgb in #10736
- Add new recognition method "ParseQ" by @ToddBear in #10836
- rm fluid for paddle dev by @tink2123 in #10931
- rec_r45_abinet for export model by @xlg-go in #10892
- fix:修复通道数不匹配造成的PPOCRLabel启动失败问题#10748,根据更新日志发现#10655,由于paddleocr中增加了对… by @Gmgge in #10847
- [New] add rec CPPD model by @Topdu in #10990
- fix
cls_x
andbbox_x
is possibly unbound by @SigureMo in #10991 - add svtr large model by @zhangyubo0722 in #10937
- [WIP]support eval pre epoch by @zhangyubo0722 in #11003
- Update kie_datasets_en.md by @sagarjgb in #10735
- fix import collection for py310 by @tink2123 in #11012
- update ppocrv4_framework by @tink2123 in #11048
- Update how_to_do_kie_en.md by @sagarjgb in #10731
- add cppd u14m train model and doc by @Topdu in #11052
- Fixed bug with "max_text_length" for VisionLAN by @victor30608 in #11025
- Cherrypicking GH-10923 to PaddleOCR:dygraph by @itasli in #11069
- Update quickstart_en.md by @sagarjgb in #10732
- Update README.md by @sagarjgb in #10733
- Update algorithm_overview_en.md by @sagarjgb in #10734
- [Cherry-pick] Cherry-pick from release/2.6 by @shiyutang in #11092
- [TIPC]update tipc scripts by @USTCKAY in #11097
- fix satrn export for paddle2.5 by @tink2123 in #11096
- [BugFix]Fix parseq net by @shiyutang in #11126
- update uygur dict by @hfengzhi in #11125
- Add tipc for "ParseQ" method by @ToddBear in #10843
- fix SAR inference, when batch size>1, norm_img_batch and valid_ratios… by @shiyunalex in #11238
- v4 det cml configs by @sylarwcy in #11258
- 解決recognition的train test分割程式執行後的文檔每行間多出一行空格 by @DingHsun in #11280
- Fix for Ambiguous Boolean Evaluation Error in PaddleOCR with Python 3.11 by @muhammadAgfian96 in #11287
- Dygraph【benchmark】add max_mem_reserved for benchmark by @mmglove in #11284
- Fix bug when running on XPU by @RuohengMa in #11299
- Dygraph by @RuohengMa in #11301
- Dygraph fix max_mem_reserved for benchmark by @mmglove in #11341
- 在check_gpu时增加对当前环境可用设备的检查 by @TracebaK in #11293
- Fixed some bugs that caused PPOCRLabel to crash, added ability to expand checkboxes by @g39088902 in #11236
- fix a bug for rec_postprocess.py by @Ataraxy33 in #11389
- Optimize prediction on long image and deduplicate similar boxes with multiple lables by @marswen in #11366
- doc: add doc for satrn by @wkml in #11397
- Update zeros' comment in rec_abinet_head.py by @YesianRohn in #11374
- Fix QPointF IndexError: list index out of range by @firmament2008 in #11393
- update paddlex of readme by @zhangyubo0722 in #11422
- chore: add notes for docker gpu deploy PP-OCRv4 by @sheiy in #11390
- Fix words by @co63oc in #11448
- [Feature]Complete the ppocrv4_act by @ranchongzhi in #11345
- rm QR code in the document by @tink2123 in #11512
- rm QR code by @tink2123 in #11532
- Fix dead links by @MatKollar in #11520
- cherry-pick for lazy import pymupdf and pre-commit by @tink2123 in #11692
- adapter new type promotion rule for Paddle 2.6 by @zxcd in #11698
- setup a workflow for publishing package to pypi by @jzhang533 in #11804
- update link mentioned at #11763 by @jzhang533 in #11764
- fix AttributeError by @GreatV in #11686
- fix: Correct misuse of
try_import
frompaddle.utils
by @neteroster in #11820 - Update quickstart.md for a better python pdf demo by @qwedc001 in #11927
- Update quickstart_en.md by @qwedc001 in #11934
- Enhance the OCR recognition accuracy of PPStructure. by @RussellLuo in #11916
- add u14m results of cppd by @Topdu in #11943
- use tensor.shape bug not paddle.shape(tensor) by @wanghuancoder in #11919
- add pre-commit workflow by @GreatV in #11973
- docs: Update FAQ.md, delete repeated question by @xu8117 in #11972
- Fix the bug where Python scripts fail to execute PDF text recognition… by @guangyunms in #11994
- 【OCR Issue No.9】以可选形式支持Visualdl by @Liyulingyue in #11947
- fix weird version info by @GreatV in #12003
- 【OCR Issue No.9】移除明确不适合放在ppocr依赖中的依赖项 by @Liyulingyue in #11946
- Burmese Language dict and corpus by @1chimaruGin in #12020
- 面版识别添加onnx支持完善 by @heweisheng in #12068
- Update README.md by @dyning in #12086
- fix readme codestyle by @GreatV in #12095
- fix wrong link for 通用OCR in README.txt by @tackhwa in #12100
- move PPOCRLabel to PFCCLab/PPOCRLabel by @GreatV in #12104
- move StyleText to PFCCLab/StyleText by @GreatV in #12121
- openocr compti code by @Topdu in #12033
- table rec code by @invictuszhao in #11999
- Error with pyclipper inhomogeneous expanded array by @zovelsanj in #12108
- 【OCR Issue No.2】修复训练过程中找不到對應模型和训练时计算精度报错 by @mattheliu in https://github.com/PaddlePaddle/Paddle...
PaddleOCRv2.7.5
fix broken v2.7.4
PaddleOCRv2.7.4
This release contains the missed commits from v2.7.0 to v2.7.1.
fixed : #11824
PaddleOCRv2.7.3
What's Changed
fixed #11808
PaddleOCRv2.7.2
What's Changed
- add finnish language files by @savikko in #10850
- fix
cls_x
andbbox_x
is possibly unbound by @SigureMo in #10973 - update ppocrv4_framework by @tink2123 in #11047
- Update ONNX conversion readme_ch.md by @greyovo in #11030
- [TIPC]update tipc scripts and rm fluid api by @USTCKAY in #11098
- fix a bug for rec_postprocess.py by @Ataraxy33 in #11408
- Modify readme 27 by @zhangyubo0722 in #11424
- fix: layout recovery image:xxx.png,err msg: list index out of range by @santlchogva in #11405
- rm QR code in the document by @tink2123 in #11511
- rm QR code by @tink2123 in #11533
- Update custom.md by @jzhang533 in #11636
- fix AttributeError by @GreatV in #11556
- update pre-commit config by @jzhang533 in #11682
- lazy import PyMuPDF by @jzhang533 in #11685
- setup a workflow for publishing package to pypi, and bump version to … by @jzhang533 in #11800
New Contributors
- @savikko made their first contribution in #10850
- @greyovo made their first contribution in #11030
- @santlchogva made their first contribution in #11405
- @jzhang533 made their first contribution in #11636
Full Changelog: v2.7.0...v2.7.2
PaddleOCRv2.7.1
New Projects
- Add Parseq recognition model.(#10836)
- Add text recognition function to return single character coordinates.(#10515)
New Features
- Add savefile option to save OCR output results.(#10628)
- Add more data preprocessing options to ppocr.py.(#10217)
- A single damaged image does not affect data set inference. (#10216)
- Compatible with fitz version. (#10181)
- Compatible with Pillow10.0 upgrade. (#10405)
- Add Finnish dictionary file. (#10850)
- Onnxruntime supports GPU. (#10668)
- TIPC supports XPU and NPU. (#10658, #10460)
- Add inference on mlu devices. (#10249)
BugFix
- Fixed cannot find the library error when packaged into exe on windows. (#10502)
- Fixed the bug of recognize page is affected by the maximum number of PDF files when recognize multiple PDF files. (#10290)
- Fixed the problem of PPOCRLabel startup failure caused by mismatch in channel number. (#10847)
- Fix memory leak problem of cpp inference. (#10441)
- Modify DSR error caused by data enhancement. (#10662)
- Fix training seed problem. (#10614)
- Fix table_master tipc error. (#10514)
- Fixed the problem of error reporting when ppocr.py uses wandb. (#10251)
- Fix memory leak in predict_rec.py. (#10688)
- Fixed the issue where dis and iou cannot be calculated correctly due to the index error of structure_boxes in the PaddleStructure::rebuild_table function. (#10810)
- Compatible with paddle 2.5 fluid exit. (#10391)
- Fix the performance problem of Tensor.numpy under stride. (#10773)
- Adapt the size of ABINet during export to the size of ABINetRecResizeImg. (#10892)
- ABINet training error. (#10744)
- Fix KeyError in ch_PP-OCRv4_rec_distill.yml. (#10761)
Documentations Fix
- Fix algorithm_kie_vi_layoutxlm_en.md, kie_datasets_en.md, README.md, algorithm_overview.md, how_to_do_kie_en.md document issues. (#10717)
- Update documentation issues with setup.py. (#10749)
- Add pyyaml library in requirements.txt. (#10653)
New Projects
New Features
- 增加savefile选项,保存ocr输出结果。(#10628)
- 增加ppocr.py 更多数据预处理选项。(#10217)
- 单张破损图片不影响整体数据集推理。(#10216)
- 兼容fitz版本。(#10181)
- 兼容Pillow10.0升级。(#10405)
- 增加芬兰语字典文件。(#10850)
- Onnxruntime 支持GPU。(#10668)
- TIPC支持XPU、NPU。(#10658,#10460)
- 增加在mlu设备上的推理。(#10249)
BugFix
- 修复windows打包成exe找不到库的问题。(#10502)
- 修复多次识别pdf,受第一次页面最大数量影响的bug。(#10290)
- 修复通道数不匹配造成的PPOCRLabel启动失败问题。(#10847)
- 修复cpp推理的内存泄漏问题。(#10441)
- 修改数据增强导致的DSR报错。(#10662)
- 修复训练seed传递问题。( #10614)
- 修复 table_master tipc 报错。(#10514)
- 修复ppocr.py使用wandb报错问题。(#10251)
- 修复predict_rec.py中的内存泄漏问题。(#10688)
- 修复PaddleStructure::rebuild_table函数中structure_boxes的索引错误导致dis和iou无法正确计算的问题。(#10810)
- 兼容paddle 2.5 fluid退场。 (#10391)
- 修复stride下Tensor.numpy的性能问题。(#10773)
- ABINet导出时尺寸适应 ABINetRecResizeImg 的尺寸。(#10892)
- ABINet训练报错问题。(#10744)
- 解决ch_PP-OCRv4_rec_distill.yml中的keyError报错问题。(#10761)
Documentations Fix
- 修复 algorithm_kie_vi_layoutxlm_en.md、kie_datasets_en.md 、README.md、algorithm_overview.md、how_to_do_kie_en.md文档问题(#10717)
- 更新setup.py的文档问题(#10749)
- 修复requirements.txt中没有pyyaml库的问题(#10653)
New Contributors
- @RuohengMa made their first contribution in #10614
- @WilliamQf-AI made their first contribution in #10628
- @xlg-go made their first contribution in #10744
- @Gmgge made their first contribution in #10777
- @victor30608 made their first contribution in #11025
Full Changelog: v2.7.0...v2.7.1
PaddleOCRv2.7.0
Release Note
- Release PP-OCRv4, support mobile version and server version
- PP-OCRv4-mobile:When the speed is comparable, the effect of the Chinese scene is improved by 4.5% compared with PP-OCRv3, the English scene is improved by 10%, and the average recognition accuracy of the 80-language multilingual model is increased by more than 8%.
- PP-OCRv4-server:Release the OCR model with the highest accuracy at present, the detection model accuracy increased by 4.9% in the Chinese and English scenes, and the recognition model accuracy increased by 2%
refer quickstart quick use by one line command, At the same time, the whole process of model training, reasoning, and high-performance deployment can also be completed with few code in the General OCR Industry Solution in PaddleX.
- ReleasePP-ChatOCR, a new scheme for extracting key information of general scenes using PP-OCR model and ERNIE LLM.