Skip to content
@OCR-D

OCR-D

DFG-Koordinierungsprojekt zur Weiterentwicklung von Verfahren der Optical Character Recognition

Pinned

  1. core core Public

    Collection of OCR-related python tools and wrappers from @OCR-D

    Python 118 31

  2. ocrd_all ocrd_all Public

    Master repository which includes most other OCR-D repositories as submodules

    Makefile 71 18

  3. spec spec Public

    Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)

    Python 17 5

  4. gt-guidelines gt-guidelines Public

    OCR-D guidelines for Ground Truth production

    HTML 6 5

  5. ocrd-webapi-implementation ocrd-webapi-implementation Public

    Python 4

Repositories

Showing 10 of 89 repositories
  • core Public

    Collection of OCR-related python tools and wrappers from @OCR-D

    Python 118 Apache-2.0 31 118 (1 issue needs help) 23 Updated May 31, 2024
  • gt_structure_text Public

    The OCR-D Ground Truth text and structure corpus was created between 2015 -2017. In the years since 2017, this corpus has been further curated and supplemented with metadata where appropriate. The corpus includes page XML files within annotations of the text and structure include.

    5 CC-BY-SA-4.0 3 0 1 Updated May 29, 2024
  • ocrd_kraken Public

    Wrapper for the kraken OCR engine

    Python 10 Apache-2.0 6 3 0 Updated May 29, 2024
  • ocrd_all Public

    Master repository which includes most other OCR-D repositories as submodules

    Makefile 71 MIT 18 22 3 Updated May 27, 2024
  • ocrd_tesserocr Public

    Run tesseract with the tesserocr bindings with @OCR-D's interfaces

    Python 38 MIT 11 12 3 Updated May 25, 2024
  • gt_structure_1_1 Public

    The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

    0 CC-BY-SA-4.0 1 0 0 Updated May 23, 2024
  • gt-repo-scripts Public

    XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).

    XSLT 0 CC-BY-SA-4.0 2 0 0 Updated May 22, 2024
  • Python 2 1 0 0 Updated May 22, 2024
  • gt_structure_5_3 Public

    The repo gt_structure_5_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

    0 CC-BY-SA-4.0 0 0 0 Updated May 21, 2024
  • gt_structure_1_2 Public

    The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.

    0 CC-BY-SA-4.0 0 0 0 Updated May 21, 2024