0
I Use This!
Low Activity
Analyzed about 20 hours ago. based on code collected about 20 hours ago.

Project Summary

hOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is independent of the presentation.

There is a Public Specification for the hOCR Format.

Tags

fonts hocr layout layoutanalysis multi-lingual ocr opticalcharacterrecognition presentation typesetting

Badges

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

All Licenses

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    there are over 3,000 projects on the Open Hub with security vulnerabilities reported against them
  • ...
    by exploring contributors within projects, you can view details on every commit they have made to that project
  • ...
    65% of companies leverage OSS to speed application development in 2016
  • ...
    check out hot projects on the Open Hub

30 Day Summary

Jul 24 2019 — Aug 23 2019

12 Month Summary

Aug 23 2018 — Aug 23 2019
  • 41 Commits
    Up + 39 (1950%) from previous 12 months
  • 4 Contributors
    Up + 2 (100%) from previous 12 months

Ratings

1 user rates this project:
5.0
 
5.0/5.0
Click to add your rating
   Spinner
Review this Project!