Optical Character Recognition OCR: Difference between revisions

From WickyWiki
mNo edit summary
mNo edit summary
 
(2 intermediate revisions by the same user not shown)
Line 4: Line 4:
[[Category:202208]]
[[Category:202208]]


Sourcecode, info and links:
= Tesseract OCR =
 
Source code, info and links:
* https://github.com/tesseract-ocr/tesseract
* https://github.com/tesseract-ocr/tesseract


Line 15: Line 17:
</source>
</source>


Greenshot integration:
= Greenshot integration =
 
This will create a .txt -file in addition to the created image-file with OCR results. Feel free to change any parameters to improve the results.
 
Greenshot screen capture:
* https://getgreenshot.org/
 
Go to Greenshot > Preferences > Plugins > External command plugin > Configure > New
 
  Name: Tesseract-OCR
  Command: C:\Program Files\Tesseract-OCR\tesseract.exe
  Arguments "{0}" "{0}" -c paragraph_text_based=0 -l eng+nld --psm 6


Go to Preferences > Plugins > External command plugin > Configure > New
Go to Greenshot > Preferences > Destination


   Tesseract-OCR
   Select "Save directly" and "Tesseract-OCR"
  C:\Program Files\Tesseract-OCR\tesseract.exe
  "{0}" outputbase -l eng+nld --psm 6

Latest revision as of 08:18, 31 August 2022


Tesseract OCR

Source code, info and links:

Binaries, installers:

Commandline usage:

tesseract imagename outputbase [-l lang] [--oem ocrenginemode] [--psm pagesegmode] [configfiles...]

Greenshot integration

This will create a .txt -file in addition to the created image-file with OCR results. Feel free to change any parameters to improve the results.

Greenshot screen capture:

Go to Greenshot > Preferences > Plugins > External command plugin > Configure > New

 Name: Tesseract-OCR
 Command: C:\Program Files\Tesseract-OCR\tesseract.exe 
 Arguments "{0}" "{0}" -c paragraph_text_based=0 -l eng+nld --psm 6

Go to Greenshot > Preferences > Destination

 Select "Save directly" and "Tesseract-OCR"