Optical Character Recognition OCR: Difference between revisions

From WickyWiki
Created page with "Category:Images Category:Windows Category:Linux Category:202208 Sourcecode, info and links: * https://github.com/tesseract-ocr/tesseract Binaries, installers: * https://tesseract-ocr.github.io/tessdoc/Home.html Commandline usage: <source lang=dos> tesseract imagename outputbase [-l lang] [--oem ocrenginemode] [--psm pagesegmode] [configfiles...] </source>"
 
mNo edit summary
 
(4 intermediate revisions by the same user not shown)
Line 4: Line 4:
[[Category:202208]]
[[Category:202208]]


Sourcecode, info and links:
= Tesseract OCR =
 
Source code, info and links:
* https://github.com/tesseract-ocr/tesseract
* https://github.com/tesseract-ocr/tesseract


Line 14: Line 16:
tesseract imagename outputbase [-l lang] [--oem ocrenginemode] [--psm pagesegmode] [configfiles...]
tesseract imagename outputbase [-l lang] [--oem ocrenginemode] [--psm pagesegmode] [configfiles...]
</source>
</source>
= Greenshot integration =
This will create a .txt -file in addition to the created image-file with OCR results. Feel free to change any parameters to improve the results.
Greenshot screen capture:
* https://getgreenshot.org/
Go to Greenshot > Preferences > Plugins > External command plugin > Configure > New
  Name: Tesseract-OCR
  Command: C:\Program Files\Tesseract-OCR\tesseract.exe
  Arguments "{0}" "{0}" -c paragraph_text_based=0 -l eng+nld --psm 6
Go to Greenshot > Preferences > Destination
  Select "Save directly" and "Tesseract-OCR"

Latest revision as of 08:18, 31 August 2022


Tesseract OCR

Source code, info and links:

Binaries, installers:

Commandline usage:

tesseract imagename outputbase [-l lang] [--oem ocrenginemode] [--psm pagesegmode] [configfiles...]

Greenshot integration

This will create a .txt -file in addition to the created image-file with OCR results. Feel free to change any parameters to improve the results.

Greenshot screen capture:

Go to Greenshot > Preferences > Plugins > External command plugin > Configure > New

 Name: Tesseract-OCR
 Command: C:\Program Files\Tesseract-OCR\tesseract.exe 
 Arguments "{0}" "{0}" -c paragraph_text_based=0 -l eng+nld --psm 6

Go to Greenshot > Preferences > Destination

 Select "Save directly" and "Tesseract-OCR"