Skip to content

[question] how to run tesseract4java on ubuntu? #56

Description

@WPFilmmaker

Hi, first of all thank you for this program, tesseract really needs a gui. I followed the usage page in the wiki however I still am unable to process my images.

I am on Lubuntu 18.04 and I installed tesseract from the package manager. Then i downloaded the jar for tesseract4java.

I created a new project and can load images, however it seems that the issue is the traineddata file. However through the package manager /synaptic) I installed "tesseract-ocr language files for English" which according to the description "This package contains the data needed for processing images in English language."

Tesseract4java however can't find that on my system I already have such file, and from the program I can't find a way to load it.

Could you help?

I also have a few questions about the traineddata file.

Even though I am on lubuntu, do I have to use such trainedata file? If yes where do I have to put them?

Is there a way to use the files from my software manager?

Thank you :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions