Skip to content

SimoneRemoli/Image-Letter-Parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Webapp Java (Tomcat + JSP/Servlet) PARSING IMAGE

Image Letter Parser is a lightweight OCR-oriented tool that takes an input image, cleans it (binarization/denoise), segments characters, and outputs the extracted letters. Designed to be simple to run and easy to extend with different OCR backends or preprocessing steps.

OCR/Tesseract (Faculty of Engineering)

License: GPL v3


Developers and Creators


Simone Remoli

Built With

The technologies listed below constitute the foundational stack employed in the design and implementation of this system:

Java    TESSERACT   

🚦 Overview

This project leverages the power of Tesseract OCR to extract text from images, opening up endless opportunities for document automation, data extraction, and more. With a flexible and modular approach, you can easily customize and expand the parsing pipeline to suit your needs.

1. Clone the Repository

git clone https://github.com/SimoneRemoli/Image-Letter-Parser.git

2. Install Tesseract

brew install tesseract

3. Add Tess4J in your POM

<dependency>
      <groupId>net.sourceforge.tess4j</groupId>
      <artifactId>tess4j</artifactId>
      <version>5.10.0</version>
</dependency>

4. Add VM options to Tomcat

IntelliJ → Run/Debug Configurations → Configuration SmartTomcat → VM options

-Djna.library.path=/usr/local/lib -Dtessdata.dir=/usr/local/share/tessdata

Note: Check

ls /opt/homebrew/share/tessdata (o /usr/local/share/tessdata)

show inside eng.traineddata.

Let’s turn images into actionable data!

Screenshot 2025-08-25 alle 11 22 03 Screenshot 2025-08-25 alle 11 23 57

Example: A — [15,104,44,48] means “the letter A is inside a bounding box that starts 15 px from the left and 104 px from the top, 44 px wide and 48 px tall.”

Ita - > Esempio: A — [15,104,44,48] significa “la lettera A è dentro un riquadro che inizia a 15 px da sinistra e 104 px dall’alto, largo 44 px e alto 48 px”.

Fra -> Exemple : A — [15,104,44,48] signifie « la lettre A se trouve dans un cadre (bounding box) qui commence à 15 px depuis la gauche et 104 px depuis le haut, d’une largeur de 44 px et d’une hauteur de 48 px. »

Kor -> 예: A — [15,104,44,48] 는 문자 A가 왼쪽에서 15px, 위쪽에서 104px 떨어진 지점에서 시작하는 바운딩 박스 안에 있으며, 너비 44px, 높이 48px임을 의미합니다.

Chin -> 示例:A — [15,104,44,48] 表示“字母 A 位于一个边界框内,该边界框从左侧 15 像素、顶部 104 像素处开始,宽 44 像素,高 48 像素。”.

Tess4J provides Java APIs (via JNA) to invoke Tesseract without writing native code. OCR (Optical Character Recognition) is the technology that “reads” text in an image or PDF and converts it into digital text.

About

Parses an image and extracts individual letters using an OCR/segmentation pipeline.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages