PDF-Extractor

About

PDF-Extractor is a web application that allows users to upload PDF files, extract text from them, and correct the text using language tools. The frontend is built with React, and the backend is built with FastAPI. This application supports both English and French languages.

Getting Started

Prerequisites

Node.js and npm
Python 3.12.3
FastAPI
pdfplumber
language_tool_python
uvicorn
langdetect

Installation

Clone the repository:

git clone https://github.com/arij01/PDF-Extractor.git
cd PDF-Extractor

Install requirements:

pip install -r requirements.txt

Running the Application

Start the backend server:

uvicorn app:app --reload

Install the required npm packages:

npm install

Start the frontend development server:

cd frontend
npm start

Open your browser and navigate to http://localhost:3000 to view the application.

Usage

A sample PDF file is provided in the sample directory. You can use this file to test the application.

Uploading a PDF

Open the application in your browser.
Drag and drop a PDF file into the designated area or click to select a file.
Click the "Submit" button to upload the file.
Wait for the text extraction and correction process to complete.
The corrected text will be displayed on the screen.

Correcting Text

The application uses language_tool_python to correct the text extracted from the PDF. The language is automatically detected using langdetect.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
frontend		frontend
sample		sample
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF-Extractor

About

Getting Started

Prerequisites

Installation

Running the Application

Usage

Uploading a PDF

Correcting Text

About

Releases

Packages

Languages

arij01/PDF-Extractor

Folders and files

Latest commit

History

Repository files navigation

PDF-Extractor

About

Getting Started

Prerequisites

Installation

Running the Application

Usage

Uploading a PDF

Correcting Text

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages