← All open source projects

paperless-ngx

paperless-ngx/paperless-ngx

paperless-ngx is a document management system for scanning, OCR, indexing, and archiving files.

Forks 2,836
Author paperless-ngx
Language Python
License GPL-3.0
Synced 2026-06-27

What it is

paperless-ngx is an open source document management system. It continues the Paperless idea as a community-supported app for people who want to move beyond folders full of unsearchable scans.

Scans, invoices, contracts, and letters quickly become chaotic without OCR, metadata, sorting rules, and search. This catalog page treats the project as a concrete tool with context, typical use cases, and limits, not just as a ranked repository.

What is inside

The repository contains the Python/Django server, web UI, document consumer, OCR, indexing, tags, users, rules, and container setup.

paperless-ngx connects file ingestion, text recognition, metadata extraction, and archive work. That repository shape helps readers understand whether they are looking at a library, an application, a learning course, or a reference guide.

How it is used

A user adds a PDF or image, the system recognizes text, suggests organization, and makes the document searchable.

A good start is importing a small set of documents, configuring naming rules, and checking OCR quality on your scans. A good first step is to repeat the small scenario below and then test the project against your own data, code, or team task.

Strengths and limits

The strength is a practical full cycle from incoming file to search and archive organization.

The limitation is operational responsibility: backups, access control, updates, and recognition quality need care.

The practical value of paperless-ngx is easiest to see through a small verifiable scenario: take the task the project was made for and follow it to a result. paperless-ngx turns a personal or work document archive into a searchable system with OCR, tags, correspondents, and processing rules. That makes the project easier to judge by actual work removed from the team.

If paperless-ngx remains in use beyond the first experiment, maintenance, updates, access rules, license terms, and clear ownership become as important as features. That is where the difference between an interesting repository and a durable product dependency usually appears.

paperless-ngx is also easier to understand through practice than through metadata alone. It has a concrete audience, a typical adoption path, and conditions where it becomes useful or unnecessary.

Example

Минимальный запуск через Docker Compose

Пример показывает общий порядок: поднять сервисы и открыть веб-интерфейс для первичной настройки.

Language: Bash
docker compose up -d
docker compose logs -f webserver