Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
7 changes: 7 additions & 0 deletions .dockerignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
.git
__pycache__/
*.pyc
results/
.venv/
.env
.DS_Store
17 changes: 17 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,11 +1,28 @@
# Change Log
All notable changes to this project will be documented in this file.

## Beta - 2026-01-27

### Added
- [x] Added Docker support to run PageIndex in a containerized environment.
- [x] Added `Dockerfile` for building a reproducible CLI image.
- [x] Added `docker-compose.yml` for simplified execution with volume mounting.
- [x] Added `.dockerignore` to optimize Docker image size and build performance.
- [x] Enabled execution of `run_pageindex.py` as the container entrypoint.

### Changed
- [x] Standardized execution environment using Python 3.11 slim image.
- [x] Improved dependency installation reliability for PDF processing libraries.

---

## Beta - 2025-04-23

### Fixed
- [x] Fixed a bug introduced on April 18 where `start_index` was incorrectly passed.

---

## Beta - 2025-04-03

### Added
Expand Down
21 changes: 21 additions & 0 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
# syntax=docker/dockerfile:1
FROM python:3.11-slim

# Dependencias del sistema.
# DeepWiki menciona que si falla pymupdf, puede requerir libmupdf-dev. :contentReference[oaicite:1]{index=1}
RUN apt-get update && apt-get install -y --no-install-recommends \
libmupdf-dev \
gcc \
&& rm -rf /var/lib/apt/lists/*

WORKDIR /app

# Instala deps primero para aprovechar cache
COPY requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

# Copia el resto del proyecto
COPY . .

# Por defecto, ejecuta el CLI (puedes pasar flags en docker run)
ENTRYPOINT ["python", "run_pageindex.py"]
Binary file added data/TheComingWave.pdf
Binary file not shown.
9 changes: 9 additions & 0 deletions docker-compose.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
services:
pageindex:
build: .
image: pageindex:local
env_file:
- .env
volumes:
- ./data:/data
- ./results:/app/results
Loading