Generated pages/ subfolders for all documents:
- arbeit: 386 pages
- praxis: 297 pages
- EPG: 11 pages
Page numbers are 0-based PDF indices matching the book viewer.
Extracted using pdftotext.
- Extract text from PDF to markdown (576 lines, 11 pages)
- Create page index JSON with line mappings for all 11 pages
- Create metadata.jsonc (validated against BookMetadataSchema)
- Create README.md with document role and cleanup conditions