Homelab Troubleshooting

Tips on running a Linux server and docker.

User Tools

Site Tools


paperlessng

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
paperlessng [2021/09/01 12:19] – created alexk7110paperlessng [2021/09/01 12:26] (current) – [Scanservjs] alexk7110
Line 2: Line 2:
  
 Top tip: put any custom config variables (such as custom file naming) in the docker-compose file under "environment". Top tip: put any custom config variables (such as custom file naming) in the docker-compose file under "environment".
 +
 +===== PDF cleaning =====
 +
 +But, I've since found out that my existing OCR'd PDFs have a janked-up OCR layer that Paperless-ng does NOT like - the text content is saved in a single column of characters. Not Paperless-ng's fault, just something to do with the way SwiftScan has saved the files.
 +
 +So, after a LOT of hunting, I've eventually settled on [[https://www.pdfshaper.com/|PDF Shaper Free]] for Windows. The free version still allows exporting all images from a PDF. Then I convert all those images back into a fresh, clean PDF (no dirty OCR). This gets dumped in Paperless-ng and job's a good'un.
 +
 +Top tip: experiment with the DPI setting for image exports to get the size/quality you want, as the DPI can be ignored in the import process.
 +
 +===== Scanservjs =====
 +
 +https://github.com/sbs20/scanservjs
 +
 +Scanservjs is a web UI frontend for your scanner. It allows you to share one or more scanners (using SANE) on a network without the need for drivers or complicated installation.
 +
  
paperlessng.1630487961.txt.gz · Last modified: 2021/09/01 12:19 by alexk7110