[软件服务] PDF2SearchablePDF make pdf searchable

date
Feb 7, 2025
slug
newsoftware-PDF2SearchablePDF
status
Published
summary
新软件-
tags
software
service
type
Post
URL

Summary

  • https://github.com/ElectricRCAircraftGuy/PDF2SearchablePDF

Server

# Tesseract OCR(Optical Character Recognition)
brew install tesseract
tesseract --version
tesseract --list-langs
cd /opt/homebrew/share/tessdata/
git clone https://github.com/tesseract-ocr/tessdata_best
git clone https://github.com/tesseract-ocr/tessdata
cp tessdata/jpn.traineddata .
cp tessdata/chi_sim.traineddata .

# PDF rendering library (based on the xpdf-3.0 code base)
# https://poppler.freedesktop.org/
brew install poppler

cd /data
git clone https://github.com/ElectricRCAircraftGuy/PDF2SearchablePDF
cd PDF2SearchablePDF

# add 日文
./pdf2searchablepdf.sh input.pdf jpn

Client

Raw


© Ying Bun 2021 - 2025