[软件服务] PDF2SearchablePDF make pdf searchable
Summary
- https://github.com/ElectricRCAircraftGuy/PDF2SearchablePDF
Server
# Tesseract OCR(Optical Character Recognition)
brew install tesseract
tesseract --version
tesseract --list-langs
cd /opt/homebrew/share/tessdata/
git clone https://github.com/tesseract-ocr/tessdata_best
git clone https://github.com/tesseract-ocr/tessdata
cp tessdata/jpn.traineddata .
cp tessdata/chi_sim.traineddata .
# PDF rendering library (based on the xpdf-3.0 code base)
# https://poppler.freedesktop.org/
brew install poppler
cd /data
git clone https://github.com/ElectricRCAircraftGuy/PDF2SearchablePDF
cd PDF2SearchablePDF
# add 日文
./pdf2searchablepdf.sh input.pdf jpn