[新服务]ArchiveBox用于保存网页的存档

date
Nov 2, 2021
slug
ArchiveBox-WebArchive
status
Published
summary
可以保存网页的html, pdf, jpg等格式
tags
service
type
Post
notion image

Summary

步骤

notion image
notion image
mkdir /data/archivebox && cd /data/archivebox
mkdir data

# 已经将远程google drive加载到 /data/gd_stanford
mkdir -p /data/gd_stanford/_service/archivebox/data/archive

# link 
ln -s /data/gd_stanford/_service/archivebox/data/archive ./data/archive
chown -R 999:999 data && chmod 755 data

curl -O 'https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/master/docker-compose.yml'
vi docker-compose.yml
# 记住你的端口

# optional
curl -O https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/master/etc/sonic.cfg
vi sonic.cfg

docker-compose run archivebox init --setup
docker-compose run archivebox schedule --every=day --depth=1 www.nine.im
docker-compose run archivebox config --set PUBLIC_INDEX=False
docker-compose run archivebox config --set PUBLIC_SNAPSHOTS=False
docker-compose run archivebox config --set PUBLIC_ADD_VIEW=False

# 
docker-compose run archivebox status

docker-compose run archivebox add https://example.com/some/page
docker-compose run archivebox add --depth=1 ~/Downloads/bookmarks_export.html
docker-compose run archivebox list --sort=timestamp --csv=timestamp,url,is_archived
notion image

验证

 
notion image

© Ying Bun 2021 - 2022