[新服务]ArchiveBox用于保存网页的存档
date
Nov 2, 2021
slug
ArchiveBox-WebArchive
status
Published
summary
可以保存网页的html, pdf, jpg等格式
tags
service
type
Post
Summary
步骤
mkdir /data/archivebox && cd /data/archivebox
mkdir data
# 已经将远程google drive加载到 /data/gd_stanford
mkdir -p /data/gd_stanford/_service/archivebox/data/archive
# link
ln -s /data/gd_stanford/_service/archivebox/data/archive ./data/archive
chown -R 999:999 data && chmod 755 data
curl -O 'https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/master/docker-compose.yml'
vi docker-compose.yml
# 记住你的端口
# optional
curl -O https://raw.githubusercontent.com/ArchiveBox/ArchiveBox/master/etc/sonic.cfg
vi sonic.cfg
docker-compose run archivebox init --setup
docker-compose run archivebox schedule --every=day --depth=1 www.nine.im
docker-compose run archivebox config --set PUBLIC_INDEX=False
docker-compose run archivebox config --set PUBLIC_SNAPSHOTS=False
docker-compose run archivebox config --set PUBLIC_ADD_VIEW=False
#
docker-compose run archivebox status
docker-compose run archivebox add https://example.com/some/page
docker-compose run archivebox add --depth=1 ~/Downloads/bookmarks_export.html
docker-compose run archivebox list --sort=timestamp --csv=timestamp,url,is_archived