Hayai Kensaku Pukiwiki Crawler & Client
Python 3.x
You can also use Docker image. Entrypoint is poetry run python3
.
- Setup
$ poetry install
$ edit config/pukiwiki.py
- Crawl
$ poetry run python3 pukiwiki-crawler.py crawl
- Create index
$ poetry run python3 pukiwiki-crawler.py add-index
- Clients for dev
# show help
$ poetry run python3 dev-client.py -h
- Setup
$ edit config/paragate.py
- Crawl
$ poetry run python3 paragate-crawler.py crawl
- Create index
$ poetry run python3 paragate-crawler.py add-index
- Setup
$ edit config/scrapbox.py
SCRAPBOX_CONNECT_SID
はブラウザの開発者ツールから cookie[connect.sid]
を取得してください。
- Crawl
$ poetry run python3 scrapbox-crawler.py crawl
- Create index
$ poetry run python3 scrapbox-crawler.py add-index
To access dev app in kubernetes...
$ kubectl port-forward service/{svc name} 9200:9200
- els => elastic search
See LICENSE for license and DOCKER_NOTICE for Docker image notices.