Puppeteer 与 Chrome Headless —— 从入门到爬虫
https://github.com/csbun/thal

中文译文地址,关于如何利用 Chrome Headless 实现爬虫的初级介绍;
http://csbun.github.io/blog/2017/09/puppeteer/

原文:
Getting started with Puppeteer and Chrome Headless for Web Scraping
Github 项目地址:https://github.com/emadehsan/thal

【譯】Headless Chrome 入門指南
https://segmentfault.com/a/1190000010411069

基础

Python3 实现重构 puppeteer
https://github.com/miyakogi/pyppeteer

深入进阶

爬虫基础篇[Web 漏洞扫描器]
http://blog.fatezero.org/2018/03/05/web-scanner-crawler-01/

A Python Package for the Google Chrome Dev Protocol
https://github.com/fate0/pychrome

使用 Headless Chrome 打造一个Crawler
https://github.com/yujiosaka/headless-chrome-crawler

Chrome DevTools Protocol (CDP)
https://chromedevtools.github.io/devtools-protocol/

Chrome Debugging Protocol interface for Node.js
https://github.com/cyrus-and/chrome-remote-interface

Chromium Docs
https://chromium.googlesource.com/chromium/src/+/master/docs/

Chromium 源码
https://cs.chromium.org/chromium/src/

Chrome Headless 命令行参数 选项 列表
https://peter.sh/experiments/chromium-command-line-switches/
https://cs.chromium.org/chromium/src/content/public/common/content_switches.cc

高阶操作 API 函数
https://chromedevtools.github.io/devtools-protocol/tot/Network

点击收藏 | 3 关注 | 3
登录 后跟帖