Total pages
3,323
across 20 crawls
Every crawl, every byte, every retry. Filter by status or export to CSV.
| URL | Strategy | Status | Extractor | Pages | Bytes | Started |
|---|---|---|---|---|---|---|
github.com/anthropics/anthropic-sdk-python cj_1000 | http | completed | css:article | 220 | 1.5 MB | 23:52 Jul 4 |
docs.anthropic.com/en/api/getting-started cj_100d | browser | completed | xpath://div | 25 | 5.2 MB | 22:52 Jul 4 |
huggingface.co/docs/transformers/index cj_101a | deep | completed | schema:Article | 270 | 7.2 MB | 21:52 Jul 4 |
en.wikipedia.org/wiki/RAG_(information_retrieval) cj_91a3f0 | browser | completed | css:.mw-parser-output p | 1 | 180.6 KB | 21:21 Jul 4 |
arxiv.org/abs/2506.12345 cj_1027 | http | completed | llm:summary | 477 | 1.9 MB | 20:52 Jul 4 |
stackoverflow.com/questions/tagged/playwright cj_40b8d2 | http | completed | xpath://div[@data-qa] | 50 | 2.3 MB | 20:32 Jul 4 |
pypi.org/project/crawl4ai cj_1034 | browser | failed | css:article | 187 | 2.5 MB | 19:52 Jul 4 |
reddit.com/r/LocalLLaMA/top?t=week cj_3ae51c | browser | failed | css:shreddit-post | 12 | 783.3 KB | 19:14 Jul 4 |
huggingface.co/datasets?sort=downloads cj_5d8a99 | http | completed | xpath://article | 20 | 987.7 KB | 18:58 Jul 4 |
nytimes.com/section/technology cj_1041 | deep | completed | xpath://div | 394 | 102.0 KB | 18:52 Jul 4 |
twitter.com/search?q=crawl4ai cj_e1f0aa | browser | rate-limited | css:article | 0 | 0 B | 18:31 Jul 4 |
anthropic.com/research cj_6b2c4d | browser | completed | css:article p | 8 | 597.7 KB | 18:01 Jul 4 |
wired.com/tag/ai cj_104e | http | completed | schema:Article | 114 | 4.4 MB | 17:52 Jul 4 |
theverge.com/ai-artificial-intelligence cj_105b | browser | rate-limited | llm:summary | 475 | 3.4 MB | 16:52 Jul 4 |
platform.openai.com/docs cj_1068 | deep | completed | css:article | 350 | 6.9 MB | 15:52 Jul 4 |
developer.mozilla.org/en-US/docs/Web/API cj_1075 | http | completed | xpath://div | 177 | 5.5 MB | 14:52 Jul 4 |
github.com/anthropics/anthropic-sdk-python cj_1082 | browser | completed | schema:Article | 171 | 6.0 MB | 13:52 Jul 4 |
docs.anthropic.com/en/api/getting-started cj_108f | deep | completed | llm:summary | 89 | 1.4 MB | 12:52 Jul 4 |
huggingface.co/docs/transformers/index cj_109c | http | completed | css:article | 189 | 4.6 MB | 11:52 Jul 4 |
arxiv.org/abs/2506.12345 cj_10a9 | browser | failed | xpath://div | 94 | 7.4 MB | 10:52 Jul 4 |