Tags: unclecode/crawl4ai
Tags
fix: replace litellm with unclecode-litellm due to PyPI supply chain … …compromise litellm 1.82.7-1.82.8 on PyPI were compromised with malicious code. PyPI quarantined the entire package (all versions uninstallable). Switched to unclecode-litellm==1.81.13, a pre-compromise fork published under our own PyPI account. Drop-in replacement — all imports unchanged.
fix: pin redis-tools version to match redis-server in Dockerfile
Release v0.8.5: Anti-Bot Detection, Shadow DOM & 60+ Bug Fixes - Anti-bot detection with 3-tier proxy escalation - Shadow DOM flattening - Deep crawl cancellation - Config defaults API - Consent popup removal (40+ CMP platforms) - Resource filtering (avoid_ads/avoid_css) - Browser recycling & memory-saving mode - GFM table compliance - Critical security fixes (RCE deserialization, Redis CVE-2025-49844) - 60+ bug fixes across browser, proxy, deep crawl, extraction, CLI, Docker - 291-test regression suite
Sync sec-ch-ua with User-Agent and keep WebGL alive in stealth mode Fix a bug where magic mode and per-request UA overrides would change the User-Agent header without updating the sec-ch-ua (browser hint) header to match. Anti-bot systems like Akamai detect this mismatch as a bot signal. Changes: - Regenerate browser_hint via UAGen.generate_client_hints() whenever the UA is changed at crawl time (magic mode or explicit override) - Re-apply updated headers to the page via set_extra_http_headers() - Skip per-crawl UA override for persistent contexts where the UA is locked at launch time by Playwright's protocol layer - Move --disable-gpu flags behind enable_stealth check so WebGL works via SwiftShader when stealth mode is active (missing WebGL is a detectable headless signal) - Clean up old test scripts, add clean anti-bot test
Release v0.8.0: The Crash Recovery & Security Update - Deep Crawl Crash Recovery: on_state_change callback and resume_state for long-running crawls - Prefetch Mode: 5-10x faster URL discovery with prefetch=True - Security Fixes: Hooks disabled by default, file:// URLs blocked on Docker API - Breaking Changes: Docker API users with hooks must set CRAWL4AI_HOOKS_ENABLED=true
PreviousNext