fix: clear DOM cache after scroll to prevent stale data in extract by wangshisan · Pull Request #4632 · browser-use/browser-use

wangshisan · 2026-04-07T13:21:41Z

Summary

After a scroll action, the enhanced_dom_tree cache in DOMWatchdog was not being invalidated. This caused subsequent extract() calls to return stale DOM data from before the scroll, leading agents to see duplicate/unchanged content and incorrectly conclude that no new data was available after scrolling.

Problem

Step 1: extract() → builds DOM → caches as enhanced_dom_tree ("18 posts")
Step 2: scroll(down=1) → browser scrolls, new content loads ✅
         → BUT cache NOT cleared! ❌
Step 3: extract() → returns cached OLD data ("18 posts") ← BUG

The agent then sees "duplicate data" → thinks "cannot get more" → may give up on pagination.

Root Cause

The scroll action in tools/service.py never called dom_watchdog.clear_cache().

Other operations that correctly clear the cache:

read_file (source=page) — service.py:1736
Tab switch (AgentFocusChangedEvent) — session.py:1141

The cache read path at markdown_extractor.py:119-125 returns cached value without any staleness check.

Fix

Add dom_watchdog.clear_cache() call after successful scroll completion:

# After successful scroll, clear DOM cache to ensure next extract gets fresh content
if browser_session._dom_watchdog:
    browser_session._dom_watchdog.clear_cache()

Changes

browser_use/tools/service.py: Added 3 lines — clear_cache() call after successful scroll return

Verification

✅ ruff check passes
✅ ruff format passes
✅ pyright passes (0 errors, 0 warnings)

Summary by cubic

Clear the DOMWatchdog enhanced_dom_tree cache after a successful scroll so the next extract() reads fresh DOM. This prevents stale or duplicate content after pagination.

Bug Fixes
- In browser_use/tools/service.py, call browser_session._dom_watchdog.clear_cache() on successful scroll.
- Ensures extract() doesn’t return pre-scroll cached DOM.

^{Written for commit 4acd164. Summary will update on new commits.}

CLAassistant · 2026-04-07T13:21:49Z

All committers have signed the CLA.

cubic-dev-ai

No issues found across 1 file

After a scroll action, the enhanced_dom_tree cache was not being invalidated, causing subsequent extract() calls to return stale DOM data from before the scroll. This led to agents seeing duplicate/unchanged content and incorrectly concluding that no new data was available after scrolling. Fix: call dom_watchdog.clear_cache() on successful scroll.

laithrw · 2026-04-11T22:06:50Z

Thanks for looking into this! This patches it at the tools.scroll() call site, which is just one entry point. Scroll also fires through the MCP server path and potentially other callers via ScrollEvent. Closing in favor of #4658, which fixes this at the event layer, but I appreciate your work on this.

cubic-dev-ai Bot reviewed Apr 7, 2026

View reviewed changes

wangshisan force-pushed the fix/scroll-clear-dom-cache branch from c3838b1 to d5f6eb6 Compare April 7, 2026 13:53

wangshisan added 2 commits April 8, 2026 19:37

Merge branch 'main' into fix/scroll-clear-dom-cache

0119501

Merge branch 'main' into fix/scroll-clear-dom-cache

4acd164

laithrw closed this Apr 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: clear DOM cache after scroll to prevent stale data in extract#4632

fix: clear DOM cache after scroll to prevent stale data in extract#4632
wangshisan wants to merge 3 commits intobrowser-use:mainfrom
wangshisan:fix/scroll-clear-dom-cache

wangshisan commented Apr 7, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

CLAassistant commented Apr 7, 2026 •

edited

Loading

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

laithrw commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wangshisan commented Apr 7, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Root Cause

Fix

Changes

Verification

Summary by cubic

Uh oh!

CLAassistant commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

laithrw commented Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wangshisan commented Apr 7, 2026 •

edited by cubic-dev-ai Bot

Loading

CLAassistant commented Apr 7, 2026 •

edited

Loading