Use Local Webdriver for UI Tests in CI by Hamms · Pull Request #65064 · code-dot-org/code-dot-org

Hamms · 2025-04-04T21:43:31Z

Specifically, I added a new --first-run-local flag to runner.rb, which tells it to use the local Selenium webdriver for the first run of a given test, and only use the configured webdriver (ie, Saucelabs) for reruns.

We can then use this flag in our CI tests to be a bit more efficient with our Saucelabs credits. We expect most tests to pass just fine with the local webdriver, but Saucelabs is more consistent and provides a better experience for debugging legitimate failures.

Also did a little miscellaneous cleanup while I was in the area:

Replaced a handful of instances of ENV['TEST_LOCAL'] == 'true' with a new helper method
Replaced a couple instances of ENV['CI'] with an existing helper method
Fixed some bugs in our SeleniumBrowser options
Bumped a version number that should have been updated as part of Use Latest Stable Chromedriver in CI #64444

Links

Related work:

Alternative to:

use local selenium directly in drone #64432

Testing story

Relying on CI tests to validate this CI-test-focused change. Note that running the local selenium webdriver alongside the puma server processes does result in slightly slower responses, which increases test flakiness slightly even for the reruns happening in Saucelabs. I deflaked a few tests in response and have been consistently getting green checkmarks since, despite multiple reruns; I think it is reasonably stable. We should be prepared to invest a bit more time in further deflaking if other engineers run into issues, though.

Also verified on Drone that this change does not noticeably impact total runtime of UI tests; see for example this run with the new logic disabled versus this run with it enabled.

Deployment strategy

We should plan to notify developers when this gets merged; warn them about the potential for increased flakiness, and ask them to speak up if they notice anything. If this change does cause issues, we can easily gate it behind a commit message flag while we resolve them.

Follow-up work

I'd like to add some kind of logging or metrics for tracking local runs vs saucelabs runs. In particular, I'd like for there to be a way for us to identify any tests that are consistently failing locally but consistently passing remotely.

… warning

This reverts commit 014ac15.

…driver

…al-selenium-in-drone

…labs rather than local

…e do locally doesn't result in fewer total attempts against sauce

… relevant

In favor of using local selenium for the first run anytime we're in the CI environment

…vironment

This reverts commit 4a8731a.

cat5inthecradle

LGTM. I like the helper function renames. Excited to make this change!

snickell

Yayyy! LGTM, its gonna be wild how much impact this should have on our saucelabs usage. I love resource usage efficiency fixes, I don't know why, but so satisfying.

snickell · 2025-04-10T04:09:59Z

        from_secret: SAUCE_USERNAME
      SAUCE_ACCESS_KEY:
        from_secret: SAUCE_ACCESS_KEY
+    shm_size: '2gb' # necessary to avoid page crashes in Selenium


I wouldn't mind a comment (or a link back to a GH comment on this PR!) showing what these look like, in case somebody in the future needs to re-evaluate and say bump it up to 3gb or whatever.

snickell · 2025-04-10T04:16:15Z

-    options.add_argument('window-size=1280,1024') if [:chrome, :firefox].include?(browser)
-    options.add_argument('headless') if headless
+    options.add_argument('--window-size=1280,1024') if [:chrome, :firefox].include?(browser)
+    options.add_argument('--headless') if headless


Not part of the review, just me being curious: were these not taking effect before without the -- being added?

They were working for Chrome, which I think is what most people use for manual local testing, but not for Firefox

Follow-up to #65064, which added logic to use a local webdriver for the first run of all UI and Eyes tests, only falling back to SauceLabs for any tests which fail that first run. This PR adds a commit flag which can be used to selectively override that behavior if a developer has a specific reason to want even passing tests to run in SauceLabs (for example, if they want to share a video of a newly-added UI test running successfully). [Slack thread](https://codedotorg.slack.com/archives/C08AMQ869QX/p1753933950644269)

* Allow Overriding Local Webdriver Tests Follow-up to #65064, which added logic to use a local webdriver for the first run of all UI and Eyes tests, only falling back to SauceLabs for any tests which fail that first run. This PR adds a commit flag which can be used to selectively override that behavior if a developer has a specific reason to want even passing tests to run in SauceLabs (for example, if they want to share a video of a newly-added UI test running successfully). [Slack thread](https://codedotorg.slack.com/archives/C08AMQ869QX/p1753933950644269) * restore accidentally-removed whitespace * [skip local webdriver] empty commit to test new commit flag

Hamms added 30 commits March 10, 2025 13:25

attempt to run selenium within the ui-tests container; yes, it's gross

f422878

clean up some debug changes

c07a61c

install actual latest chromedriver

586beb0

use local selenium directly rather than trying to host a grid locally

633ace0

always test headlessly in CI

49da92e

Resolve "FromAsCasing: 'as' and 'FROM' keywords' casing do not match"…

c0f7311

… warning

follow documented approach to install Chrome for Testing

5185130

temporarily use feature branch docker image

f296776

use dashes when adding arguments

8452383

don't complicate things with selenium_http_client

450921d

revert to manual installation, but now backed by something more current

c1ced1b

try with --no-sandbox

608a05b

try with --no-sandbox and expanded shm_size

e4fa465

try with just expanded shm_size

45f2a67

try without relying on shm at all

014ac15

Revert "try without relying on shm at all"

dc2d437

This reverts commit 014ac15.

avoid using saucelabs-specific functionality when not using saucelabs

032ebdb

explicitly set CHROME_BIN so karma can find it

1c130fd

revert to original installation method for chrome; just update chrome…

f253627

…driver

set shm_size consistently in both ui test runners

c9c0f11

switch to versioned image and re-sign drone config

6677139

also update test file with new versioned image

fc4aa85

Merge branch 'elijah/chrome-for-testing-in-docker-ci' into elijah/loc…

08cafce

…al-selenium-in-drone

attempt to use an environment variable to rerun failed tests in sauce…

3f49e1c

…labs rather than local

Merge branch 'staging' into elijah/local-selenium-in-drone

23d957d

clean up some no-longer-necessary debug customization

405c0aa

Merge branch 'staging' into elijah/local-selenium-in-drone

57b8047

temporarily bump rerun sup by 1 in drone CI,, so that the first run w…

a96ec2f

…e do locally doesn't result in fewer total attempts against sauce

speculatively add a ridiculous delay to confirm whether or not that's…

dd4fe98

… relevant

add basic 'wait for progress' ui test step

25c4bff

Hamms added 10 commits April 3, 2025 12:34

[first run local] empty commit to exercise new CI test commit flag

3a7a016

Merge branch 'staging' into elijah/local-selenium-in-drone

3710d37

[first run local] empty commit to exercise new CI test commit flag

2fbf8c3

remove 'first run local' option

3ab719d

In favor of using local selenium for the first run anytime we're in the CI environment

sign drone

ccc2607

use CI::Utils.running_on_ci? everywhere we can

6e0582c

implement new logic as an option rather than tying directly to the en…

305b59f

…vironment

rename helper method now that it's simplified

d1e9bc1

temporarily disable first_run_local to verify time delta

4a8731a

Revert "temporarily disable first_run_local to verify time delta"

58fe011

This reverts commit 4a8731a.

Hamms changed the title ~~Always use local selenium in drone~~ Use Local Webdriver for UI Tests in CI Apr 8, 2025

Hamms added the UI Testing label Apr 8, 2025

Hamms requested a review from a team April 8, 2025 23:40

Hamms marked this pull request as ready for review April 9, 2025 18:34

Hamms requested a review from a team as a code owner April 9, 2025 18:34

cat5inthecradle approved these changes Apr 9, 2025

View reviewed changes

snickell approved these changes Apr 10, 2025

View reviewed changes

Hamms added 4 commits April 10, 2025 12:39

add error specifics in comment

bdefcbc

comment new helper method

b15d210

Merge branch 'staging' into elijah/always-use-local-selenium-in-drone

ebcf77a

resign drone again after comment change

00d749d

Hamms merged commit e1d1cfc into staging Apr 14, 2025

Hamms deleted the elijah/always-use-local-selenium-in-drone branch April 14, 2025 23:44

This was referenced Apr 15, 2025

use local selenium directly in drone #64432

Closed

attempt to run selenium within the ui-tests container #64304

Closed

DO NOT MERGE - local selenium testing #64278

Closed

Use Selenium Without Saucelabs in Drone #63606

Closed

Hamms mentioned this pull request Aug 1, 2025

Allow Overriding Local Webdriver Tests #67509

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Local Webdriver for UI Tests in CI#65064

Use Local Webdriver for UI Tests in CI#65064
Hamms merged 62 commits into
stagingfrom
elijah/always-use-local-selenium-in-drone

Hamms commented Apr 4, 2025 •

edited

Loading

Uh oh!

cat5inthecradle left a comment

Uh oh!

snickell left a comment

Uh oh!

snickell Apr 10, 2025

Uh oh!

Uh oh!

snickell Apr 10, 2025

Uh oh!

Hamms Apr 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Hamms commented Apr 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Links

Testing story

Deployment strategy

Follow-up work

Uh oh!

cat5inthecradle left a comment

Choose a reason for hiding this comment

Uh oh!

snickell left a comment

Choose a reason for hiding this comment

Uh oh!

snickell Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

snickell Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

Hamms Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Hamms commented Apr 4, 2025 •

edited

Loading