improve performance for Java "block-to-interface" conversion by kares · Pull Request #9401 · jruby/jruby

kares · 2026-04-28T12:17:54Z

Passing a Ruby block to a Java method that expects a SAM (single-abstract-method "functional interface") type has evolved into having very bad performance despite being one of the very useful JI features available in JRuby, current behavior:

allocates a fresh RubyProc (to wrap the block)
materialises a singleton class (MetaClass) for the proc object
singletonClass.include(<interfaceModule>) - synchronized(runtime.hierarchyLock).
extended callback re-invokes singletonClass.include(<interfaceModule>)
a no-op, but includeModule still calls invalidateCacheDescendants unconditionally, taking the same lock again
addMethod("method_missing") - synchronized(hierarchyLock)
addMethod("<sam-method>") - synchronized(hierarchyLock)

4 acquisitions per task on a single monitor
Under load with multiple threads, the lack of caching becomes very noticeable and prevents proper concurrent execution:

per-call latency is dominated by singleton-class setup, even when the call site is otherwise trivial
multi-thread throughput is capped - adding more workers doesn't make things "faster"

The fix introduces a specialised, lock-free conversion path for blocks that originate at the Java-integration layer and are known to be on their way into just being used for block-to-interface execution, existing semantics for user-land RubyProcs are preserved.

Performance

Numbers for the included micro benchmark: https://gist.github.com/kares/c4616956570fd58515ad6f0ffd822f8f

Improvement seems to be at the order of 20-30x and with multi-threaded execution far beyond (no more contention between threads).

headius

This is basically ok but there's a lot of duplicated code from elsewhere and we could consider using MethodHandle instead of reflection objects in some of these places. The logic seems sound; generate a class for Interface### to Proc dispatch and reuse it. I have some concerns about how these are being cached, for situations where many classes are encountered rarely or only once.

Overall I approve but we can chat through some improvements before merging.

I'd also like to see the benchmark code somewhere so we can continue to audit and profile those cases.

headius · 2026-05-06T01:42:16Z

+                implClass = defineImplClass(loader, interfaceType, implClassName);
+            }
+
+            constructor = (Constructor<? extends BlockInterfaceTemplate>) implClass.getConstructor(RubyProc.class);


Of course this stuff uses java.lang.reflect all over the place (both before and after this change) but we could be using MethodHandles for all of these and skip the overhead of a reflective invocation.

tried unreflecting the constructor into a handle, no difference with the benchmarks mentioned in the PR.

The handle needs a static root of some kind, either a static final field (not possible here) or by using LambdaMetaFactory to generate an interface implementation. Without those it usually is not much faster than reflection, which also uses unrooted handles internally.

headius · 2026-05-06T01:59:59Z

+    /**
+     * Loads {@code argIndex} from the Java arg slots and boxes it if primitive.
+     */
+    private static void loadBoxedArg(GeneratorAdapter ga, int argIndex, Class<?> paramType) {


These three utility methods already exist in various forms throughout JRuby. Look at RealClassGenerator for some examples. We don't need to reimplement primitive boxing and unboxing and class literal loading again.

kares · 2026-05-07T10:09:56Z

+    @JIT
+    @SuppressWarnings("unused")
+    protected final IRubyObject __ruby_call(final Class<?> returnType) {
+        return block.call(runtime.getCurrentContext());


while this seem a little abandoned (could be part of the generated class); at occasions I wanted to be able to set a break point around the "block-to-interface" execution, in Java, having the "template" super-class allows for that. no hard feelings if preference is to simply get rid of this.

I get the desire and it's nice to be able to fall back on something debuggable. I wish there were a way to specify __FILE__ and __LINE__ in Java so you could provide the original generated code lines in the stack trace.

kares added 5 commits April 28, 2026 12:26

[refactor] fun interface method wout exception

bf163f5

draft: wire up "bare" block interface generation

8fe48aa

assume functional interface with block generator

c003a5e

wrap up (contained) block-to-interface feature

20b0849

[test] more block-to-interface conversion specs

117c2bb

kares added java integration performance labels Apr 28, 2026

kares commented Apr 28, 2026

View reviewed changes

Comment thread test/jruby/test_instantiating_interfaces.rb

This was linked to issues Apr 28, 2026

Java interface using closure implementaion is slower than mixin implementation #1401

Open

excessive loading of same InterfaceImpl #5023

Open

kares marked this pull request as ready for review April 28, 2026 15:19

headius reviewed May 6, 2026

View reviewed changes

kares marked this pull request as draft May 6, 2026 15:04

kares added 4 commits May 7, 2026 11:53

[test] more spec coverage for primitives

194db41

[test] more coverage for (generic) proc-to-interface

77cd570

[bench] simple block-to-interface benchmark

c205081

[refactor] re-use result coercion + return

ec52250

kares force-pushed the proc-to-iface-slow-10.0 branch from 128b86d to 7fc73e2 Compare May 7, 2026 09:53

kares commented May 7, 2026

View reviewed changes

kares force-pushed the proc-to-iface-slow-10.0 branch from 7fc73e2 to d1ff8e7 Compare May 7, 2026 10:35

[refactor] avoid boxing - re-use argument coercion

bfb3c62

kares force-pushed the proc-to-iface-slow-10.0 branch from d1ff8e7 to bfb3c62 Compare May 7, 2026 11:00

kares mentioned this pull request May 8, 2026

proc to interface inconsistency with multiple args #9424

Open

kares marked this pull request as ready for review May 8, 2026 13:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

improve performance for Java "block-to-interface" conversion#9401

improve performance for Java "block-to-interface" conversion#9401
kares wants to merge 10 commits into
jruby:jruby-10.0from
kares:proc-to-iface-slow-10.0

kares commented Apr 28, 2026 •

edited

Loading

Uh oh!

Uh oh!

headius left a comment

Uh oh!

headius May 6, 2026

Uh oh!

kares May 8, 2026

Uh oh!

headius May 8, 2026

Uh oh!

headius May 6, 2026

Uh oh!

kares May 7, 2026

Uh oh!

headius May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kares commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance

Uh oh!

Uh oh!

headius left a comment

Choose a reason for hiding this comment

Uh oh!

headius May 6, 2026

Choose a reason for hiding this comment

Uh oh!

kares May 8, 2026

Choose a reason for hiding this comment

Uh oh!

headius May 8, 2026

Choose a reason for hiding this comment

Uh oh!

headius May 6, 2026

Choose a reason for hiding this comment

Uh oh!

kares May 7, 2026

Choose a reason for hiding this comment

Uh oh!

headius May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kares commented Apr 28, 2026 •

edited

Loading