Memoize ruby-macho's internals by woodruffw · Pull Request #942 · Homebrew/ruby-macho

woodruffw · 2026-02-11T04:02:56Z

This is an experiment in using Claude to squeeze more performance out of ruby-macho's internals. The first commits have the raw outcomes (including a mountain of Markdown from Claude); the last commit is me cleaning things up.

The two key constraints here were (1) no public API changes (it kind of failed at this at Copilot notices, but I think these changes are actually fine), and (2) all existing tests must pass. Other than that, I gave Claude free rein to generate its own benchmarks and evaluate them.

The combination of the changes here make us 50-70% faster on common (read-heavy) workloads. The actual impact on Homebrew will probably be slightly smaller, since Homebrew doesn't re-read the computed properties much. Still, not bad for ~80 lines of diff 🙂

Signed-off-by: William Woodruff william@yossarian.net

woodruffw · 2026-02-11T04:10:55Z

(The codecov/patch failure seems to be bogus; the actual coverage check didn't decline at all.)

Copilot

Pull request overview

This PR experiments with improving ruby-macho’s internal performance by memoizing frequently accessed derived values and adding a load-command type index, while aiming to keep the public API unchanged and maintain test compatibility.

Changes:

Added memoization for several derived query methods in MachOFile (e.g., rpaths, linked dylibs, segments, segment alignment).
Built an indexed @load_commands_by_type structure during load command parsing for faster command(...) lookups.
Simplified aggregation in FatFile using flat_map, and updated local tooling metadata (mise + Bundler lockfile version).

Reviewed changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated 4 comments.

File	Description
`mise.toml`	Pins a Ruby toolchain version for `mise`-based development.
`lib/macho/macho_file.rb`	Adds memoization and a type index for load commands to reduce repeated scans.
`lib/macho/fat_file.rb`	Uses `flat_map` for cleaner/faster list aggregation across slices.
`Gemfile.lock`	Updates the Bundler version recorded in the lockfile.

Comments suppressed due to low confidence (1)

lib/macho/macho_file.rb:268

segments is now memoized and returns a cached array. Since this method is public, returning the same mutable array instance can be an observable API change if callers mutate it (previously it returned a fresh array each time). Consider returning a defensive copy of the cached value to keep memoization without changing external behavior.

    # All segment load commands in the Mach-O.
    # @return [Array<LoadCommands::SegmentCommand>] if the Mach-O is 32-bit
    # @return [Array<LoadCommands::SegmentCommand64>] if the Mach-O is 64-bit
    def segments
      @segments ||= if magic32?
        command(:LC_SEGMENT)
      else
        command(:LC_SEGMENT_64)
      end
    end

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lib/macho/macho_file.rb

Signed-off-by: William Woodruff <william@yossarian.net>

Copilot

Pull request overview

Copilot reviewed 2 out of 4 changed files in this pull request and generated no new comments.

Comments suppressed due to low confidence (1)

lib/macho/macho_file.rb:269

The segments method stores a memoized value but doesn't return a duplicate like the other memoized methods (dylib_load_commands, linked_dylibs, rpaths). Although command already returns a duplicate, that duplicate gets stored in @segments, and then this method returns @segments directly without an additional .dup. This allows callers to mutate the cached value. For consistency and safety, add .dup before returning.

    def segments
      @segments ||= if magic32?
        command(:LC_SEGMENT)
      else
        command(:LC_SEGMENT_64)
      end
    end

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Bo98 · 2026-02-19T23:23:57Z

lib/macho/macho_file.rb

    # @return [Array<LoadCommands::SegmentCommand64>] if the Mach-O is 64-bit
    def segments
-      if magic32?
+      @segments ||= if magic32?


need to dup this I think.

Or alternatively there's not too much to gain here from memoization given command(...) is memoized

woodruffw self-assigned this Feb 11, 2026

woodruffw added the performance label Feb 11, 2026

woodruffw marked this pull request as ready for review February 11, 2026 04:06

Copilot AI review requested due to automatic review settings February 11, 2026 04:06

Copilot started reviewing on behalf of woodruffw February 11, 2026 04:07 View session

Copilot AI reviewed Feb 11, 2026

View reviewed changes

lib/macho/macho_file.rb Outdated Show resolved Hide resolved

lib/macho/macho_file.rb Show resolved Hide resolved

lib/macho/macho_file.rb Show resolved Hide resolved

lib/macho/macho_file.rb Show resolved Hide resolved

woodruffw requested a review from Bo98 February 11, 2026 04:11

woodruffw force-pushed the ww/perf branch from f525c3c to 6529bb6 Compare February 11, 2026 04:15

MikeMcQuaid approved these changes Feb 11, 2026

View reviewed changes

woodruffw added 5 commits February 19, 2026 17:50

Perf: implement (1) and (2)

8967e15

Signed-off-by: William Woodruff <william@yossarian.net>

Recommendation (4)

710fc2a

Signed-off-by: William Woodruff <william@yossarian.net>

Recommendation (5)

6ac309a

Signed-off-by: William Woodruff <william@yossarian.net>

Remove benchmarking artifacts

9c4f686

Signed-off-by: William Woodruff <william@yossarian.net>

Return fresh copies

bd76ef0

Signed-off-by: William Woodruff <william@yossarian.net>

woodruffw force-pushed the ww/perf branch from 5c1af61 to bd76ef0 Compare February 19, 2026 22:50

woodruffw requested a review from Copilot February 19, 2026 22:54

Copilot started reviewing on behalf of woodruffw February 19, 2026 22:54 View session

Copilot AI reviewed Feb 19, 2026

View reviewed changes

Bo98 approved these changes Feb 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

Memoize ruby-macho's internals#942

Memoize ruby-macho's internals#942
woodruffw wants to merge 5 commits intomainfrom
ww/perf

woodruffw commented Feb 11, 2026 •

edited

Loading

Uh oh!

woodruffw commented Feb 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Bo98 Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Comments

Conversation

woodruffw commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

woodruffw commented Feb 11, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Bo98 Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

woodruffw commented Feb 11, 2026 •

edited

Loading