Optimized Concept Learning in BEN

None, None

Optimized Concept Learning in BEN

Better identifying when transformations apply in a program synthesis agent

Bachelor Thesis (2026)

Author(s)

J.S. Duifs (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

D.Z. Zak – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

S. Dumančić – Mentor (TU Delft - Electrical Engineering, Mathematics and Computer Science)

A. van Deursen – Graduation committee member (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Faculty

Electrical Engineering, Mathematics and Computer Science

AI Program synthesis Programming by Example ARC

To reference this document use

https://resolver.tudelft.nl/uuid:e09e34d9-a870-4196-9da4-90b09ca6b617

More Info

expand_more

Publication Year

2026

Language

English

Graduation Date

26-06-2026

Awarding Institution

Delft University of Technology

Project

CSE3000 Research Project

Programme

Computer Science and Engineering

Faculty

Electrical Engineering, Mathematics and Computer Science

Downloads counter

6

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

The Abstraction and Reasoning Corpus (ARC) challenges AI systems to induce general rules from as few as two to four examples. BEN is a program synthesis agent that tackles ARC through a Divide, Align, and Conquer pipeline inspired by analogical reasoning, but its concept learning phase, which synthesizes Boolean rules governing when transformations apply, must frequently choose between candidate rules of equal structural complexity. Because ARC's extreme data scarcity makes such ties highly prevalent, the system's coarse notion of complexity provides insufficient nuance to distinguish between candidates, leaving it vulnerable to overfitting.

We present three optimizations to BEN's concept learner, built on a reimplementation of BEN in Julia. These optimization provide better nuance in complexity at different stages of the pipeline, helping the system more accurately mimic human analogical reasoning. A \emph{Rule Generality} heuristic uses cross-transformation frequency analysis to discount broadly applicable features. A \emph{DNF Model Score} leverages cumulative solver objective values as a semantic tie-breaker between structurally equivalent rules. A \emph{Simple Minimal Transformation Coverage} heuristic extends the greedy set-cover procedure, with secondary criteria based on correspondence ambiguity and transformation complexity. An ablation study on 400 ARC-AGI-1 tasks shows these optimizations increase accuracy from 32 to 36 solved tasks with no regressions and negligible computational overhead. Analysis of rule complexity reveals that generated DNFs consistently require only a single clause, suggesting that the current performance bottleneck lies in the upstream pipeline rather than in concept complexity itself.

Files

Research_paper_6_.pdf

(pdf | 0.652 Mb)

License info not available