#1 - feat: kOREInference — Algorithm 4 iDRegEx with MDL scoring + ensemble integration - tobi/grammar-inference-engine

tobi commented

2026-07-01 12:50:23 +00:00

Owner

What

Implements kOREInference following arXiv 1004.2372 Algorithm 4 (iDRegEx) exactly:

For k = 1..kmax, N random trials:
- iKoa (Algorithm 1) → deterministic k-OA
- rwr² (Algorithm 3) → k-ORE expression
- Validate determinism + k-occurrence
Score all candidates by MDL (model cost + data cost)
Return the best (KOA, expression, k)

Changes

File	Change
`bex/kore.py`	Rewrote. Removed broken PTA→Shrink→Repair→StateElimination approach. Replaced with proper Algorithm 4 pipeline wrapping `ikoa` + `rwr_sq` + MDL
`bex/ensemble.py`	Added kOREInference as 3rd algorithm in `infer_ensemble` alongside CRX and iDRegEx. Refactored prefer logic into a clean dispatch table
`bex/__init__.py`	Export `kOREInference` and `validate_k_ore`
`tests/test_kore.py`	32 tests — core inference, edge cases (empty, single, 20×identical), validate_k_ore, MDL scoring, paper-faithful assertions
`tests/test_ensemble.py`	19 tests — ensemble runs all three, prefer=crx/idregex/koreinference, edge cases, stochastic stability

Test count: 79 total (28 existing + 32 kORE + 19 ensemble), all passing.

Key finding

Applied to 3,926 real opencode step-boundary tool sequences, kOREInference returns None — rwr0 cannot handle the interconnectivity. The tool graph (read→bash, read→grep, read→glob, bash→read, etc.) is not SORE (Single Occurrence Regular Expression). This is a genuine empirical result: the agent's behavior within steps is probabilistic, not grammatically structured.

Next steps

Merge when ready
The examples/ directory (untracked) has exploratory scripts for the session-data analysis

## What Implements `kOREInference` following **arXiv 1004.2372 Algorithm 4** (iDRegEx) exactly: 1. For k = 1..kmax, N random trials: - iKoa (Algorithm 1) → deterministic k-OA - rwr² (Algorithm 3) → k-ORE expression - Validate determinism + k-occurrence 2. Score all candidates by **MDL** (model cost + data cost) 3. Return the best (KOA, expression, k) ## Changes | File | Change | |------|--------| | `bex/kore.py` | **Rewrote.** Removed broken PTA→Shrink→Repair→StateElimination approach. Replaced with proper Algorithm 4 pipeline wrapping `ikoa` + `rwr_sq` + MDL | | `bex/ensemble.py` | Added kOREInference as 3rd algorithm in `infer_ensemble` alongside CRX and iDRegEx. Refactored prefer logic into a clean dispatch table | | `bex/__init__.py` | Export `kOREInference` and `validate_k_ore` | | `tests/test_kore.py` | **32 tests** — core inference, edge cases (empty, single, 20×identical), validate_k_ore, MDL scoring, paper-faithful assertions | | `tests/test_ensemble.py` | **19 tests** — ensemble runs all three, prefer=crx/idregex/koreinference, edge cases, stochastic stability | Test count: **79 total** (28 existing + 32 kORE + 19 ensemble), all passing. ## Key finding Applied to 3,926 real opencode step-boundary tool sequences, `kOREInference` returns `None` — rwr0 cannot handle the interconnectivity. The tool graph (read→bash, read→grep, read→glob, bash→read, etc.) is **not SORE** (Single Occurrence Regular Expression). This is a genuine empirical result: the agent's behavior within steps is probabilistic, not grammatically structured. ## Next steps - Merge when ready - The `examples/` directory (untracked) has exploratory scripts for the session-data analysis

tobi added 1 commit 2026-07-01 12:50:24 +00:00

feat: implement kOREInference (Algorithm 4) with MDL scoring, add to ensemble, 79 tests

ci/woodpecker/push/woodpecker Pipeline was successful

Details

ci/woodpecker/pr/woodpecker Pipeline was successful

Details

edd6d9d4dd

tobi added 1 commit 2026-07-01 13:09:17 +00:00

feat: core+outlier analysis via min_coverage parameter, 6 new tests

ci/woodpecker/push/woodpecker Pipeline was successful

Details

ci/woodpecker/pr/woodpecker Pipeline was successful

Details

9045769d57

tobi commented

2026-07-01 13:09:23 +00:00

Author

Owner

Neu: `min_coverage` Core + Outlier Analyse

infer_ensemble(sequences, min_coverage=0.8) findet jetzt zusätzlich den engsten Kern per iterativem CRX + Outlier-Removal:

Outlier werden nach Symbol-Seltenheit identifiziert (Sequenzen mit seltenen Symbolen zuerst entfernt)
Bis genau min_coverage (default 1.0 = kein Filter) der Sequenzen übrig ist
Rückgabe enthält result['core'] mit {grammar, coverage, outlier_count, outliers}

Beispiel: 15 Ansible-Rollen

min_coverage=1.0: fail.include_vars.set_fact.package.file.template.service?.npm?.pip?.lineinfile?  (alle 15)
min_coverage=0.9: fail.include_vars.set_fact.package.file.template.service?.(lineinfile+npm)?       (13/15, outliers: mysql, docker)
min_coverage=0.8: fail.include_vars.set_fact.package.file.template.service?.npm?                    (12/15, + nginx)
min_coverage=0.7: fail.include_vars.set_fact.package.file.template.service?                         (10/15, + phpmyadmin, apache)

Die Outlier sind Rollen mit den seltensten Symbolen (npm, pip, lineinfile). Ein LLM sieht: "10/15 Rollen folgen dem Kern. Nur wer spezifische Tools braucht, fügt extras hinzu."

Tests

85 Tests insgesamt (+6 neue für min_coverage), alle grün.

## Neu: `min_coverage` Core + Outlier Analyse `infer_ensemble(sequences, min_coverage=0.8)` findet jetzt zusätzlich den **engsten Kern** per iterativem CRX + Outlier-Removal: - Outlier werden nach **Symbol-Seltenheit** identifiziert (Sequenzen mit seltenen Symbolen zuerst entfernt) - Bis genau `min_coverage` (default 1.0 = kein Filter) der Sequenzen übrig ist - Rückgabe enthält `result['core']` mit `{grammar, coverage, outlier_count, outliers}` ### Beispiel: 15 Ansible-Rollen ``` min_coverage=1.0: fail.include_vars.set_fact.package.file.template.service?.npm?.pip?.lineinfile? (alle 15) min_coverage=0.9: fail.include_vars.set_fact.package.file.template.service?.(lineinfile+npm)? (13/15, outliers: mysql, docker) min_coverage=0.8: fail.include_vars.set_fact.package.file.template.service?.npm? (12/15, + nginx) min_coverage=0.7: fail.include_vars.set_fact.package.file.template.service? (10/15, + phpmyadmin, apache) ``` Die Outlier sind Rollen mit den seltensten Symbolen (`npm`, `pip`, `lineinfile`). Ein LLM sieht: "10/15 Rollen folgen dem Kern. Nur wer spezifische Tools braucht, fügt extras hinzu." ### Tests 85 Tests insgesamt (+6 neue für min_coverage), alle grün.

tobi added 1 commit 2026-07-01 13:16:31 +00:00

docs: add min_coverage to MCP tool + README, include core in output

ci/woodpecker/push/woodpecker Pipeline was successful

Details

ci/woodpecker/pr/woodpecker Pipeline was successful

Details

036a84cc76

tobi commented

2026-07-01 13:16:34 +00:00

Author

Owner

Letzter Commit: MCP-Tool + README aktualisiert.

infer_best_grammar hat jetzt min_coverage Parameter (default 1.0 = deaktiviert)
Nur wenn Agent explizit setzt (min_coverage=0.8), kommt Core+Outlier-Analyse in der Antwort
README dokumentiert den Parameter und die Algorithm Selection Guide

Letzter Commit: MCP-Tool + README aktualisiert. - `infer_best_grammar` hat jetzt `min_coverage` Parameter (default 1.0 = deaktiviert) - Nur wenn Agent explizit setzt (`min_coverage=0.8`), kommt Core+Outlier-Analyse in der Antwort - README dokumentiert den Parameter und die Algorithm Selection Guide

tobi added 1 commit 2026-07-01 13:18:34 +00:00

docs: update README and SHOWCASE for kOREInference + core/outlier analysis

ci/woodpecker/push/woodpecker Pipeline was successful

Details

ci/woodpecker/pr/woodpecker Pipeline was successful

Details

0886e5f3bc

tobi added 1 commit 2026-07-01 13:21:57 +00:00

docs: fix Go lint description (both optional), format outliers in SHOWCASE usage

ci/woodpecker/push/woodpecker Pipeline was successful

Details

ci/woodpecker/pr/woodpecker Pipeline was successful