grammar-inference-engine/bex
tobjend 0e2aec582b Grammar inference engine: CRX + iDRegEx ensemble with MDL scoring, MCP server, showcase, and blog post
- Ensemble inference (infer_ensemble) runs both CRX and iDRegEx, picks best by MDL
- CRX: CRX algorithm for wide coverage (accepts all sequences, large vocabulary)
- iDRegEx: iDRegEx for minimal core grammar (tightest common pattern)
- MDL scoring: fixed model_cost to count alphabet symbol occurrences, fixed dispatch order in _count_words_fast
- Fixed _match_tokens: rewritten as _match_possible with proper backtracking
- Fixed _parse_parts disjunction: children use _parse_flat_symbol to avoid dot-splitting
- MCP server: infer_best_grammar and infer_grammar tools
- Added prefer parameter (crx/idregex) to skip ensemble
- 28 passing tests
- SHOWCASE.md with Geerlingguy Galaxy demonstration
- blog_post.md with full technical deep-dive
2026-07-01 09:51:41 +02:00
..
__init__.py Grammar inference engine: CRX + iDRegEx ensemble with MDL scoring, MCP server, showcase, and blog post 2026-07-01 09:51:41 +02:00
__main__.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
automaton.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
baum_welch.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
cli.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
crx.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
ensemble.py Grammar inference engine: CRX + iDRegEx ensemble with MDL scoring, MCP server, showcase, and blog post 2026-07-01 09:51:41 +02:00
expr.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
idregex.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
ikoa.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
ilocal.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
koa.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
kore.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
marking.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
mcp_server.py Grammar inference engine: CRX + iDRegEx ensemble with MDL scoring, MCP server, showcase, and blog post 2026-07-01 09:51:41 +02:00
mdl.py Grammar inference engine: CRX + iDRegEx ensemble with MDL scoring, MCP server, showcase, and blog post 2026-07-01 09:51:41 +02:00
pta.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
repair.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
role_grammar.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
rwr0.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
rwrsq.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
shrink.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
soa.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
template.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
tokenizer.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
twotinf.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00
yaml_to_seq.py Initial commit: BEX-based grammar inference engine 2026-07-01 08:01:16 +02:00