Tagged: llm

1 post and 2 projects

Posts

How I Benchmark LLMs on AL Code

2026-01-16

An in-depth look at CentralGauge, an open source benchmark for evaluating LLM performance on AL code generation for Business Central, covering task design, scoring methodology, and cross-model comparison results.

alllmbenchmarkbusiness-centraldeveloper-tools

Projects

CentralGauge - AL Code Benchmark for LLMs

Active

An open source benchmark for evaluating LLM performance on AL code generation for Microsoft Dynamics 365 Business Central, with 56 tasks across three difficulty tiers, real compilation, and test execution.

alllmbenchmarkbusiness-centralai

GitHub → Live Site →

AL Train

Active

A fine-tuning pipeline that trains language models to write AL code, using corpus data enhanced with AI-generated descriptions.

albusiness-centralllmai

GitHub →