Tagged: llm

1 post and 2 projects

Posts

How I Benchmark LLMs on AL Code

An in-depth look at CentralGauge, an open source benchmark for evaluating LLM performance on AL code generation for Business Central, covering task design, scoring methodology, and cross-model comparison results.

alllmbenchmarkbusiness-centraldeveloper-tools

Projects

An open source benchmark for evaluating LLM performance on AL code generation for Microsoft Dynamics 365 Business Central, with 56 tasks across three difficulty tiers, real compilation, and test execution.

alllmbenchmarkbusiness-centralai

AL Train

Active

A fine-tuning pipeline that trains language models to write AL code, using corpus data enhanced with AI-generated descriptions.

albusiness-centralllmai