Skip to content

feat(sdk): support multiple MAQL candidates in metric skill evaluation#1665

Merged
zdenekmusil-gd merged 1 commit into
masterfrom
zmu/gdai-1831-metric-skill-multi-maql
Jun 17, 2026
Merged

feat(sdk): support multiple MAQL candidates in metric skill evaluation#1665
zdenekmusil-gd merged 1 commit into
masterfrom
zmu/gdai-1831-metric-skill-multi-maql

Conversation

@zdenekmusil-gd

Copy link
Copy Markdown
Contributor
  • Add _best_maql_match to compare actual MAQL against any candidate list
  • _execute_single_metric_run takes expected_outputs: list[dict] and uses _best_maql_match; primary candidate drives simulated user responses
  • run_agentic_metric_skill and evaluate_agentic_metric_skill accept expected_output: dict | list — single dict is wrapped automatically
  • Error message on failure now lists all candidates

JIRA: GDAI-1831
risk: nonprod

- Add `_best_maql_match` to compare actual MAQL against any candidate list
- `_execute_single_metric_run` takes `expected_outputs: list[dict]` and uses
  `_best_maql_match`; primary candidate drives simulated user responses
- `run_agentic_metric_skill` and `evaluate_agentic_metric_skill` accept
  `expected_output: dict | list` — single dict is wrapped automatically
- Error message on failure now lists all candidates

JIRA: GDAI-1831
risk: nonprod
@codecov

codecov Bot commented Jun 17, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 79.21%. Comparing base (3236e41) to head (19d3936).

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #1665   +/-   ##
=======================================
  Coverage   79.21%   79.21%           
=======================================
  Files         232      232           
  Lines       15809    15809           
=======================================
  Hits        12523    12523           
  Misses       3286     3286           

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@zdenekmusil-gd zdenekmusil-gd merged commit 653f5bc into master Jun 17, 2026
13 checks passed
@zdenekmusil-gd zdenekmusil-gd deleted the zmu/gdai-1831-metric-skill-multi-maql branch June 17, 2026 12:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants