Skip to content

Releases: dylibso/mcpx-eval

v0.4.3

21 May 23:08
Compare
Choose a tag to compare

v0.4.2

21 May 02:57
Compare
Choose a tag to compare

Full Changelog: v0.4.1...v0.4.2

v0.4.1

12 May 21:39
Compare
Choose a tag to compare

Full Changelog: v0.4.0...v0.4.1

v0.4.0

08 May 22:23
06bb428
Compare
Choose a tag to compare

What's Changed

  • refactor: use MCP protocol to access mcp.run via SSE/stdio by @zshipko in #14

Full Changelog: v0.3.0...v0.4.0

v0.3.0

16 Apr 18:47
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.3.0

v0.2.1

27 Mar 21:57
d6035d0
Compare
Choose a tag to compare

What's Changed

  • cleanup: more tests, improve output of multiple results by @zshipko in #8

Full Changelog: v0.2.0...v0.2.1

v0.2.0

20 Mar 18:05
cc6df68
Compare
Choose a tag to compare

What's Changed

  • chore: update to mcp-run client with oauth support by @zshipko in #7

Full Changelog: v0.1.4...v0.2.0

v0.1.4

20 Mar 00:19
e43eb93
Compare
Choose a tag to compare

What's Changed

  • fix: improve separation of judge and test profiles by @zshipko in #6

Full Changelog: v0.1.3...v0.1.4

v0.1.3

19 Mar 23:47
4c715ab
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.2...v0.1.3

v0.1.2

19 Mar 19:33
4452070
Compare
Choose a tag to compare

What's Changed

  • fix: use judge model for judge analysis by @zshipko in #3

Full Changelog: v0.1.1...v0.1.2