feat(agents): generator-as-protocol-adapter and tool coercion by Hartorn · Pull Request #2279 · Giskard-AI/giskard-oss

Hartorn · 2026-02-26T17:08:40Z

Known limitation

The serialize_tools, serialize_messages, and deserialize_response methods on BaseGenerator are currently "advisory" — they exist as override points but are called from inside LiteLLMGenerator._complete_once, not orchestrated by the base class. A new provider subclass could skip them entirely with no enforcement.

Next step: Restructure so BaseGenerator owns the serialize → call → deserialize pipeline, with the subclass implementing only a _send(wire_messages, wire_tools, params) method. This also ties into replacing the mixin-based retry/rate-limiting with a composable middleware pipeline. Tracked as a follow-up.

Add serialize_tools, serialize_messages, deserialize_response to BaseGenerator so provider subclasses own all wire-format translation. LiteLLMGenerator now delegates through these methods instead of calling to_litellm/from_litellm directly.

Tool.run() gains input coercion (TypeAdapter + _params_model) and output serialization (_return_adapter), returning str to eliminate double-quoting in workflow. Workflow _run_tools simplified accordingly.

Description

Related Issue

Type of Change

📚 Examples / docs / tutorials / dependencies update
🔧 Bug fix (non-breaking change which fixes an issue)
🥂 Improvement (non-breaking change which improves an existing feature)
🚀 New feature (non-breaking change which adds functionality)
💥 Breaking change (fix or feature that would cause existing functionality to change)
🔐 Security fix

Checklist

I've read the CODE_OF_CONDUCT.md document.
I've read the CONTRIBUTING.md guide.
I've written tests for all new methods and classes that I created.
I've written the docstring in Google format for all the methods and classes that I used.
I've updated the pdm.lock running pdm update-lock (only applicable when pyproject.toml has been
modified)

Add serialize_tools, serialize_messages, deserialize_response to BaseGenerator so provider subclasses own all wire-format translation. LiteLLMGenerator now delegates through these methods instead of calling to_litellm/from_litellm directly. Tool.run() gains input coercion (TypeAdapter + _params_model) and output serialization (_return_adapter), returning str to eliminate double-quoting in workflow. Workflow _run_tools simplified accordingly.

gemini-code-assist · 2026-02-26T17:09:10Z

Summary of Changes

Hello @Hartorn, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a significant architectural improvement by formalizing the role of generators as protocol adapters, centralizing the logic for translating between internal data structures and external LLM provider formats. It also enhances the Tool class with automatic input coercion and output serialization, making tool execution more robust and predictable. These changes streamline the integration of new LLM providers and simplify the development of agent workflows by abstracting away low-level data transformations.

Highlights

Generator as Protocol Adapter: The BaseGenerator class has been refactored to act as a protocol adapter, centralizing the responsibility for converting internal Message and Tool objects to and from provider-specific wire formats. This is achieved through new serialize_tools, serialize_messages, and deserialize_response methods, which subclasses can override.
Tool Input Coercion and Output Serialization: The Tool.run() method now includes robust input coercion, validating arguments against a Pydantic model derived from the tool's signature. It also performs output serialization, ensuring that the result is always a JSON-safe string, handling complex types like BaseModel, datetime, and UUID automatically.
LiteLLM Integration Update: The LiteLLMGenerator has been updated to delegate its serialization and deserialization tasks to the new methods on BaseGenerator, removing direct calls to Message.to_litellm() and Message.from_litellm().
Simplified Workflow: The _run_tools method in the workflow has been simplified, as Tool.run() now directly returns the serialized string content, eliminating the need for manual json.dumps calls within the workflow.
Architectural Rules Document: A new markdown document, generator-adapter.mdc, has been added to formally define the architectural principles and rules for generator-based protocol adaptation and tool handling.
Comprehensive Testing: Extensive new test cases have been added to validate the new generator translation methods and the input coercion and output serialization capabilities of the Tool.run() method.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

libs/giskard-agents/.cursor/rules/generator-adapter.mdc
- Added new architectural rules document for generator protocol adaptation.
libs/giskard-agents/src/giskard/agents/generators/base.py
- Updated BaseGenerator docstring to describe its role as a protocol adapter.
- Added default serialize_tools, serialize_messages, and deserialize_response methods to BaseGenerator.
libs/giskard-agents/src/giskard/agents/generators/litellm_generator.py
- Removed unused LiteLLMMessage import.
- Updated LiteLLMGenerator docstring.
- Modified _complete_once to use BaseGenerator's new serialization and deserialization methods.
libs/giskard-agents/src/giskard/agents/tools/tool.py
- Imported json, PrivateAttr, and TypeAdapter.
- Added _params_model and _return_adapter private attributes to Tool for coercion and serialization.
- Modified from_callable to capture return annotations and parameter models for TypeAdapter.
- Changed Tool.run() return type to str.
- Updated Tool.run() docstring to reflect input coercion and output serialization.
- Implemented input coercion using Pydantic's model_validate in Tool.run().
- Implemented output serialization using TypeAdapter.dump_python or json.dumps in Tool.run().
libs/giskard-agents/src/giskard/agents/workflow.py
- Modified _run_tools to directly use the serialized string output from tool.run().
libs/giskard-agents/tests/test_generator.py
- Added imports for json, Any, BaseGenerator, Tool, tool, Field, PrivateAttr.
- Added SpyGenerator for testing generator translation method calls.
- Added tests for test_translation_methods_called_during_workflow, test_custom_serialize_messages_override, and test_custom_serialize_tools_override.
libs/giskard-agents/tests/test_tools.py
- Added imports for json, datetime, timezone, UUID, BaseModel.
- Added CityWeather, Address, Person, TimestampedRecord Pydantic models for testing.
- Added tests for Tool.run() output serialization for various types, including BaseModel.
- Added tests for Tool.run() input coercion for BaseModel, optional BaseModel, and lists of BaseModel.
- Added tests for primitive type handling in Tool.run().

Activity

The pull request is currently marked as ON-HOLD, indicating it's a foundational feature that might be awaiting further architectural decisions or related changes.
The author has clearly outlined known limitations and proposed next steps for future restructuring, suggesting a phased approach to these architectural improvements.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant and well-executed architectural refactoring by establishing the BaseGenerator as a protocol adapter. This change, along with the introduction of input coercion and output serialization in Tool.run(), greatly improves the separation of concerns and simplifies the workflow logic. The new architectural rules documented in generator-adapter.mdc are a fantastic addition for long-term maintainability.

I've found one issue related to error handling during argument validation in tools, which I've detailed in a specific comment. Once that is addressed, this will be an excellent contribution to the codebase.

…ware pipeline Replace WithRetryPolicy and WithRateLimiter mixins with a composable CompletionMiddleware pipeline on BaseGenerator. This decouples cross-cutting concerns (retry, rate-limiting) from the generator inheritance hierarchy, making them stackable and independently testable.

…d docs Update requires-python to 3.12, fix stale docstrings and types in base generator, align tests with new middleware API (remove RetryPolicy, with_retries, with_rate_limiter), and document the middleware pipeline in the README.

…o.sleep mock Patch asyncio.sleep instead of overriding _before_sleep to verify exponential backoff and max_delay capping without production hooks.

Narrow _retry_mw helper to LiteLLMRetryMiddleware and assert middleware round-trips correctly in test_chat_workflow_serialization.

…ne' into refactor/generator-as-protocol-adapter Made-with: Cursor # Conflicts: # libs/giskard-agents/src/giskard/agents/generators/litellm_generator.py

Move serialization/deserialization pipeline into BaseGenerator._complete so subclasses only implement _call_model with raw wire types. Remove provider-coupled methods (Message.to_litellm, Message.from_litellm, Tool.to_litellm_function) from domain objects. Extract FinishReason type alias for reuse. Made-with: Cursor

Hartorn · 2026-03-02T09:11:24Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a significant and well-executed architectural refactoring, turning the BaseGenerator into a protocol adapter. This decouples the core agent logic from provider-specific data formats. The introduction of serialize_tools, serialize_messages, and deserialize_response provides clear extension points for new providers. Additionally, the Tool.run method is enhanced with input coercion and output serialization, ensuring that tools always return a string, which simplifies the workflow logic. The changes are supported by a comprehensive set of new and updated tests. My review found one potential issue with error handling during tool argument validation. Overall, this is a high-quality contribution that greatly improves the extensibility and robustness of the agent framework.

libs/giskard-agents/src/giskard/agents/tools/tool.py

…nce API Extract RetryPolicy config model and give BaseGenerator dedicated retry_policy and rate_limiter fields with guaranteed middleware ordering. Adds with_retries() and with_rate_limiter() convenience methods. Renames middleware -> middlewares for codebase consistency.

…rd-AI/giskard-oss into refactor/generator-as-protocol-adapter

Hartorn · 2026-03-09T13:29:41Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a significant and valuable architectural refactoring by establishing the generator as a protocol adapter. The introduction of serialize_tools, serialize_messages, and deserialize_response methods in BaseGenerator effectively decouples the core agent logic from provider-specific data formats. The changes to Tool.run() to include input coercion and automatic output serialization are also excellent improvements, simplifying tool implementation and workflow logic. The updates are consistently applied across the codebase, including extensive and well-written tests for the new functionality. I have one suggestion to simplify the argument preparation logic in Tool.run() for better clarity and to remove redundancy. Overall, this is a great step forward for the agent's architecture.

libs/giskard-agents/src/giskard/agents/tools/tool.py

…el stubs - Move Pydantic model_validate into the try/except so ValidationError from malformed LLM arguments is caught by the tool's catch handler. - Re-inject RunContext after coercion since validated dict drops non-model keys. - Resolve merge conflicts: union imports from both branches in base.py, litellm_generator.py, and test_generator_retry.py. - Add _call_model stub to MockGenerator in giskard-checks tests to satisfy the new abstract method on BaseGenerator. Made-with: Cursor

Hartorn · 2026-03-09T13:52:08Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a significant and beneficial architectural refactoring, positioning the BaseGenerator as a protocol adapter. This change centralizes provider-specific translation logic, improving modularity and maintainability. The addition of input coercion and output serialization for tools is also a great enhancement, making tool execution more robust and predictable. The implementation is solid and well-tested for the most part. My feedback focuses on a few areas: ensuring type safety for FinishReason, a minor refactoring opportunity in tool argument handling, and, most importantly, updating several test mocks in the giskard-checks library to align with the new generator architecture, which is crucial for maintaining test effectiveness.

libs/giskard-agents/src/giskard/agents/generators/_types.py

libs/giskard-agents/src/giskard/agents/tools/tool.py

…r/generator-as-protocol-adapter Made-with: Cursor # Conflicts: # libs/giskard-agents/src/giskard/agents/generators/_types.py # libs/giskard-agents/src/giskard/agents/generators/base.py # libs/giskard-agents/src/giskard/agents/generators/litellm_generator.py # libs/giskard-agents/src/giskard/agents/tools/tool.py # libs/giskard-agents/tests/test_generator_retry.py # libs/giskard-agents/tests/test_tools.py

mattbit

Great improvement on tools and removing the to_litellm methods which don't belong to the tools and chats. This stuff should be handled internally by the Generator implementation.

The "protocol adapter" for the serialization is not necessary, this serialize/deserialize clean up is specific to litellm and internal to the final BaseGenerator implementation.
We shouldn't impose a structure on how these transformations should be implemented (e.g. by message/by tools), as it will be different for every provider.
We should move what's needed in the LiteLLMGenerator (as internal helpers) and leave the BaseGenerator untouched.

I agree that in some cases having standardized serialization for chat completions could help, but right now it's not needed. It's also not necessarily compatible with different APIs. Especially we see providers APIs diverge from the chat completion standard, so it could be counterproductive to impose such abstraction.

libs/giskard-agents/src/giskard/agents/generators/base.py

…enerationParams.merge - Move serialize_tools, serialize_messages, deserialize_response from BaseGenerator into LiteLLMGenerator as private helpers - Simplify _call_model signature to (messages, params) -> (Message, FinishReason) - Add GenerationParams.merge() for param resolution instead of _resolve_params - Update _complete to use GenerationParams.merge() as template method - Use PEP 695 type alias for FinishReason - Update generator-adapter cursor rule to reflect new architecture - Adapt all test mocks to new _call_model contract Made-with: Cursor

@OverRide

- Add @OverRide to all test mock methods overriding BaseGenerator._call_model - Annotate unannotated fixture/test params (mock_response, persona, context, kwargs) - Remove unused generator fixture param from test_workflow_error_handling - Assign unused call results to _ to satisfy reportUnusedCallResult - Rename _calls/_call_count to public attrs in test-only mock generators Made-with: Cursor

mattbit

All good!

Hartorn marked this pull request as draft February 26, 2026 17:08

github-actions bot added the Scope: Agents label Feb 26, 2026

Hartorn self-assigned this Feb 26, 2026

gemini-code-assist bot reviewed Feb 26, 2026

View reviewed changes

Hartorn and others added 7 commits February 27, 2026 10:15

test(agents): restore sleep-time assertions in retry tests via asynci…

0e25fe8

…o.sleep mock Patch asyncio.sleep instead of overriding _before_sleep to verify exponential backoff and max_delay capping without production hooks.

test(agents): tighten serialization test assertions per review feedback

166601d

Narrow _retry_mw helper to LiteLLMRetryMiddleware and assert middleware round-trips correctly in test_chat_workflow_serialization.

Merge branch 'main' into feat/generator-middleware-pipeline

198bdb9

Merge remote-tracking branch 'origin/feat/generator-middleware-pipeli…

297665c

…ne' into refactor/generator-as-protocol-adapter Made-with: Cursor # Conflicts: # libs/giskard-agents/src/giskard/agents/generators/litellm_generator.py

Hartorn changed the base branch from main to feat/generator-middleware-pipeline March 2, 2026 09:10

gemini-code-assist bot reviewed Mar 2, 2026

View reviewed changes

libs/giskard-agents/src/giskard/agents/tools/tool.py Outdated Show resolved Hide resolved

mattbit and others added 4 commits March 3, 2026 18:59

Merge branch 'main' into feat/generator-middleware-pipeline

3c5c709

Merge branch 'main' into feat/generator-middleware-pipeline

5e69aed

Merge branch 'feat/generator-middleware-pipeline' of github.com:Giska…

191dab6

…rd-AI/giskard-oss into refactor/generator-as-protocol-adapter

github-actions bot added the Scope: Checks label Mar 9, 2026

Hartorn force-pushed the refactor/generator-as-protocol-adapter branch 2 times, most recently from 6d1e7d3 to d54d577 Compare March 9, 2026 13:28

gemini-code-assist bot reviewed Mar 9, 2026

View reviewed changes

libs/giskard-agents/src/giskard/agents/tools/tool.py Outdated Show resolved Hide resolved

Hartorn force-pushed the refactor/generator-as-protocol-adapter branch from d54d577 to 8278eea Compare March 9, 2026 13:50

gemini-code-assist bot reviewed Mar 9, 2026

View reviewed changes

libs/giskard-agents/src/giskard/agents/generators/_types.py Outdated Show resolved Hide resolved

libs/giskard-agents/src/giskard/agents/tools/tool.py Show resolved Hide resolved

Base automatically changed from feat/generator-middleware-pipeline to main March 11, 2026 08:47

Hartorn marked this pull request as ready for review March 11, 2026 09:19

Hartorn changed the title ~~[ON-HOLD] feat(agents): generator-as-protocol-adapter and tool coercion~~ feat(agents): generator-as-protocol-adapter and tool coercion Mar 11, 2026

mattbit requested changes Mar 11, 2026

View reviewed changes

libs/giskard-agents/src/giskard/agents/generators/base.py Outdated Show resolved Hide resolved

mattbit and others added 3 commits March 11, 2026 12:06

Merge branch 'main' into refactor/generator-as-protocol-adapter

4548a22

mattbit approved these changes Mar 11, 2026

View reviewed changes

mattbit merged commit c69fe1a into main Mar 11, 2026
23 checks passed

mattbit deleted the refactor/generator-as-protocol-adapter branch March 11, 2026 13:18

Uh oh!

Conversation

Hartorn commented Feb 26, 2026

Known limitation

Description

Related Issue

Type of Change

Checklist

Uh oh!

gemini-code-assist bot commented Feb 26, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Hartorn commented Mar 2, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Hartorn commented Mar 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Hartorn commented Mar 9, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

mattbit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattbit left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants