Develop and ship production AI agents, including orchestration frameworks and eval harnesses. Design and implement tool-use runtimes and multi-step workflows for agent systems. Evaluate and improve retrieval systems that agents utilize for enhanced performance.

Requirements

  • Experience in building in-house agent frameworks or cloud-agents runtime.
  • Expertise in eval infrastructure for agents, including offline and online systems.
  • Proficiency with tool-use, MCP, or orchestration layers.
  • Knowledge of memory, retrieval, or state-passing primitives for agents.