Meet Harness-1: A 20B Retrieval Subagent Trained With Reinforcement Learning Inside a Stateful Search Harness on gpt-oss-20b
MarkTechPost
Read Full Article at MarkTechPost →
Ad Slot — In-Article (728x90)
UIUC and Chroma's Harness-1 is a 20B retrieval subagent trained with reinforcement learning inside a stateful search harness.
The harness maintains the bookkeeping — candidate pool, importance-tagged curated set, evidence graph, verification records — while the policy decides what to search, curate, verify, and when to stop. It reaches 0.
This is a summary. For the full story, read the original article at MarkTechPost.
Original source: MarkTechPost