<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Agentic AI on blog.yfzhou</title><link>https://blog.yfzhou.fyi/tags/agentic-ai/</link><description>Recent content in Agentic AI on blog.yfzhou</description><generator>Hugo -- 0.140.2</generator><language>en-us</language><lastBuildDate>Mon, 18 May 2026 11:00:00 +0800</lastBuildDate><atom:link href="https://blog.yfzhou.fyi/tags/agentic-ai/index.xml" rel="self" type="application/rss+xml"/><item><title>What we learned building sandbox for document agent</title><link>https://blog.yfzhou.fyi/posts/doc-sandbox/</link><pubDate>Mon, 18 May 2026 11:00:00 +0800</pubDate><guid>https://blog.yfzhou.fyi/posts/doc-sandbox/</guid><description>&lt;p>&lt;em>Cross-posted from the &lt;a href="https://raycaster.ai/blog/sandbox-for-document-agent">Raycaster blog&lt;/a>; I&amp;rsquo;ve spent the last several months building this, and here is my take.&lt;/em>&lt;/p>
&lt;p>2025 brought us the new idiom for building AI (beginning with Manus and Claude Code): give it tools to operate a computer. This is a break from the past default approach represented by ChatGPT, which is LLM + a menu of bespoke API connections to plug in to various systems of record.&lt;/p>
&lt;p>Our first attempt at a document agent was to ingest documents, parse them into plaintext pages, expose search/read/write tools, and let the LLM operate over virtual directories of artifacts and pages backed by a SQL database.&lt;/p></description></item><item><title>Test-Driven Development with an LLM for Fun and Profit</title><link>https://blog.yfzhou.fyi/posts/tdd-llm/</link><pubDate>Thu, 16 Jan 2025 23:03:30 +0800</pubDate><guid>https://blog.yfzhou.fyi/posts/tdd-llm/</guid><description>&lt;p>Welcome to the very first post in a new blog! Here I will discuss software development, SRE work, and other fun stuff. Sometimes an idea is just too good to pass up. I hope this blog will motivate me to turn sparks and little pieces into general knowledge in writing the words down.&lt;/p>
&lt;p>The other day I was discussing &lt;a href="https://www.tabbyml.com">Tabby&lt;/a> with a coworker. We talked about whether we should consider AI-autocompleted code harmful and ditch everyone&amp;rsquo;s newfound habit due to LLM&amp;rsquo;s inherent unreliability and their tendency toward spaghetti code, throwing traditional software engineering principles like DRY out the window. I disagreed: what if we could have a framework that integrates AI development tooling while also making everything better and more reliable instead? This instantly reminds me of Test-Driven Development, or TDD, which I think is great when combined with the use of a Large Language Model.&lt;/p></description></item></channel></rss>