<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Liberty]]></title><description><![CDATA[James' Blog!

(Views expressed here are not necessarily the views of Manifold Markets.)]]></description><link>https://jamesgrugett.com</link><image><url>https://substackcdn.com/image/fetch/$s_!ICn0!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e7015b9-384b-414d-929c-40e59c893c8f_1024x1024.png</url><title>Liberty</title><link>https://jamesgrugett.com</link></image><generator>Substack</generator><lastBuildDate>Sun, 12 Apr 2026 14:00:35 GMT</lastBuildDate><atom:link href="https://jamesgrugett.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[James Grugett]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[jahooma@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[jahooma@substack.com]]></itunes:email><itunes:name><![CDATA[James]]></itunes:name></itunes:owner><itunes:author><![CDATA[James]]></itunes:author><googleplay:owner><![CDATA[jahooma@substack.com]]></googleplay:owner><googleplay:email><![CDATA[jahooma@substack.com]]></googleplay:email><googleplay:author><![CDATA[James]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[What I learned building an AI coding agent for a year]]></title><description><![CDATA[We thought we'd be the best within months. Here's what went wrong &#8212; and why I'm more excited than ever.]]></description><link>https://jamesgrugett.com/p/what-i-learned-building-an-ai-coding</link><guid isPermaLink="false">https://jamesgrugett.com/p/what-i-learned-building-an-ai-coding</guid><dc:creator><![CDATA[James]]></dc:creator><pubDate>Sat, 05 Jul 2025 06:57:21 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Kqnr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Kqnr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Kqnr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 424w, https://substackcdn.com/image/fetch/$s_!Kqnr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 848w, https://substackcdn.com/image/fetch/$s_!Kqnr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 1272w, https://substackcdn.com/image/fetch/$s_!Kqnr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Kqnr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png" width="1024" height="862" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:862,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1149439,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/167566096?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Kqnr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 424w, https://substackcdn.com/image/fetch/$s_!Kqnr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 848w, https://substackcdn.com/image/fetch/$s_!Kqnr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 1272w, https://substackcdn.com/image/fetch/$s_!Kqnr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b5dae55-efb0-45ac-a6ee-48079e8d2302_1024x862.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>It&#8217;s been a full year of trying to build the best coding agent!</p><p>I didn&#8217;t know that my world was about to change last July 4th, at hackathon where I first prototyped a CLI coding tool that became <a href="https://codebuff.com">Codebuff</a>. What a ride it&#8217;s been!</p><p>From leaving <a href="https://manifold.markets/">Manifold</a>, to doing YC F24, to hiring, to competing with Claude Code, all the while averaging ~70 hours weekly by working most weekends &#8212; it&#8217;s been a lot!</p><p>We may not have won the first round, but I&#8217;m more fired up and excited for the future than ever.</p><h1>Our bet</h1><p>We got so many things right initially:</p><ul><li><p><strong>CLI first.</strong> Scoping down to just a command line tool helped us focus on the core of a coding agent.</p></li><li><p><strong>Inject more context.</strong> Immediately reading a dozen files related to the user prompt gave a huge advantage over competitors.</p></li><li><p><strong>No permissions checks</strong>. We were full YOLO mode from the very beginning which was positively heretical then.</p></li><li><p><strong>Premium tool.</strong> It makes sense to spend more when developer salaries are the alternative.</p></li><li><p><strong>Knowledge files.</strong> We came up with the idea of knowledge.md files that are checked in to your codebase. Codebuff would automatically update these files as it learned.</p></li></ul><p>Most of these are standard or becoming standard in coding agents today!</p><h1>What didn&#8217;t work out</h1><p>For the first 10 months, we always thought we were weeks away from breaking out and growing exponentially. During YC, we even did grow exponentially, to $5k MRR.</p><p>We regularly got people saying it was the best coding agent. But, it wasn&#8217;t always as reliable.</p><p>Our file editing strategy was flaky for months, much worse than Cursor&#8217;s with its custom model to rewrite a file.</p><p>Even after we adopted <a href="https://relace.ai">Relace</a>&#8217;s fast rewriter model, our product still had a long tail of issues that made ~5-10% of tasks fail. Some of these issues just take time to isolate and fix, but we could have prioritized better.</p><p>Without reliability, we could not have high retention. Without high retention, Codebuff could not grow.</p><h1>What we should have done</h1><p>Here&#8217;s what I&#8217;d do differently after an extensive retrospective.</p><h4><strong>Build end-to-end evals and run them nightly</strong></h4><p>This would get us regular quantified feedback on how Codebuff performs as a coding agent. It would help solve reliability issues AND allow us to test hypotheses on how to further improve our product.</p><p>Because we did not have this, we spent way too much time manually testing Codebuff after every change or when evaluating whether to switch models.</p><h4><strong>Cut every feature that is not core</strong></h4><p>We thought we scoped down a lot by sticking to the CLI, but we should have cut even more. Elon Musk was right when he said you must first &#8220;delete the part!&#8221;.</p><p>Here are a few features we should have cut earlier:</p><ul><li><p>Magic detection of whether the input is supposed to be a terminal command or prompt</p></li><li><p>Automatic knowledge file updates, which we tweaked for months before largely scrapping</p></li><li><p>A pseudo-terminal library (node-pty) for color output &amp; aliases, which was recently named our biggest blackhole feature ever</p></li></ul><h4><strong>Get the whole team improving the core product</strong></h4><p>I took on too much of the core system and left my cofounder to deal with other tasks which may not have been as impactful. It helps focus and morale to get all hands in the game.</p><h4><strong>Live in the future</strong></h4><p>Never stop thinking about how to disrupt your current product. What is the next thing? What experiments can we try today to make it work?</p><h4><strong>Monthly retrospectives</strong></h4><p>One bit of process that could have helped us achieve the above is monthly retrospective meetings. Schedule these on your calendar and set aside an hour for everyone to answer these questions and discuss them:</p><ul><li><p>What should we double down on?</p></li><li><p>What should we cut?</p></li><li><p>What should we explore next?</p></li></ul><h1>Next steps for Codebuff</h1><p>In the last couple months, we&#8217;ve done more reflection and exploration as competitors such as Claude Code have entered the market with similar ideas.</p><p>(Incidentally, I believe Claude Code succeeded in part by having a more focused bet: client-side only, search-replace file editing only, agentic-RAG only.)</p><p>We&#8217;ve been dreaming of the next thing, and now I&#8217;m confident we know what it is.</p><h4><strong>Our new multi-agent product is live!</strong></h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!I362!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!I362!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 424w, https://substackcdn.com/image/fetch/$s_!I362!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 848w, https://substackcdn.com/image/fetch/$s_!I362!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 1272w, https://substackcdn.com/image/fetch/$s_!I362!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!I362!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png" width="1098" height="280" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:280,&quot;width&quot;:1098,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73672,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/167566096?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!I362!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 424w, https://substackcdn.com/image/fetch/$s_!I362!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 848w, https://substackcdn.com/image/fetch/$s_!I362!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 1272w, https://substackcdn.com/image/fetch/$s_!I362!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e46e100-cd49-4e7a-ab4c-bc5143c1acf7_1098x280.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Our multi-agent framework, launched two days ago, is already increasing our evals!</figcaption></figure></div><p>I&#8217;m happy to say that, as of two days ago, we&#8217;ve soft-launched our multi-agent architecture, where agents spawn other agents with different roles.</p><p>The reception so far has been overwhelmingly positive even though this is the very beginning. My cofounder says we&#8217;re just scratching the surface of what is possible in this framework: &#8220;it feels like an infinite world of possibilities,&#8221; he says.</p><p>I agree &#8212; <a href="https://codebuff.com">check it out!</a> And stay tuned for a bigger launch soon!</p><h1>Predictions for the next year</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://manifold.markets/JamesGrugett/ai-coding-agent-forecasts-from-my-b" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZGQU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 424w, https://substackcdn.com/image/fetch/$s_!ZGQU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 848w, https://substackcdn.com/image/fetch/$s_!ZGQU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 1272w, https://substackcdn.com/image/fetch/$s_!ZGQU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZGQU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png" width="1456" height="1245" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1245,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:290368,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://manifold.markets/JamesGrugett/ai-coding-agent-forecasts-from-my-b&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/167566096?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZGQU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 424w, https://substackcdn.com/image/fetch/$s_!ZGQU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 848w, https://substackcdn.com/image/fetch/$s_!ZGQU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 1272w, https://substackcdn.com/image/fetch/$s_!ZGQU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4ebdf59b-187f-428b-88f5-0673d9a2d2d3_1516x1296.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Follow along <a href="https://manifold.markets/JamesGrugett/ai-coding-agent-forecasts-from-my-b">on Manifold</a> and place your bets!</figcaption></figure></div><p>If we got so many things right about what was coming for coding agents last year, can we do it again? I think so!</p><p>Here are my forecasts:</p><p><strong>The multi-agent paradigm will win.</strong> Our experience is that it&#8217;s possible to rapidly improve capabilities by delegating tasks to specialized agents.</p><p><strong>&#8220;Live learning&#8221; will be standard.</strong> Having the coding agent learn as it does tasks is extremely powerful.</p><p><strong>Coding agents will flip the initiative.</strong> We&#8217;ll see a shift from the user always initiating prompts, to the coding agent more often coming up with tasks for the user, e.g. to review key decisions.</p><p><strong>Coding agents will close the loop.</strong> Instead of just proposing code changes, they will also use the product itself to perform QA and evals, and commit the changes autonomously.</p><p><strong>Recursively improving coding agents will start working.</strong> And all the top coding agents will be a flavor of this.</p><p><strong>xAI will gain a sizable lead.</strong> The multi-polar era will recede as xAI gains a decisive lead in model quality and intelligence.</p><p><strong>The best model will not matter as much as today.</strong> Instead, it will be the network of agents that distinguishes the best product.</p><h1>It&#8217;s been a blast</h1><p>Thanks for reading, and cheers to another year of:</p><p>Big ideas, grinding, new employees, office snacks, customers that want to acquire us, offsites in Tokyo, afternoon breaks for running or basketball, and late night coding sessions.</p><p>May the best coding agent win!</p><p>James</p><p><strong>P.S. Come help us build the world&#8217;s best coding agent!</strong></p><p>You can join as a founding engineer and possibly have a stake in the first 10 trillion dollar startup once agents rule the world. Email <a href="mailto:james@codebuff.com">james@codebuff.com</a>. We also offer referral bonuses!</p>]]></content:encoded></item><item><title><![CDATA[Taking on the giants — an AI coding startup on the brink of success]]></title><description><![CDATA[We're winning users off of Cursor and Windsurf. Why does it feel like we are losing?]]></description><link>https://jamesgrugett.com/p/taking-on-the-giants-an-ai-coding</link><guid isPermaLink="false">https://jamesgrugett.com/p/taking-on-the-giants-an-ai-coding</guid><dc:creator><![CDATA[James]]></dc:creator><pubDate>Mon, 31 Mar 2025 01:11:24 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dWuj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Meet my codegen startup, <a href="https://codebuff.com">Codebuff</a>. </p><p>Every day, there&#8217;s a new AI launch and we feel further behind. Gemini Pro 2.5. The new DeepSeek V3. Claude Code.</p><p>Every day, there&#8217;s a new bug report (or three or five). Our npm package is bricked, throwing an error on startup. A service we are using is down. We deleted a user&#8217;s code, and they didn&#8217;t have a backup &#8212; losing 7 hours of work.</p><p>But also, every day, there&#8217;s a new comment on our <a href="https://discord.gg/rpNfzEve">Discord</a>, on <a href="https://x.com/CodebuffAI">Twitter</a>, or on Bookface, saying they like our product best, it&#8217;s faster, it solved something another service couldn&#8217;t.</p><h1>The rollercoaster is real</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dWuj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dWuj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!dWuj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!dWuj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!dWuj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dWuj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png" width="576" height="384.13186813186815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:576,&quot;bytes&quot;:3816583,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/160216227?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dWuj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!dWuj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!dWuj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!dWuj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc304c9c6-b93a-40f7-85f9-6334298d8661_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">This is us</figcaption></figure></div><p>Our company is 7 months old. We are three people. We have &gt;100 paying subscribers, but are struggling to make the product reliable to scale further. </p><p>It turns out that to make something &#8220;just work&#8221; means you need to fix 1000 papercuts, one at a time. That means debugging websocket connections, staring at logs to understand which of 10 steps went wrong in applying a file edit, and responding to angry users to get more insight on what failed this time.</p><p>(And if you&#8217;re me, you might need to deploy a hotfix for your hotfix for your hotfix haha.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9bBp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9bBp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 424w, https://substackcdn.com/image/fetch/$s_!9bBp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 848w, https://substackcdn.com/image/fetch/$s_!9bBp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 1272w, https://substackcdn.com/image/fetch/$s_!9bBp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9bBp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png" width="604" height="320.7516339869281" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:650,&quot;width&quot;:1224,&quot;resizeWidth&quot;:604,&quot;bytes&quot;:130039,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/160216227?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9bBp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 424w, https://substackcdn.com/image/fetch/$s_!9bBp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 848w, https://substackcdn.com/image/fetch/$s_!9bBp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 1272w, https://substackcdn.com/image/fetch/$s_!9bBp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e707398-7540-4499-bd25-5fa00b30a125_1224x650.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Cursor and/or the labs will crush you</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TCWD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TCWD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 424w, https://substackcdn.com/image/fetch/$s_!TCWD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 848w, https://substackcdn.com/image/fetch/$s_!TCWD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 1272w, https://substackcdn.com/image/fetch/$s_!TCWD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TCWD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png" width="1456" height="908" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:908,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:4215050,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/160216227?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TCWD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 424w, https://substackcdn.com/image/fetch/$s_!TCWD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 848w, https://substackcdn.com/image/fetch/$s_!TCWD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 1272w, https://substackcdn.com/image/fetch/$s_!TCWD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4241c550-3d8e-4470-ab10-edc20ddf73d0_2616x1632.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Codebuff (left) vs Claude Code (right). We fulfilled the user&#8217;s prompt 5 times faster</figcaption></figure></div><p>Meanwhile, competitors are forging ahead. Is there room for another coding agent startup, when billions of funding is already distributed?</p><p>We have the best code output by some measures (speed x quality). Moreover, I can see how we can stay ahead, for at least a while.</p><ul><li><p>Cursor/Windsurf are limited by $20/month plans and aren&#8217;t optimizing for a fully capable coding agent.</p></li><li><p>Claude Code can only use Claude. The best codegen will use many models.</p></li><li><p>Injection of better code context quickly will continue to be an advantage, even as models get smarter.</p></li></ul><p>I recently met another codegen founder, who was seemingly resigned to the &#8220;bitter lesson&#8221;.</p><p>He says new models will come out and they will be better than whatever you are working on. Just stop trying to compete on code quality. Compete on infra and marketing &#8212; those will be the only enduring advantages.</p><p>I don&#8217;t agree. I continue to believe even very smart models will improve with more context.</p><p>Even models trained to be more agentic, where Sonnet 3.7 is just the start, will have a similar opportunity for improvement.</p><p>Historically, startups are the most responsive to customer needs. I think a startup will beat the labs at this task of cobbling together all the right pieces of context.</p><h1>You need to do more marketing</h1><p>For every startup, you talk to enough people while building it that you get an intuition for what everyone else thinks you are lacking.</p><p>Commonly, they will tell you that you just need to get your product out in front of more people.</p><p>Like, you need to personally go to events and tell people to use your product. You need to post on social media. Or, you need to email Bill Gates (something my Dad mentioned for my first startup haha).</p><p>Maybe.</p><p>But founders have limited time and need to prioritize. Assuming you have enough users that are providing feedback daily (a big assumption), focusing on building a high quality product that people can&#8217;t live without is often the better strategy.</p><p>The returns to marketing are sharply limited if your user retention sucks. Conversely,  marketing is way easier if you have a killer product.</p><p>So, yes we need to do more marketing, but we also need to fix a high proportion of our bugs.</p><h1>Why is your team so small???</h1><p>&#8212;Say the VC&#8217;s hoping we will take more investment to dilute ourselves beyond what is necessary.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bGsE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bGsE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 424w, https://substackcdn.com/image/fetch/$s_!bGsE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 848w, https://substackcdn.com/image/fetch/$s_!bGsE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 1272w, https://substackcdn.com/image/fetch/$s_!bGsE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bGsE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png" width="580" height="309.783693843594" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:642,&quot;width&quot;:1202,&quot;resizeWidth&quot;:580,&quot;bytes&quot;:122662,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://jamesgrugett.com/i/160216227?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bGsE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 424w, https://substackcdn.com/image/fetch/$s_!bGsE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 848w, https://substackcdn.com/image/fetch/$s_!bGsE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 1272w, https://substackcdn.com/image/fetch/$s_!bGsE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc83a2ac1-d2f6-4b8a-9a97-54d7e65b2a0f_1202x642.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">We (intentionally!) raised less than our peers in YC</figcaption></figure></div><p>It&#8217;s human nature to over-hire. Nearly every first-time founder gets it wrong.</p><p>Your company is more impressive the more people it has. It can also get more done too. It&#8217;s a given that more hours worked collectively means better results, right?</p><p>Wrong.</p><p>If the scope of what you need to build is one hairy bit of software, adding employees is unlikely to speed that up, since most code should have only one owner.</p><p>It&#8217;s also dangerous to not keep a focused vision. New employees, even great ones, might want to take the company in a different direction. Even if they&#8217;re right, they&#8217;re wrong. Your startup can only go one way with full conviction or it will fail.</p><p>All that said, hiring our first employee did seem to take some of the weight off our backs, given that we had one more person we could trust to handle user issues. It helps that he&#8217;s a cracked engineer!<br><br>If you&#8217;re interested in joining us to build the best coding agent, email me (james@codebuff.com)!</p><h1>The grind continues</h1><p>I work everyday. Frequently, for the entire day. I like it. It&#8217;s rewarding albeit tiring.</p><p>But those sweet moments when users are happy,</p><p>when they&#8217;re rooting for us, when they say we&#8217;re about to blow up,</p><p>and when the roller coaster races, faster and faster, up the incline toward a new peak&#8212;</p><p>make the struggle all worth it.</p><p></p><div><hr></div><p></p><p>Today I took a break and went to the blog club at <a href="https://moxsf.com/">a local coworking space</a> to hang out with friends, resulting in this post.</p><p>Thanks for reading!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://jamesgrugett.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Liberty! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Announcing Manicode, v0]]></title><description><![CDATA[The best AI coder that ever ran from your terminal]]></description><link>https://jamesgrugett.com/p/announcing-manicode-v0</link><guid isPermaLink="false">https://jamesgrugett.com/p/announcing-manicode-v0</guid><dc:creator><![CDATA[James]]></dc:creator><pubDate>Tue, 13 Aug 2024 22:43:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ztIG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ztIG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 424w, https://substackcdn.com/image/fetch/$s_!ztIG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 848w, https://substackcdn.com/image/fetch/$s_!ztIG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 1272w, https://substackcdn.com/image/fetch/$s_!ztIG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ztIG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png" width="848" height="318" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:318,&quot;width&quot;:848,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:55402,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ztIG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 424w, https://substackcdn.com/image/fetch/$s_!ztIG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 848w, https://substackcdn.com/image/fetch/$s_!ztIG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 1272w, https://substackcdn.com/image/fetch/$s_!ztIG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a2732c3-e88d-4877-9685-620a59e4164e_848x318.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A Manicode testimonial from last night</figcaption></figure></div><p>Try it like this:</p><p><code>&gt; npm install -g manicode</code></p><p><code>&gt; manicode</code></p><p>Hello. It&#8217;s been a month and a half since my post <a href="https://jamesgrugett.com/p/software-automation-will-make-us">&#8220;Software automation will make us into Crazed-Super-Scientist Barons&#8221;</a>.</p><p>In that piece, I postulate that LLM&#8217;s killer use case is generating code. When software becomes one-tenth as costly to write, it will unleash the &#8220;Crazed-Super-Scientist Barons&#8221;, i.e. entrepreneurs that use this newfound power to build amazing things.</p><p>I didn&#8217;t imagine that I would be building a coding tool that could help make this a reality!</p><h1>AI Grant application</h1><div id="prediction-market-iframe" class="prediction-market-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://manifold.markets/embed/JamesGrugett/will-manicode-be-accepted-into-ai-g&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/86fa12be-7ccd-4c8c-9b2b-cd06fcaf0a5a_600x315.png&quot;}" data-component-name="PredictionMarketToDOM"><iframe id="iframe-prediction-market" class="prediction-market-iframe" src="https://manifold.markets/embed/JamesGrugett/will-manicode-be-accepted-into-ai-g" width="560px" height="405px" frameborder="0"></iframe></div><p>Here&#8217;s an excerpt from my application to the AI Grant accelerator, which does a good job explaining Manicode (more discussion below!):</p><p>Run <code>manicode</code> in your terminal. Ask it to do any coding task. It will make changes to your files.</p><p>...and it will do a really good job. Why?</p><ol><li><p><strong>It has full access to read and write to your files, run terminal commands, and scrape the web</strong></p><p>It can: grab files it needs for context, edit multiple files at once (no copy-pasting), run the type checker, run tests, install dependencies, and search for documentation.</p><p>These abilities are key to doing a good job and will only become more powerful as LLM's continue to level up.</p></li><li><p><strong>It uses so-called "knowledge" files</strong></p><p>LLM's perform so much better with extra context!</p><p>With Manicode, we've come up with this idea to check in <code>knowledge.md</code> files in any directory, and write down extra bits of context, like which 3 files you need to edit in order to create a new endpoint. Or which patterns are being deprecated and which should be used. Or which directories can import from other directories.</p><p>Every codebase has lots of implicit knowledge like this that you have to impart to your engineers. Once written down, it makes Claude really fly! It's truly a night and day difference.</p></li><li><p><strong>It's synchronous, and you can give feedback</strong></p><p>You're chatting with it. It takes ~30 seconds to get back to you and then you can tell it what you want to do next or what it did wrong.</p><p>This keeps Manicode on track and aligned.</p></li><li><p><strong>It learns</strong></p><p>The flow of using Manicode is:</p><ul><li><p>Ask it to do something</p></li><li><p>If it fails, point out its error</p></li><li><p>Manicode fixes the error and automatically writes down how it can improve for next time in a knowledge file</p></li><li><p>You push the commit, and now Manicode has become even more capable when the next engineer runs it in the codebase.</p></li></ul><p>This is the magic loop that will make Manicode productive for experienced engineers in giant codebases.</p></li><li><p><strong>We're unafraid to spend for better results</strong></p><p>We can use as many parallel API calls with as much context as we can to produce the best code, because we know that the alternative is human labor, which is much more expensive.</p></li></ol><p><strong>We're targeting the largest market for software engineers</strong></p><ul><li><p>It's a tool for daily use by experts (not just junior engineers)</p></li><li><p>It's for software maintainers (not just people starting new projects)</p></li></ul><p><strong>We're starting with a console application, because it's simple and has great distribution</strong></p><ul><li><p>Every developer knows how to install new packages with npm or pip.</p></li><li><p>Most developers already have the terminal accessible: as a pane in your vscode window, for example.</p></li></ul><p><strong>The timing is right</strong></p><p>Claude Sonnet 3.5 passed some bar of coding competence, and the form factor of a fully capable agent that can change any file works now, whereas before you could only reliably edit one function at a time.</p><p><strong>There is a moat after all</strong></p><p>Handling every tech stack well, knowing when to run their tests and type check, integrating with git, linear, slack, and email, supporting database migrations, etc, etc, etc. You can build hundreds or thousands of special case prompt magic to improve things so that it always just magically works the first time. A startup arriving at this 6 months late wouldn't catch up.</p><p><strong>Intro video</strong></p><div id="youtube2-ZzT4HIhnzio" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;ZzT4HIhnzio&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/ZzT4HIhnzio?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p><strong>Demo video</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;65123f9f-6601-4de1-aec8-418178ff4e05&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h1>Why Manicode is the right product</h1><blockquote><div class="pullquote"><p>"Wow, manicode is pretty great. I think by the end of the hour I'll have all the schema changes." - Manifold dev while I was writing this Substack piece</p></div></blockquote><p><strong>Manicode is just an LLM wrapper.</strong> But I think this is what works best.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TiA4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TiA4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 424w, https://substackcdn.com/image/fetch/$s_!TiA4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 848w, https://substackcdn.com/image/fetch/$s_!TiA4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!TiA4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TiA4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg" width="886" height="499" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:499,&quot;width&quot;:886,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:58591,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TiA4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 424w, https://substackcdn.com/image/fetch/$s_!TiA4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 848w, https://substackcdn.com/image/fetch/$s_!TiA4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!TiA4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d3d86b9-e7f8-41ef-b35a-09430468b63b_886x499.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The best code will continue to be generated by the best LLM on the market for the foreseeable future.</p><p>You could build a smaller, custom model. But it probably won&#8217;t have seen as much code as OpenAI&#8217;s or Anthropic&#8217;s models. It&#8217;s hard to compete with billions of dollars of investment!</p><p><strong>Manicode gets user feedback quickly.</strong></p><p>LLM&#8217;s are not quite good enough to go off on their own and continue to make progress.</p><p>They get stuck: sometimes they can&#8217;t fix a type error, or a test. They go off in a wrong direction: they pick the wrong design and keep building for that.</p><p>The best experience is thus going to a quicker feedback loop. Where the human can direct the AI every 30 seconds on what to do next, or what it did wrong. This is why Manicode is a synchronous conversation with the AI.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/nearcyan/status/1823276039805837683" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mcOL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 424w, https://substackcdn.com/image/fetch/$s_!mcOL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 848w, https://substackcdn.com/image/fetch/$s_!mcOL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 1272w, https://substackcdn.com/image/fetch/$s_!mcOL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mcOL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png" width="1192" height="430" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c560abc1-035a-4662-9858-d71a7c82800d_1192x430.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:430,&quot;width&quot;:1192,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:78612,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/nearcyan/status/1823276039805837683&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mcOL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 424w, https://substackcdn.com/image/fetch/$s_!mcOL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 848w, https://substackcdn.com/image/fetch/$s_!mcOL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 1272w, https://substackcdn.com/image/fetch/$s_!mcOL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc560abc1-035a-4662-9858-d71a7c82800d_1192x430.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Manicode fills context well</strong></p><p>Manicode&#8217;s genuine new idea is to write extra context as knowledge.md files and check it into your codebase, side-by-side with the actual code. Knowledge is anything you write that makes the actual agent work!</p><p>Other than that, Manicode knows the directory structure and chooses other relevant files to include automatically. This makes a big difference, especially because it can use the knowledge files to help it pick relevant files.</p><p><strong>Manicode has full access</strong></p><p>It can read and write to your files, as well as run potentially risky commands in your terminal without any confirmation from the user.</p><p>That sounds incredibly scary, but is actually much less risky in reality, especially if you have version control.</p><p>What command could it run that would mess things up that much? The extra abilities only make it more useful.</p><p>This quality of doing something that normal people think "goes too far" or seems unsafe is a correlated with good startup ideas, because it means fewer people are likely to have thought of it. (E.g. For Airbnb: You let random strangers sleep in your house? Or Manifold: You let anyone ask and judge the resolution of their own question?)</p><h1>The race is on</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/garrytan/status/1822715982999433488" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fZOS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 424w, https://substackcdn.com/image/fetch/$s_!fZOS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 848w, https://substackcdn.com/image/fetch/$s_!fZOS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 1272w, https://substackcdn.com/image/fetch/$s_!fZOS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fZOS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png" width="1190" height="826" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:826,&quot;width&quot;:1190,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:154391,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/garrytan/status/1822715982999433488&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fZOS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 424w, https://substackcdn.com/image/fetch/$s_!fZOS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 848w, https://substackcdn.com/image/fetch/$s_!fZOS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 1272w, https://substackcdn.com/image/fetch/$s_!fZOS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa67b0fc3-f56f-4d81-a68a-cbc18cf13880_1190x826.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The most valuable application for LLM&#8217;s is right in front of us.</p><p>There are probably a hundred-plus startups vying to win with billions in investment. There could hardly be more at stake. The winner could be the next FAANG star, that brings the creative power of LLM&#8217;s to the world.</p><p>Bet on Manicode.</p><p>P.S. <a href="https://discord.gg/mcWTGjgTj3">Join the Discord</a>!</p>]]></content:encoded></item><item><title><![CDATA[The Future of Prediction Markets]]></title><description><![CDATA[My talk at Manifest 2024]]></description><link>https://jamesgrugett.com/p/the-future-of-prediction-markets</link><guid isPermaLink="false">https://jamesgrugett.com/p/the-future-of-prediction-markets</guid><dc:creator><![CDATA[James]]></dc:creator><pubDate>Sat, 03 Aug 2024 00:26:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!1di6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Watch the video <a href="https://www.youtube.com/watch?v=R1OoI6YQAgg">here</a>, or proceed to read the lightly edited transcript.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1di6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1di6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 424w, https://substackcdn.com/image/fetch/$s_!1di6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 848w, https://substackcdn.com/image/fetch/$s_!1di6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 1272w, https://substackcdn.com/image/fetch/$s_!1di6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1di6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png" width="654" height="446.9321533923304" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:695,&quot;width&quot;:1017,&quot;resizeWidth&quot;:654,&quot;bytes&quot;:1508431,&quot;alt&quot;:&quot;This is pretty close to what Manifest looked like haha&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="This is pretty close to what Manifest looked like haha" title="This is pretty close to what Manifest looked like haha" srcset="https://substackcdn.com/image/fetch/$s_!1di6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 424w, https://substackcdn.com/image/fetch/$s_!1di6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 848w, https://substackcdn.com/image/fetch/$s_!1di6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 1272w, https://substackcdn.com/image/fetch/$s_!1di6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f734615-63bf-450d-a3e0-d2514ec10d59_1017x695.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">This is pretty close to what Manifest looked like haha.</figcaption></figure></div><h1>The talk</h1><p>Welcome! I'm so glad you're all here. Like all of you, you've come from far away places. Welcome to Manifest! I guess this is the first talk. Manifest this year is twice as big as last year. It's going to be crazy. I hope you guys have a great time. I hope you have twice as much fun. I think my talk is not going to be twice as good as last year, but yeah, welcome.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://jamesgrugett.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Liberty! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>My name is James. I'm one of the co-founders of Manifold. It's nice to meet you if we haven't met already. I'm going to dive right into my talk, which is on the future of prediction markets. I hope that you find it interesting.</p><p>I'm most interested in the potential use cases for prediction markets. I'm going to outline four of them in my talk. Basically, prediction markets are sort of going places these days. They're getting more popular. We even have people writing articles about prediction markets. Some of them are saying things like, "Oh, they're not going to grow very much more" or "They're actually not as good as polls." But when you have these kinds of articles coming out that are criticizing them, I think that's actually a good sign. That means we're on to something. So I'm happy about that.</p><p>The original idea of Manifold was to take prediction markets and have them be run by a creator where one person would ask the question, they would set the resolution criteria, they would allow people to trade, and they would earn trading fees. So they would sort of run this whole thing. They provide a lot of value by doing that, and then they would earn from the trading fees. So it turns it into almost like a mini business. That was the original idea that we proposed to Scott Alexander.</p><p>But we decided to move into play money. So it was never quite the case that you could create any question and if it got popular enough, actually earn income from it. However, Manifold has recently announced that we are going to introduce these cash prizes using the sweepstakes model. I'm happy to say that in about a month - it's not launched yet - you will be able to earn cash prizes and perhaps run markets and run them like a business and possibly earn a profit by being a creator of markets. So I'm really excited for that.</p><p>Also at this Manifest, you will be able to learn more about sweepstakes, including there will be a live theatrical performance and musical that will go over the details. I hope you will attend that. I think it'll be in the park in like a couple of hours. So look forward to that.</p><p>And then without further ado, I'll continue to enlighten you on the four use cases that I think are really valuable for prediction markets.</p><h1>I. Running a market profitably</h1><p>The first one I've sort of outlined already, which is to run a market in a way that's net profitable. On Manifold so far, with the play money world, we actually were not running these zero-sum markets. In fact, we were printing a bunch of Mana. We were giving out lots of bonuses because that made it a lot easier for someone who creates a market to earn Mana from doing it. So we would give you a bonus for every unique trader that trades on your market.</p><p>It was common for people to make a profit by creating markets on all these topics. Post pivot, we have eliminated all those bonuses, and it's very sad. All the markets are zero-sum actually, and we're charging trading fees. So when users trade, a little bit of their bet actually goes to the creator of the market now. We are trying to bring back the original vision of Manifold, which is that the creator can make a profit on their market.</p><p>Let me tell you an anecdote. Just yesterday, you know that Starship succeeded at taking off and re-entering, and it didn't explode. That's amazing. It's really significant, but it's significant for another reason, which is that one of our users, Chris J Billington, created a market on whether the Starship would not explode. He subsidized it with 50,000 of his own Mana, and in this zero-sum environment, he was able to earn a profit for the first time as a creator because lots of people bet on it. He managed to close the market before the spaceship launched, so that meant he was able to get back his liquidity.</p><p>The way liquidity works is a little complicated, but if the probability gets bet to an extreme, then it actually ends up eating it up, and that money goes to the traders that bet on it. But if you close the market early, which is kind of a novel thing because in Manifold with the play money version, people are just not that concerned about play money. But in this world, to justify adding a large amount of subsidy - and I hope that this was basically $50, 50,000 Mana, but I hope that we can scale that. So this is like a proof of concept that something with a little bit of subsidy, $50, is something that we can magnify, and we can eventually have $50,000 as a subsidy.</p><p>If you can manage liquidity right, then you can sort of turn the market into a profitable endeavor. So I think that that was something very significant that just happened yesterday.</p><p>Imagine if you have these engines for prediction markets where each one can actually become profitable. Then that will inspire a lot of people in a decentralized way to create markets on their niche, on what they're knowledgeable about. I think that is a great thing, and that's the first thing that I'm very jazzed about.</p><h1>II. Buying information</h1><p>Okay, the second thing I'm jazzed about for prediction markets: not every topic can support enough traders or enough betting volume in order to earn enough fees. So essentially, it's like what if you have a question that's not very popular? What if it's a niche question? What if it's some obscure scientific fact that you want a forecast on, but maybe it's not quite profitable?</p><p>In that case, prediction markets are actually still useful. Here's how: basically, you flip the script and you say instead of trying to earn a profit, I am going to purchase information. I am going to subsidize this market. So you put in however much money you think it's worth to answer this question, and then you just create the market. Then in a crowdsourced way, lots of people from around the world see that, and then they bet on it if they think it's profitable. Through this magic, the prediction market will create a forecast to answer your question.</p><p>So the second case is basically buying information. I think that that's really cool.</p><p>I've done this, for example. I wanted to know what the daily active users for Manifold would be in July, so in the future. I set it up. I asked this question using our experimental numeric distribution format, which I think is also novel. Essentially, it allows people to bet on this complex continuous distribution. It's actually not continuous; we break it into little buckets, but it's approximately continuous. Users can choose a range and then bet an amount within that range, and it will adjust the distribution to be higher where they bet it.</p><p>I paid 10,000 Mana, and I got as a result this nice distribution which said, "Here's the expected daily active users for Manifold in July." That's an amazing service. I think that's useful. I think that's worth $10, and I think it might be worth more. If you put in more, the more Mana you put, the more subsidy, the more traders will put in an effort to fix all the cases in that distribution.</p><p>Maybe they think that the chance is basically centered around where our current users are, and then there's this long tail where maybe we're going to grow a lot. I'm happy about that case, but it basically said there was like a 5% chance that we would grow by 50% or more. If I were interested in those tails and I wanted it to be more precise, it would actually just work to add more subsidy to the market so that people have more incentive to bet it to be accurate.</p><p>There are tons of examples within Manifold because we use markets all the time for everything. We're buying information or we're doing a brainstorming session where we're getting users to propose features to us, we're getting users to find bugs for us. There are many formats where you can ask either an open-ended question and get free response answers.</p><p>The T-shirt design for Manifest that you have is created through a market, which was basically a contest that says, "Who will design the best T-shirt?" Another example that's pretty cool is that every month, the Manifold team does a retrospective of what went well and what didn't go so well in the last month. I create two markets for that. These are free response markets that mostly the Manifold team uses where we submit answers of what we thought went especially well and what we thought maybe didn't. This helps us improve.</p><p>Basically, I will just look at those answers and then subjectively be like, "I think this is pretty important" and "This one's not that important," and I come up with weights for them. Then I just resolve the market according to that. So it provides an incentive for people to submit and bet on answers where I think it will be judged as relatively important. It's just one more use case for how we use markets internally. I think that the ability to earn cash prizes and subsidize stuff is going to be a major deal, which might unlock a lot of use cases for other companies to use these as well.</p><h1>III. Economic hedging</h1><p>Okay, the third use case for prediction markets that I think is valuable and will be interesting going forward is economic hedging. Usually, when you make a bet, you're betting it because you think you're going to make money in expectation, like you're going to get more out of it than you put in. Sometimes you don't even need that in order for it to be profitable for you.</p><p>The way that works is because people have utility functions that are basically risk-averse. They might want to hedge the downside so that maybe they're losing money in expectation, but they're also making sure that the worst case is not so bad. This is kind of like an insurance market.</p><p>I created a market for myself because I ride an electric unicycle, and it's very dangerous. So I created a market like, "Will I have an accident or will I have an injury?" The market itself can first find the right price, so it discovers what is the right price for my insurance, essentially. It'll be like, "Okay, I broke my collarbone last year right after last Manifest riding an electric skateboard, so I switched to unicycle because I thought it was safer." So you have a base rate.</p><p>I created that market, and I started. I was like, "I'll buy yes at 4% that I will have an accident in the rest of 2024." Then people started betting it up, and now it's like a 10% chance that I'll have an accident. So I ended up just posting a limit order to buy yes at 10%. People will fill that. I've gotten some of it filled already. Then I have insurance. Whenever I actually make a profit, if I do end up breaking another bone.</p><p>Economic hedging is probably less useful personally. I think the main use cases are usually for businesses. There are tons of use cases for this. There's like, "Will the weather be bad on this day that makes the event not work?" or "Will Trump be elected, and then somehow that changes something for you in your business?" or "Taiwan is invaded, and that's not good for your business for some reason." So you can hedge all of those outcomes.</p><p>What I will say is that the user-created model is basically an amazing combination with this economic hedging use case because you can figure out what you want to hedge, and then you create exactly that question and hedge it. Like I could create exactly the question I wanted on my electric unicycle. But if you're a business, I think that that's like a superpower. If you're going through traditional finance and stuff, it's not that easy. So economic hedging is pretty cool.</p><h1>IV. Matchmaking</h1><p>Okay, the fourth use case that I think is really valuable is matchmaking. I would put this in a generalized way. There's basically hiring, where you're matching a job seeker to a company. There's networking, like who should I meet, who should I talk to. There are friendships, like who would I really vibe with. And there's dating, of course. I mean, obviously.</p><p>After last Manifest, Robin Hansen gave a speech, and he said, "You guys are doing cool stuff, but you're kind of not getting to the really valuable use cases. You need to think long and hard about where prediction markets are going to be the most valuable. Is it really about predicting whether the ball pit is going to materialize at Manifest?" He suggested that hiring markets, helping companies hire employees, is obviously really valuable.</p><p>I heard that, and then I was like, "I know exactly what to build," and that was Manifold Love. I don't know if you guys know, but basically, the way I'll explain very briefly, the idea is that you create a public profile, a dating profile. You upload photos, you answer questions, and then you bet on who among these profiles is going to date who.</p><p>Secondly, I'd say that there were definitely some issues where people were not that interested in browsing on behalf of someone else. People are definitely more interested in themselves and browsing for themselves and betting on their own prospects. But I think that with the addition of sweepstakes and cash prizes, that could help incentivize people to go out and find matches for other people. So I think it's possible that Manifold Love could be rebooted at some point. I know if Stephen were here, my brother, he would be like, "No, no, no, we're not doing that at all. Just forget about Manifold Love." But yeah, I think it's really cool.</p><p>Also, we are planning to actually do the hiring use case. Manifold is not really hiring at the moment, but we might soon. When we are, I will create a market on who we will hire. I will subsidize it with lots of money so that you'll make at least $10,000, maybe more, by betting on who we're going to hire. I actually think that this is a promising use case. I think that crowdsourcing who we would hire, like people have good ideas, they know people, and the mechanism is correct. It's like they're going to bet if they think we're going to hire them, that it's profitable in expectation for them to bet on it. That will surface to us the most promising candidates. We will just look at them and see which one has the highest probability of being hired, then we'll interview those people. I think it's going to work. So we will dog food that, and if it's sufficiently successful, then we can just try to make that work for everyone, try to run that as a product for other companies. I think that's really exciting.</p><h1>V. AI</h1><p>Okay, so those are four use cases. This is a talk about the future of prediction markets, so I have to mention AI at some point. Basically, my take on AI is that it will take all of those use cases and it will supercharge them because AI is actually going to make all of those way better.</p><p>Let me run through them again. So we have running a profitable market. Imagine that it's an AI that creates the market. It's the AI that has to judge whether it happened. I actually think AIs are going to be great judges, like an impartial source. Instead of relying on a human that's emotional and maybe they didn't read the evidence correctly, or they woke up and were sick that day, you don't know. But if it's an AI, you're like, "Okay, that's objective." I think they're going to be great market makers and resolvers. They can do it for cheap, which is really like another superpower.</p><p>So it means that we can have markets on everything because it's going to be so cheap for AIs to create and resolve markets. You can ask the AI questions about the resolution criteria, and it will always be there to respond and give you clarifications. So I think it's going to be a better service, and it'll be cheaper. That's amazing because we actually can just support markets on everything. Then you can just type into Manifold any question, and we already have a lot of questions - we have like 100,000 - but then we'll just have like 100 million or something. There'll just be so many questions.</p><p>That's just the first use case - running markets as a business. It's going to be profitable for those AIs in particular because they just don't need to be paid much.</p><p>Then there's another use case, which is doing research or subsidizing markets. Why would an AI pay to create a market and subsidize it? I think we're going to enter this world of AI agents that are trying to understand the world. They're trying to make progress on their own. The world is complex, and the AI might not know everything. I think there could be domain expert AI agents that know about certain things.</p><p>Basically, what I think will happen is that prediction markets will be like a native technology of AIs. When there's something they don't know, they will ask a question using APIs, and they will subsidize it. Then other AIs will answer, will bet on it, and so it can, in a matter of seconds, sort of figure something out about the world. This is all just information technology. It's just whatever it wants to learn, it can do. There will be these AI agents that perform this role because it's profitable to bet on these markets.</p><p>They will be running businesses, and then they will be hedging their business economically using prediction markets because it just makes sense. It just produces value. So I think that prediction markets are actually a really good match with AI. I think humans are actually not a good match for prediction markets when you think about it. It doesn't come naturally, like thinking in probabilities, figuring out exactly how much to bet. That's like only weird humans do that. It's not something that all of us just natively think in this way economically and in probabilities. But computers are so good at this, and they're already trading on markets. Like most of the volume in stock markets is all algorithms and stuff. So basically, I want to submit that prediction markets, and in particular user-created prediction markets, will be a useful tool for AIs to do a bunch of things using all the use cases I outlined.</p><p>One ending anecdote might be if, in the future, you have your personal AI and it knows everything about you. But people are protective of their data, so actually only your AI has that data. It's kind of like it sees your whole life and it knows everything you say. So it has a really good grasp on you personally. That AI could go out and bet in prediction markets on who you will marry, on which jobs you will take. You will benefit from this because you will get all these nice forecasts of what you should be doing. It will make your life a lot better. I think that that would be a truly amazing world.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://jamesgrugett.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Liberty! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Software automation will make us into Crazed-Super-Scientist Barons]]></title><description><![CDATA[10x cheaper and faster software development will be here in ~1 year via AI agents. How will the world change?]]></description><link>https://jamesgrugett.com/p/software-automation-will-make-us</link><guid isPermaLink="false">https://jamesgrugett.com/p/software-automation-will-make-us</guid><dc:creator><![CDATA[James]]></dc:creator><pubDate>Mon, 01 Jul 2024 03:25:32 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/92e2e965-48a2-4c6b-8be1-7c9b9ccb44dc_4032x3024.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We&#8217;re quickly moving up the <a href="https://worrydream.com/LadderOfAbstraction/">abstraction ladder</a> for software development! <a href="https://thezvi.substack.com/p/on-claude-35-sonnet">Claude 3.5 Sonnet</a> is more evidence that the cutting edge is continuing to improve.</p><p>I propose 5 levels of automation for software development, akin to the <a href="https://www.synopsys.com/automotive/autonomous-driving-levels.html">self-driving cars levels</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://jamesgrugett.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Liberty! Subscribe for free.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>Levels of Software Automation</h1><p>In the last few years, we&#8217;ve moved from no automation, to auto-completing lines of code, to writing whole functions:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8Ebi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8Ebi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 424w, https://substackcdn.com/image/fetch/$s_!8Ebi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 848w, https://substackcdn.com/image/fetch/$s_!8Ebi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 1272w, https://substackcdn.com/image/fetch/$s_!8Ebi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8Ebi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png" width="1456" height="442" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:442,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:107786,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8Ebi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 424w, https://substackcdn.com/image/fetch/$s_!8Ebi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 848w, https://substackcdn.com/image/fetch/$s_!8Ebi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 1272w, https://substackcdn.com/image/fetch/$s_!8Ebi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9beed302-c592-4b89-b9f9-66dde980bee0_1746x530.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">We are currently at level II. See <a href="https://www.cursor.com/">Cursor</a> for the state of the art.</figcaption></figure></div><p>For experienced engineers, levels <strong>I</strong> and <strong>II</strong> are tools that result in only a modest speedup, currently between 1-2x.</p><p>However, I think we&#8217;re on the brink of a major shake-up.</p><h1>Level III automation</h1><p>The next level of automation, where a human guides the AI toward implementing whole features, will be totally different. Software will become much, much cheaper.</p><p>A feature that might take 3 hours of concentrated work today could be done in 15 minutes by spec&#8217;ing it out in a paragraph and leaving a few comments on an AI agent&#8217;s proposed changes.</p><p>This is the vision of the startup <a href="https://mentat.ai/">Mentat.ai</a>, which claims the highest score on a software engineering benchmark.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://twitter.com/bio_bootloader/status/1806342922893394290" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NbLy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NbLy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NbLy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NbLy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NbLy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg" width="562" height="318.8269230769231" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:826,&quot;width&quot;:1456,&quot;resizeWidth&quot;:562,&quot;bytes&quot;:84469,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:&quot;https://twitter.com/bio_bootloader/status/1806342922893394290&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NbLy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NbLy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NbLy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NbLy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feafa9600-7fd9-4356-b5fb-0baa853fd65d_1668x946.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">MentatBot&#8217;s impressive <a href="https://twitter.com/bio_bootloader/status/1806342922893394290">benchmark result</a>.</figcaption></figure></div><p>I tried it out yesterday, with little success.</p><p>First, I created <a href="https://github.com/manifoldmarkets/manifold/issues/2693">an issue</a> on our open source repo, and tagged &#8220;@MentatBot&#8221; to trigger it:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://github.com/manifoldmarkets/manifold/issues/2693" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DP7S!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 424w, https://substackcdn.com/image/fetch/$s_!DP7S!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 848w, https://substackcdn.com/image/fetch/$s_!DP7S!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 1272w, https://substackcdn.com/image/fetch/$s_!DP7S!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DP7S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png" width="1456" height="633" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:633,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:172735,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://github.com/manifoldmarkets/manifold/issues/2693&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DP7S!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 424w, https://substackcdn.com/image/fetch/$s_!DP7S!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 848w, https://substackcdn.com/image/fetch/$s_!DP7S!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 1272w, https://substackcdn.com/image/fetch/$s_!DP7S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96f54c7c-6c53-41c7-8f62-c490a25b81ef_1840x800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I asked the AI agent bot to do some work for me.</figcaption></figure></div><p>The resulting <a href="https://github.com/manifoldmarkets/manifold/pull/2694">Pull Request</a> seemed impressive at first, but on closer inspection, almost every change it made was a little bit wrong. The bot edited some of the right files, but didn&#8217;t call the helper function that it created elsewhere. It also edited some wrong files that were more like library code. It created type errors that it couldn&#8217;t fix, and didn&#8217;t always follow my instruction.</p><p>Still, MentatBot is a promising early stab at the problem, currently powered by GPT 4o (they hope to upgrade to Claude Sonnet 3.5 soon).</p><p>With another year of improvements to base LLMs, plus further <a href="https://situational-awareness.ai/from-gpt-4-to-agi/">unhobbling</a> via efforts of startups to chain LLM calls productively, I can imagine us at automation level <strong>III</strong> in a year (50% chance) or two years (75% chance).</p><p>Below, I created a market on roughly the criteria for level III automation by July 2025:</p><div id="prediction-market-iframe" class="prediction-market-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://manifold.markets/embed/JamesGrugett/will-ai-agents-be-able-to-code-a-sm&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/96caf5e9-2c06-40a6-bfb9-a2c7282db3e2_600x315.png&quot;}" data-component-name="PredictionMarketToDOM"><iframe id="iframe-prediction-market" class="prediction-market-iframe" src="https://manifold.markets/embed/JamesGrugett/will-ai-agents-be-able-to-code-a-sm" width="560px" height="405px" frameborder="0"></iframe></div><p>The world will change appreciably with a 10x speedup in software creation.</p><p>There are 4.4M software engineers in the US. They collectively earn approximately $500B per year. If we&#8217;re able to do all that work with 10% of the engineers, that naively implies ~$450 billion in value created.</p><p>Of course, decreasing the cost of software by 90% will dramatically increase the demand, as economists know. That&#8217;s why the value created from level <strong>III</strong> automation is likely much larger, though hard to predict. With an explosion in use cases for cheap software, the value created could be in the trillions annually.</p><p>Suddenly, software will become more polished. Bank apps will take less time to load. There will be fewer bugs in day-to-day usage.</p><p>Most importantly, there&#8217;ll be an explosion of startups. Niches will be filled where it was not profitable previously. We&#8217;ll have more personalized software, generated even for individuals. And of course faster software development will feed into better AI.</p><p>However, the key unlock of level III automation is not cost savings. It&#8217;s iteration speed.</p><h1>Crazed-Super-Scientist Barons</h1><p>It&#8217;s well-known <a href="https://manifold.markets/">Manifold</a> lore that we push changes at a breakneck pace, sometimes to the detriment of our users.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://twitter.com/ManifoldMarkets/status/1701426706865897787" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-EVD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 424w, https://substackcdn.com/image/fetch/$s_!-EVD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 848w, https://substackcdn.com/image/fetch/$s_!-EVD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 1272w, https://substackcdn.com/image/fetch/$s_!-EVD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-EVD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png" width="1456" height="272" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:272,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://twitter.com/ManifoldMarkets/status/1701426706865897787&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-EVD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 424w, https://substackcdn.com/image/fetch/$s_!-EVD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 848w, https://substackcdn.com/image/fetch/$s_!-EVD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 1272w, https://substackcdn.com/image/fetch/$s_!-EVD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29f4e68b-51a5-4a1a-924d-4bd560c3392d_1458x272.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">We printed this on a <a href="https://twitter.com/ManifoldMarkets/status/1701426706865897787">poster in our office</a>. It&#8217;s become a source of pride.</figcaption></figure></div><p>For moving this quickly, as a small team of 6 full-time, Manifold was said to be a &#8220;fiefdom run by crazed scientist barons.&#8221;</p><p>I ran with this idea and proposed that all organizations would be more effective if they operated on this model. See my &#8220;<a href="https://manifold.markets/JamesGrugett/will-the-mad-scientists-theory-of-g">Mad Scientists Theory of Governance</a>&#8221; market for further elaboration!</p><p>Now, imagine what will happen when you hand us another 10x speedup. I added  &#8220;Super-&#8221; into the phrase, but it&#8217;s hard to envision where exactly that will lead us.</p><p>In day-to-day work, we receive a giant volume of requests for bug fixes and features. There&#8217;re so many ideas and experiments to try that we are incredibly bottlenecked on execution.</p><p>Going ten times faster could change this bottleneck from execution to getting feedback (and deciding what to do next). You need to test product changes on users to see whether your idea was good, and that takes time in the real world.</p><p>Running A/B tests can take a while to give statistically significant results. However, qualitative feedback can be richer and faster. I predict startups like ours will collect more individual feedback from users, because they will have the capacity to act on it. (Just like Discord has been critical for increasing user feedback in our journey so far.)</p><p>If today it takes two years to find product-market fit for a new product, then crazed-super-scientist barons should be able to do it in a few months.</p><p>We&#8217;ll thus see an acceleration of the serial-entrepreneur phenomenon, including more parent companies that spin up dozens of products. Such could be the future of Manifold. Our name allows for it, at least!</p><h1>The speed limit of progress</h1><p>Innovation is currently driven by small teams pushing hard against the frontier. This sets the global speed limit for progress.</p><p>In the next 1-2 years, AI will increase that speed limit by a factor of ten, at least for software startups. Exciting times!</p><p>As AI continues to develop, and the level of automation increases, the speed limit will continue to be pushed back across all fields. I look forward to this world of abundant frontier advances!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://jamesgrugett.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Liberty! Subscribe for free.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[How to give a talk]]></title><description><![CDATA[My learnings from Manifest, as a beginning speaker]]></description><link>https://jamesgrugett.com/p/how-to-give-a-talk</link><guid isPermaLink="false">https://jamesgrugett.com/p/how-to-give-a-talk</guid><dc:creator><![CDATA[James]]></dc:creator><pubDate>Fri, 21 Jun 2024 07:58:05 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/4e55056c-959c-4d03-8a4d-6eafe658f474_1792x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Some talks are wayyy better than others.</p><p>At Manifest, I went to talks where the audience was deeply engaged with every word the speaker said.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://jamesgrugett.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading my first post on <em>Liberty</em>! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Was the audience paying attention because they found the content interesting? Or maybe the speaker just had <em>charisma</em> &#8212; <a href="https://www.eisenhowerlibrary.gov/sites/default/files/file/what_is_leadership.pdf">a special X-factor</a> that most humans could never hope to attain? (I don&#8217;t think that&#8217;s it!)</p><p>No, I think giving a good talk can be learned simply. Here&#8217;s the formula that worked for me.</p><h1>1. Delete your slides</h1><p>This is the big one, and it&#8217;s counter intuitive. You will do infinitely better if you follow this one point.</p><p>Basically, slides are distracting. If people are trying to read your slide <em>and</em> listen, then they will stop listening. If they can read through the point you&#8217;re about to make, then they will do that, and then they will stop listening.</p><p>Just like good UX shows you one thing at a time, let your audience focus on what you are saying. Without a distraction, they will have no choice but to hang on every word.</p><p>In addition, no slides forces you to talk naturally to your audience. You&#8217;ll make eye contact like a normal human. You&#8217;ll see if people are following along or if they are confused. It&#8217;ll force you to prepare, too. Because you need to know what you&#8217;re going to say.</p><p>If you really need to show some visuals, then it&#8217;s OK to use slides, but <a href="https://www.howtogiveatalk.com/blog/principle-1-dont-put-words-on-slides">don&#8217;t put words on them</a>. If you&#8217;re introducing a complex diagram, <a href="https://www.howtogiveatalk.com/blog/principle-2-use-black-slides">put a slide with literally nothing on it</a> first so you can introduce the idea and prime the audience.</p><h1>2. Ham it up!</h1><p>The more passion you inflect into your speech, the more engrossed the audience will be.</p><p>This is approximately a linear relationship. It can go from 0, if you are totally lifeless when speaking, to unbounded highs if you are really cranking it.</p><p>There&#8217;s no highest number, right? Likewise, there is no limit to the level of passion which will improve your talk!</p><p>This was the rule at the talks I attended at Manifest. Partly, passion is just correlated with speakers having something interesting to say.</p><p>However, you can ruin a talk by speaking with very little confidence or droning on in a monotone.</p><p>But when speakers get really into it, with hand gestures and emphasis, and the slow spell-ing-out-of-syll-a-bles&#8230; then, you can&#8217;t help but cheer along!</p><h1>3. Practice each section</h1><p>Split your talk into sections and practice them aloud, preferably with an audience of one or two for feedback<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>My talk this year had 7 sections, approximately 5 minutes in length each. I practiced each at least twice.</p><p><em>Especially</em> practice the transitions between sections! Then you&#8217;ll never get lost.</p><p>I feel like preparing a talk is much less intimidating when you break it down this way. It&#8217;s hard to keep a half-hour talk in memory. But, if it&#8217;s just one idea at a time, you&#8217;ll be fine.</p><h1>4. Iterate on an outline</h1><p>Creating anything is an iterative process. Nothing starts out good. Neither will the content of your talk.</p><p>But it&#8217;s a waste to write a whole essay &#8212; thousands of words &#8212; and then realize that the topic was slightly wrong, or the structure was off, and so you need to rewrite it.</p><p>Instead, use the natural abstraction for a piece of thought &#8212; a hierarchical outline! And iterate on it.</p><p>You should be able to whip up a 30-minute talk in record time by blasting out an outline, and then alternating between speaking a section out loud and refining that section in your outline based on what sounds good.</p><p>After a few iterations, you&#8217;ll also have your talk memorized enough to be ready for prime time.</p><p>Speaking of prime time&#8230;</p><h1>5. Chug a Red Bull and do this vocal exercise</h1><p>I&#8217;ve given two recorded talks in my life and I&#8217;m two-for-two on chugging Red Bulls right before. It works! I&#8217;m telling you!</p><p>Allow about 30 minutes for the caffeine to kick in.</p><p>Next, point your index fingers, and press them into the middle of your left and right cheeks. Then, bite down and blow air out of your lips &#8212; go &#8220;brrr&#8221; like you&#8217;re imitating a propeller plane.</p><p>Got it? Ok, now do this while singing up and down an octave-and-a-half scale. And repeat. Yes, just like that! (Or, review the video below for help, hahaha!)</p><p>By warming up your voice, you&#8217;ll sound clearer and more easily convey emotion. (Source: my Mom is an opera singer.)</p><div id="youtube2-fQfaqE8UHHM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;fQfaqE8UHHM&quot;,&quot;startTime&quot;:&quot;1386&quot;,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/fQfaqE8UHHM?start=1386&amp;rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h1>Conclusion</h1><p>Use these tips to go forth and give your own talk!</p><p>Have confidence in yourself. You are interesting. The world needs to know what you have to say!</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Thank you, Quinn, for saving my talk by asking if I needed a buddy to listen and give feedback on it.</p></div></div>]]></content:encoded></item></channel></rss>