<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Sapien Loop]]></title><description><![CDATA[Staying in the Engineering and AI Loop for Sapiens]]></description><link>https://sapienloop.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!CUl7!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff83f393d-8082-4398-a74d-6072e30c3a93_608x608.png</url><title>Sapien Loop</title><link>https://sapienloop.substack.com</link></image><generator>Substack</generator><lastBuildDate>Thu, 18 Jun 2026 04:05:16 GMT</lastBuildDate><atom:link href="https://sapienloop.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Sudhakar Velamoor]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[sapienloop@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[sapienloop@substack.com]]></itunes:email><itunes:name><![CDATA[Sudhakar Velamoor]]></itunes:name></itunes:owner><itunes:author><![CDATA[Sudhakar Velamoor]]></itunes:author><googleplay:owner><![CDATA[sapienloop@substack.com]]></googleplay:owner><googleplay:email><![CDATA[sapienloop@substack.com]]></googleplay:email><googleplay:author><![CDATA[Sudhakar Velamoor]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Cognitive Surrender and Cogent Engineering]]></title><description><![CDATA["Co-intelligence" to "Co-existence"]]></description><link>https://sapienloop.substack.com/p/cognitive-surrender-and-cogent-engineering</link><guid isPermaLink="false">https://sapienloop.substack.com/p/cognitive-surrender-and-cogent-engineering</guid><dc:creator><![CDATA[Sudhakar Velamoor]]></dc:creator><pubDate>Wed, 10 Jun 2026 17:57:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-2fv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Last week, Anthropic said  <a href="https://www.anthropic.com/institute/recursive-self-improvement">AI is close to building itself</a>. Before this occurs, they say, we should pause AI to prevent dire consequences to humanity. The trend continues from before. We have heard this before in the form of a <a href="https://futureoflife.org/open-letter/pause-giant-ai-experiments">letter</a>,  <a href="https://www.reuters.com/technology/musk-experts-urge-pause-training-ai-systems-that-can-outperform-gpt-4-2023-03-29/">signed by Elon Musk</a>, who subsequently proceeded to accelerate Grok&#8217;s training schedule, instead of pausing.  Anthropic&#8217;s IPO is around the corner, and so are OpenAI and SpaceX IPOs. This is a 4 trillion plus market event based on valuations that focus on a future of mankind where, supposedly, entire industries could be replaced or wiped out. The optimism surrounding AI companies has led to the devaluation of SaaS companies and their earnings multiples.  Naturally, in leading up to an IPO, convincing both their investors and the larger public around how impressive this technology is key to Anthropic&#8217;s success, and they are doing an excellent job of it. Whether you fully subscribe to &#8220;AGI is coming in 2 years&#8221; or to &#8220;AGI is already here&#8221; - my thesis is that we&#8217;ll all intuitively know when AGI really arrives, but I don&#8217;t know as much about this as <a href="https://jack-clark.net/2026/05/26/import-ai-458-reckoning-with-the-future-and-a-singularity-story/">Jack Clark</a> does, and his thoughts in the link here are definitely worth a read.</p><p>This post isn&#8217;t really about the speed at which AI is showing recursive self improvement, but about how humans seem to be succumbing to &#8220;cognitive surrender&#8221;, a term coming out of <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646">research from Wharton</a>. This paper is an excellent read for everyone who loved reading &#8220;Thinking Fast and Slow&#8221; by Daniel Kahneman.  In my own life, I have found these 2-system mechanisms of explaining human cognition to be somewhat limiting, but very useful in a leadership context. Often when trying to explain a prediction that I might make intuitively (System 1), I reach for my System 2 analytical methods to reason through so I could explain it to myself and to others. Sometimes this is hard when there are decisions to be made quickly. Other times, you could take your time with these decisions, and take the System 2 path entirely. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sapienloop.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Sapien Loop! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Well - the paper and research introduces a new system, System 3 (Artificial), which is an AI system that sits &#8220;in silico&#8221; and helps you with decisions. When there&#8217;s a conflict between System 1 &amp; System 2, usually people review their actions based on a detailed analysis from System 2 and come back and adjust their intuitions. The flow is illustrated by Shaw and Nave (authors of the paper).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-2fv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-2fv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 424w, https://substackcdn.com/image/fetch/$s_!-2fv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 848w, https://substackcdn.com/image/fetch/$s_!-2fv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 1272w, https://substackcdn.com/image/fetch/$s_!-2fv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-2fv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png" width="696" height="474" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:474,&quot;width&quot;:696,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:67447,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/200691224?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-2fv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 424w, https://substackcdn.com/image/fetch/$s_!-2fv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 848w, https://substackcdn.com/image/fetch/$s_!-2fv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 1272w, https://substackcdn.com/image/fetch/$s_!-2fv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff073ac0b-5eb3-48c8-a5ca-09004af3882d_696x474.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The key new term is &#8220;Cognitive surrender&#8221;, which is entirely a new phenomenon studied by the authors and the findings are astonishing. The participants,  who didn&#8217;t know ahead of time whether it was going to be accurate or not, relegated their decisions to AI 93% of the time when it was deemed accurate and 80% when it was deemed faulty. The more interesting find is that System 3 <em>facilitated</em> cognitive surrender. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hEhK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hEhK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 424w, https://substackcdn.com/image/fetch/$s_!hEhK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 848w, https://substackcdn.com/image/fetch/$s_!hEhK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 1272w, https://substackcdn.com/image/fetch/$s_!hEhK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hEhK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png" width="1049" height="915" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2d69a890-d104-4671-9802-ead7593eada2_1049x915.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:915,&quot;width&quot;:1049,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:78958,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/200691224?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hEhK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 424w, https://substackcdn.com/image/fetch/$s_!hEhK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 848w, https://substackcdn.com/image/fetch/$s_!hEhK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 1272w, https://substackcdn.com/image/fetch/$s_!hEhK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d69a890-d104-4671-9802-ead7593eada2_1049x915.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>And when compounded by time pressure, AI accuracy benefited human cognition and decision making owing to the accuracy vs. speed benefits of AI. </p><p>What is the take away from all this? In my own experience, I&#8217;ve followed the principles of &#8220;Co-intelligence&#8221; as suggested by Ethan Mollick of: inviting AI to the table, being the human in the loop, treating AI like a person and assuming it&#8217;s at the worst point in its life cycle, I have been skeptical of the results from AI and worked to ensure accuracy. However, Agentic AI is changing this entire approach from Co-intelligence to &#8220;<a href="https://co-existence.ai/">Co-existence</a>&#8221;, where increasing amounts of decision making is being delegated to the AI vs. being a human in the loop. </p><p>Knowing all of the above, Agentic Coding is a place where cognitive surrender I believe has already happened very quickly, as the agent can verify the actions that it takes through a feedback loop. And when the AI is accurate, we have less of a need to go back and look at it. As long as it &#8220;just works&#8221;, we worry less about the internals of a device. It&#8217;s easier to surrender when you don&#8217;t really have enough expertise in an area and you&#8217;re looking for answers quickly. </p><p>The agentic AI thesis is that whole companies can be put out of business quickly. Let us say that I believe there&#8217;s a market need for a search engine that doesn&#8217;t use AI, however small that market may be. To expand this market hypothesis, in a recent experiment, I asked AI to create a simple search engine using <a href="https://en.wikipedia.org/wiki/PageRank">PageRank</a> and <a href="https://en.wikipedia.org/wiki/Bloom_filter">Bloom Filters</a>.  The goal of this search engine would be to create a very simple interface without any AI assisted search. While this is just me kicking around the tires to see if &#8220;goal oriented&#8221; AI recursion can do the job for me entirely. I&#8217;ve used a lot of search engines over the years, and worked with enterprise search engines such as Elasticsearch and Solr, and while I&#8217;m not a domain expert on scaling search engines, I wondered how far I could go. It was done with Antigravity &amp; Gemini 3.5 Flash with some clear guidelines for the AI around engineering practices. I&#8217;m sure Claude or ChatGPT will claim that they could do a better job, but models and code agents are supposed to be commodities in the long run, so you can&#8217;t blame the consumer yet.</p><p>I accepted the implementation entirely. It was written in <a href="https://golang.org">Go</a>, and knowing that building the search index could take time, concurrency was a key focus. The AI knew and suggested how to build it using <a href="https://dbdb.io/db/boltdb">BoltDB</a>, a fast key-value store implementation, inverted search index, and a bloom filter with mutex locks to ensure safe database operations. I tried the search engine out, and I saw some interesting base results. Why there&#8217;s a UFO reference in a note from 2001 on XML signatures by W3.org (a standards body for the world wide web) begs some questions, but I digress.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ttlG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ttlG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 424w, https://substackcdn.com/image/fetch/$s_!ttlG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 848w, https://substackcdn.com/image/fetch/$s_!ttlG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 1272w, https://substackcdn.com/image/fetch/$s_!ttlG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ttlG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png" width="1456" height="546" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:546,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1515191,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/200691224?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ttlG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 424w, https://substackcdn.com/image/fetch/$s_!ttlG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 848w, https://substackcdn.com/image/fetch/$s_!ttlG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 1272w, https://substackcdn.com/image/fetch/$s_!ttlG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbdd35f8-a532-4e8c-b04c-639d4cacf128_2630x986.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cKM0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cKM0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 424w, https://substackcdn.com/image/fetch/$s_!cKM0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 848w, https://substackcdn.com/image/fetch/$s_!cKM0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 1272w, https://substackcdn.com/image/fetch/$s_!cKM0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cKM0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png" width="1456" height="217" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:217,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:530974,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/200691224?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cKM0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 424w, https://substackcdn.com/image/fetch/$s_!cKM0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 848w, https://substackcdn.com/image/fetch/$s_!cKM0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 1272w, https://substackcdn.com/image/fetch/$s_!cKM0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b6a43b-16ac-428f-9b6a-c2d8c0d1ff5e_2618x390.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Then I dove into the code a little bit, and even though I again knew not enough about the details of a search engine, I found something interesting. To understand this further, a Bloom filter ideally uses multiple hash functions, but with the <a href="https://en.wikipedia.org/wiki/Fowler%E2%80%93Noll%E2%80%93Vo_hash_function">FNV hash</a>, you can avoid having to do <code>k</code> hash functions that increase time complexity. One of the parameters of the FNV hash is a prime number. And when I looked at the code for this implementation, I saw this constant called</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1brg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1brg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 424w, https://substackcdn.com/image/fetch/$s_!1brg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 848w, https://substackcdn.com/image/fetch/$s_!1brg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 1272w, https://substackcdn.com/image/fetch/$s_!1brg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1brg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png" width="600" height="42.64705882352941" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:58,&quot;width&quot;:816,&quot;resizeWidth&quot;:600,&quot;bytes&quot;:16979,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/200691224?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1brg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 424w, https://substackcdn.com/image/fetch/$s_!1brg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 848w, https://substackcdn.com/image/fetch/$s_!1brg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 1272w, https://substackcdn.com/image/fetch/$s_!1brg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a3dfd87-ca80-40d9-8224-2b2c69298d2d_816x58.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Anyone with elementary school math knows that&#8217;s not a prime number, so I asked the AI, &#8220;why is this a composite number?&#8221; and out came this answer.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d9En!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d9En!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 424w, https://substackcdn.com/image/fetch/$s_!d9En!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 848w, https://substackcdn.com/image/fetch/$s_!d9En!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 1272w, https://substackcdn.com/image/fetch/$s_!d9En!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d9En!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png" width="1456" height="231" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:231,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1178442,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/200691224?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d9En!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 424w, https://substackcdn.com/image/fetch/$s_!d9En!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 848w, https://substackcdn.com/image/fetch/$s_!d9En!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 1272w, https://substackcdn.com/image/fetch/$s_!d9En!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1eccd7a-5a18-4058-af35-80a5ae35fe42_3360x534.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The point of the above anecdote, in conjunction with cognitive surrender, is hopefully clear. Relegated decisions (cognitive surrender) such as the above (granted I&#8217;m not ready to give Google a run for their money yet) could have risks buried deep in the codebase. I&#8217;m sure a more seasoned domain expert in search probably could have more substantial findings. Spotting these could be another AI reviewer&#8217;s job, insofar as domain expertise has been extracted and embedded into an AI. With larger and larger LLMs, the hope is that it will be, but what about all the proprietary knowledge inside of orgs around domains? How much surrender has already happened inside of frontier labs like Open AI and Anthropic? No one outside could know concretely, but based on their articles, the indication significant percentage, especially if any company&#8217;s output raises by 8X in terms of software. Cognitive surrender is here, and we all have to prepare by asking ourselves questions about where we should, and where we should not. The answers to those questions will help guide people toward their purpose, and avoid building a false sense of confidence based on a surrendered self.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sapienloop.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Sapien Loop! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Time is on my side, yes it is!]]></title><description><![CDATA[Memory, clocks and synchronization.]]></description><link>https://sapienloop.substack.com/p/time-is-on-my-side-yes-it-is</link><guid isPermaLink="false">https://sapienloop.substack.com/p/time-is-on-my-side-yes-it-is</guid><dc:creator><![CDATA[Sudhakar Velamoor]]></dc:creator><pubDate>Sun, 31 May 2026 14:56:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ERkd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>That <em>Rolling Stones</em> song has been playing in my head recently as I&#8217;ve looked at VLAs - Vision, Language, Action models that use Vision and Language Models (VLMs) as the backbone of how to do things in the physical world. If reading origin papers is your thing, <a href="https://arxiv.org/pdf/2303.03378v1">here it is</a>. This <a href="https://www.pi.website/research/memory">video</a> from Physical Intelligence shows how time is a key factor in the physical world, and how memory is as well. </p><p>Over the past few updates of Claude / Open AI and other frontier models, memory has been a key upgrade, including inside of code agents. Common hacks earlier included explicit instruction to your agent to capture anything about your style and need back into an AGENTS or CLAUDE file. </p><p>To all VLA builders and researchers, here&#8217;s something that I hope all of these models have - a sense of time. Even if I have memory, if I don&#8217;t know that my toast needs to be done before breakfast time, there&#8217;s a breakdown. If there aren&#8217;t  any deadlines for my todos in long horizon tasks, I could keep doing them assuming people will wait for me, when they likely won&#8217;t. Every LLM, VLM, VLA type frontier model must have access to a clock on the side of the user it is serving. [As an aside, here&#8217;s a <a href="https://synthsentience.substack.com/p/what-ai-cant-read-clocks-actually">post</a> on how analog clock reading hasn&#8217;t been great across frontier models]</p><p>A clock is within arm&#8217;s reach of models. Just ask for the time and you get a nice answer: </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ERkd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ERkd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 424w, https://substackcdn.com/image/fetch/$s_!ERkd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 848w, https://substackcdn.com/image/fetch/$s_!ERkd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 1272w, https://substackcdn.com/image/fetch/$s_!ERkd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ERkd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png" width="508" height="195.03571428571428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:559,&quot;width&quot;:1456,&quot;resizeWidth&quot;:508,&quot;bytes&quot;:64763,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/199919607?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ERkd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 424w, https://substackcdn.com/image/fetch/$s_!ERkd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 848w, https://substackcdn.com/image/fetch/$s_!ERkd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 1272w, https://substackcdn.com/image/fetch/$s_!ERkd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558dc647-08dd-4c8a-ad93-3fedf5988692_1616x620.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>Just the way I can ask for time, any good agentic harness should provide access to a clock, along with a running time that the agent manages within tasks. LLMs do not have a sense of time, and in that, they are somewhat eternal. The power of an LLM is that eternal ability to be present and in the moment for you. However, while that magic is good for creative pursuits that define co-intelligence, it&#8217;s not great for autonomous agentic tasking.  There&#8217;s a perfectly capable CPU clock available on the AI side of this interaction. Humans haven&#8217;t been able to tell time without clocks or bells for a long time now. We don&#8217;t make stuff up (at least not about time), but use tools, and that should be the standard for LLMs as well. </p><p>Yet, I don&#8217;t quite see evidence of this from frontier models that they use clocks well in longer horizon tasks with multi-step workflows.  Take for example, asking an LLM to prepare me for a talk that&#8217;s happening 6 months from now on VLA. There should be a pause on this from the model because it doesn&#8217;t quite make sense. Who&#8217;s the audience? How much are they willing to learn about something from 6 months ago? How much more work is going to be done in the next 6 months in VLA? The questions pile up without good answers. But give this prompt to any LLM - </p><blockquote><p>You&#8217;re an expert in Frontier models, VLMs and VLAs. Prepare me for a talk that&#8217;s happening 6 months from now on VLA and VLMs. Help me put together a story.  I need a presentation that&#8217;s current and exciting for the audience with the latest reports.&nbsp;</p></blockquote><p>and it starts to faithfully execute without any questions. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AxkY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AxkY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 424w, https://substackcdn.com/image/fetch/$s_!AxkY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 848w, https://substackcdn.com/image/fetch/$s_!AxkY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 1272w, https://substackcdn.com/image/fetch/$s_!AxkY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AxkY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png" width="1456" height="732" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:732,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:164439,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/199919607?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AxkY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 424w, https://substackcdn.com/image/fetch/$s_!AxkY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 848w, https://substackcdn.com/image/fetch/$s_!AxkY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 1272w, https://substackcdn.com/image/fetch/$s_!AxkY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d4941df-c432-4e9f-820b-721f2e23e5d4_1600x804.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I could put a stopper on it, such as &#8220;Ask me enough questions until we&#8217;ve reached a shared understanding&#8221;.  </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Y2LM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Y2LM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 424w, https://substackcdn.com/image/fetch/$s_!Y2LM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 848w, https://substackcdn.com/image/fetch/$s_!Y2LM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 1272w, https://substackcdn.com/image/fetch/$s_!Y2LM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Y2LM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png" width="1456" height="1384" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1384,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:218623,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/199919607?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Y2LM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 424w, https://substackcdn.com/image/fetch/$s_!Y2LM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 848w, https://substackcdn.com/image/fetch/$s_!Y2LM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 1272w, https://substackcdn.com/image/fetch/$s_!Y2LM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83f6653-95d5-4b27-bee8-e23eb1d7fdf1_1538x1462.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The list above keeps going (a total of 20). The fact that VLA is evolving rapidly over the past year and will continue to evolve a lot more in the next 6 months is not a factor that goes into the LLM agent decision initially, until prodded. I love the eagerness of Gen AI to solve problems, but until the user and their AI are aligned, zero-shot results are going to be somewhat sub-par, and will require further iterations.</p><p>Why might the above be? LLM pre-training without associated time history of where things are located in time, not just in the vector space of tokens and words makes you feel like it&#8217;s all happening at the same time. Likely a lot of this gets addressed in post-training and RLHF, but not entirely. For the next generation of LLMs (not just VLAs), the ability to use the concept of time more natively should be key.  Loading more data and praying for emergent behavior is the same as using hope as a strategy. </p><p>There is some research in this area, and but seems pretty early stage. The most quoted <a href="https://arxiv.org/pdf/2406.09170">paper</a> seems to have examples such as the following:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_bQU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_bQU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 424w, https://substackcdn.com/image/fetch/$s_!_bQU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 848w, https://substackcdn.com/image/fetch/$s_!_bQU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 1272w, https://substackcdn.com/image/fetch/$s_!_bQU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_bQU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png" width="870" height="356" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:356,&quot;width&quot;:870,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:97229,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/199919607?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_bQU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 424w, https://substackcdn.com/image/fetch/$s_!_bQU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 848w, https://substackcdn.com/image/fetch/$s_!_bQU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 1272w, https://substackcdn.com/image/fetch/$s_!_bQU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea4a60-fc04-4770-af50-4b5cdb180a6e_870x356.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There&#8217;s another paper <a href="https://arxiv.org/pdf/2311.17667v2">here</a> - which has a couple of other benchmarks, but there&#8217;s a <em>lot</em> more to think about and if there&#8217;s anyone out there doing a lot of work on this, please forgive my ignorance, but these benchmarks don&#8217;t seem to go below the surface, and LLMs still don&#8217;t seem to be doing great in this area. Here are some other references to papers in this space.</p><ul><li><p><a href="https://arxiv.org/pdf/2601.05300">Paper 1</a></p></li><li><p><a href="https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00459/110012/Time-Aware-Language-Models-as-Temporal-Knowledge">Paper 2</a></p></li><li><p><a href="https://aclanthology.org/2025.acl-short.46/">Paper 3</a></p></li></ul><p>A lot of them focus on math, reasoning and historical facts. Helpful, but how do I inject a sense of momentum into my model (not the momentum optimization inside a deep neural net, but an economic and market definition of how things build up over time).</p><p>If you think that&#8217;s not enough of an example, here&#8217;s another example of temporal reasoning failure.  I&#8217;m in Boston, commuting to the suburbs. There are advisories around any delays available publicly on the local transit site (mbta.com). And here&#8217;s my convo with Google Gemini. The local time is 6:05 pm. I&#8217;ve cut off some preambles. </p><blockquote><p><br>Gemini : the 320 pm train is delayed by 5-15 minutes and the next train is also likely delayed. Were you trying to take those?<br>Me: What timezone are you in?<br>Gemini: based on my location I&#8217;m in the Eastern Daylight Time zone. <br>Me: what time you got?<br>Gemini: 6:07 pm (mind you by this time a couple minutes passed)<br>Me: why are you telling me about trains from 320 pm when the current time is 6:07 pm?<br>Gemini: you are correct to point that out. That&#8217;s my mistake. Based on my current info - the 430 pm is delayed and the next train could be delayed as well. Are you planning to take those?<br>&#8212;- I repeat the time question again and get the same answer &#8212;<br>Me: Filter out any train timings prior to the current time. (Tool use hint)<br>Gemini: yes. Let me do that. Based on my current info - the 430 pm train is delayed and the next train is delayed as well. Are you planning to take those? <br>Me: please report to Google HQ that you didn&#8217;t do a great job on this one. <br></p></blockquote><p>This is another one from Meta AI - where the time is just there on the message for the taking - but the AI hasn&#8217;t been fed that information by default. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_OTt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_OTt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_OTt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_OTt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_OTt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_OTt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg" width="336" height="614.3283582089553" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2205,&quot;width&quot;:1206,&quot;resizeWidth&quot;:336,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text, letter&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text, letter" title="text, letter" srcset="https://substackcdn.com/image/fetch/$s_!_OTt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_OTt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_OTt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_OTt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcf618c30-f247-40dc-b8d2-53ad3f96c4fd_1206x2205.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>My conclusions based on what I have experienced so far and read, would be the following </p><ol><li><p>There is a significant opportunity to incorporate a user clock inside agentic LLM workflows, including a deadline, calendar and other artifacts. I understand nobody wants to be pressured, and time factor on LLMs is not really a thing because the inference speed is sufficient, but deadlines are always important for humans in the context of planning. We are still not quite there in terms of accepting zero-shotted code directly into production, and at least a question or two to understand deadlines is in order in the system prompt of these agents.  </p></li><li><p>There is a need to evaluate and benchmark not only temporal reasoning, but temporal agentic planning and action. Does temporal data not need to be part of the LLM at all? I find that very odd, especially because if you are human, you need that for sure, and if the LLM / VLM training is not incorporating that as a path to AGI, I&#8217;d be worried.</p></li><li><p>Memory is a helpful factor here especially as you remember to go back and tick off things that you planned to complete during a long horizon task, or remember the user&#8217;s preferences for all of their tasks. </p></li></ol><p>I have more ideas on this front, and will explore more in a future post. But I would like to end this with a quote from my other favorite song &#8220;Time&#8221; by <em>Pink Floyd </em></p><blockquote><p>The time is gone, the song is over<br>Thought I&#8217;d something more to say</p></blockquote><p></p>]]></content:encoded></item><item><title><![CDATA[Long Horizon Agentic Tasks and the VibeSlop onslaught]]></title><description><![CDATA[Coding agent performance on long horizon tasks has been part of every possible frontier model demo and benchmark recently.]]></description><link>https://sapienloop.substack.com/p/long-horizon-agentic-tasks-and-the</link><guid isPermaLink="false">https://sapienloop.substack.com/p/long-horizon-agentic-tasks-and-the</guid><dc:creator><![CDATA[Sudhakar Velamoor]]></dc:creator><pubDate>Thu, 28 May 2026 11:22:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!NErk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Coding agent performance on long horizon tasks has been part of every possible frontier model demo and benchmark recently.  &#8220;Look, ma! I created a completely new  OS kernel from scratch!&#8221; says Gemini, but also Claude and GPT.  Some success with complex task planning and execution on that plan over significant periods of time without supervision has gained a lot of traction. Ralph loops, agentic loops and other methods of keeping things going through a feeder system of either supervised or unsupervised product backlogs have been studied since the first release of Claude Code. Some trends are emerging.  Strong code agent harnesses are one such trend. The right tooling to interact with your environment (Bash, Read, Write, Git etc) while going through the tasks with a strong feedback loop has helped see real gains. Objectives, reflections (thinking) and feedback loops make it more likely that the open ended task that the agent and the LLM are handed is achievable. But does this produce VibeSlop (Vibe Coding + AI slop)? And if it does, how do we control it? What happens when the volume of code suddenly explodes to a level that it takes much longer to review the code than it does to write using AI? I have more questions than answers, but likely everyone does as well  </p><p>Building software is a domain driven exercise, apart from the core algorithms that were invented and continue to get invented over time. Here&#8217;s a potential framework that can help with VibeSlop trickle based on generated code classification. We can broadly classify such AI generated code into</p><ul><li><p><em><strong>The Muscle Memory Snippet</strong></em>: Code generated to help with testing, organizing and viewing existing artifacts as well as creating them. This is baffling at first but there&#8217;s so much code in tool use while an agent is running that it contributes a significant amount. Example below of a session.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NErk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NErk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 424w, https://substackcdn.com/image/fetch/$s_!NErk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 848w, https://substackcdn.com/image/fetch/$s_!NErk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 1272w, https://substackcdn.com/image/fetch/$s_!NErk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NErk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png" width="1456" height="350" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:350,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1107947,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sapienloop.substack.com/i/199210213?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NErk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 424w, https://substackcdn.com/image/fetch/$s_!NErk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 848w, https://substackcdn.com/image/fetch/$s_!NErk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 1272w, https://substackcdn.com/image/fetch/$s_!NErk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5aabc0de-2940-4612-ae92-cee187f007d7_2668x642.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>Proposal:  There maybe opportunities to cache and reuse snippets, but prompt caching is likely a strong enough path. I call it muscle memory because this is something that engineers build into their workflow and some of them get really, really good at it. </p></li><li><p><em><strong>The Bespoke App Factory</strong></em>: Planning a trip? A new travel itinerary visual can help. Why communicate with docs when you can with Apps? And I don&#8217;t mean that in any sarcastic way, but when Apps are cheap, you can generate them on-demand for the right impact. Just be careful to not overdo this.</p><p></p><p>Proposal:  Given the token count spent and the human effort to get these artifacts generated (pretty low end), you may think about throwing them away after they&#8217;re useful life expires. The really good ones, like a flip book of pictures from a family trip or some well researched notes, maybe AI fodder for <em><strong>long term memory</strong></em>. </p></li><li><p><em><strong>The Consultant Prototype</strong></em>: An initial full prototype that&#8217;s been vibe coded using ad hoc prompts and some skills. An agent harness may or may not be used, and the user is iterating on an idea. There are a <em>lot</em> more of these now than 1 year ago, and growing exponentially.</p><p></p><p>Proposal:  Since the above is largely demo and illustration material, it could be used by code agents as a guide for building the next revision. If a prototype is really high quality (and I haven&#8217;t seen this happen personally), it is something that can become a reference implementation. VibeSlop creep is very high here, so be prepared to throw this out, no matter how much time might be invested in this. There could be value in extracting the conversation bits for developing new product requirements. </p></li><li><p><em><strong>The Agentic New Product</strong></em>: A strong initial product from scratch, which has been specified by using &#8220;grill-me&#8221; skills, specs, full fidelity UX wireframes in Figma, PRD, TRD, architecture guidelines and so on. Tasks specified by a user down to quite a bit of detail. </p><p></p><p>Proposal:  This is the future of software engineering as I see it. A lot of reusable skills, process guidelines, security requirements and the organizational knowledge gets extracted and placed in the hands of the machine and needs to be referenced all the time. While Vibeslop can be reduced by providing much clearer guidelines, beware the tech stack and the domain where this is applied. A lot more training data maybe needed to ensure success if there&#8217;s a completely new paradigm being figured out. As a long running process, context pollution, rot and confusion can take over here unless you break the work into atomic units. All agile software principles apply here along with expertise on how bigger chunks of work get broken down into smaller ones.</p></li><li><p><em><strong>The Agentic Maintainer</strong></em>: Existing production app where updates are made incrementally as features, bugs, hotfixes and so on.</p><p></p><p>Production Updates for features should ideally follow the Agentic Product Dev experience above. For bug fixes and hotfixes, you need some serious human in the loop especially around the domain that your software operates in. But what happens when the original code was written by the machine? Could it be VibeSlop? Is VibeSlop unique to machines? I personally don&#8217;t believe so. The lessons from human software builds over decades apply here. Ongoing code audits, scans and levels of vigilance that you use to manage growth phase companies are significantly more valuable when evaluating machine coded software. There will be a lot of companies that emerge in this area, but the incumbents who do these scans and the principles behind them stay. The great news about agentic AI coded apps is if you had the foresight, you could have documented every decision to the last bit of detail. But even lacking that, LLMs and code agents are great for understanding any code base and explaining that to you in whatever form that suits you (diagrams, live apps, full on profiling and dependency tool builds). I strongly recommend that PR review comments add the right organizational context for such bug and hot fixes.</p></li></ul><p>The great news is that newer variants of Claude Code, GitHub Copilot, Codex, Antigravity, Pi and others provide quite a bit of the above principles in the harness (planning, tasking, implementation against tasking, TDD and other such goal setting and achieving). But also, using agents for a first pass on the code reviews is a great way to help scale up some of the work. Also when you have access to multiple models, use one model to review another model&#8217;s work.</p><p>2025 was the first year of agentic AI in the mainstream. We have to assume that by that definition, these code agents today are the worst in their jobs, and they will improve exponentially. World models could be a new way to assess consequences prior to diving into implementation, but also needs significant domain knowledge transfer to machines. So while VibeSlop is a thing out there, applying the framework above for decisions and maintaining a strong set of organizational hierarchical rules for code agents (<code>CLAUDE.md</code> or <code>AGENTS.md)</code> will help cut it down in the longer term. </p>]]></content:encoded></item><item><title><![CDATA[Agentic Search]]></title><description><![CDATA[Search as a phenomenon has been key to online human behavior for almost 3 decades, and while it may happen in the background now, the quest for pulling something up and having it in your fingertips is not going away.]]></description><link>https://sapienloop.substack.com/p/agentic-search</link><guid isPermaLink="false">https://sapienloop.substack.com/p/agentic-search</guid><dc:creator><![CDATA[Sudhakar Velamoor]]></dc:creator><pubDate>Sun, 24 May 2026 23:47:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!CUl7!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff83f393d-8082-4398-a74d-6072e30c3a93_608x608.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Search as a phenomenon has been key to online human behavior for almost 3 decades, and while it may happen in the background now, the quest for pulling something up and having it in your fingertips is not going away.  When searching for facts, humans need clear, concise answers quickly.  With Agentic Search, that interaction could expand to create a <em>Curb your enthusiasm</em> moment of &#8220;stop and chat&#8221;,  where people have to wonder if there&#8217;s indeed going to be an expanded follow up to a simple question of &#8220;What&#8217;s the capital of France?&#8221; that they need to wait for.  The follow up at the tail of every Google AI mode answer or a ChatGPT interaction is akin to a teaser to the next video in a feed and the area where ad monetization could play a key role. And while this is great for business, it could re-create some of the doom-scrolling challenges of the past with social media addiction and attention grab. Agentic &#8220;everything&#8221; is probably the way forward, but searching for facts should be a quick endeavor, and I hope that it stays that way.</p>]]></content:encoded></item></channel></rss>