<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.10.0">Jekyll</generator><link href="https://anirudharamesh.github.io/feed.xml" rel="self" type="application/atom+xml" /><link href="https://anirudharamesh.github.io/" rel="alternate" type="text/html" /><updated>2026-04-07T00:29:21-04:00</updated><id>https://anirudharamesh.github.io/feed.xml</id><title type="html">Anirudha Ramesh</title><subtitle>Research Engineer at InstaDeep specializing in AI, Machine Learning, Genomics, Drug Discovery, and Robotics. Sharing insights on AI research and applications.</subtitle><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><entry><title type="html">The Birth of a New Era of Distributed Collective Intelligence</title><link href="https://anirudharamesh.github.io/blog/collective-distributed-autoresearch/" rel="alternate" type="text/html" title="The Birth of a New Era of Distributed Collective Intelligence" /><published>2026-03-25T00:00:00-04:00</published><updated>2026-03-25T00:00:00-04:00</updated><id>https://anirudharamesh.github.io/blog/collective-distributed-autoresearch</id><content type="html" xml:base="https://anirudharamesh.github.io/blog/collective-distributed-autoresearch/"><![CDATA[<p>Flexible compute usually means scaling up. More GPUs, bigger clusters, elastic cloud provisioning. The assumption is that more hardware = better results, and that the way to participate meaningfully is to have access to serious resources.</p>

<p>But there’s another dimension to flexibility that gets overlooked. Not just scaling the hardware, but scaling the <em>effort</em> across people and machines, regardless of what each one has.</p>

<p>When you’re optimizing something, a model, a system, a process, you’re usually doing it alone, on whatever hardware you have. But what if you could share the problem with others, pool your efforts, and optimize together? Each person’s improvements become starting points for the next, regardless of what machine they’re running on.</p>

<p>Not distributed compute in the traditional sense, but distributed <em>optimization</em>. A shared candidate pool where anyone can pull a problem, improve it, and push the result back. The system routes effort to wherever it’s most productive, and everyone’s work compounds.</p>

<hr />

<p>I built an initial framework for this and tested it on OpenAI’s <a href="https://openai.com/index/parameter-golf/">Parameter Golf</a> challenge, deliberately on a M4 MacBook Air (16GB), the most constrained Apple Silicon, as opposed to the intended 8xH100 setup. If the idea holds, the hardware shouldn’t matter much.</p>

<p>The setup: share a pool of candidate solutions. An AI researcher agent runs an autoresearch loop. A bandit selects which candidate to improve, trains it for 10 minutes, evaluates, keeps improvements, reverts regressions. Periodically scan for new candidates from other participants and add them to the pool. Share improved configs back.</p>

<p>Everyone’s work compounds.</p>

<p><img src="/images/parameter-golf/collective_final_v2.png" alt="System Diagram" /></p>

<p>84 experiments over ~14 hours, across 14 candidates, 6 of which were ported directly from other participants’ submissions. It was the best submission in its compute category at time of submission.</p>

<hr />

<p>Three phases emerged naturally, without being designed for:</p>

<ol>
  <li>
    <p><strong>Environment-specific fixes</strong> (3.2 → 2.1 bpb). The baseline was configured for H100s. The warmdown schedule was decaying the learning rate from step 1. Fixing this single parameter was worth -0.98 bpb, the biggest win of the entire run. Reducing batch size to fit more training steps in the wallclock budget was the second.</p>
  </li>
  <li>
    <p><strong>Hyperparameter tuning</strong> (2.1 → 1.98 bpb). Learning rate, momentum, sequence length. Systematic but diminishing returns.</p>
  </li>
  <li>
    <p><strong>Architecture changes</strong> (1.98 → 1.93 bpb). SwiGLU activation and mirrored recurrence (weight sharing across layers) broke through the hyperparameter plateau.</p>
  </li>
</ol>

<p><img src="/images/parameter-golf/pr_optimization_journey.png" alt="Optimization Journey" />
<em>Scores from partial validation used during iteration for speed. Full eval: val_bpb=1.9263.</em></p>

<hr />

<p>The most interesting result wasn’t any single technique. It was cross-pollination.</p>

<p>SwiGLU was discovered on one candidate. Mirrored recurrence was found on another, ported from <a href="https://github.com/openai/parameter-golf/pull/84">PR #84</a>. Neither was the best on its own. Combining them on a third candidate beat both.</p>

<p><img src="/images/parameter-golf/technique_matrix.png" alt="Technique Matrix" /></p>

<p>The winning candidate was the only one combining <em>all</em> the winning techniques, and it found them by having access to a diverse pool where good ideas could be identified and recombined. You don’t need a single brilliant run. You need a diverse pool where improvements from different sources compose.</p>

<hr />

<p>One person, one MacBook, overnight. But there’s nothing about it that requires this.</p>

<p>If multiple people shared their optimization problems into collective pools, each pulling from and contributing back, improvements would compound across participants. You don’t even need the same hardware. Someone on a laptop explores one part of the space, someone on a cluster explores another, and the system routes effort to wherever it’s most productive.</p>

<p>That’s what flexible compute should actually mean. Not elastic provisioning of more of the same, but a system where different people, different hardware, and different ideas compound into something none of them would reach alone.</p>

<hr />

<p>Working on some immediate improvements to this, useful here and importantly beyond:</p>

<ol>
  <li>Evolving meta-strategy, hyperagents-style, where the search strategy itself adapts.</li>
  <li>Actively partitioning agent effort to suit strengths and reduce token use. Hyperparameter optimization, for instance, need not be agent-driven when other HPO methods are stronger and more efficient in this search.</li>
  <li>Expanding and testing the collective candidate pool with other players and collaborators, who can have their own self-improvement setups.</li>
</ol>

<p>Full submission with all configs: <a href="https://github.com/openai/parameter-golf/pull/597">PR #597</a></p>

<hr />

<h3 id="appendix-what-worked-and-what-didnt-parameter-golf">Appendix: What worked and what didn’t (Parameter Golf)</h3>

<p><strong>Worked:</strong> smaller batches = more steps (the single most important insight for constrained hardware), mirrored recurrence (4 unique blocks reused across 8 layers, halves parameters while maintaining effective depth), SwiGLU over relu² (consistent improvement across every candidate), and the bandit itself (correctly concentrated 41 of 84 experiments on the eventual winner).</p>

<p><strong>Didn’t work:</strong> deeper models (too slow per step), very small batches (gradient noise outweighs step count benefit), BigramHash and ValueResidual (added complexity without clear gain on constrained hardware).</p>]]></content><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><category term="AI" /><category term="Machine Learning" /><category term="Distributed Learning" /><category term="Autoresearch" /><summary type="html"><![CDATA[Flexible compute, redefined.]]></summary></entry><entry><title type="html">Background Everything: The Missing Predictive Coding and Caching Layer in AI</title><link href="https://anirudharamesh.github.io/blog/background-everything/" rel="alternate" type="text/html" title="Background Everything: The Missing Predictive Coding and Caching Layer in AI" /><published>2026-03-01T00:00:00-05:00</published><updated>2026-03-01T00:00:00-05:00</updated><id>https://anirudharamesh.github.io/blog/missing-predictive-coding-caching-layer</id><content type="html" xml:base="https://anirudharamesh.github.io/blog/background-everything/"><![CDATA[]]></content><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><category term="Upcoming" /><summary type="html"><![CDATA[]]></summary></entry><entry><title type="html">We Solved Execution. Now What?</title><link href="https://anirudharamesh.github.io/blog/we-solved-execution-now-what/" rel="alternate" type="text/html" title="We Solved Execution. Now What?" /><published>2026-02-20T00:00:00-05:00</published><updated>2026-02-20T00:00:00-05:00</updated><id>https://anirudharamesh.github.io/blog/we-solved-execution-now-what</id><content type="html" xml:base="https://anirudharamesh.github.io/blog/we-solved-execution-now-what/"><![CDATA[<p>Yes yes, solved is generous. But close enough that the real constraint has shifted. People on my team and in my circles who’d spend a lot of time writing code to implement new things, scaffolding projects, processing data and logs, and managing setup and deployments barely have to anymore.</p>

<p>If I had to distill AI progress into a single formula, it would have more variables than most people are paying attention to (and perhaps even more than I’ve put down here).</p>

<p>y here represents meaningful AI-driven output, how quickly human ambition translates into real results.</p>

<div class="language-plaintext highlighter-rouge"><div class="highlight"><pre class="highlight"><code>y = f(compute, hardware, energy, execution, tasks, feedback)
Δy ≤ min(Δcompute, Δhardware, Δenergy, Δexecution, Δtasks, Δfeedback)
</code></pre></div></div>

<p>To hit exponentiation in output y, you need all inputs to scale. Let any one stagnate and y is capped. For a long time, the first three dominated the conversation. More GPUs, faster chips, more energy. Reasonable, since these were clearly the constraints being hit. But buried in that formula are variables that’ve been quietly left alone. Feedback loops, for one. Some are already instant, test cases pass or fail the moment code runs. But much of it, deployment, distribution, user behavior, still depends on time. And then there are tasks.</p>

<hr />

<p>Historically, execution was done by people. Software engineering, financial modeling, research, the things that took time because they required humans to do them. Project timelines, even with every other input in place, were bottlenecked by engineering hours. People were the execution units, and they didn’t scale easily.</p>

<p>That bottleneck is breaking. Since the tail end of 2025, and more concretely the start of 2026, agentic AI has matured to the point where Claude Code running on Opus 4.5, and recent versions of Codex running on GPT’s latest models, can be considered genuine executors of work. Setting up agentic orchestration workflows is perhaps the highest-delta effort-per-time investment one can make right now, if they haven’t already.</p>

<p>Once set up, the game shifts to context management and parallelism. Executors can own portions of workflows on their own. And while we aren’t yet in a world where human validation isn’t needed, we are certainly at the edge of one where human <em>micro-validation</em> isn’t. How often are SWEs who’ve adopted AI tools early actually peering into the code their AI writes, compared to six months ago?</p>

<p>It is clear that we are confronted with a new reality of execution units being more available, in a flexible and easily scalable manner. This is no longer a reality of tomorrow. It is a reality of today.</p>

<p>Humans don’t primarily need to be tasked with executing “tasks” anymore, but rather leading purposes.</p>

<hr />

<p>And yet, at a policy and social level, the conversation has barely moved past compute, hardware, and energy. Task scaling was left at the side stage, perhaps because we did not anticipate needing to worry about it so soon. Now that we actually have the executors, this gap is hard to ignore. Clear paths on how to find new roots, new core ideas, new tasks, aren’t obvious, and effort here remains minimal at the corporate level.</p>

<p>There are of course bastions within certain frontier labs who’ve been advocating for this. I was lucky to attend Tim Rocktäschel’s talk at ICLR last year, where he argued for moving away from fixed-objective optimization entirely, towards open-ended systems: ones that generate an endless sequence of novel stepping stones rather than converge on a predefined goal. The automation of innovation itself. Other “AI-scientist” companies form similar theses as well, but they’re all largely focused towards scientific acceleration, and not meeting today’s corporate world where it is.</p>

<p>Back then, despite finding this fascinating, I did not anticipate the urgency of innovation needed here. The world looks very different now.</p>

<hr />

<p>This urgency is two-fold.</p>

<p>If we are to continue pushing along the exponential, and if we believe it to be existentially important to do so, task scaling is essential. This is a mid-to-long-term issue.</p>

<p>Our current social order will also likely be challenged, and massively so, if we fail to task scale. Companies that can’t scale their outputs proportionally to their reduced reliance on human execution are left with more people than they have work for. This is an immediate concern and calls for urgent action.</p>

<hr />

<p>The path worth choosing is scaling tasks with people involved, not around them.</p>

<blockquote>
  <p>“Every job has tasks and a purpose. AI automates tasks, but it can’t replace your purpose. A software engineer’s purpose isn’t to write code; it’s to solve problems. The more time AI saves on coding, the more time engineers have to find the problems worth solving.” — Jensen Huang, 2026 <em>(slightly paraphrased)</em></p>
</blockquote>

<p>This is the crux of it. Task scaling alone, pointing more AI at more work, is not enough. The other side of the coin is scaling the breadth of human purpose in parallel. As AI absorbs execution, the surface area of what people choose to pursue needs to expand with it.</p>

<p>If it doesn’t, what we’re left with looks less like abundance and more like displacement.</p>

<p>Task scaling without purpose scaling is just unemployment. Purpose scaling without task scaling is idealism with no execution behind it. These two things have to rise together.</p>

<p>This is also likely to win in the short term, and it seems to be the path of least resistance towards abundance without requiring radical social reimagination. A world where more people are leading broader purposes, enabled by AI to execute more of what they imagine, is a more interesting world to be in.</p>

<p>Execution was always just the step between a good idea and seeing it matter. We now have more execution capacity than at any point in history. The question is whether we can generate enough good ideas, enough tasks, enough purposes, to match it.</p>

<p>Now it is time to solve for purpose.</p>]]></content><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><category term="AI" /><category term="Execution" /><category term="Purpose" /><category term="Task Scaling" /><summary type="html"><![CDATA[AI solved execution. The bottleneck now is purpose.]]></summary></entry><entry><title type="html">Should Robots Be Able to Synthesize Aids on the Fly? Tool Synthesis as a Regularizer</title><link href="https://anirudharamesh.github.io/blog/tool-synthesis-as-regularizer/" rel="alternate" type="text/html" title="Should Robots Be Able to Synthesize Aids on the Fly? Tool Synthesis as a Regularizer" /><published>2026-01-18T00:00:00-05:00</published><updated>2026-01-18T00:00:00-05:00</updated><id>https://anirudharamesh.github.io/blog/tool-synthesis-as-regularizer</id><content type="html" xml:base="https://anirudharamesh.github.io/blog/tool-synthesis-as-regularizer/"><![CDATA[<blockquote>
  <p><em>When the world is too diverse to learn, change the world.</em></p>
</blockquote>

<p>In robotics, there is a lot of emphasis on learning to live and adapt in the world we’ve created. This involves learning to generalize to all the structures we function in, and the various tools we utilize. This makes a lot of sense, as after all, the world is already built, and learning to live in it is ‘simpler’ than reinventing it.</p>

<p>However, there is a well known long-tail in learning that applies to both visual / world understanding, and operation. Robots are expected to generalize across a combinatorial space of tasks, tools, and environments, many of which are rare but operationally important.</p>

<p>For example,</p>

<blockquote>
  <p>A robot struggles to reliably read a reflective gauge or poorly lit indicator. Instead of learning to perceive across extreme lighting and material conditions, it fabricates a clip-on diffuser or alignment jig that constrains the viewing geometry.</p>
</blockquote>

<p>Or, in manipulation:</p>

<blockquote>
  <p>A robot needs to turn a small, recessed dial using an end-effector that lacks sufficient grip or clearance. Rather than learning a brittle, high-precision manipulation behavior, it fabricates a simple adapter that converts the dial into a familiar, easy-to-grasp geometry.</p>
</blockquote>

<p>What if instead of learning to adapt to a breadth that includes this long-tail in its totality, we can regularize and simplify this learning task?</p>

<p>One way to address this is to shift some of the burden away from learning itself. Rather than requiring a robot to master every rare interaction, we can allow it to create simple aids that transform unfamiliar tasks into familiar ones.</p>

<p>This is not just a practical observation, but a principled one. In open-world robotics, learning requires generalizing over a combinatorial explosion of task × tool × environment. The No Free Lunch theorem tells us that without restricting this space, broad generalization is impossible.
Tool synthesis introduces such an assumption. It assumes that many hard or rare tasks can be transformed, via simple physical interventions, into a smaller set of familiar interactions. This does not make learning universally easier; it deliberately narrows the class of problems the robot aims to solve. In doing so, it trades breadth for structure, reducing the effective space the robot must learn over.</p>

<p>A simple human analogy makes this clear. If you want to dust the ceiling but don’t have a duster long enough to reach it, you can either balance atop a stool and perform a delicate maneuver, or you can build an attachment that extends the duster. The latter simplifies the task by changing the tool, not the behavior.</p>

<p>The ability to synthesize tools or aids in this fashion can act as a regularizer over the behaviors robots need to learn, simplify learning while allowing them to adapt effectively.</p>

<p>Among the ways a robot might modify its environment, on-demand fabrication is particularly appealing because it allows task-specific structure without requiring an exhaustive inventory of tools. With this notion, access to 3D-printing seems natural, even essential to unlocking a robot’s capabilities.</p>

<p>The idea of 3D printing structures, both large and small, is not new. It has been explored in contexts ranging from construction, to self-assembling robotic systems, to aiding efforts to conquer the stars via space-based manufacturing, such as NASA’s Archinaut project, a now shelved exploratory effort aimed at in-space fabrication and assembly.</p>

<p>The role in commercial robotics has however perhaps been skimmed past. It’s time to re-evaluate this, and incorporate it into how we attempt to learn and scale robot learning.</p>

<p>Tool synthesis should be treated not as an edge case, but as a core capability alongside perception, planning, and control.</p>]]></content><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><category term="Robotics" /><category term="Machine Learning" /><category term="Tool Synthesis" /><category term="3D Printing" /><category term="AI" /><summary type="html"><![CDATA[Exploring how on-demand tool fabrication can regularize robot learning by transforming rare, difficult tasks into familiar interactions—reducing the combinatorial complexity of open-world robotics.]]></summary></entry><entry><title type="html">Let there be cute robots! The sillier path for the creation of data flywheels in robotics</title><link href="https://anirudharamesh.github.io/blog/cute-robots-data-flywheels/" rel="alternate" type="text/html" title="Let there be cute robots! The sillier path for the creation of data flywheels in robotics" /><published>2026-01-17T00:00:00-05:00</published><updated>2026-01-17T00:00:00-05:00</updated><id>https://anirudharamesh.github.io/blog/cute-robots-data-flywheels</id><content type="html" xml:base="https://anirudharamesh.github.io/blog/cute-robots-data-flywheels/"><![CDATA[<blockquote>
  <p><em>Data is the fossil fuel of AI.</em>
~ Ilya Sutskever, 2024</p>
</blockquote>

<p>We know how the story goes for LLMs for text, VLMs and Large Vision Models with images and videos. Audio is and has been generated en-mass via call centers, conferences, and as an accompaniment of videos.</p>

<p>By and large, across these domains, there’s both a large quantity and a large variation in domains, and form factors, enabling <em>general purpose</em> learning.</p>

<p>Robotics is different.</p>

<h3 id="the-strange-case-of-robotics">The strange case of robotics</h3>

<p>Robotics needs a combination of,</p>

<ol>
  <li><strong>World “Understanding”</strong> involving
    <ul>
      <li>physics</li>
      <li>object-world and object-object relations</li>
    </ul>
  </li>
  <li><strong>Perception</strong> within the context of understood world</li>
  <li><strong>Ability to plan and interact</strong> with <em>its world</em> subject to <em>its own form factor</em> (i.e. “affordances”)</li>
  <li><strong>Task Understanding / Instructional Directive</strong></li>
</ol>

<p><strong>The promise of autonomous driving.</strong> With Waymos now taking around a third of trips in San Francisco, looking through the lens of autonomous driving may give us idea on how to build these pipelines for general purpose robotics.</p>

<p>However, even upon a high-level inspection, it is clear that going down the autonomous vehicle path would require big big $$$</p>

<p>AV companies spend billions in developing and over a decade of commercial effort (collecting millions of hours of first-person driving videos along the way) to develop the first commercially viable iterations of their product.</p>

<p>This is in-spite the fact that this problem is tightly constrained as compared to general purpose robotics. Here,</p>

<ol>
  <li><strong>World understanding</strong>: road videos encode expected behavior, constraints, and predictable actions</li>
  <li><strong>Perception</strong>: video is specifically suited to roads, often even to specific cities via progressive rollouts</li>
  <li><strong>Form factor</strong>: fixed. It is always a car, operating in a known environment</li>
  <li><strong>Tasks and instructional directives</strong>: fixed and well-defined</li>
</ol>

<p>To <em>solve</em> autonomous driving, we constrain both the form-factor, and the world it’ll operate in.</p>

<p>General purpose robotics however necessitates loosening these constraints. To this end, we now need data that will help with general world understanding, as well as the ability to plan and interact with a general form factor.</p>

<p>i.e.,</p>

<blockquote>
  <p>f_av(S) = a</p>
</blockquote>

<blockquote>
  <p>⇒ f_general(S, self) = a</p>
</blockquote>

<p>Fortunately, for the large part we already have the data needed for world understanding through the internet.</p>

<p>What we lack is data to learn the ability to plan and interact within a robot’s own form factor, and to adapt to new instructional directives.</p>

<p>If we go down the AV path, we’d collect millions of hours of data, across a range of form factors, across numerous environments and different people / methodologies of instructional directive (which is very difficult to direct centrally).</p>

<p>This requires massive capital investment, and akin to the promise of autonomous driving, lays the path towards general purpose robotics on a long, familiar, and expensive trajectory.</p>

<p>This capital burden likely needs to be taken up by the companies trying to create these robots, and not the general consumer, as if a consumer is sold a serious robot for some task, the consumer would want it to work well. On the flip side,</p>

<p><em>It’s okay for a robot to not be very useful, as long as it does not cost a lot!</em></p>

<hr />

<h2 id="the-road-less-traveled-what-if-we-could-do-this-some-other-way">The road less traveled. What if we could do this some other way?</h2>

<p>I take inspiration from some consumer trends from the past few years,</p>

<ol>
  <li>
    <p><strong>Explosion of labubus and collectible figurines.</strong></p>

    <p>There is great appetite in the consumer market for ‘cute’ collectibles — with people willing to pay $100s for securing the right collectible (which costs $ to make), which by itself does nothing but look cute. The collectibles differ in a range of ways from just primarily color (labubus) to differing poses (smiskis) to completely different form factors.</p>
  </li>
  <li>
    <p><strong>Pokemon Go.</strong></p>

    <p>Consumers enjoy community gaming and competition, even when interlaced with the physical world. ‘Training’ is gamified.</p>
  </li>
</ol>

<p>Bringing these together, I believe we could spark this data flywheel with significantly lesser investment from creators (costs shared with consumers right from the start) with cheap, small, cute robots styled as collectibles that have some capacity to learn / interact from human instructional directive through gamified ‘training’.</p>

<p>This also pushes away from the burden of robots needing to immediately be useful, with the primary purpose here just being cute and adequately responsive to human interaction.</p>

<p>Gamified training can allow both free form instructional directive (through vision, audio, text, some specific software) in general environments humans operate in, as well as specific tasks / subtasks (jacket on-off / sweeping from karate kid) designed to generate learning specific skills.</p>

<hr />

<h3 id="a-reasonable-skepticism">A reasonable skepticism</h3>

<p>Choosing to walk down this road comes with challenges. It is nontrivial to create robots that are sufficiently cheap, meaningfully diverse in form factor at scale, while still being capable of learning or responding well enough to make the interaction engaging. Human-provided feedback, even when gamified, is noisy and inconsistent. Physical interaction also does not scale as easily as digital signals.</p>

<p>There is also the question of transfer. What, if anything, carries over from small, toy-like robots to more capable systems is not guaranteed.</p>

<p>This raises a deeper question of whether physical instantiation needs to be the primary source of data at all. Perhaps most learning can happen in virtual or simulated settings, with occasional physical embodiments serving to calibrate the sim-to-real gap rather than define the data distribution.</p>

<p>The bet here is not that this path avoids these challenges, but that it offers a cheaper, more iterable way to explore them than the capital-intensive routes robotics has historically relied on.</p>

<hr />

<p>Regardless, the path less traveled is appealing because we can still run this experiment many a times over and still not have to invest even a fraction of what we had to within AV.</p>

<p>Besides, wouldn’t it be a wonderful to live in a world with cute, silly robots?</p>]]></content><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><category term="Robotics" /><category term="Machine Learning" /><category term="Data Collection" /><category term="Consumer Robotics" /><category term="AI" /><summary type="html"><![CDATA[What if instead of billion-dollar autonomous driving approaches, we could kickstart robotics data collection with cheap, cute collectible robots that consumers actually want to interact with?]]></summary></entry><entry><title type="html">Learning to Detect by Learning to Predict</title><link href="https://anirudharamesh.github.io/posts/2012/08/Learning-to-Detect-by-Learning-to-Predict/" rel="alternate" type="text/html" title="Learning to Detect by Learning to Predict" /><published>2022-05-20T00:00:00-04:00</published><updated>2022-05-20T00:00:00-04:00</updated><id>https://anirudharamesh.github.io/posts/2012/08/future-post</id><content type="html" xml:base="https://anirudharamesh.github.io/posts/2012/08/Learning-to-Detect-by-Learning-to-Predict/"><![CDATA[<p>The ability to identify and localize objects in an environment by humans and other animals is powered by the agent’s ability to predict where an object is likely to be in the environment. We explore the use of this idea as a prior to improve detection of objects in the scene, which can be seen as the conditional posterior.</p>

<p>Please find the entire blog [here] (https://sites.google.com/andrew.cmu.edu/vlr-ldlp/).</p>]]></content><author><name>Anirudha Ramesh</name><email>anirudha.ramesh50@gmail.com</email></author><category term="project" /><summary type="html"><![CDATA[The ability to identify and localize objects in an environment by humans and other animals is powered by the agent’s ability to predict where an object is likely to be in the environment. We explore the use of this idea as a prior to improve detection of objects in the scene, which can be seen as the conditional posterior.]]></summary></entry></feed>