跟读练习: Most devs don't understand what agents are - 通过YouTube学习英语口语

OpenAI just introduced AgentKit, a complete set of tools for developers and enterprises to build, deploy and optimize agents.

⏸ 已暂停

速度:

106 句

如果句子过短或过长，请点击 Edit 进行调整。

OpenAI just introduced AgentKit, a complete set of tools for developers and enterprises to build, deploy and optimize agents.

This is cool, fairly exciting announcement, developers can now design workflows visually, embed agentic UIs faster.

Now I don't really care about this product, I'm not sure I'll end up using it really.

But what I am kind of interested in is how they're talking about agents.

Because if we look at this, this is not an agent to me.

We have here a set of deterministic steps.

We start, then we enter a jailbreak guardrail, which by the way just filters for malicious inputs I assume.

Then we have another LLM call here which just routes the input to one of three separate agents.

This to me is not an agent builder, this is a workflow.

I thought we as a kind of AI engineering typescripty community had landed on some definitions for what agents were

and what workflows were.

But it turns out no we didn't because OpenAI seems to have a different definition from the one that we've been using.

I want to talk through this debate so that you understand what agents are, what workflows are and why the distinction even matters.

And the best place to start is with Anthropic's famous article building effective agents.

This came out in December last year and it basically codified what an agent was and what a workflow was.

This is how Anthropic defines an agent, it's essentially a loop.

We'll talk more about what this loop is in a minute and what makes this agentic. But

if we zoom up to a workflow example here we can see a very similar example to what OpenAI just put out.

Instead of a loop here we have a directional flow, we have predetermined code paths.

And this is what Anthropic calls a workflow.

And it's kind of funny too that this famous article really talking about, you know, titled building effective agents actually talks through like six different kinds of workflows.

Let's go to TLDraw where we've got a bit of dark mode where we can actually dive into some of these concepts.

An agent is a loop where the LLM decides when to stop.

That loop is essentially multiple LLM calls one after the other.

Now if you call an LLM multiple times with the same information it's not going to do anything useful.

And so to make this an agent you kind of need to give it new information each time.

The way that works is the LLM calls tools.

It basically says execute this piece of code for me and then tell me what happened when that piece of code ran.

Just to dive into this for a minute it kind of looks like this.

Let's imagine our system has access to a tool called write file where it can write files to the file system.

The user can say to the agent, write a new file called gitignore.

Then the assistant comes back with a message here saying, okay, call this tool with this content and this path.

On our local machine then we execute the tool and we send the result back to the LLM.

And so this flow becomes a loop where the LLM is gaining more information each time.

This beautiful loop is what drives things like clawed code, coding agents, all the stuff that you're kind of used to using.

The key thing then is that the agent then decides when it's had enough.

So the agent can either continue to call tools or it can say stop.

At which point it will emit a special token

that just says stop and we can catch that in the frontend and no longer call the LLM again and again.

Workflows are of course much easier to define.

There's no loop here, it's just predetermined steps one after another.

You take one LLM call, you pass its result to another LLM call and you pass that result to another LLM call.

You might have some deterministic logic in these steps, like If the LLM call returns one thing, do one thing.

If it returns another, do another.

But all of those code paths are known ahead of time and written in code.

Workflows are neat, by the way, because you get opportunities to optimise the system.

For instance, you can have parallel workflows where you have multiple LLM calls at the same time.

We might take in or produce a chunk of text, split it into two parts, get the LLM to summarise each part of it, and then pass the results of those to another LLM call where we summarise the summaries.

Because the path to the solution is known up front, we can optimise it in all sorts of ways, which make workflows really, really powerful.

And by the way, if I had to pick between agents and workflows, like one that I could take to a desert island, I would probably pick workflows.

But that's just me because I'm a natural contrarian.

So let's sum up then.

Agent and workflow.

What are the differences?

What are they good at?

Well, the first thing to say that to qualify in this category, you need multiple LLM calls.

Like a single LLM call all by itself doesn't really qualify as either an agent or a workflow.

It's just a frickin' API call.

We don't need an extra definition for that.

To me, the key difference is who decides when to stop the program.

With an agent, as we saw, it is the LLM really.

The LLM can say, OK, I've done the work, let's now stop.

Whereas in a workflow it is predetermined steps that are known up front.

Now the reason that this entire distinction matters is that agents and workflows are good for different things.

An agent is really good when the path to the solution is unclear

or when you need to be able to generalise it to lots and lots of different tasks.

Coding Coding assistants are a really, really, really good example of this.

Because the coding assistant in Clawcode or Cursor doesn't know what kind of codebase it's going to go into, it doesn't know what kind of bug you're going to throw at it, and so it needs to be able to adapt on the fly.

In other words, agents are really, really good at improvising.

But workflows are much better when the path to the solution is known up front.

When you need to do the same thing a thousand times, you always want a workflow.

Because as we saw with the parallelizable steps you can basically optimise it in all sorts of different ways.

Whereas an agent you really leave the optimisation up to the agent itself.

Agent is like jazz, you know, it's all improvisation, all feel.

And workflows are like classical music where you can spend ages optimising the upfront set up

so that the final output is as good as it can be.

The next thing to say though is that agents and workflows are a spectrum, not a hard definition.

Most systems out there you will see will be somewhere on this gradient between agent and workflow.

For instance, a pure agent where the LLM is solely in charge of deciding when to stop, well, I don't want to deploy that because that thing is going to eventually run forever.

And so most agents have a max steps counter, in other words a deterministic stop in the code to prevent the agent running infinitely.

This is so common that tools like the AISDK actually have a max steps parameter to their agents.

Going further down we have agents that contain workflows.

Many agents are able to call workflows from within tools.

Which by the way allows you to build really really smart systems

because you get the generalizability of the agent and then you're able to optimise the tools that that agent has.

Finally, of course, you can have workflows that contain loops.

This might be that you produce some text and you evaluate it multiple times to refine the output continuously.

The difference here, of course, is does the LLM itself have the ability to break the loop early?

For me that's a sign that it's an agent rather than a workflow.

But these terms are on a spectrum

and most systems out there will use some combination of each or have agents within workflows or workflows within agents.

And so the definitions are useful because they allow you to think about problems in terms of patterns.

And so it kind of hurts me a little bit when I see as agent workflows grow more complex.

Ah, what did we do to deserve this?

This is just so confusing.

Now of course I'm annoyed I suppose because I'm interested in agents versus workflows as like a pedagogical tool, as a teaching tool.

Because I do find the definitions useful for communicating what you're trying to build and the trade-offs between them.

But also there's a sense that everyone's using the word without there necessarily being a good definition behind it.

I only hope that this definition will spread

100

that the anthropic definition of just two calls in a loop will be what people land on.

101

Now if you're digging what I'm putting out then you will love AIHero.dev.

102

I'm going to be releasing something soon which is going to mash together AI

103

and TypeScript and give you the ability to ship really powerful AI applications with the language

104

that you know and you know that I love.

105

Thanks so much for joining along folks.

106

I will see you very soon.

什么是跟读法？

跟读法 (Shadowing) 是一种有科学依据的语言学习技巧，最初开发用于专业口译员的培训，并由多语言者Alexander Arguelles博士普及。这个方法简单而强大：您在听英语母语原声的同时立即大声重复——就像是一个延迟1-2秒紧跟说话者的影子。与被动听力或语法练习不同，跟读法强迫您的大脑和口腔肌肉同时处理并模仿真实的讲话模式。研究表明它能显着提高发音准确性，语调，节奏，连读，听力理解和口语流利度——使其成为雅思口语备考和真实英语交流最有效的方法之一。

☕ 请我们喝杯咖啡

由于您的支持，ShadowingEnglish 保持完全免费。服务器和 AI 费用高昂——您的咖啡将帮助我们继续前行！🙏

通过 PayPal 捐赠

ShadowingEnglish.com – 英语跟读练习

使用跟读技巧流利地说英语。听正宗的YouTube视频，逐句跟读，建立真实的发音和流利度——全球雅思学习者都在使用。

跟读练习: Most devs don't understand what agents are - 通过YouTube学习英语口语

上下文背景

日常交流的五個關鍵短語

逐步跟讀指導

什么是跟读法？

跟读练习: Most devs don't understand what agents are - 通过YouTube学习英语口语

下载应用

上下文背景

日常交流的五個關鍵短語

逐步跟讀指導

什么是跟读法？