Challenges and Opportunities in AI and Agent Capabilities

Introduction to AI Engineering and Agents

AI engineering is an evolving discipline that aims to landmark the state of the art in the industry [01:05:00]. Past discussions at AI Engineer Summits have covered the rise of the AI engineer, the three types of AI engineers, and the maturation and spread of the discipline across different fields [01:10:00].

Evolution of the Discipline

There’s ongoing resistance regarding the identity of an AI engineer [01:33:00]:

Machine Learning (ML) View: An AI engineer is mostly an ML engineer with some prompting [01:41:00].
Software Engineering (SE) View: An AI engineer is primarily a software engineer calling a few Large Language Model (LLM) APIs [01:47:00].

However, it’s expected that AI engineering will emerge as its own distinct discipline over time, growing beyond the current perception of being 90% software engineering and 10% AI [01:54:00]. Differences in language, such as ML engineers saying “test time compute” versus AI engineers saying “inference time compute,” highlight this distinction [02:24:00].

Defining an AI Agent

Defining “agent” is a monumental task, with various perspectives:

Machine Learning Perspective: Agents are often discussed in the context of reinforcement learning environments, focusing on actions achieving goals [05:41:00].
Software Engineering Perspective: Can be reductive, sometimes equating an agent to a simple “for loop” [05:49:00].
Crowdsourced Definitions: Simon Willison crowdsourced over 300 definitions, highlighting common themes [06:00:00]. Key characteristics include:
- Goal-oriented behavior [06:17:00]
- Tool use [06:17:00]
- Control flow [06:20:00]
- Long-running processes [06:20:00]
- Delegated authority [06:22:00]
- Small multi-step task completion [06:23:00]
OpenAI’s Definition: OpenAI introduced a new definition for agents, indicating ongoing work in this area [06:52:00].

Why Agents are Gaining Traction Now

The current momentum for agents, in contrast to previous years, is attributed to several factors:

Increased Capabilities: Agent capabilities are starting to reach human baselines [07:28:00]. This includes:
- Better reasoning [07:39:00]
- Improved tool use [07:41:00]
- Better tools [07:42:00]
Model Diversity: The market share for models like OpenAI has diversified significantly, from 95% two years ago to 50% now [07:50:00]. The emergence of new Frontier Model Labs poses challenges to established players like OpenAI [08:02:00].
Reduced Cost of Intelligence: The cost of GPT-4 level intelligence has decreased by 1,000 times in the last 18 months [08:14:00]. Similar cost reductions are being observed for “01 level intelligence” [08:24:00].
RL Fine-Tuning Options: The availability of reinforcement learning (RL) fine-tuning options is contributing to agent development [08:28:00].
Outcome-Based Charging: A shift towards charging for outcomes rather than just costs is emerging [08:43:00].
Advancements in Multi-Agents and Hardware: Progress in multi-agent systems and faster inference due to improved hardware are also key factors [08:48:00].

Challenges and Opportunities in AI Adoption and Development

Current Challenges and Insights in Developing AI Coding Agents and Agents in General

While there’s a strong push for 2025 to be the “Year of Agents” by major figures like Satya Nadella, Sam Altman, and Greg Brockman [04:20:00], skepticism exists. Some were initially told to remove “agents” from their branding, only to be told to put it back on later [05:09:00].

A significant challenge in developing AI agents is that speakers often come from backgrounds where they primarily make agent frameworks for a living, rather than putting them into production [03:26:00]. This has led to a new rule at conferences: “no more vendor pitches” [03:36:00], making curation more difficult as speakers have less incentive to share production-level insights [03:46:00].

Opportunities and Use Cases

Product Market Fit: Coding agents and support agents are showing product market fit [09:12:00]. Deep research also has product market fit [09:15:00].
Emerging Use Cases: Several other use cases are up and coming [09:17:00].
The “Everything + Agent” Formula: It has been observed that “everything plus agent works,” such as:
- Agent + RAG (Retrieval-Augmented Generation) [04:03:00]
- Agent + Sentiment [04:04:00]
- Agent + Search [04:05:00] This formula is seen as a “simple formula for making money in 2025” [04:07:00].

However, there are “anti-use cases” that should be avoided, such as agents for booking flights or Instacart orders, as users often prefer to handle these tasks themselves [09:23:00].

Future Prospects in AI and Agent-based Technologies

The growth of AI products, particularly agents, is strongly tied to reasoning capabilities [10:41:00]. OpenAI reported 400 million users, a 33% growth in three months [09:46:00]. The growth of ChatGPT, from zero to 400 million users in 2.5 years, shows a clear trend [09:51:00].

ChatGPT’s usage growth stalled for a year when it didn’t ship any agentic models [10:05:00]. However, the introduction of “01 models” has doubled ChatGPT usage, and it’s projected to hit a billion users by the end of the year, quintupling its September last year user base [10:21:00]. This means 1/8th of the world’s population could be using ChatGPT by year-end [10:49:00].

The job of an AI engineer is evolving towards building agents, similar to how ML engineers build models and software engineers build software [11:00:00].

Tubegraph

Explorer

Table of Contents