From: redpointai

The concept of defensibility in AI applications, particularly at the application layer, is a recurring theme among industry experts. While early discussions focused on proprietary models or unique datasets, the conversation has evolved to emphasize different aspects such as user experience, product velocity, and most importantly, network effects [00:00:33].

Shifting Focus from Models to Applications

Initially, the value of model companies was perceived to stem from their ability to be the sole provider of a model for a period, allowing them to build products on top of it. However, this period of exclusive advantage has proven to be much shorter than anticipated [00:04:00]. The focus has largely shifted to what engineers can achieve above the model layer to augment its capabilities [00:07:06]. The overwhelming consensus is that application development, sometimes colloquially referred to as “GPT wrappers,” is now the most interesting area [00:07:07].

Product-Market Fit First

A key principle is to prioritize building something users desire before worrying about differentiation [00:07:17]. This contrasts with earlier assumptions that inherent advantages like Apple’s position would guarantee success in consumer AI, which was not the case [00:10:07].

Sources of Defensibility

Network Effects

Network effects are considered paramount for long-term defensibility [00:39:16]. Companies often prioritize the single-player experience, neglecting the multiplayer aspects that can foster strong network effects [00:39:22].

A prime example is Chai Research, a character AI competitor that, despite not having its own proprietary models, has outlasted competitors by building a network of users submitting models [00:40:00]. This approach turns the company into a marketplace, leveraging the “choke point” at the protocol layer between users and model providers [00:40:39].

Brand and Velocity

Rapid brand recognition and product velocity are crucial [00:41:17]. Companies that quickly become synonymous with an entire category gain significant advantage in customer access [00:41:20]. This allows them to command higher Average Contract Values (ACVs), even doubling them, due to the premium associated with being a leading brand [00:41:41].

The ability to move quickly, develop a broad product, and adapt to new model releases every few months is an “existential event” for companies [00:42:12]. This emphasis on execution, user experience, and speed is reminiscent of successful traditional SaaS companies, relying on a multitude of small, compounding advantages rather than a single revolutionary feature [00:42:30].

Unique Data and Stateful AI

While general-purpose models tend to dominate, specific use cases leveraging unique datasets can offer defensibility [00:17:34]. Examples include robotics, biology, and material science, where proprietary data or processes (like wet labs) are essential [00:17:35].

Furthermore, “stateful AI” or memory in AI applications, akin to databases, is seen as a potentially interesting area for investment and defensibility [00:16:03].

Implications for Investment

  • Application Layer over Infrastructure (sometimes): The application layer is generally considered more interesting than bare-metal infrastructure, as it allows companies to charge for utility rather than just cost, avoiding the “cost plus” pricing model often seen in infrastructure [00:47:14].
  • Targeting Cost Centers: Early AI applications often found success in areas where businesses were already comfortable outsourcing to BPOs (Business Process Outsourcing) due to cost-cutting objectives [00:32:28]. These areas, like customer support, implicitly signal a willingness to accept some performance cuts for cost savings [00:32:32].
  • Focus on Revenue Growth: The next wave of AI applications is expected to focus more on revenue growth rather than just cost cutting [00:32:06]. Applications that demonstrably increase top-line revenue, such as those in AI go-to-market, will likely prove even more defensible, as businesses are willing to pay more for clear ROI [00:32:57].
  • Tolerance for Imperfection: In certain applications, such as voice AI for scheduling or intake, 100% precision is not required for a great product [00:33:19]. Even with 75% effectiveness, significant value and increased revenue can be generated, as many existing processes have much lower success rates [00:33:30]. This allows for viable products even as model reliability continues to improve [00:33:46].

Overhyped and Underhyped Areas

  • Overhyped: Agents frameworks are seen as overhyped, with the market still seeking stability rather than a proliferation of frameworks [00:10:42]. The focus should be on establishing protocols rather than embedding them within frameworks [00:11:55].
  • Underhyped: Stateful AI/memory and Private Cloud Compute (PCC) are considered underhyped. PCC, especially for handling larger LLMs while maintaining privacy and on-device security, is seen as having significant potential due to the need for multi-tenant architectures with single-tenant guarantees in GPU-constrained environments [00:12:45].

Unanswered Questions

A major unanswered question is the extent to which Reinforcement Learning (RL) can be successfully applied to non-verifiable domains, such as legal contracts or outbound sales [00:50:22]. If RL remains limited to verifiable domains, it could lead to a future where fully autonomous AI excels in areas like coding and math, while human involvement remains necessary for tasks like writing sales emails [00:51:00].

Another key question revolves around the scalability of AI, specifically concerning hardware and availability of GPUs. The “rule of nines” suggests that achieving higher reliability (e.g., from 90% to 99%) requires an order of magnitude increase in compute every 2-3 years, raising concerns about continued reliance on a few dominant hardware providers like Nvidia [00:51:34].

Finally, the emerging need for “agent authentication” – a new form of SSO for agents – remains an open question, as current systems are not designed to differentiate between human and AI actions on websites [00:54:41].