Scaling AI solutions in production

From: aidotengineer

D. Kila, CEO at Contextual AI, shares insights from his experience in enterprise AI, focusing on the challenges and opportunities in scaling AI agents and solutions for businesses [00:00:16].

The Enterprise AI Paradox

While there is a significant opportunity with AI, estimated to add $4.4 trillion to the global economy according to McKenzie, many enterprises face frustration [00:00:54]. A Forbes study indicates that only one in four businesses actually derive value from AI investments, leading to questions about ROI [00:01:26].

This situation echoes Moravec’s Paradox from robotics, where tasks seemingly hard for humans (like beating chess grandmasters) are easier for computers, while tasks easy for humans (like vacuuming a house or self-driving) are much harder [00:01:44]. In enterprise AI, language models excel at complex tasks like code generation and mathematical problems [00:02:28]. However, they struggle with “context,” which humans effortlessly provide based on expertise and intuition [00:02:41].

The Context Paradox and Differentiated Value

The “context paradox” is crucial for unlocking ROI with AI [00:03:15]. Current general-purpose AI assistants often focus on convenience [00:03:25]. To achieve differentiated value and business transformation, enterprises must become proficient at handling internal context [00:03:36].

Key Lessons for Building and Scaling AI Systems

Drawing from experience at Contextual AI, several best practices for building AI systems have emerged:

1. Focus on Systems, Not Just Models

Language models typically constitute only about 20% of a complete enterprise AI deployment [00:04:40]. Retrieval Augmented Generation (RAG) has become the standard method for enabling generative AI to operate effectively on an organization’s proprietary data [00:05:02]. An effective RAG pipeline around a “mediocre” language model will outperform an “amazing” model with a poor RAG pipeline [00:05:23]. The focus should be on the entire system that solves the problem, not just the model [00:05:35].

2. Prioritize Specialization Over AGI

Enterprise expertise is a valuable fuel for AI [00:05:47]. While Artificial General Intelligence (AGI) has its applications, solving specific, difficult, domain-specific problems within an enterprise is best achieved through specialization, allowing the AI to better capture and utilize institutional knowledge [00:06:11].

3. Leverage Data as Your Moat

An enterprise’s data is its long-term asset, even more so than its people [00:06:56]. The challenge is enabling AI to work on noisy data at scale [00:07:26]. Successfully doing so allows for the creation of differentiated value and a competitive moat [00:07:32].

4. Design for Production from Day One

Pilots are relatively easy to build [00:07:56]. Scaling AI models and their impact on development tools from a pilot (e.g., 10 users, a few documents) to production (e.g., thousands of users, millions of documents, 20,000 use cases) is significantly harder [00:08:20]. Existing open-source tools often struggle at this scale [00:08:31]. Enterprise requirements like security and compliance add further complexity [00:08:50]. Therefore, design for production from the outset to bridge the gap effectively [00:09:01].

5. Prioritize Speed Over Perfection

For successful production rollouts of RAG agents, speed is paramount [00:09:10]. Deploy solutions to real users early, even if they are only “barely functional” [00:09:22]. This enables iterative improvement (“hill climbing”) to achieve a “good enough” state. Delaying for perfection can make the transition from pilot to production very difficult [00:09:42].

6. Empower Engineers to Focus on Value

Engineers should focus on delivering business value and competitive advantage, not on mundane tasks like optimizing chunking strategies or basic prompt engineering [00:10:10]. State-of-the-art platforms for RAG agents can abstract away these lower-level concerns [00:10:55].

7. Make AI Easy to Consume

A common pitfall is that even when AI solutions run in production, internal usage can be zero [00:11:15]. Solutions must be integrated into existing workflows to maximize real production usage [00:11:55].

8. Design for “Wow” Moments and Sticky Usage

User adoption and stickiness are key [00:12:15]. Design user onboarding experiences to deliver “wow” moments quickly, where users immediately grasp the value and potential of the AI [00:12:31]. For example, a Qualcomm engineer found a 7-year-old hidden document through the system, answering long-standing questions and changing their workflow [00:12:45].

9. Manage Inaccuracy Through Observability

Accuracy is table stakes for AI [00:13:24]. Enterprises are increasingly concerned with managing the 5-10% inaccuracy [00:13:41]. This requires robust observability, proper audit trails (especially in regulated industries), and attribution [00:13:54]. In RAG systems, clear attribution of answers to source documents is vital for dealing with inaccuracies [00:14:17].

10. Be Ambitious

Many AI projects fail not from aiming too high, but too low [00:14:51]. Projects that merely answer basic questions about HR policies, for instance, often lack significant ROI and become gimmicks [00:15:00]. Enterprises should aim for ambitious goals that, if solved, deliver substantial ROI and drive business transformation [00:15:12]. The current era of AI presents a unique opportunity to effect significant societal and business change [00:15:36].

By understanding the “context paradox” and applying these lessons—focusing on systems, specializing for expertise, and being ambitious—enterprises can turn challenges and solutions in scaling AI agents into significant opportunities [00:16:03].

Tubegraph

Explorer

Table of Contents