Pros and cons of building proprietary AI models

From: redpointai

Building proprietary AI models involves creating and maintaining an organization’s own artificial intelligence models rather than relying exclusively on commercial or third-party solutions. This approach comes with distinct advantages and disadvantages, particularly concerning customization, control, and cost [00:30:08].

Pros of Building Proprietary AI Models

Tailored Characteristics and Performance

One significant advantage is the ability to achieve specific model characteristics that off-the-shelf commercial models may not offer [00:30:08]. For companies like Replit, low latency is crucial for features like code suggestions, which may not be consistently met by general-purpose commercial models [00:30:10].

Cost Efficiency

While the initial investment might seem substantial (e.g., $100,000 for training a 3B parameter model at Replit [00:29:37]), building a proprietary model can be more cost-effective in the long run than being a continuous customer of commercial models, especially if the features need to be part of a free offering [00:28:56] [00:28:46]. Small models, in particular, can be both capable and affordable to train and deploy [00:29:17] [00:29:27].

Control Over Data and Training

Proprietary models offer complete control over the training data and process. This is vital because LLMs are fundamentally a function of the data they are fed, making the data akin to the “source code” [01:12:17] [00:35:12]. Control over the data means:

Quality Improvement: The ability to curate high-quality, diverse, and fresh data, even performing multiple training epochs on the same high-quality data for better performance [01:16:35].
Domain Specificity: Training on specific types of data (e.g., application code from Replit’s user base, rather than just infrastructure code from GitHub) can yield better results for particular use cases [01:18:14].
Security: Understanding the training process and data mitigates significant security risks, such as hidden “backdoors” that could be activated by certain prompts [00:36:51] [00:35:50].

Strategic and Talent Development

Building custom AI models for enterprises allows companies to develop internal AI talent and position themselves as an “AI company,” which can be a strong strategic consideration [00:29:45] [00:30:37].

Cons of Building Proprietary AI Models

Capital and Resource Intensity

While it can be cost-effective for specific use cases like Replit’s, training large, general-purpose models requires substantial capital expenditure on compute and data annotation [00:29:40] [00:14:32]. The rapid affordability of commercial models like GPT-3.5 makes the decision to train internally less universally rational for many companies [00:27:59].

Limited True Openness and Dependence

The concept of “open source AI models” is debated. If a model’s training process cannot be reproduced, or if the compiler for its source code is unavailable, it’s not truly open source [00:31:58] [00:32:00]. This means companies relying on open source models might still be dependent on the “goodwill” of the releasing entity (e.g., Meta for Llama) and their continued investment [00:32:30]. Strategically, long-term dependence on such external factors without control over the underlying data or training process can be risky [00:33:22].

Competition with Giants

Companies building proprietary models face stiff competition from major players like Microsoft and OpenAI, who have vast resources, established install bases, sales teams, and continuous advancements in their models [00:52:01]. It’s challenging for smaller entities to match the scale, compute, and comprehensive data collection of these large labs [00:54:37].

Staying Ahead of Model Advancements

The rapid pace of AI model development means a proprietary model, once built, risks being leapfrogged by subsequent releases from major labs [00:54:56]. This requires continuous investment in research and development to maintain competitiveness [00:56:18].

Hybrid Approach

Many companies adopt a hybrid strategy, using proprietary models for core, latency-sensitive features and leveraging commercial models for other use cases that don’t require the same specific characteristics [00:30:10] [00:31:10]. This allows companies to “start from the problem, from the customer pain point, explore potential solutions, run the numbers,” and make strategic investments where it makes the most sense [00:30:24].

The landscape for AI models is not yet set, with ongoing debates about the long-term viability of relying solely on commercial APIs versus the strategic advantage of internal model development [00:33:55].

Tubegraph

Explorer

Table of Contents