The balance between human inputs and AI automation in software development

From: redpointai

AI’s impact on software development has been profound, with coding emerging as one of the most effective use cases for AI to date, evidenced by products like GitHub Co-pilot’s million-plus paying users [00:00:09]. This shift raises questions about the impact of AI on software development and programming jobs and the future of software engineering with AI [00:00:37].

Current State of AI in Coding

Today, AI tools primarily serve as “inner loop accelerants” for software development [00:07:01]. The “inner loop” refers to the frequent, day-to-day cycle a developer goes through to write, test, and validate code [00:07:16]. This includes tasks that are commonplace or have been written before [00:07:57].

Sourcegraph, a leader in this space, offers two main products that illustrate this:

Code Search Engine: Helps human developers understand vast codebases by enabling search, navigation (go to definition, find references), and context acquisition [00:04:09].
Cody (AI Coding Assistant): Provides inline completion (autocomplete thoughts while typing code), chat functionality for high-level questions and code generation, and a menu of commands for specific, often toilsome actions like writing doc strings or unit tests [00:04:50].

A key differentiator for Cody is its “context and awareness about your code base,” leveraging Sourcegraph’s ability to provide context to both human and AI brains [00:05:37]. This is crucial because applying AI within a specific codebase requires awareness of its unique environment, frameworks, and deployment practices [00:05:54].

Future of AI in Coding – Towards Automation

The next frontier for AI in coding involves more and more automation, moving beyond inner loop accelerants to multi-step, bot-driven development [00:08:32]. While tools like Devin show promise, they still require human oversight [00:08:55].

Achieving this requires:

Feedback Loops: Trial and error with code changes, observing results, and learning from mistakes [00:10:30].
Virtual Execution Environments: To try things, make changes, and observe results in a controlled setting [00:11:02].
Context Fetchers: Extremely effective mechanisms for retrieving relevant code pieces, crucial for reducing the “cycles” an AI needs to reach a correct answer [00:11:07].

The goal is to reduce the time and cost associated with reaching a correct solution, moving AI-generated code from demo-level functionality to production reliability [00:11:50].

Impact of Model Advancements

Advances in large language models (LLMs), such as the progression from GPT-3.5 to GPT-4 and Claude 3, have significantly enabled new functionalities and improved reliability for AI coding assistants [00:13:11]. Newer models demonstrate better ability to integrate search results and additional code contexts into working code examples [00:14:44].

These advancements allow for:

More Reliable Chat Interactions: Leading to more consistent “wow moments” for users [00:14:32].
Zero-Shot Application Building: The ability to create an application from scratch using specific libraries and APIs with minimal human intervention [00:15:29].

Long context windows are beneficial for questions involving the tying together of many different concepts, but they are not a silver bullet [00:16:11]. Simply shoving an entire codebase into the context window doesn’t guarantee the best results; the best application architectures combine context windows with tailored information retrieval mechanisms, indicating that Retrieval Augmented Generation (RAG) remains crucial [00:17:08].

Role and Expectations of Software Developers in the Context of AI Advancements

AI tools impact different types of developers differently:

Junior Developers: Tend to enjoy inline completions more, viewing them as a “pedagogical tool” that provides a starting point for tasks they might not know how to do well [00:23:01].
Senior Engineers: Often prefer chat over inline completions, and some may dislike completions if they are not consistently smart enough or disrupt their flow [00:21:48]. They sometimes feel they could write better code faster [00:22:58].

AI, particularly LLMs, functions like a “bullshitting machine” or pattern matcher, which is helpful for boilerplate or common tasks [00:23:16]. This offloads the “toil and drudgery” from human developers, allowing them to focus on more creative and differentiated aspects of their work [01:08:06].

Strategic Considerations for AI Application Developers

AI Development Philosophy

Sourcegraph’s philosophy for building a successful AI product for developers emphasizes adding the most end-user value and quickly moving the needle for user experience [00:25:11]. Their “dumb thing first” approach involves starting with the simplest possible solution (e.g., classical keyword search for RAG) to establish a baseline before exploring more complex methods like embeddings or vector databases [00:25:29].

Evaluation Metrics

Product metrics, such as acceptance rate and overall usage volume, are considered the most authoritative measures of an AI feature’s success [00:19:56]. While offline benchmarks are used, direct user feedback and real-world engagement are prioritized because they reflect actual value and user experience, which benchmarks may not fully capture [00:19:01].

Organizational Structure

Sourcegraph’s engineering organization is structured with a distinct “model layer” team focused on fine-tuning models for specific use cases (e.g., language-specific code generation like Rust or Ruby) and defining quantitative benchmarks [00:26:07]. The rest of the product engineering is divided into code search and Cody (AI) teams, with an expectation that these will integrate more closely over time due to synergies [00:36:10].

Cost and Pricing

Inference cost is a consideration, but Sourcegraph prioritizes adding value, expecting costs to decrease over time [00:31:16]. Their pricing model is “active user per month,” aiming to align with the value customers receive [00:33:03].

The Evolution of AI Developer Tools and Hardware

There is a growing trend towards local inference models, driven by factors such as:

Availability: Enables usage in environments without network connectivity (e.g., airplanes) [00:44:47].
Privacy: Keeping computation contained within a local machine [00:45:31].
Cost: Potentially cheaper than cloud inference [00:45:26].
Latency: As inference times improve on local hardware, network round trips become the bottleneck, making local execution crucial for latency-sensitive developer workflows [00:46:13].

Managing Code Quality with AI

The “Cody Ignore” feature allows users to exclude certain files from being used as context for AI generation [00:29:21]. Initially for sensitive files, it’s now also requested to exclude “bad code” to improve the quality of AI-generated suggestions, providing engineering leaders with a lever to influence code quality [00:29:46].

The Role of AI in the Future of Coding and Developer Ecosystems

The release of ChatGPT was a significant accelerant for Sourcegraph, shifting the winds and pushing them to invest more in AI coding tools like Cody [00:40:12]. Companies with unique access to or ways of structuring valuable data (like Sourcegraph with codebases) are seen as primary beneficiaries of the current AI wave [00:41:24].

The Agentic Future

Sourcegraph is actively experimenting with full automation (outer loop) using existing context providers combined with an execution loop to solve specific problems [00:58:18]. These efforts also improve inner loop functionalities, for instance, by developing better verification mechanisms like test generation [00:58:43].

Open Ecosystem and Building Blocks

The company aims to enable an open ecosystem where “any developer out there” can build on top of AI [01:01:32]. They want to provide high-quality context providers as cross-cutting “building blocks” that can be plugged into various new AI agent UIs or user experiences, rather than monopolizing every cool new idea [01:00:09]. Features like Cody’s custom commands empower users to define context, prompts, and output rendering for AI tasks [01:01:53].

This vision fosters an open future for software creation, preserving freedom of choice for developers and companies regarding models, code hosts, and technology stacks [01:11:08]. The goal is to avoid a future where large players vertically integrate the market onto proprietary development platforms [01:12:13].

Milestones and Future Outlook

Significant milestones for AI in coding would include the ability for fully automated systems to solve classes of problems that are currently impossible in production, such as automatically fixing a high percentage of simple bugs based on production logs [01:03:55].

The number of engineers is expected to grow, but the definition and day-to-day experience of software development will change drastically [01:06:08]. AI will automate much of the “toil and drudgery,” allowing developers to spend more time on creative, valuable, and “magical” aspects of building software [01:08:06].

While AGI (Artificial General Intelligence) is considered overhyped, especially the idea that simply scaling Transformers will achieve it [01:14:12], there’s significant value in building things complementary to AI and LLMs [01:15:25]. Formal specifications and formal languages are underhyped, as they remain essential for describing precise requirements that natural language cannot adequately convey, much like math is necessary for precision beyond natural language [01:15:58]. This suggests that programming languages will continue to exist and become an increasingly important complement to AI [01:17:01].

The pace of advancement in model output quality and efficiency continues to be a positive surprise, reducing the need for extensive workarounds [01:18:14]. The growth of open-source models is also seen as a positive development, offering advantages like fine-tuning capabilities and access to internal model states (e.g., attention weights) for application-level improvements [01:20:04].

The future of AI application development also holds vast opportunities in rethinking how work is done in specific knowledge domains (vertical SaaS) and in consumer applications, potentially leading to “concierge as a service” paradigms where users simply tell the AI what to do [01:24:12].

Tubegraph

Explorer

Table of Contents