Developing AI coding assistants with contextawareness

From: redpointai

The field of AI in coding has rapidly evolved, with companies like Sourcegraph leading the way in developing tools that significantly enhance developer productivity by providing deep context awareness.

The Role of AI in Coding Today [00:00:07]

Coding is recognized as one of the most effective use cases for AI to date, evidenced by over a million paying users for GitHub Co-pilot [00:00:11]. AI’s primary impact currently lies in accelerating the “inner loop” of software development [00:07:01]. This refers to the frequent cycle a developer goes through multiple times a day: understanding a task, writing code, testing it, and repeating [00:07:18]. Tools that assist in these frequent, commonplace tasks, such as code completion or context-aware chat, are in heavy day-to-day use [00:07:39].

Sourcegraph’s Approach: Cody, the AI Coding Assistant [00:03:50]

Sourcegraph, a leader in the space, offers two main products for developers:

Code Search Engine: This tool helps human developers understand vast codebases by allowing them to search, navigate definitions, and find references, thereby acquiring necessary context [00:04:09].
Cody: An AI coding assistant that provides [00:04:50]:
- Inline completion: Autocompletes thoughts as code is typed [00:04:56].
- Chat: Enables asking high-level questions about code and generating code for specific tasks [00:05:00].
- Menu of commands: Targets specific, often tedious, actions like writing doc strings or high-quality unit tests [00:05:09].

Context-Awareness as a Differentiator [00:05:28]

Cody’s key differentiator is its ability to perform these functions with the benefit of context and awareness about the user’s specific codebase [00:05:37]. Unlike general models that might provide “median” answers from open-source code, Cody understands the specific frameworks, libraries, and production environments relevant to a developer’s project [00:06:06]. This deep contextual understanding is crucial for applying AI effectively within an existing codebase [00:05:58].

Advancements in AI Models and Retrieval [00:13:10]

The progression of large language models (LLMs) from GPT-3.5 to GPT-4 and Claude 3 has significantly enhanced Cody’s reliability [00:14:21]. Newer models show improved ability to incorporate given context and integrate it into working code examples [00:14:41]. This allows for more consistent “wow moments” in daily usage, such as zero-shot application building using specific libraries and APIs [00:14:51].

The Role of Retrieval Augmented Generation (RAG) [00:17:02]

Sourcegraph heavily utilizes RAG to tie queries to the customer’s actual codebase [00:06:16]. While larger context windows in LLMs are helpful, they are not a complete solution. The best application architectures will always combine large context windows with more tailored information retrieval mechanisms [00:17:08]. Sourcegraph has extensively evolved its search techniques for RAG, even finding that classical keyword search, combined with clever chunking strategies, was highly effective early on, often outperforming naive vector database approaches [00:52:08].

The Path to Full Automation (Agents) [00:08:32]

While current AI tools excel at accelerating the inner loop, the next frontier is full automation through agents. This involves systems that can perform multi-step, bot-driven development where the AI drives the process, and humans primarily advise [00:08:40].

Achieving this requires:

Evaluational language models: To make and refine code changes towards a desired pull request [00:09:48].
Context acquisition: The ability to fetch detailed information about the surrounding code and existing patterns [00:10:14].
Recollection of actions: A feedback loop where the AI learns from trials, errors, and observed results [00:10:28].
Virtual execution environment: To try things, make changes, and observe results in a controlled setting [00:11:02].
Efficiency: Reducing the number of “cycles” needed to reach a correct answer, which relies on better observation, learning, prediction, and context retrieval [00:11:21].

Code offers a natural environment for this exploration due to its clearer “did this work?” evaluation [00:12:13]. Sourcegraph is actively building systems targeting specific problems amenable to full automation, combining existing context providers with an execution loop [00:58:18]. This work, including developing subroutines like test generation, benefits both outer-loop automation and inner-loop assistance [00:58:48].

Model Strategy and Evaluation [00:24:54]

Sourcegraph’s philosophy is to “do the dumb thing first” [00:25:29], prioritizing ease of implementation and rapid iteration. This meant focusing on RAG initially rather than complex fine-tuning [00:27:26]. However, fine-tuning is now being explored to address language-specific code generation where base models underperform (e.g., Rust, Ruby) [00:28:06].

For model evaluation, Sourcegraph emphasizes product metrics over traditional benchmarks [00:19:56]. They prioritize metrics like:

Acceptance rate and overall volume for code completions [00:41:59].
Engagement and usage for explicit actions like chat and test generation [00:42:33].

This user-centric approach allows them to quickly understand actual value, as user acceptance directly reflects utility [00:20:09]. When new models like GPT-5 are released, Sourcegraph’s plan is to enable them quickly in Cody, allow users to choose, and observe product usage metrics to gauge effectiveness [00:17:37].

Impact on Developers [00:20:31]

AI coding tools show varying benefits across developer experience levels [00:21:09]:

Junior developers tend to find inline completions highly beneficial, viewing them as a pedagogical tool that provides useful starting points [00:23:01].
Senior engineers often prefer chat-based interactions, as inline completions can be disruptive if not consistently precise [00:21:48].

Ultimately, AI helps automate the “toil and drudgery” of coding, freeing up developers to focus on higher-value, more creative tasks that differentiate their work [01:08:06].

The Future of AI in Coding and Developer Ecosystems [01:05:46]

The number of engineers is expected to grow, but the definition of “engineering” and the day-to-day experience of a software developer will change drastically [01:06:11]. The “outer loop” of software development, encompassing the full lifecycle from ideation to deployment, is where more automation will eventually happen [00:07:07].

Key trends include:

Local Inference Models: A growing demand for running models locally (e.g., using Olama or LM Studio) due to benefits like offline availability (e.g., on an airplane), privacy, and cost [00:43:58]. As GPU and model speeds improve, local inference could reduce latency even further, which is critical for developer experience [00:46:01].
Evolving Product Roadmaps: Sourcegraph prioritizes adding value over over-optimizing costs, anticipating that inference costs will decrease over time [00:31:33].
Open Ecosystems: Sourcegraph believes in enabling an open ecosystem where developers have freedom of choice regarding models, code hosts, and technology stacks [01:11:08]. They aim to provide building blocks (like Cody’s custom commands for prompt engineering) that allow any developer to leverage AI, not just a specialized class of AI developers [01:01:53]. This contrasts with a future where large players might vertically integrate and force adoption of proprietary platforms [01:12:16].
Complimentary Technologies: Formal specifications and languages will remain crucial alongside AI, as natural language often lacks the precision needed for complex tasks like programming [01:15:32].
Underhyped Areas: The development of tools that are complementary to AI, particularly formal specifications and languages, is seen as underhyped. These are not replaced by AI but work in tandem to achieve precise outcomes [01:15:32].

Milestones for AI in Coding [01:03:04]

A significant milestone for AI in coding would be when classes of problems, such as simple bug fixes derived from production logs, become automatically solvable by fully automated systems in production, moving from near-zero to a high percentage like 80% [01:03:52]. This requires continuous improvements in model quality and controllability [01:04:36].

Conclusion

The development of AI coding assistants like Cody, with their emphasis on context-awareness and effective use of RAG, is transforming the daily experience of software developers. The future promises further automation, moving towards comprehensive AI agents, while maintaining an open ecosystem that empowers all developers. This shift will allow humans to focus on the unique, creative aspects of software creation, leaving repetitive tasks to AI [01:08:06].

Tubegraph

Explorer

Table of Contents