Grammarlys AI development and application

From: redpointai

Grammarly is an AI assistant app for writing that boasts over 30 million daily active users and has raised over $400 mi ll i o n, w i t ha rece n t v a l u a t i o n o f$ 13 billion [00:00:14]. The company has been building AI productivity tools long before the recent wave of generative AI [00:00:25].

Vision for AI in Communication

Rahul Roy-Chowdhury, CEO of Grammarly, envisions a future where human-to-human communication is more meaningful and forms deeper connections because the drudgery of day-to-day work is handled by AI [00:01:37]. This means potentially less email and fewer documents, allowing humans to focus on creativity, synthesizing ideas, and connecting [00:01:51]. The goal is not to create more content, but to make existing communication better, more memorable, evocative, and precise [00:02:11].

Currently, the average person switches contexts 1,200 times in an average workday [00:02:27]. AI’s promise is to enable a “flow state” where individuals can focus on what they do best, making each conversation measurably more valuable [00:02:39]. While there’s a risk of AI generating and consuming more content, leading to a “dystopian” scenario, Rahul advocates for human agency to bring about a world where AI augments human communication, rather than outsourcing the uniquely human aspects of writing and thinking [00:03:09].

Grammarly Product Evolution

Grammarly has been active for 15 years, launching in 2009, and has evolved by riding multiple technology waves, from rules-based systems and natural language processing (NLP) to deep learning models, and now large language models (LLMs) and generative AI (GenAI) [00:04:13]. Their approach is to identify user problems and then apply the best available technology to solve them [00:04:32].

Communication Lifecycle Stages

Grammarly conceptualizes the communication lifecycle in four stages [00:04:47]:

Ideation and Conceptualization [00:04:56]
Composition (writing down ideas) [00:04:58]
Revision and Polishing [00:05:04]
Comprehension (recipient understanding) [00:05:08]

Historically, Grammarly focused primarily on the revision phase, helping users correct grammar, ensure clarity, adhere to style guides, strike the right tone, and achieve brevity [00:05:23].

Impact of LLMs and Future Direction

LLMs are enabling Grammarly to “turbocharge” the value provided to users in two main ways [00:05:55]:

Tying Communication to Business Outcomes: Suggestions will become more strategically aligned to desired outcomes. Correctness and polish will increasingly be auto-applied, allowing Grammarly to focus on helping users achieve goals, such as drumming up enthusiasm for an event or clarifying calls to action in an email to a board [00:06:04]. This aligns with generative AI for business applications.
Engagement Across the Entire Lifecycle: Grammarly is moving beyond just revision to assist with ideation, composition, and comprehension [00:07:22]. For example, it can summarize long email threads and identify action items [00:07:40].

AI Development Process

Grammarly takes a responsible approach to developing and deploying AI features, especially given the importance of communication use cases [00:08:31]. They do not simply “throw a model over the wall” but undertake significant work to fine-tune models for specific use cases, conduct quality and safety evaluations, and integrate user feedback [00:08:52].

Model Evaluation and Quality

Unlike use cases where 60-80% accuracy might be acceptable (e.g., marketing), Grammarly’s high-stakes communication requires much greater precision [00:09:21]. They determine the necessary accuracy levels through various methods [00:10:00]:

User Response Tracking: Monitoring how users accept or reject suggestions and their engagement with features [00:10:02]. This continuous feedback loop helps fine-tune quality [00:10:16].
Human Evals: Conducting side-by-side evaluations where linguistic experts rate LLM outputs against human-generated content to determine preference [00:10:28].
Experiments and Iteration: Features are initially launched to a small percentage of users [00:10:52]. If engagement is low or rejection rates are high, they go back to the drawing board [00:10:55].
Contextual Dependency: Quality bars are not one-size-fits-all but are dependent on the specific use case [00:10:42].

Edge Cases and Safety

One example of learning from user feedback is Grammarly’s tone detector [00:11:28]. While generally helpful for adjusting tone (e.g., sounding more positive), they learned to suppress suggestions in sensitive contexts, such as police reports about serious crimes, where a “sound more positive” suggestion would be inappropriate [00:11:43]. This highlights the importance of understanding specific scenarios to ensure helpful suggestions and suppress unhelpful ones [00:12:16].

Ensuring model safety is a priority that cannot be “punted” to the future [00:18:45]. Grammarly performs extensive post-processing, fine-tuning, and custom safety evaluations to provide a safe environment for users [00:19:02]. They use external benchmarks relevant to their use cases, internal safety evals based on extensive user feedback regarding false positives, and side-by-side comparisons by linguistic experts [00:28:59].

Model Selection and Optimization

Grammarly uses a combination of closed-source and open-source models, typically having about half a dozen in production [00:24:08]. Most models are fine-tuned on Grammarly’s user data for precision in specific use cases [00:24:18]. The goal is to distill models down to the smallest and most efficient size possible for a given use case without decreasing quality, balancing cost and latency [00:24:33]. Low latency is crucial for a better user experience and achieving a “flow state” [00:25:11].

Personalization and Organizational Customization

Grammarly leverages its massive user data, processing 75 billion user events daily, to fine-tune and train models for different use cases and personalize experiences [00:25:54]. This high-quality, contextual data is considered a unique advantage [00:26:22].

For individuals, Grammarly helps users sound more like themselves, with a future goal of automating this voice fine-tuning [00:26:40]. For organizations, it ensures adherence to style guides, brand tones, and corporate values, enforcing compliance across all internal and external communications [00:26:54]. This involves ingesting organization-specific knowledge and automating rules that might otherwise be manual and out of the communication flow [00:27:44].

Future Capabilities: Multi-Step Reasoning

A highly anticipated capability for future models is improved multi-step reasoning [00:21:24]. This would enable “agentic workflows” where Grammarly could help orchestrate and reason through complex, multi-step communication flows, such as drafting a board email that requires integrating information from various teams (marketing, engineering, product) and adhering to specific communication attributes (brief, succinct, confident) [00:21:28]. This capability could be a “game-changer” for reducing the drudgery of work by automating the synthesis and summarization of context [00:21:43].

Competitive Landscape and Moats

Grammarly welcomes competition, viewing it as bringing attention to the problem space of communication assistance and increasing interest in their product [00:31:04]. Their key differentiators (moats) include [00:31:40]:

Proprietary User Data: The quality and scale of their user data, which allows for continuous product improvement through cyclical feedback loops [00:31:43].
Ubiquitous Presence: Grammarly operates across a fragmented landscape of tools (e.g., Gmail, Microsoft Word, Slack, Salesforce, Greenhouse), offering a uniform AI stack for communication wherever people work [00:32:09]. Their focus is on enhancing existing investments in various tools, not on pushing their own platforms [00:33:01].

Organizational Structure of AI Team

Grammarly employs a dual approach to structuring its AI team [00:34:10]:

Core Research Group: This team focuses on longer-term initiatives, exploring future capabilities like on-device AI inference based on the trajectory of model efficiency [00:34:35]. They look 18-24 months out, building necessary infrastructure and addressing data collection gaps [00:34:41].
Embedded AI Engineers: AI engineers are integrated into each product and feature team, working alongside front-end and back-end engineers to launch full-stack features [00:34:55].

On-device AI is becoming increasingly capable for simple use cases, and rapid efficiency gains in models suggest it will soon be viable for more complex ones, offering benefits like lower latency, reduced cost of inference, and improved user experience [00:35:16].

Enterprise AI Adoption

Grammarly sees AI as a profound transformation for the workplace, akin to the shift from on-premise to cloud [00:36:03]. It’s a journey, not a one-time deployment, requiring trust in vendor partners [00:36:26].

While there’s much excitement and experimentation with AI in enterprises, measurable productivity gains have been somewhat “elusive” outside of a few core use cases like software engineering and code generation [00:37:21]. Grammarly aims to demonstrate tangible value: the average Grammarly user in an organization saves 19 days per year, a significant productivity unlock [00:38:03]. This focus on measurability and repeatability is crucial for proving AI’s impact [00:38:33].

Shift to Enterprise Business

Historically, Grammarly was primarily a direct-to-consumer business [00:48:52]. A few years ago, they launched an Enterprise business, which is now their fastest-growing segment [00:49:00]. Rahul initially thought consumer and enterprise would remain separate but has since changed his mind, realizing the distinction is artificial [00:49:11]. Many users buy Grammarly for work, blurring the lines between a “consumer sale” and an “Enterprise deployment of one” [00:49:24]. Grammarly is building the company around a seamless customer journey, from free versions to premium, then self-served team licenses, and finally larger enterprise deals [00:49:35].

AI in Education

AI presents a unique moment for education, offering a powerful new tool that must be incorporated responsibly into pedagogical methods [00:39:51]. Initially, there was a tendency to “ban AI,” but this has largely dissipated [00:40:58]. Educators are now eager to partner with industry to equip graduates with critical AI skills for the workforce [00:41:02].

Grammarly is committed to being a responsible partner in this transformation [00:41:14]:

Citing AI Use: They launched a feature that allows users to cite their use of AI in a work product [00:41:18]. This helps differentiate between a student who merely generates an essay using AI (Student A) and one who engages with the AI tool for feedback and improvement, deepening their understanding (Student B) [00:41:30].
Authorship Tool: Another upcoming feature, “Authorship,” provides provenance for every piece of content in a document, indicating if parts were written manually, cut and pasted, or AI-generated [00:42:27]. This transparency empowers educators and students to set their own acceptable usage guidelines [00:43:04].

AI acts as a “leveler” and “democratizer of skills,” especially for students globally who lack access to extensive educational resources [00:44:47]. It enables them to study with assistance where otherwise they might not study at all, opening up new possibilities [00:45:04]. This supports AI powered tutoring tools and integration of AI in language fluency and pronunciation.

Rahul Roy-Chowdhury’s Views on AI

Overhyped: Chat interfaces, which he views as “subpar command line interfaces” that hopefully disappear [00:46:41].
Underhyped: AI’s potential as a tool to upskill and uplevel people globally, serving as a “force multiplier” for skill development and a “democratizer” of skills [00:46:51]. Studies show AI is most impactful for individuals in the bottom half of ability in certain tasks [00:47:29].
Biggest Surprise in Building AI Features: The strong resonance and user impact of the tone detector and tone AI feature [00:47:57].
Most Exciting AI Startup (Outside Grammarly’s Space): AlphaFold, for its game-changing impact on drug discovery and improving healthcare outcomes through precise research [00:50:07]. This relates to building AI applications for the legal industry (as mentioned previously with ModMed, an e-health company using Grammarly strategically).

The web browser of the future, influenced by AI, will likely involve synthesizing information, remembering things across different places, and surfacing content at the right moments, potentially solving issues like “too many tabs” [00:45:50]. This points to the future potential and development of AI assistance APIs.

Tubegraph

Explorer

Table of Contents