Criticism of deep neural networks in achieving AGI

From: jimruttshow8596

Many leading researchers hold the view that current deep neural networks (DNNs) and related machine learning (ML) algorithms are fundamentally unsuited for creating human-level Artificial General Intelligence (AGI) [01:13:04]. These DNNs constitute most of the AI that receives attention today, including systems like Dolly 2, GPT-3, and Jasper.ai [01:29:06].

While some researchers, like Gary Marcus and Douglas Hofstadter, believe current DNNs are entirely misguided for AGI, and the mainstream rapidly believes only minor tweaks are needed, Ben Goertzel holds a more moderate view. He suggests DNNs, in some variation, could serve as significant components of an AGI architecture [01:38:06], but are missing many key aspects necessary for human-level intelligence [01:38:06].

Core Criticisms of Deep Neural Networks

Shallow Pattern Recognition

A primary criticism is that current deep neural networks, despite their name, primarily identify “shallow patterns” within large datasets [01:47:06]. They function more like sophisticated lookup tables, recording and indexing vast amounts of detailed patterns and then extrapolating from these to deal with new situations [01:48:06], [02:06:06].

For instance, when given a puzzle about fitting a big table through a small door, a transformer neural net suggested using a table saw [01:58:06]. This error highlights a lack of understanding of the underlying reality; the AI associated “table saw” with “sawing tables” based on linguistic patterns, not a conceptual model of what a table saw actually is [02:00:06], [02:03:06]. This limitation is widespread across various machine learning algorithms [02:06:06].

Lack of Abstraction and Generalization

The approach of DNNs, which leverages massive data and processing power to recognize and extrapolate from particular patterns, struggles to generalize to domains that do not demonstrate those specific patterns [02:14:06]. This is known as a “knowledge representation issue” [02:29:06]. The knowledge is represented as a catalog of weighted particulars, with no inherent attempt to abstract fundamental principles [02:36:06].

The ability to find concise abstractions of experience is critical for generalization to new or different domains [02:51:06]. Humans, for example, can play a small number of war games (e.g., 2,500-5,000 sessions) across many titles and extract generalizations that allow them to quickly adapt to new, different games [02:58:06], [03:08:06]. In contrast, systems like AlphaGo rely on hundreds of millions of game iterations [03:39:06]. This is often referred to as “one-shot learning” or “few-shot learning,” a capability current DNN architectures generally lack, unlike humans or even smart animals like dogs [02:53:06].

Missing Creative and Imaginative Leaps

Current DNN architectures tend to bypass the aspects of human intelligence that enable “creative, imaginative leaps” [02:18:06]. While they excel at “repermuting elements from existing images” in graphics programs like Dolly, they do not innovate in the way artists like Matisse or Picasso did, who fundamentally rethought art [02:59:06]. The prevailing economic models in the AI industry favor systems that combine existing elements to maximize known metrics, rather than fostering unpredictable creativity [02:47:06], [02:55:06].

Commercial Bias Against AGI Research

The AI industry has largely self-organized to leverage DNNs for commercial value, focusing on narrow applications where systems repeat well-understood operations to maximize defined metrics [02:47:06]. For example, military applications prioritize AI that “obey Doctrine” over those that “create, imagine and improvise” [02:07:06]. Similarly, advertising AI focuses on maximizing user clicks, which doesn’t require imaginative capabilities [02:26:06]. This focus on short-term financial returns disincentivizes long-term AGI research and development [02:40:06].

“The economics of modern industry suits itself really really well to AIs that are good at like combining already existing elements to to maximize maximize known metrics and what what this means is that the industry of AI doesn’t have that much motivation to flail around doing AGI r d” [02:50:06]

Research Attention Span

A contributing factor to the lack of progress in AGI is a perceived lack of attention span among younger researchers [02:40:06]. Many are accustomed to running learning algorithms on data sets and getting immediate, “cool” results [02:50:06]. AGI research, however, often requires patience and may not yield immediate feedback for days, weeks, or even years [02:50:06].

Conclusion

While deep neural networks have achieved impressive practical success in narrow AI tasks, their fundamental reliance on shallow pattern recognition, difficulty with true abstraction and generalization, and the economic incentives that favor short-term, predictable outputs, make them ill-suited on their own for achieving human-level AGI [02:38:06]. Progress towards AGI will likely require exploring other, less mainstream approaches or significantly upgrading current DNNs to incorporate more biologically realistic learning mechanisms and structured semantic representations [00:59:06].

Tubegraph

Explorer

Table of Contents