Progress and direction towards developing AGI

From: jimruttshow8596

The field of Artificial Intelligence (AI) is experiencing rapid advancements, particularly in the realm of Artificial General Intelligence (AGI). The pace of change in AI is compared to the emergence of PCs in the late 1970s and early 1980s, but happening “10 times faster” [00:01:25]. This exponential acceleration, as projected by Ray Kurzweil, is occurring differentially across various areas of AI pursuit [00:02:07].

Dr. Ben Goertzel, a leading authority on AGI and the instigator of the OpenCog project, believes the unfolding of AGI is accelerating [00:02:52]. He maintains a high probability of AGI within five years, even increasing his previous 50/50 chance to 60/40 [00:03:26].

Defining Artificial General Intelligence (AGI)

Defining AGI is a complex and highly debated topic, similar to how biology lacks a universally agreed-upon definition for “life” [00:21:39].

One perspective, rooted in algorithmic information theory and statistical decision theory, views AGI as the ability to achieve a vast variety of goals in diverse environments [00:22:05]. This can be formalized, as Marcus Hutter and Shane Leg (a co-founder of DeepMind) did, as a weighted average of how well a reinforcement learning system achieves all computable reward functions [00:22:36]. However, this definition suggests humans are “complete retards” at optimizing arbitrary reward functions [00:23:25].

Another philosophical approach, like Weaver’s theory of open-ended intelligence, considers intelligence as complex self-organizing systems maintaining their existence, boundaries, and self-transforming [00:24:26].

Human-Level General Intelligence

When discussing human-level or human-like AGI, the focus shifts to specific human capabilities [00:24:50]. While IQ tests offer a measure, they are considered imperfect for assessing true human intelligence [00:25:46]. More multifactorial views, like Gardner’s theory of multiple intelligences (musical, literary, physical, existential, logical), offer a closer approximation [00:26:07]. Ultimately, the field of psychology doesn’t provide a rigorous, data-driven assessment of human intelligence [00:26:21].

The Turing Test, which assesses an AI’s ability to imitate a human in conversation, was never considered a strong measure of general intelligence, as “fooling people can be disturbingly easy” [00:26:38]. With current AI systems approaching its capabilities without true AGI, it is no longer taken seriously as an AGI benchmark [00:27:16].

Limitations of Current AI: Large Language Models (LLMs)

Large Language Models (LLMs) in their current form (Transformer Nets trained to predict the next token) are not expected to lead to full human-level AGI [00:04:53]. However, they are capable of many amazing and useful functions and can be valuable components of AGI systems [00:05:14].

The fundamental limitations of LLMs stem from their architecture, which primarily recognizes surface-level patterns in data [00:32:37]. This leads to several key weaknesses:

Hallucination Problem

LLMs are known for “hallucinating” or making up facts, especially when asked obscure questions [00:09:42]. While models like GPT-4 have improved, this remains a challenge [00:09:58].

[!NOTE] Proposed Solutions for Hallucination

Probing the Network: It may be possible to solve hallucination by analyzing the network’s internal activation patterns to detect when it’s hallucinating, allowing for filtering [00:11:32].
Entropy/Paraphrasing: Correct answers tend to have different entropy than incorrect ones [00:14:00]. Asking an LLM to paraphrase a query multiple times and comparing the consistency of answers can help detect hallucinations [00:14:33].

While these solutions are useful for practical applications, they don’t necessarily advance AGI, as human hallucination avoidance stems from a “reality discrimination function” and reflective self-modeling [00:12:19].

Lack of Complex Multi-step Reasoning

LLMs struggle with complex, multi-step reasoning required for tasks like writing an original science paper [00:30:11]. While they can “turn the crank” on advanced math given an initial idea, they cannot originate novel scientific concepts or discern the “aesthetic” quality of mathematical definitions that lead to useful theorems [00:39:53]. This limitation is tied to their fundamentally derivative and imitative character [00:33:20].

Lack of Original Artistic Creativity

LLMs also exhibit a “banality” in their output, as they average existing utterances [00:34:17]. While clever prompting can push them beyond their centers and produce results comparable to a professional journeyman’s first draft (e.g., movie scripts, blues guitar solos), they cannot achieve the groundbreaking creativity of an Einstein, Thelonious Monk, or Jimi Hendrix [00:35:30]. They cannot invent new musical styles or fundamentally surprising scientific theories [00:31:40].

Human intelligence, particularly the ability to abstract, is guided by “agentic nature” – the need to survive, reproduce, and self-transform within an environment [00:42:25]. This agentic drive leads to the development of heuristics and abstractions that allow for adaptation to new situations [00:44:34].

Different Paths to AGI Development

The pursuit of AGI is currently a genuine race among large companies [00:20:06]. Different approaches are being explored:

Neural Net Universe (e.g., DeepMind, Google)

One promising direction involves enhancing existing neural network architectures [00:48:11]:

Increased Recurrence: Adding more recurrence into Transformer networks, similar to LSTMs, could foster deeper abstractions [00:47:13].
Alternative Training Methods: Replacing or complementing backpropagation with methods like predictive coding could improve training for complex recurrent networks [00:47:56].
Hybrid Architectures: Combining elements like AlphaZero (for planning) with neural knowledge graphs (like in Differential Neural Computing) and recurrent Transformers could be powerful [00:48:38]. Google and DeepMind are ideally suited for this due to their expertise in these areas [00:48:47].
Minimum Description Length Learning: Yoshua Bengio’s group is exploring neural nets explicitly designed to learn abstractions through minimum description length principles, coupled with Transformers [00:49:49].

OpenCog/Hyperon Approach

Dr. Goertzel’s OpenCog Hyperon project represents a different approach, prioritizing a self-modifying, self-rewriting metagraph at its core [00:55:54].

[!INFO] OpenCog Hyperon’s Core Philosophy

Weighted Labeled Metagraph: The central component is a highly flexible graph structure where links can connect multiple nodes, point to other links or subgraphs, and be typed and weighted [00:54:59].
Knowledge Representation: This metagraph represents various forms of knowledge (apostolic, declarative, procedural, attentional, sensory) and cognitive operations (reinforcement learning, logical reasoning, sensory pattern recognition) [00:55:22].
Meta Programs: Learning programs themselves are represented as sub-metagraphs within the larger graph, enabling them to act on, transform, and rewrite chunks of the same metagraph they reside in [00:55:54].
Reflection: Unlike LLMs, OpenCog is highly oriented towards reflection, recognizing patterns within its own mind, processes, and execution traces, and representing those patterns internally [00:57:07].
Integration of AI Paradigms: This framework naturally accommodates historical AI paradigms like logical inference and evolutionary programming, as well as new approaches like “mutually rewriting sets of rewrite rules” [00:58:13].
LLMs as Supporting Actors: LLMs can exist on the periphery of this system, feeding into and interacting with the metagraph, but are not the central hub [00:58:41].

This approach is considered “least humanlike” but offers a “really short” path from human-level AGI to superhuman AGI because the system is designed for self-rewriting its own code [00:59:41]. It is also well-suited for scientific discovery and artistic creativity due to its support for logical reasoning and evolutionary learning [01:00:18].

Challenges and Future Outlook

A primary challenge for the OpenCog Hyperon project is scalability of infrastructure [01:00:45]. Just as powerful multi-GPU servers were crucial for the advancement of LLMs, a scalable processing infrastructure is needed to validate the OpenCog approach [01:01:24]. The project is developing a pipeline from its native language, Meta, to highly efficient languages like Rholang (designed for multi-CPU cores) and HyperVector math, eventually aiming for specialized hardware like associative processors (APUs) [01:02:46].

The hope is that this new infrastructure will enable ancient AI paradigms like logical reasoning and evolutionary programming to operate at scale, and provide a flexible environment for experimenting with novel AI algorithms [01:04:00]. While the Hyperon project may not have advanced as rapidly as LLMs, it is meeting its technical milestones ahead of schedule, with more funding and better tooling available now than in previous decades [01:05:15]. LLMs themselves are proving helpful for various aspects of non-LLM AI projects, contributing to an overall acceleration in the field [01:05:29].

Tubegraph

Explorer

Table of Contents