Innovative approaches in AI research

From: jimruttshow8596

Artificial General Intelligence (AGI) is an informal term referring to computer systems capable of performing tasks considered intelligent when done by humans, especially those they were not specifically programmed or trained for [01:48:00]. This contrasts with “narrow AI,” which excels at highly particular tasks based on specific programming or data-driven training [02:38:00]. While humans can “take a leap” into loosely connected domains and improvise, current AI, such as AlphaFold for protein folding, struggles with generalization beyond its training data [02:49:00].

Limitations of Current Deep Neural Networks

Deep Neural Networks (DNNs) and other Machine Learning (ML) algorithms, while absorbing most of the AI world’s attention and achieving practical success in narrow AI [02:31:00], are fundamentally unsuited for human-level AGI [13:07:07]. While some researchers believe DNNs are completely misdirected for AGI [14:56:00], others think they are almost there with a few tweaks [15:18:00]. A more moderate view suggests DNNs can serve as significant components of an AGI architecture, but they are missing many key aspects required for human-level intelligence [15:41:00].

A core limitation is that current DNNs behave much like “very clever lookup tables” [17:23:00], recording and indexing patterns in a subtle way that considers their overlap and contextual usefulness [17:01:01]. However, they primarily look at “shallow patterns” in data [17:47:00]. For example, a neural net asked how to fit a large table through a small door might suggest using a “table saw” because it recognizes the words, not understanding that a table saw is a tool for cutting wood, not tables [18:20:00]. This highlights their failure to build an underlying model of reality [20:00:00].

Current AI systems leverage huge amounts of data and processing power to recognize highly particular patterns and extrapolate from them [20:59:00]. This approach struggles to generalize to domains that do not exhibit those specific patterns [21:17:00]. The knowledge representation is a “large catalog of weighted and interdependent and contextualized particulars” with no attempt to abstract [21:33:00]. The ability to find concise abstractions of experience is crucial for generalization [21:55:00]. Humans, by contrast, can make broad generalizations from small data sets, like learning war game strategies from a few thousand sessions across many titles [23:08:00]. This “one-shot learning” or “few-shot learning” is missing in current architectures [25:55:00].

The AI industry’s current focus is largely driven by commercial value, where repeating well-understood operations to maximize metrics is prioritized [27:40:00]. Creative, imaginative, and unpredictable AI is less desired for tasks like ad clicks or military doctrine, which benefits highly leveraged deep neural nets [28:19:00]. This economic pressure has led to AGI research remaining on the margins [30:02:00].

Three Viable Paths to True AGI

Despite the challenges, several alternative approaches offer promising avenues for achieving AGI.

1. Cognitive Level Approach: Hybrid Neural Symbolic, Evolutionary Metagraph-Based AGI

This approach, exemplified by the OpenCog project, aims to emulate the human mind’s high-level cognitive functions using advanced computer science algorithms [33:30:00]. Instead of direct biological simulation, it takes inspiration from the human mind’s functions like perception, action, planning, working memory, long-term memory, and social reasoning [35:06:00].

A key concept is “cognitive synergy,” where distinct cognitive functions within the AI architecture transparently assist each other’s internal processing [36:38:00]. This is achieved by centering the system on a large distributed Knowledge Graph (a hypergraph or metagraph) [37:57:00]. Different AI algorithms (for perception, action, memory, reasoning, learning) act on this common graph, with new mathematical methods making them subcases of a single meta-algorithm [39:32:00].

This approach addresses criticisms of “good old-fashioned AI” (GOFAI) by:

Using logic-based knowledge representation but incorporating uncertainty (fuzzy, probabilistic logic) and allowing for contradictions [43:57:00].
Not depending on hand-coding common sense knowledge, instead relying on learning [44:45:00].
Integrating evolutionary learning, both implicitly (e.g., attention-driven premise selection where importance acts like fitness) and explicitly (e.g., genetic algorithms for procedure learning or creativity) [47:20:00]. The core system’s “autopoiesis” (self-organization) and evolutionary dynamics are fundamental [50:22:00].

2. Brain Level Approach: Large-Scale Non-linear Dynamical Brain Simulation

This path involves simulating the brain’s non-linear dynamics, distinct from current deep neural networks [51:25:00]. While understanding the human brain is intrinsically interesting, current measuring instruments are insufficient to fully reverse-engineer its complex processes, particularly abstraction formation [52:48:00]. The brain involves more than just neurons; glia, astrocytes, cellular diffusion, and potentially quantum biology also play roles [54:11:00].

Despite these limitations, there’s a lack of large-scale brain simulations aimed at producing intelligent behavior [54:46:00]. Pioneering work by Gerald Edelman and Eugene Izhikevich on chaos theory-based neuron models, more biologically realistic than those in modern DNNs, showed how disparate neurons could be holistically bound by sub-threshold charge leakage [55:31:00].

Recent developments, such as Alex Ororbia’s predictive coding-based learning mechanism, offer a potential “backpropagation killer” that could work with more biologically realistic neurons (e.g., Izhikevich neurons that account for sub-threshold spreading of activation) and glia [57:41:00]. This could lead to neural nets with better generalization and more compact models, allowing for cleaner interfaces with symbolic systems like OpenCog [01:10:00].

A major hurdle for full brain simulation is the inadequacy of Von Neumann (serial) computing architectures [01:39:00]. The brain’s power comes from its inherent parallelism [01:51:00]. While GPUs provided a degree of parallelization for DNNs [01:02:01], more specialized parallel hardware is needed for complex brain simulations [01:04:02]. The development of specialized chips (e.g., for Izhikevich neurons or graph pattern matching) is becoming more viable [01:05:06], potentially leading to “AGI boards” integrating various specialized processors [01:30:00].

3. Chemistry Level Approach: Massively Distributed AI-Optimized Artificial Chemistry Simulation

This approach originates from artificial life research, extending beyond genetic algorithms to simulate entire artificial organisms with artificial metabolisms and genomes within simulated worlds [01:16:32]. Recognizing the complexity of biological systems (DNA triggering RNA, methylation, epigenomics, protein catalysis), it seeks to capture the spirit of biochemistry [01:17:33].

Inspired by work like Walter Fontana’s algorithmic chemistry, this involves creating “little list codelets” or programs that act on each other to produce new programs in complex reaction chains [01:19:05]. The motivation is to evolve the underlying “chemistry” or representation itself, potentially finding a more expressive basis for intelligence than biological evolution’s arbitrary chemistry on Earth [01:21:03].

Two sub-approaches exist:

Realistic chemistry simulation: Simulating actual chemistry/biochemistry (e.g., Bruce Damer’s EvoGrid project for the origin of life) [01:22:46]. This requires immense compute resources [01:26:21].
Abstracted algorithmic chemistry: Making an algorithmic chemistry that doesn’t try to be a real chemistry simulation, potentially requiring fewer compute resources [01:26:59]. This could involve modern programming languages like Meta (Meta-type talk) used in OpenCog Hyperion [01:27:48].

A crucial enhancement is using an AI observer system (machine learning) to study the evolving chemical soup, identify promising “vats” or configurations, and guide the evolution [01:29:10]. This “directed chemical evolution” leads to hybrid architectures where the algorithmic chemistry is accelerated by machine learning and proto-AGI [01:30:38]. This approach also has a beautiful decentralized aspect, envisioning millions of people running virtual algorithmic chemistry simulations analyzed and refreshed by a central AGI system [01:32:04].

Again, the challenge of truly parallel hardware for simulating physical chemistry or analogous algorithmic chemistry remains [01:33:38]. However, creative exploration into nanometer-scale computing infrastructures could open new avenues [01:36:12].

Conclusion: Supporting Diverse AGI Research

All three approaches are interesting and warrant far more attention and resources than they currently receive [01:25:01]. While the cognitive-level approach (hybrid AGI) is currently seen as the most likely path to AGI, it can still integrate ideas from other paradigms, such as biologically realistic neural nets for perception or algorithmic chemistry for creative idea generation [01:38:36].

The amount of funding needed for AGI research is substantial but pales in comparison to global spending on less impactful endeavors [01:40:05]. Increased funding, potentially on the scale of hundreds of billions of dollars, could massively accelerate AGI R&D [01:41:04]. This could come from governments improving research funding distribution [01:52:00], or through a cultural shift towards citizen science and open-source development for AGI, as more people recognize AGI’s viability and the limitations of big tech’s current approaches [01:47:01]. Such a shift could be part of broader positive cultural changes for the greater good [01:49:40].

Tubegraph

Explorer

Table of Contents