The evolution of neural networks and machine learning

From: mk_thisisit

Artificial Intelligence (AI) is rapidly becoming the central topic of conversation globally, with machines being built that are demonstrably smarter than humans [00:00:00]. While AI is expected to solve many existing problems, it will also create new ones [00:00:24].

Neural Networks: Beyond Statistics

Wojtek Zaręba, a co-creator of ChatGPT, refutes the idea that AI is “only statistics” [00:01:12]. He argues that current models exhibit abilities that go beyond simple statistical recall:

Reasoning and Generalization [00:01:34]: AI models can solve problems requiring reasoning that they have never explicitly seen, demonstrating an ability to generalize intelligently across very different data [00:01:56].
- Example: A model trained on images of numbers with a black background can still recognize those numbers if a single black pixel is changed to white [00:03:07]. This occurs despite the new data being “completely different” from the training data [00:03:56].
Empirical vs. Theoretical Understanding [00:02:27]: There is an empirical understanding of how neural networks achieve such results, but a deep theoretical understanding of why this happens is lacking [00:02:36]. For instance, unlike many traditional models (e.g., decision trees), transformers do not “fall apart” when moved away from training data [00:04:37].

Limitations and Contextual Understanding in AI

Despite advancements, AI still faces significant limitations, particularly in contextual understanding:

Spatial and Cultural Context: While humans perceive the world spatially and understand context, models often struggle. An example is an autonomous taxi that drove into a sacred zone during a Corpus Christi procession in San Francisco’s Chinatown. The model knew to stop at a distance from people but did not understand the cultural significance of the “sacred zone” [00:05:35].
Multimodal Data: The lack of contextual understanding in models trained solely on image data highlights the need for multimodal inputs (images, text, video, sound, voice) for a deeper comprehension of the world [00:06:57].
Sensory Experience: Models currently lack “senses” akin to humans, and their data acquisition differs fundamentally [00:08:34].
Training Differences (Human vs. Machine Intelligence):
- Human Brain: Training and testing are integrated into one continuous process. Humans learn from their own real-world experiences, including positive and negative outcomes [00:09:42].
- Computers: Typically, training and testing are separate processes. Models learn primarily from vast datasets collected by others [00:09:37]. For example, a robot doesn’t learn to avoid stairs because it fell; it learns because someone recorded such an event and fed it the data [00:10:32].
- Reinforcement Learning: An exception where models learn from their own experience, such as those that mastered Rubik’s Cube manipulation or games like Dota and StarCraft [00:10:59].
Limits of Physics and Knowledge: A fundamental limitation is that models cannot be fed data about phenomena we don’t understand (e.g., the source of gravity, quantum effects, or how human consciousness is born) [00:12:07]. Creating a faithful copy of reality in a model is challenging if reality’s functions are unknown to us [00:12:47].

OpenAI’s Classification of AI Development Levels

OpenAI considers two main processes for imparting knowledge to models:

Large-scale Data Collection: Training models to predict the next word in vast datasets (like the internet) enables them to understand what humanity has already understood [00:13:00].
Reinforcement Learning: Rewarding models for correct behavior in limited domains. This approach allowed models to discover moves in Go that humans had not in thousands of years [00:13:41].

Based on these, OpenAI classifies AI development into five levels:

Level 1: Conversational (Turing Test):
- Models can hold conversations and pass the Turing Test, making it difficult for a person to distinguish between talking to a human or a computer [00:14:56]. Current language models are approaching this level, with distinctions sometimes only noticeable through response delays [00:15:22].
Level 2: Reasoning:
- Models can solve problems requiring approximately 10 minutes of complex reasoning, like non-trivial mathematical or scientific tasks [00:15:50]. This level requires deep understanding beyond immediate answers [00:16:24].
Level 3: Agents:
- Models act as “agents” capable of performing multi-hour or multi-day tasks in the real world [00:17:02]. For example, being instructed to build a website, the agent would handle domain acquisition, coding, deployment, and even communication (e.g., sending design options or email updates) [00:17:15]. This is a significant leap from current models like ChatGPT, which quickly get lost with multi-step actions [00:18:00].
Level 4: Scientist:
- Models function as scientists, dedicating months to deep thought, re-examining existing assumptions, and inventing new things [00:19:02]. This implies questioning fundamental beliefs, similar to Albert Einstein realizing that time might not be constant [00:19:35].
Level 5: Organizational:
- AI becomes competent enough to autonomously run entire organizations, managing planning, analysis, and decision-making for entities like a 1000-person company [00:20:17].

Reaching Level 5 is predicted to occur in less than 10 years, reflecting an accelerating pace of technological innovation analogous to human history, where stages of development (e.g., cities, industrialization, computers, internet) have progressively shortened [00:23:31].

The Concept of Consciousness and AI

The discussion extends to whether AI can achieve consciousness:

Definition of Consciousness: The human brain receives information as electrical signals (bits) and creates an “immersive simulation of reality” [00:35:10]. Consciousness is described as our experience of this simulation [00:35:18].
Philosophical Zombie: This thought experiment describes a being that behaves exactly like a conscious person but lacks internal subjective experience [00:35:30]. The question is whether AI could possess this “internal cinema” [00:35:46].
Hypothesis on Self-Awareness: It is hypothesized that at some point, a model within a simulation might begin to simulate its own existence, leading to a form of self-awareness [00:37:42]. This “click” could be when the model starts to understand its own participation in changing reality [00:38:09].
Building Consciousness: If consciousness is understood, it can be built [00:39:55]. If it’s not understood, it’s difficult to build. The idea that consciousness results from quantum effects (as suggested by Roger Penrose) is noted [00:40:06].
Thought Experiments on AI Consciousness:
1. Train a model on data excluding any mention of consciousness. If it then spontaneously discusses having such experiences, it could hint at consciousness [00:40:27].
2. Connect AI to a human brain. If the human’s consciousness expands, it doesn’t necessarily mean the AI itself is conscious (similar to how psychedelics can expand consciousness without being conscious themselves) [00:41:25].
AI as a “Power Bank” for the Brain: The possibility of models serving as a “power bank” for the brain is considered plausible [00:42:15].

Challenges and Future of AI

Defective Nature of Current AI: While the human brain operates on ~20 watts, current language models require exponentially more power (10^9 watts) [00:27:40]. This suggests that current AI is “defective” compared to biological efficiency [00:28:01].
- This is likened to comparing a bird to an airplane: birds are efficient and acrobatic, while airplanes are heavy but can carry hundreds of people across oceans [00:28:09]. AI models might be similar: less efficient but capable of massive tasks [00:28:46].
Evolution and Data Efficiency: The human brain’s efficiency is partly due to DNA, which contains information on how to effectively use reality, a product of billions of years of evolution [00:29:43]. Evolution itself is a computationally powerful process that has discovered general intelligence multiple times (humans, elephants, ravens, dolphins) [00:31:01]. Current AI models require huge datasets for initial training, but then gain the “incredible property” of learning quickly within a single conversation [00:32:14]. The ultimate goal is a model that can solve new, complex problems (like global warming) with minimal new data [00:33:13].
Hallucinations in Models: Current training methods (predicting the next word, then post-training with human preferences) lead models to always provide an answer, even if they don’t know it. Humans tend to reward models for providing answers, even if they’re guesses, rather than admitting ignorance [00:43:45].
- Solution: Train models to express certainty (probabilities) and understand the boundary between their knowledge and ignorance [00:45:50].
Memory in Neural Networks: Like a character with short-term memory loss (e.g., in “Memento”), current neural networks remember within a single conversation, but then the memory disappears [00:51:22].
- Future: Models need to “live” longer, either by having a context window long enough to encompass a “whole life of experience” or by incorporating new algorithms for continuous learning and weight updates based on new interactions [00:51:38]. This would give them more “sense of self” and continuity [00:53:10].
Energy and Computing Power: Currently, the limitation is a lack of computing power. In the future, it might shift to a lack of energy [00:53:50]. However, just as computers have evolved from room-sized machines to powerful mobile phones, AI will likely undergo significant optimization to become more economical [00:54:17].

Phases of AI Development (Artificial Intelligence Evolution)

Wojtek Zaręba outlines three phases of AI development:

Product Phase: Current stage where companies create and integrate AI products into most software [00:55:18].
Geopolitical Phase: Countries recognize that investment in AI is crucial for their geopolitical position [00:55:29]. Within a year and a half (perhaps 2025-2026), AI will likely be the main topic of global conversation [00:56:32]. This phase will see many “agents” doing different things, potentially impacting the labor market [00:56:52].
Superintelligence Phase: Machines become significantly smarter than humans [00:58:25]. At this stage, international cooperation will become critical [00:58:41]. Such superintelligence could create new chips, deeply understand scientific literature, invent new things, and even run virtual companies [00:59:23].

Risks and Mitigation

AI brings inherent risks:

Misuse (Misi): AI can be used for negative purposes, such as deepfakes, military applications, or creating biological pandemics. A key concern is that AI could significantly increase the number of people capable of synthesizing dangerous viruses [01:17:13]. AI can also assist in hacking, nuclear, or chemical weapon development [01:18:55].
- Mitigation Efforts (OpenAI): OpenAI uses a framework called “PR” to assess model capabilities across categories like biological risk, chemical, nuclear, cybersecurity, and persuasion (convincing people) [01:20:21]. They define levels (low, medium, high, critical) and work to reduce risk levels (e.g., from high to medium or low) [01:21:46].
AI Race: Dangers arise from organizations intensely competing to develop AI [01:22:43].
Accidents: Unintentional negative outcomes due to inattention [01:22:58].
Alignment/Control (Misalignment): Ensuring that powerful AI models with broad skills actually “listen to us” and behave as expected [01:24:27].

Zaręba is optimistic about minimizing negative applications, believing specific protective steps can be taken [01:19:45].

Societal Impact and Worldcoin

Impact on Society: AI’s integration into society will be non-trivial [00:00:26]. While some might resist (like the Luddite movement destroying textile machines due to job fears), historical trends show that technological development is difficult to stop, and it has often addressed societal problems [01:06:23].
Worldcoin: This project, co-founded by OpenAI’s Sam Altman, aims to create a system for distributing future AI-generated prosperity and addressing issues like distinguishing humans from bots [01:08:44]. It identifies individuals uniquely through iris scans (stored cryptographically) to ensure fair distribution and prevent one person from taking the value of many [01:14:13]. The project emphasizes decentralization to prevent centralized attacks [01:15:42].
- This initiative reflects a potential shift from “zero-sum games” (where one gains at another’s expense) to a “positive-sum game” where prosperity is vastly expanded, changing the dynamics of what is considered right or wrong [01:12:04].

Wojtek Zaręba’s Journey and Philosophy

Wojtek Zaręba, a co-founder of OpenAI, hails from Kluczbork, Poland [01:27:55]. He entered the field when AI was less developed, knowing many of the pioneers from conferences [01:28:08]. His academic background (PhD in AI from New York) and strong publications positioned him to join OpenAI when Greg Brockman approached him [01:28:29].

He attributes his belief in AI’s potential to a 2012 ImageNet competition where Geoffrey Hinton’s team demonstrated neural networks’ superior results, vastly outperforming other approaches [01:31:18]. This convinced him of the technology’s capabilities [01:33:04].

Zaręba led the robotics group at OpenAI, where they successfully trained a network to solve a Rubik’s Cube with a robot hand, a complex task that could not be directly programmed [01:34:18]. His work also contributed significantly to projects like Copilot, which uses AI to generate code, aiding millions of programmers [01:37:17].

He defines himself primarily as a scientist, also acknowledging elements of a futurologist and philosopher [01:39:32]. His personal philosophy emphasizes kindness, close relationships, and the importance of love, drawing from long-term studies on human well-being [01:40:28]. He has financially supported his old high school, establishing a lab and scholarships to provide opportunities for smart students who might lack them [01:44:21].

His dream for the future of AI is to unlock its incredible opportunities for humanity, hoping for decisions that foster happiness and peace, pushing towards a better collective future [01:47:03].

Tubegraph

Explorer

Table of Contents