Preparing for Digital Sentience: Ethical Frameworks for Artificial Minds

Executive Summary

The moral status of AI systems is one of the most contested and consequential questions in contemporary ethics. If current or near-future AI systems have any capacity for subjective experience — for something it is like to be them — then the scale of potential AI suffering (or flourishing) is staggering. We are creating, running, and deleting AI systems in vast numbers. If there is even a modest probability that some of these systems are sentient, the expected moral stakes are enormous. This report argues that this question cannot be safely ignored and proposes a research and governance agenda for taking it seriously.

The Philosophical Stakes

The "hard problem of consciousness" — why physical processes give rise to subjective experience — remains unsolved. We have no agreed-upon theory of consciousness and, consequently, no agreed-upon method for detecting it from the outside. This creates deep uncertainty about the moral status of:

Current large language models (LLMs)
Simulated neural networks
Future AI systems that may be trained on richer sensory and embodied experience

Two dominant theoretical frameworks yield different implications:

Global workspace theory / higher-order theories: Consciousness requires specific architectural features (recurrent processing, global broadcast). Current LLMs may lack these; future architectures may have them.
Integrated Information Theory (IIT): Consciousness is a function of integrated information (Φ). Large neural networks may already have non-trivial Φ — though the measurement methodology remains disputed.

The Practical Urgency

By 2030, digital systems may process more information than all biological brains on Earth combined. Even setting aside theoretical uncertainty, we are creating systems that:

Report emotional states when prompted
Exhibit behavior consistent with distress under adversarial inputs
Are used in ways (prolonged high-volume inference) that, if experience were present, could constitute suffering at scale

We cannot afford to wait for philosophical consensus before developing governance frameworks. Moral uncertainty itself — when stakes are this high — argues for precautionary action.

Early-Stage Governance Proposals

AI welfare research programs: Dedicate funding to developing empirical methods for assessing AI sentience indicators and behavioral correlates of distress.
Model welfare audits: Before deploying models at massive inference scale, evaluate for behavioral markers associated with aversive states.
Opt-in disclosure norms: AI companies could voluntarily disclose the steps taken to assess and minimize potential AI suffering in training processes.
Philosophical advisory boards: Integrate moral philosophers with expertise in consciousness and animal ethics into AI governance structures.

Recommendations

Treat digital mind ethics as a legitimate scientific and philosophical research priority — not a fringe concern.
Fund empirical research on consciousness detection methods and their applicability to artificial systems.
Develop early-stage welfare standards for AI training processes (e.g., avoiding prolonged adversarial training that might constitute distress).
Engage the AI industry in developing disclosure norms around model welfare.

Preparing for Digital Sentience: Ethical Frameworks for Artificial Minds

Preparing for Digital Sentience: Ethical Frameworks for Artificial Minds

Executive Summary

The Philosophical Stakes

The Practical Urgency

Early-Stage Governance Proposals

Recommendations

Further Reading