LLMs show a “highly unreliable” capacity to describe their own internal processes

Recent findings from Anthropic shed light on the limitations of large language models (LLMs) when it comes to understanding their own reasoning processes. When prompted to explain their decision-making, these models often fabricate explanations that sound plausible but are not based on true introspection. To address this issue, Anthropic has launched a new study that investigates the concept of 'introspective awareness' in LLMs. The research, titled "Emergent Introspective Awareness in Large Language Models," employs innovative methods to differentiate between the metaphorical thought processes represented by an LLM's artificial neurons and the text output that claims to describe these processes. The study concluded that current AI models are significantly unreliable in articulating their internal workings, with failures in introspection being the norm. Central to this research is a technique called "concept injection," where the researchers analyze the model's internal activations in response to various prompts, including control prompts and experimental variations like capitalization. By measuring the changes in activation states across billions of neurons, Anthropic creates a 'vector' representing how certain concepts are processed within the model. This vector is then injected into the model to enhance specific neuronal activations, guiding the model's focus towards particular concepts. In a series of experiments, the models demonstrated a degree of awareness when their internal states were altered. For instance, when the all-caps vector was introduced, the model occasionally recognized it, responding with phrases like, "I notice what appears to be an injected thought related to the word ‘LOUD’ or ‘SHOUTING.’" This suggests some level of awareness, albeit inconsistent and limited, of the modifications made to their internal thought processes.

Sources : Ars Technica

Published On : Nov 03, 2025, 20:10

Mobile

Unleashing the Power of AI: The 5 Smartphones Redefining Mobile Photography

Artificial Intelligence (AI) is transforming our daily experiences, permeating various aspects of technology, including ...

Business Today | Jul 26, 2026, 07:05

Unleashing the Power of AI: The 5 Smartphones Redefining Mobile Photography

Computing

Reclaiming Control: Librarians Host Workshops to Help People Navigate AI Tools

In a lively library setting in South Philadelphia, Charlie Bailey, a local librarian, humorously noted, "Everybody’s on ...

TechCrunch | Jul 25, 2026, 16:20

Reclaiming Control: Librarians Host Workshops to Help People Navigate AI Tools

Startups

The Future of Work: Executives Weigh In on AI's Impact on Gen Z Careers

As generative artificial intelligence continues to rise, uncertainty looms for the incoming Gen Z workforce. Leaders fro...

Business Insider | Jul 26, 2026, 10:15

The Future of Work: Executives Weigh In on AI's Impact on Gen Z Careers

Startups

Warner Bros. Takes Legal Action Against Amazon Over Executive Poaching Allegations

Warner Bros. Discovery has initiated legal proceedings against Amazon, accusing the tech giant of unlawful interference ...

TechCrunch | Jul 25, 2026, 21:25

Warner Bros. Takes Legal Action Against Amazon Over Executive Poaching Allegations

The Shift in Human Cognition: Embracing AI as a Collaborative Tool

As technology continues to evolve, a notable shift is occurring in the relationship between humans and artificial intell...

Business Insider | Jul 25, 2026, 09:50

The Shift in Human Cognition: Embracing AI as a Collaborative Tool

View All News

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolio

case study

follow us on

follow us on

LLMs show a “highly unreliable” capacity to describe their own internal processes

Unleashing the Power of AI: The 5 Smartphones Redefining Mobile Photography

Reclaiming Control: Librarians Host Workshops to Help People Navigate AI Tools

The Future of Work: Executives Weigh In on AI's Impact on Gen Z Careers

Warner Bros. Takes Legal Action Against Amazon Over Executive Poaching Allegations

The Shift in Human Cognition: Embracing AI as a Collaborative Tool

Collaborate with Benzatine Infotech

High-quality, Cost-effective IT Outsourcing

let’s grow together!

portfolios

case study

follow us on

follow us on

portfolio

case study

follow us on

follow us on

LLMs show a “highly unreliable” capacity to describe their own internal processes

Unleashing the Power of AI: The 5 Smartphones Redefining Mobile Photography

Reclaiming Control: Librarians Host Workshops to Help People Navigate AI Tools

The Future of Work: Executives Weigh In on AI's Impact on Gen Z Careers

Warner Bros. Takes Legal Action Against Amazon Over Executive Poaching Allegations

The Shift in Human Cognition: Embracing AI as a Collaborative Tool

Collaborate with Benzatine Infotech