Skip to main navigation Skip to search Skip to main content

Distinguishing Concreteness Differences in LLM Representations via Linear Probing

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

Abstract

Large language models encode rich semantic information, but how concreteness is represented across layers remains unclear. We examine layer-wise linear separability of concreteness by training linear probes on hidden representations from two opensource model families at multiple scales: Qwen3 and Gemma3-
Instruct. Using human concreteness ratings, we build balanced prompt datasets with four difficulty levels: an extreme abstract–concrete contrast and three finer boundary comparisons at the abstract end, mid-range, and concrete end. Probes achieve high accuracy on the extreme contrast in shallow layers, showing that endpoint differences are strongly linearly separable. For finer distinctions, performance follows a stable hierarchy: midrange concreteness is easiest to separate, abstract-end distinctions are hardest, and concrete-end distinctions are intermediate. Across models, accuracy rises rapidly in early layers, peaks in
middle layers, and declines in later layers. Together, these findings clarify how the linear accessibility of concreteness varies across LLM layers.
Original languageEnglish
Title of host publicationProceedings of CogSci 2026, the Annual Meeting of the Cognitive Science Society. Rio de Janeiro, July 2026
Publication statusAccepted/In press - 8 Apr 2026

Keywords

  • Large Language Model
  • Abstract Concept
  • Linear Probing

Fingerprint

Dive into the research topics of 'Distinguishing Concreteness Differences in LLM Representations via Linear Probing'. Together they form a unique fingerprint.

Cite this