Abstract
Large language models encode rich semantic information, but how concreteness is represented across layers remains unclear. We examine layer-wise linear separability of concreteness by training linear probes on hidden representations from two opensource model families at multiple scales: Qwen3 and Gemma3-
Instruct. Using human concreteness ratings, we build balanced prompt datasets with four difficulty levels: an extreme abstract–concrete contrast and three finer boundary comparisons at the abstract end, mid-range, and concrete end. Probes achieve high accuracy on the extreme contrast in shallow layers, showing that endpoint differences are strongly linearly separable. For finer distinctions, performance follows a stable hierarchy: midrange concreteness is easiest to separate, abstract-end distinctions are hardest, and concrete-end distinctions are intermediate. Across models, accuracy rises rapidly in early layers, peaks in
middle layers, and declines in later layers. Together, these findings clarify how the linear accessibility of concreteness varies across LLM layers.
Instruct. Using human concreteness ratings, we build balanced prompt datasets with four difficulty levels: an extreme abstract–concrete contrast and three finer boundary comparisons at the abstract end, mid-range, and concrete end. Probes achieve high accuracy on the extreme contrast in shallow layers, showing that endpoint differences are strongly linearly separable. For finer distinctions, performance follows a stable hierarchy: midrange concreteness is easiest to separate, abstract-end distinctions are hardest, and concrete-end distinctions are intermediate. Across models, accuracy rises rapidly in early layers, peaks in
middle layers, and declines in later layers. Together, these findings clarify how the linear accessibility of concreteness varies across LLM layers.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of CogSci 2026, the Annual Meeting of the Cognitive Science Society. Rio de Janeiro, July 2026 |
| Publication status | Accepted/In press - 8 Apr 2026 |
Keywords
- Large Language Model
- Abstract Concept
- Linear Probing
Fingerprint
Dive into the research topics of 'Distinguishing Concreteness Differences in LLM Representations via Linear Probing'. Together they form a unique fingerprint.Projects
- 1 Active
-
eTALK: [eTALK] embodied Thought for Abstract Language Knowledge
Cangelosi, A. (PI)
26/06/24 → 25/06/29
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver