Newest Papers

Date: Newest Clear all

Advanced filtersTopic: All

Experimental

Topic

Topics are auto-detected from title, abstract, and metadata and may be imperfect.

Publication Date

Newest Oldest Clear

Average Rating

Highest Lowest

Validating LLMs in social science: Epistemic threats and emerging norms

Meera Desai, Dallas Card, Abigail Z. Jacobs

arXiv·2026

Large language models (LLMs) are reshaping social science methodology. Researchers increasingly prompt language models to generate quantitative measurements of social concepts, for example labeling data or simulating survey responses. Yet LLMs pose methodological challenges including bias, hallucination, and brittleness across contexts, with unclear threats to validity. Standard practices and norms for addressing these challenges are still emerging. We collect and systematically analyze validation practices in a comprehensive corpus of papers from eight flagship social science journals that use LLMs as measurement instruments. We find that LLM-generated measurements frequently play a central role in empirical analyses, yet validation practices are inconsistent and limited. We outline complementary strategies for more robust validation, pointing toward better norms and standards around the use of LLMs in social science.

No ratings yet

View paper →

Internal Pluralism and the Limits of Pairwise Comparisons

Bailey Flanigan, Michelle Si

arXiv·2026

Local pairwise comparisons are a standard tool for learning how people want decision rules to work, e.g., in participatory design or alignment. However, their use builds in two strong assumptions: that local comparisons are sufficient evidence about how a person wants an automated decision rule to behave, and that people can always answer those comparisons decisively. We investigate how these assumptions may be compromised under internal pluralism: the idea that an individual evaluates decision rules according to multiple authoritative priorities about how the rule should behave. We provide a formal model of such pluralistic preferences over decision rules, which then lets us identify two distinct failures of forced local pairwise comparison data. First, priorities such as proportionality, egalitarianism, and equal treatment are inherently global: what they imply in one case can depend on what happens elsewhere, so local comparisons may fail to capture them. Second, even when priorities are representable locally, tension between strongly-held priorities can generate internal conflict, producing potentially costly behavioral distortions when comparisons are forced. We then use our model to investigate the alternative -- allowing people to report indecision -- and our findings suggest that doing so can considerably reduce the number of queries needed to learn preferences accurately. We conclude by describing how our model points toward preference-learning methods that elicit these priorities directly, yielding more faithful and interpretable accounts of what people value.

No ratings yet

View paper →

How To Write About Film

sam bodrojan

Substack·2026

an introduction

★ 5.0 (1)

View paper →

Page 1 of 609Next

A Derivation Of The Transformer Architecture

Brandon Sandhu

Google Drive·2026

The paper develops an intuitive, mathematical understanding of tokenization, embeddings, queries, keys, values, self-attention, multi-head attention, MLPs, residual connections, and backpropagation, with the aim of making these concepts more accessible without sacrificing mathematical rigor. Prerequisites are basic linear algebra, multivariable calculus, probability theory, and some information theory. Note: Positional encodings are intentionally omitted to simplify the presentation and focus on understanding the core architecture, rather than constructing a fully functional Transformer.

No ratings yet

View paper →

Position: RL Researchers Need to Distinguish Between Solving Simulators and Using Simulators as a Proxy

Matthew Vandergrift, Esraa Elelimy, Martha White

arXiv·2026

One goal in reinforcement learning (RL) research is to understand general-purpose sequential decision-making, using benchmark simulators as a proxy for learning in deployment settings. When running experiments, however, the goal of achieving high performance in the simulator can mutate into focusing exclusively on solving the simulator. To achieve high scores, researchers may adopt solutions exclusively meant for solving simulators, rather than learning while the agent is deployed outside a simulator. Solving simulators is also worthy of investigation, but it is a fundamentally different RL research question. In this paper, we argue that RL researchers need to distinguish between two use cases of simulators: solving simulators and using simulators as a proxy for learning in deployment. We first discuss how these two use-cases are importantly different, in terms of constraints on how the agent can use the simulator, which algorithms are appropriate, and which evaluation metrics are appropriate. We then highlight several issues and misleading conclusions that can occur by not making the distinction between these two settings clear, supported with examples and simple experiments. This work is a call to the community to begin clearly distinguishing how they are using simulators in their work, hopefully sparking further discussion on which empirical practices work best in each setting.

No ratings yet

View paper →

Forecasting With LLMs: Improved Generalization Through Feature Steering

Humzah Merchant, Bradford Levy

arXiv·2026

Successful forecasting involves identifying patterns between historical and future states of the world which generalize to future observations. We apply LLMs to a variety of forecasting tasks and inspect their internal states using sparse autoencoders to understand whether they appear to rely on time-specific pieces of knowledge versus generalizable patterns. Our analyses identify features associated with both time-aware reasoning and look-ahead-biased reasoning. We then apply the LLMs to an entirely different domain and intervene on these features. We find that amplifying time-awareness features substantially reduces look-ahead bias on forecasting prompts while preserving general reasoning performance. In contrast, steering the candidate look-ahead-bias features does not produce an effect. These results suggest that interpretable temporal features can be used to causally shift LLMs toward more historically grounded reasoning.

No ratings yet

View paper →

Newest Papers

Validating LLMs in social science: Epistemic threats and emerging norms

Internal Pluralism and the Limits of Pairwise Comparisons

How To Write About Film

The perpetual present-tense

A Derivation Of The Transformer Architecture

Position: RL Researchers Need to Distinguish Between Solving Simulators and Using Simulators as a Proxy

Forecasting With LLMs: Improved Generalization Through Feature Steering

All you need is PostgreSQL

Ageism is not just about age: Introducing the concept of generalized discrimination experiences.

everyone is a monster to someone

Where to Find the Colors Your Screen Can’t Show You – Ryan Moulton's Articles

Does anything I write matter anymore?