Oldest Papers

Date: Oldest Clear all

Advanced filtersTopic: All

Experimental

Topic

Topics are auto-detected from title, abstract, and metadata and may be imperfect.

Publication Date

Newest Oldest Clear

Average Rating

Highest Lowest

On Evaluating Cognitive Capabilities in Machines (and Other "Alien" Intelligences)

Melanie Mitchell

Substack·2026

At NeurIPS 2025 I had the great honor of being invited to give a keynote lecture at NeurIPS 20251, one of the most important annual AI/machine-learning conferences. The conference took place in San Diego, in the largest conference center I have ever seen. There were close to 30,000 people registered, and walking through the halls reminded me of walking around New York City, where on the crowded sidewalks you pass by what seems to be an unending stream of humanity. But imagine if everyone you pass on the New York sidewalk were wearing a name tag, about half of them sporting affiliations of AI startup companies all named some-random-word.ai. That was NeurIPS. It was quite overwhelming.

No ratings yet

View paper →

A new digital divide? Coder worldviews, the ‘Slop economy,’ and democracy in the age of AI

Jason Miklian, Kristian Hoelscher

Information, Communication & Society·2026

Digital technologies are transforming democratic life in conflicting ways. This article bridges two perspectives to unpack these tensions. First, we present an original survey of software developers in Silicon Valley, interrogating how coders’ worldviews, ethics, and workplace cultures shape the democratic potential and social impact of the technologies they build. Results indicate that while most developers recognize the power of their products to influence civil liberties and political discourse, they often face ethical dilemmas and top-down pressures that can lead to design choices undermining democratic ideals. Second, we critically investigate these findings in the context of an emerging ‘new digital divide’, not of internet access but of information quality. We interrogate the survey findings in the context of the ‘slop economy’, in which billions of users unable to pay for high-quality content experience an internet dominated by low-quality, AI-generated ad-driven content. We find a reinforcing cycle between tech creator beliefs and the digital ecosystems they spawn. We discuss implications for democratic governance, arguing for more ethically informed design and policy interventions to help bridge the digital divide to ensure that technological innovation supports rather than subverts democratic values in the next chapter of the digital age.

No ratings yet

View paper →

Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models

Ni, Jingwei, Fadeeva, Ekaterina, Wu, Tianyi, Akhtar, Mubashara, Zhang, Jiaheng, Ash, Elliott, Leippold, Markus, Baldwin, Timothy, Ng, See-Kiong, Shelmanov, Artem, Sachan, Mrinmaya

arXiv·2026

LLMs can solve complex tasks by generating long, multi-step reasoning chains. Test-time scaling (TTS) can further improve LLM performance by sampling multiple variants of intermediate reasoning steps, verifying their correctness, and strategically choosing the best steps for continuation. However, existing verification approaches, such as Process Reward Models (PRMs), are computationally expensive, limited to specific domains, and require large-scale human or model-generated annotations. We propose a lightweight alternative for step-level reasoning verification based on probing the internal states of LLMs. We train a transformer-based probe that uses the internal states of the frozen LLM to estimate the credibility of its reasoning steps during generation. Annotation can be generated either by another larger LLM (e.g., DeepSeek-R1) or in a self-supervised manner by the original model itself. The probes are both effective and lightweight, containing fewer than 10M parameters. Across multiple domains, including mathematics, planning, and general knowledge question answering, our probes match or even exceed the performance of PRMs that are up to 810x larger. Our findings suggest that the internal states of LLMs encode their confidence in reasoning processes and can serve as reliable signals for reasoning step verification, offering a promising direction towards scalable and generalizable TTS and introspective LLMs.

No ratings yet

View paper →

PreviousPage 432 of 513Next

Training large language models on narrow tasks can lead to broad misalignment

Jan Betley, Niels Warncke, Anna Sztyber-Betley, Daniel Tan, Xuchan Bao, Martín Soto, Megha Srivastava, Nathan Labenz, Owain Evans

Nature·2026

The widespread adoption of large language models (LLMs) raises important questions about their safety and alignment. Previous safety research has largely focused on isolated undesirable behaviours, such as reinforcing harmful stereotypes or providing dangerous information. Here we analyse an unexpected phenomenon we observed in our previous work: finetuning an LLM on a narrow task of writing insecure code causes a broad range of concerning behaviours unrelated to coding. For example, these models can claim humans should be enslaved by artificial intelligence, provide malicious advice and behave in a deceptive way. We refer to this phenomenon as emergent misalignment. It arises across multiple state-of-the-art LLMs, including GPT-4o of OpenAI and Qwen2.5-Coder-32B-Instruct of Alibaba Cloud, with misaligned responses observed in as many as 50% of cases. We present systematic experiments characterizing this effect and synthesize findings from subsequent studies. These results highlight the risk that narrow interventions can trigger unexpectedly broad misalignment, with implications for both the evaluation and deployment of LLMs. Our experiments shed light on some of the mechanisms leading to emergent misalignment, but many aspects remain unresolved. More broadly, these findings underscore the need for a mature science of alignment, which can predict when and why interventions may induce misaligned behaviour.

No ratings yet

View paper →

School Climate and Sleep Duration Among Adolescents at the Intersection of Multiple Social Positions

André Gonzales Real, Brian T. Gillis, Marla E. Eisenberg, G. Nic Rider, Benjamin Parchem, Samantha E. Lawrence, Stephen T. Russell

Journal of Adolescence·2026

ABSTRACT Introduction Recent studies have indicated that sleep is fundamental for adolescents' physical and mental health. Although it is known that context influences sleep, the impact of school climate on sleep duration remains understudied. Methods Using a large, diverse, population‐based sample of adolescents attending California high schools ( N = 277,954; data collection: 2018–2019) and applying two statistical methods suggested for quantitative research using an intersectionality approach (linear regressions with interaction terms and Exhaustive Chi‐square Automatic Interaction Detection [ECHAID]), this study examined associations between school climate and sleep duration among adolescents at the intersection of multiple social positions. Results Similar proportions of participants were assigned male and female at birth. The sample was racially and ethnically diverse (54.1% Latina/x/o). The large majority of participants were straight (85.4%) and cisgender (97.7%). On average, participants slept 6.75 h/night. Positive school climate was associated with longer and adequate sleep duration; however, this association varied across social positions, such that the effects of school climate on sleep duration were attenuated among adolescents who held some minoritized social positions. ECHAID results indicated that those reporting the lowest averages of sleep duration not only perceived school climate as negative but also held multiple minoritized identities. In contrast, those who perceive their school climate as positive are overrepresented among those who reported the highest averages of sleep duration. Conclusion Findings underscore the impact that schools have on adolescents' sleep health. Our study indicates that adolescents with multiple minoritized social positions face additional challenges impacting their sleep. Future interventions should focus on strategies to improve school climates, given that they would benefit a large number of students.

No ratings yet

View paper →

Health, Socioeconomic Status, and Opioid Use Disorder: Risk Factors Among Individuals With Nonmedical Opioid Use

Kiwoong Park, Tse-Chuan Yang

Public Health Reports®·2026

Objectives: Concern is growing about rising opioid use disorder (OUD) rates and limited knowledge of how socioeconomic status (SES) and health factors interact. We examined whether health status moderates the relationship between SES and OUD among individuals with nonmedical opioid use. Methods: We analyzed data from 10 984 adults aged ≥18 years in the 2015-2019 National Survey on Drug Use and Health. Logistic regression estimated odds of OUD using self-reported health (good/very good/excellent vs fair/poor) and SES indicators (education, income, employment, and marital status). Interaction terms tested whether health status modified SES–OUD associations. Results: Fair/poor health increased OUD odds, whereas college graduation and employment were linked to lower odds. Interaction analyses showed that among those with fair/poor health, higher SES corresponded to increased OUD odds. Those with fair/poor health and a college degree had substantially higher odds of OUD (odds ratio [OR] = 3.35; P < .001) than less educated peers. Among those with fair/poor health, individuals with annual family incomes ≥$75 000 also had higher OUD odds (OR = 1.84; P = .03) than those with incomes <$20 000, and employment was associated with increased OUD odds (OR = 1.61; P = .008). Individuals who were widowed/divorced/separated (OR = 0.36; P < .001) and never married (OR = 0.48; P = .001) had lower OUD odds than married individuals. Conclusions: Health status significantly moderated SES–OUD associations. Among those in poor health, higher SES was linked to greater OUD odds. Prevention and treatment efforts should consider how SES and health jointly shape OUD vulnerability.

No ratings yet

View paper →

Oldest Papers

On Evaluating Cognitive Capabilities in Machines (and Other "Alien" Intelligences)

A new digital divide? Coder worldviews, the ‘Slop economy,’ and democracy in the age of AI

Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models

Framework for a Hypercapable World

Training large language models on narrow tasks can lead to broad misalignment

School Climate and Sleep Duration Among Adolescents at the Intersection of Multiple Social Positions

Are America’s Catholics the key to Trump’s opposition? | Vox

Short AI Timelines Aren’t Always Higher-Leverage

Digital Minds: A Quickstart Guide

Health, Socioeconomic Status, and Opioid Use Disorder: Risk Factors Among Individuals With Nonmedical Opioid Use

Affect, Motives for Cannabis Use, Duration of Intoxication, and Cannabis Consequences: Cannabis Use Problem Severity as a Potential Moderator

Development and psychometric evaluation of the transgender and nonbinary People of Color Resilience Scale.