Tag: #alignment

Articles related to alignment

technology

Preprint Claims Language Models Undergo 'Alignment Transition' Around 3.5 Billion Parameters

An unreviewed arXiv preprint argues language models shift from trading truth for reasoning to improving both past ~3.5B parameters; code and dashboard released.

#ai, #alignment, #language-models, #arxiv