Language Modelling - Search News

Tencent’s new AI technique teaches language models ‘parallel thinking’

The Parallel-R1 framework uses reinforcement learning to teach models how to explore multiple reasoning paths at once, ...

Tech Xplore on MSN

AlloyGPT: Leveraging a language model to aid alloy discovery

Additive manufacturing of alloys has enabled the creation of machine parts that meet the complex requirements needed to ...

36mon MSN

China’s DeepSeek Unveils New AI Model That Could Halve Usage Cost

DeepSeek called the model the an advancement in its next-generation lineup of AI.

18hon MSN

China's DeepSeek releases 'intermediate' AI model on route to next generation

Chinese AI developer DeepSeek has released its "experimental" latest model, which it said was more efficient to train and ...

13h

Big AI firms pump money into world models as LLM advances slow

One near-term application of world models is in the entertainment industry, where they can create interactive and realistic ...

Beyond Autoregression: A New Model For Text Generation

There are also trade-offs in creativity. Because the energy critic favors low-energy (i.e., high-probability) text, the model ...

NextBigFuture

Analog in-memory Computing Attention Mechanism for Fast and Energy-efficient Large Language Models

A Nature paper describes an innovative analog in-memory computing (IMC) architecture tailored for the attention mechanism in ...

DeepSeek's new V3.2-Exp model cuts API pricing in half to less than 3 cents per 1M input tokens

MMLU-Pro holds steady at 85.0, AIME 2025 slightly improves to 89.3, while GPQA-Diamond dips from 80.7 to 79.9. Coding and agent benchmarks tell a similar story, with Codeforces ratings rising from ...

10h

Anthropic launches Claude Sonnet 4.5, claims it's the world's best coding model

Anthropic has released Claude Sonnet 4.5, a new large language model that excels at coding tasks and outperforms competitors' ...

16h

AI Data Model Protection: How To Secure Your AI Tool Against Attacks

One of the biggest risks to any AI tool is data integrity. Cybersecurity is built on the CIA triad of confidentiality, ...

The Next Platform

MythWorx Mashes Up Neuromorphic And GenAI To Take On Model Giants

There is constant chatter surrounding the promise of generative AI, agentic AI, and – eventually – artificial general ...

Quanta Magazine

To Understand AI, Watch How It Evolves

Naomi Saphra thinks that most research into language models focuses too much on the finished product. She’s mining the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results