Large Language Models Reasoning Capability

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

SiliconANGLE

Elon Musk’s xAI unveils Grok-3 with advanced reasoning capabilities

Elon Musk’s xAI Corp. late Monday night announced the launch of Grok-3, the latest in the company’s family of large language models. The company says the AI model is a significant leap in power over ...

News Medical

Improving logical reasoning in large language models for medical use

Large language models (LLMs) can store and recall vast quantities of medical information, but their ability to process this information in rational ways remains variable. A new study led by ...

Benzinga.com

China's DeepSeek Teams Up With Tsinghua University To Raise AI Bar, Boost Reasoning Capabilities

China's DeepSeek, in collaboration with researchers from Tsinghua University, developed a technique to improve the reasoning capabilities of large language models (LLMs) that combines generative ...

Hosted on MSN

Microsoft says 'rStar-Math' demonstrates how small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1 by +4.5%

Microsoft has potentially made a breakthrough with small language models (SLMs) after the recent development of a new reasoning technique dubbed rStar-Math. For context, the technique enhances the ...

GeekWire

Buyer beware: OpenAI’s o1 reasoning model is an entirely different beast

GeekWire chronicles the Pacific Northwest startup scene. Sign up for our weekly startup newsletter, and check out the GeekWire funding tracker and VC directory. by Anthony Diamond on Dec 26, 2024 at 8 ...

Geeky Gadgets

How LLMs Are Redefining AI : Beyond Predicting the Next Word

Large Language Models (LLMs) have evolved far beyond their initial role as next-word predictors. Recent research, particularly from Anthropic, sheds light on the sophisticated mechanisms driving these ...

VentureBeat

How test-time scaling unlocks hidden reasoning abilities in small language models (and allows them to outperform LLMs)

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Very small language models (SLMs) can ...

Devdiscourse

AI’s next breakthrough will come from memory, not bigger models

Memory, as the paper describes, is the key capability that allows AI to transition from tools to agents. As language models ...

Tech Xplore on MSN

DarkMind: A new backdoor attack that leverages the reasoning capabilities of LLMs

Large language models (LLMs), such as the models supporting the functioning of ChatGPT, are now used by a growing number of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results