News

Additionally, the model’s hallucination rate has been reduced, contributing to more reliable and consistent output.
LMArena, a popular benchmark for large language models, has been accused of giving preferential treatment to AIs made by big ...
In particular, that marathon refactoring claim reportedly comes from Rakuten, a Japanese tech services conglomerate that ...
Qwen 2.5 Coder/Max is currently the top open-source model for coding, with the highest HumanEval (~70–72%), LiveCodeBench (70 ...
The Bengaluru startup noted that Sarvam-M sets a new benchmark for models of its size in Indian languages, as well as in math ...
Founded in July 2023 by Vivek Raghavan and Pratyush Kumar, Sarvam aims to make Generative AI accessible at scale in India. In ...
Anthropic this week unveiled it's latest LLM (Large Language Model) which can act as both a chatbot and AI assistant. Its special sauce -- coding -- seems ...
Sarvam AI claims that the advanced Sarvam-M model outperforms Meta 's LLaMA-4 Scout on most benchmarks and is comparable to ...
M, a 24-billion-parameter hybrid language model boasting strong performance in math, programming, and Indian languages.
Despite criticism over whether the model is “good enough” to compete globally, Sarvam-M’s launch has significantly raised the profile of Indian efforts in the AI space. The model is now publicly ...
DeepSeek has gone viral. Chinese AI lab DeepSeek broke into the mainstream consciousness this week after its chatbot app rose ...
Researchers identified two consistent failure modes in LLM reasoning: overcomplication and overlooking. In the ...