#language-model

[ follow ]
#ai
fromInfoQ
10 months ago
Artificial intelligence

MiniMax Releases M1: A 456B Hybrid-Attention Model for Long-Context Reasoning and Software Tasks

Data science
fromTheregister
3 weeks ago

PrismML debuts 1-bit LLM in bid to free AI from the cloud

PrismML's Bonsai 8B is a 1-bit language model that outperforms larger models, enhancing AI efficiency for mobile applications.
fromInfoQ
10 months ago
Artificial intelligence

MiniMax Releases M1: A 456B Hybrid-Attention Model for Long-Context Reasoning and Software Tasks

fromFuturism
9 months ago

OpenAI Is Reportedly About to Release GPT-5

"Along with the new reporting, several users have GPT-5 being tested in the wild, noted, in a sign that developers are putting the finishing touches on the LLM before a public release."
Artificial intelligence
fromMedium
9 months ago

Alibaba-Backed Moonshot Unveils Kimi K2: Open-Source AI Model Outperforms ChatGPT and Claude in...

Moonshot's new open-source language model, Kimi K2, outperforms competitors like GPT-4.1 and Claude Opus 4 in coding tasks, optimized for software development.
Artificial intelligence
Artificial intelligence
fromInfoQ
9 months ago

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

Mistral's Voxtral integrates advanced speech recognition and language understanding, offering deployment flexibility with openly available model weights.
Artificial intelligence
fromInfoQ
10 months ago

Microsoft Introduces Mu: A Lightweight On-Device Language Model for Windows Settings

Microsoft's Mu model enhances local processing for real-time system control, reducing cloud dependency.
#dolphin-communication
fromInfoQ
11 months ago

Mistral Unveils Medium 3: Enterprise-Ready Language Model

The transition to Mistral Medium 3 allows enterprises to benefit from strong performance and flexible deployment while managing costs effectively with competitive pricing.
Artificial intelligence
[ Load more ]