النزاعات

الذكاء الاصطناعي

أشباه الموصلات

TSMC Samsung Intel SMIC ASML HBM SK Hynix Micron Arm

مراكز بيانات الذكاء الاصطناعي

Hyperscaler capex Power demand Nvidia (GPUs)AMD Stargate Cooling & water

مختبرات الذكاء الاصطناعي

OpenAI Anthropic Google DeepMind Meta AI xAI Mistral DeepSeek Alibaba (Qwen)

إصدارات النماذج وقدراتها

التقييمات والاختبارات المرجعية

الوكلاء واستخدام الأدوات

السلامة والمواءمة

المفتوح مقابل المغلق

المعايير والبروتوكولات

الكم والمواد

التقنية الحيوية والأحياء التركيبية

الشركات الناشئة

ML engineering press

حسب الانحياز · 1 قراءات عبر هذه النسخة

MarkTechPost · United States · MiniMax ships M3, a Chinese open-weight model claiming frontier coding at one-twentieth the attention cost

Technical writeup of M3's MiniMax Sparse Attention (MSA), which selects relevant key-value blocks to cut per-token compute to one-twentieth at 1M-token context, with native multimodal input and computer use for agentic coding.

“MSA cuts per-token compute to one-twentieth at 1M-token context, with over 9x faster prefill and 15x faster decoding than the prior generation.”

المصدر ↗