~ai-venturebeat | Bookmarks (565)
-
Hidden costs in AI deployment: Why Claude models may be 20-30% more expensive than GPT in enterprise settings
It is a well-known fact that different model families can use different tokenizers. However, there has...
-
Astronomer’s $93M raise underscores a new reality: Orchestration is king in AI infrastructure
Astronomer secures $93 million in Series D funding to solve the AI implementation gap through data...
-
Microsoft launches Phi-4-Reasoning-Plus, a small, powerful, open weights reasoning model!
The release demonstrates that with carefully curated data and training techniques, small models can deliver strong...
-
Salesforce takes aim at ‘jagged intelligence’ in push for more reliable AI
Salesforce unveils groundbreaking AI research tackling "jagged intelligence," introducing new benchmarks, models, and guardrails to make...
-
UiPath’s new orchestrator guides AI agents to follow your enterprise’s rules
UiPath's agent orchestration layer Maestro moves prompts through three layers: the agent, a human and the...
-
The ‘era of experience’ will unleash self-learning AI agents across the web—here’s how to prepare
AI visionaries predict an 'Era of Experience' where AI learns autonomously, and it will have important...
-
Qwen swings for a double with 2.5-Omni-3B model that runs on consumer PCs, laptops
The Qwen2.5-Omni-3B model is licensed for non-commercial use only under Alibaba Cloud’s Qwen Research License Agreement.
-
Breaking the ‘intellectual bottleneck’: How AI is computing the previously uncomputable in healthcare
How University of Texas Medical Branch is using AI to identify patients at high cardiovascular risk,...
-
OpenAI rolls back ChatGPT’s sycophancy and explains what went wrong
Many organizations may also begin shifting toward open-source alternatives that they can host and tune themselves.
-
Structify raises $4.1M seed to turn unstructured web data into enterprise-ready datasets
Brooklyn-based Structify emerges from stealth with $4.1 million in seed funding to transform how businesses prepare...
-
No more window switching: Mastercard’s Agent Pay transforms how enterprises use AI search
Mastercard is working with AI companies and banks to allow AI platforms and agents to facilitate...
-
Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second
Meta partners with Cerebras to launch its new Llama API, offering developers AI inference speeds up...
-
Meta’s first dedicated AI app is here with Llama 4 — but it’s more consumer than productivity or business oriented
This mainstream exposure will likely accelerate a shift in what people expect not just from consumer...
-
Tripp launches Kōkua AI as mental wellness coach across multiple platforms
Tripp launched Kōkua AI, a mental wellness guide designed to deliver real-time, personalized emotional support across...
-
xMEMS extends micro cooling fan-on-a-chip tech to AI data centers
xMEMS Labs, a pioneer of MEMS-based chips, announced that its innovative µCooling fan-on-a-chip tech will be...
-
Alibaba launches open source Qwen3 model that surpasses OpenAI o1 and DeepSeek R1
Qwen3’s open-weight release under an accessible license marks an important milestone, lowering barriers for developers and...
-
Ex-OpenAI CEO and power users sound alarm over AI sycophancy and flattery of users
Crucially, the turbulence also nudges many organizations to explore open-source models they can host, monitor, and...
-
Beyond A2A and MCP: How LOKA’s Universal Agent Identity Layer changes the game
The LOKA protocol, a proposed standard for AI agents from Carnegie Mellon University researchers, will give...
-
30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times
d1 framework changes boosts diffusion LLMs with novel reinforcement learning, unlocking efficient, problem-solving AI possibilities.
-
Writer releases Palmyra X5, delivers near GPT-4.1 performance at 75% lower cost
Writer unveils Palmyra X5: The enterprise AI model that processes 1,500 pages at once, costs 75%...
-
Does RAG make LLMs less safe? Bloomberg research reveals hidden dangers
RAG is supposed to make enterprise AI more accurate, but it could potentially also make it...
-
Is your AI product actually working? How to develop the right metric system
Metrics are critical for determining AI product performance. But where to begin? Here's a framework to...
-
DeepSeek’s success shows why motivation is key to AI innovation
How did DeepSeek attain such cost-savings while American companies could not? Let's dive into the technical...
-
Liquid AI is revolutionizing LLMs to work on edge devices like smartphones with new ‘Hyena Edge’ model
Hyena Edge’s success positions Liquid AI as one of the emerging players to watch in the...