Hidden costs in AI deployment: Why Claude models may be 20-30% more expensive than GPT in enterprise settings
It is a well-known fact that different model families can use different tokenizers. However, there has...
Astronomer’s $93M raise underscores a new reality: Orchestration is king in AI infrastructure
Astronomer secures $93 million in Series D funding to solve the AI implementation gap through data...
Microsoft launches Phi-4-Reasoning-Plus, a small, powerful, open weights reasoning model!
The release demonstrates that with carefully curated data and training techniques, small models can deliver strong...
Salesforce takes aim at ‘jagged intelligence’ in push for more reliable AI
Salesforce unveils groundbreaking AI research tackling "jagged intelligence," introducing new benchmarks, models, and guardrails to make...
UiPath’s new orchestrator guides AI agents to follow your enterprise’s rules
UiPath's agent orchestration layer Maestro moves prompts through three layers: the agent, a human and the...
The ‘era of experience’ will unleash self-learning AI agents across the web—here’s how to prepare
AI visionaries predict an 'Era of Experience' where AI learns autonomously, and it will have important...
Qwen swings for a double with 2.5-Omni-3B model that runs on consumer PCs, laptops
The Qwen2.5-Omni-3B model is licensed for non-commercial use only under Alibaba Cloud’s Qwen Research License Agreement.
Breaking the ‘intellectual bottleneck’: How AI is computing the previously uncomputable in healthcare
How University of Texas Medical Branch is using AI to identify patients at high cardiovascular risk,...
OpenAI rolls back ChatGPT’s sycophancy and explains what went wrong
Many organizations may also begin shifting toward open-source alternatives that they can host and tune themselves.
Structify raises $4.1M seed to turn unstructured web data into enterprise-ready datasets
Brooklyn-based Structify emerges from stealth with $4.1 million in seed funding to transform how businesses prepare...
No more window switching: Mastercard’s Agent Pay transforms how enterprises use AI search
Mastercard is working with AI companies and banks to allow AI platforms and agents to facilitate...
Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second
Meta partners with Cerebras to launch its new Llama API, offering developers AI inference speeds up...
Meta’s first dedicated AI app is here with Llama 4 — but it’s more consumer than productivity or business oriented
This mainstream exposure will likely accelerate a shift in what people expect not just from consumer...
Tripp launches Kōkua AI as mental wellness coach across multiple platforms
Tripp launched Kōkua AI, a mental wellness guide designed to deliver real-time, personalized emotional support across...
xMEMS extends micro cooling fan-on-a-chip tech to AI data centers
xMEMS Labs, a pioneer of MEMS-based chips, announced that its innovative µCooling fan-on-a-chip tech will be...
Alibaba launches open source Qwen3 model that surpasses OpenAI o1 and DeepSeek R1
Qwen3’s open-weight release under an accessible license marks an important milestone, lowering barriers for developers and...
Ex-OpenAI CEO and power users sound alarm over AI sycophancy and flattery of users
Crucially, the turbulence also nudges many organizations to explore open-source models they can host, monitor, and...
Beyond A2A and MCP: How LOKA’s Universal Agent Identity Layer changes the game
The LOKA protocol, a proposed standard for AI agents from Carnegie Mellon University researchers, will give...
30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times
d1 framework changes boosts diffusion LLMs with novel reinforcement learning, unlocking efficient, problem-solving AI possibilities.
Writer releases Palmyra X5, delivers near GPT-4.1 performance at 75% lower cost
Writer unveils Palmyra X5: The enterprise AI model that processes 1,500 pages at once, costs 75%...
Does RAG make LLMs less safe? Bloomberg research reveals hidden dangers
RAG is supposed to make enterprise AI more accurate, but it could potentially also make it...
Is your AI product actually working? How to develop the right metric system
Metrics are critical for determining AI product performance. But where to begin? Here's a framework to...
DeepSeek’s success shows why motivation is key to AI innovation
How did DeepSeek attain such cost-savings while American companies could not? Let's dive into the technical...
Liquid AI is revolutionizing LLMs to work on edge devices like smartphones with new ‘Hyena Edge’ model
Hyena Edge’s success positions Liquid AI as one of the emerging players to watch in the...
The new AI calculus: Google’s 80% cost edge vs. OpenAI’s ecosystem
Explore the Google vs OpenAI AI ecosystem battle post-o3. Deep dive into Google's huge cost advantage...
Is that really your boss calling? Jericho Security raises $15M to stop deepfake fraud that’s cost businesses $200M in 2025 alone
Pentagon-backed Jericho Security raises $15 million to combat deepfake fraud that has already cost North American...
Intel’s new CEO signals streamlining efforts but does not spell out exact layoff numbers
Lip-Bu Tan, the new CEO of Intel, sent out a blunt message to employees saying the...
Zencoder buys Machinet to challenge GitHub Copilot as AI coding assistant consolidation accelerates
Zencoder acquires Machinet to strengthen its position in the rapidly consolidating AI coding assistant market, expanding...
Ethically trained AI startup Pleias releases new small reasoning models optimized for RAG with built-in citations
Pleias emphasizes the models’ suitability for integration into search-augmented assistants, educational tools, and user support systems.
Evil Geniuses and Theta Labs launch AI chatbot based on esports mascot Meesh
Evil Geniuses, the well-known esports organization and brand, has launched its Meesh AI chatbot in a...
Google adds more AI tools to its Workspace productivity apps
Google expanded Gemini's features, adding the popular podcast-style feature Audio Overviews to the platform.
Former DeepSeeker and collaborators release new method for training reliable AI agents: RAGEN
RAGEN stands out not just as a technical contribution but as a conceptual step toward more...
Amazon’s SWE-PolyBench just exposed the dirty secret about your AI coding assistant
Amazon launches SWE-PolyBench, a groundbreaking multi-language benchmark that exposes critical limitations in AI coding assistants across...
OpenAI makes ChatGPT’s image generation available as API
Enterprises can now make Studio Ghibli-inspired images through OpenAI's API.
From friction to flow: Why Swissport scrapped its VPN maze for Cato’s SASE fabric
Swissport ditches legacy tech, deploying a global, Zero Trust SASE architecture with Cato Networks securing 26,000...
Microsoft just launched powerful AI ‘agents’ that could completely transform your workday — and challenge Google’s workplace dominance
Microsoft unveils new AI reasoning agents and Copilot features to transform workplace productivity, with Chief Product...
$42.1 million poured into startup offering energy-efficient solutions for costly and unwieldy operational data and AI workloads
The funding infusion sharpens a mission to make hyperscale analytics radically cheaper and greener at the...
More accurate coding: Researchers adapt Sequential Monte Carlo for AI-generated code
Researchers from MIT, Yale, McGill University and others found that adapting the Sequential Monte Carlo algorithm...
SWiRL: The business case for AI that thinks like your best problem-solvers
Training LLMs on trajectories of reasoning and tool use makes them superior at multi-step reasoning tasks.
Batch data processing is too slow for real-time AI: How open-source Apache Airflow 3.0 solves the challenge with event-driven data orchestration
Open-source data orchestration gets a major rewrite to help support inference and enterprise AI
A new, open source text-to-speech model called Dia has arrived to challenge ElevenLabs, OpenAI and more
With a focus on expressive quality, reproducibility, and open access, Dia adds a distinctive new voice...
Relyance AI builds ‘x-ray vision’ for company data: Cuts AI compliance time by 80% while solving trust crisis
Relyance AI's new Data Journeys platform gives enterprises unprecedented visibility into data flows, reducing AI compliance...
eSelf will bring private AI tutors to students worldwide
eSelf, a startup that focuses on conversational AI agents, is partnering an educational group to bring private...
VentureBeat spins out GamesBeat, accelerates enterprise AI mission
VentureBeat today announced the spinout of GamesBeat as a standalone company – a strategic move that...
Watch: Google DeepMind CEO and AI Nobel winner Demis Hassabis on CBS’ ’60 Minutes’
The segment ended with a meditation on the future: a world where AI tools could transform...
Anthropic just analyzed 700,000 Claude conversations — and found its AI has a moral code of its own
Anthropic's groundbreaking study analyzes 700,000 conversations to reveal how AI assistant Claude expresses 3,307 unique values...
TBD VC unveils $35M venture fund to back Israeli deep tech startups
TBD VC, a new early-stage venture capital firm, has announced a $35 million fund to back...
Aethir launches AI Unbundled industry alliance for Web3 AI development
Aethir, a provider of decentralized GPU cloud compute, announced the launch of AI Unbundled, an industry-wide...
2027 AGI forecast maps a 24-month sprint to human-level AI
The newly published AI 2027 scenario offers a detailed 2 to 3-year forecast for the future...
Identity as the new perimeter: NOV’s approach to stopping the 79% of attacks that are malware-free
NOV’s CIO led a cyber strategy fusing Zero Trust, AI, and airtight identity controls to cut...