~ai-venturebeat | Bookmarks (565)

Hidden costs in AI deployment: Why Claude models may be 20-30% more expensive than GPT in enterprise settings

venturebeat.com

It is a well-known fact that different model families can use different tokenizers. However, there has...
It is a well-known fact that different model families can use different tokenizers. However, there has been limited analysis on how the process of “tokenization” itself varies across these tokenizers. Do all tokenizers result in the same number of tokens for a given input text? If not, how different are the generated tokens? How significant […]

ai ai deployment ai ml and deep learning anthropic api
1
Astronomer’s $93M raise underscores a new reality: Orchestration is king in AI infrastructure

venturebeat.com

Astronomer secures $93 million in Series D funding to solve the AI implementation gap through data...
Astronomer secures $93 million in Series D funding to solve the AI implementation gap through data orchestration, helping enterprises streamline complex workflows and operationalize AI initiatives at scale.

ai business data infrastructure enterprise analytics programming & development
1
Microsoft launches Phi-4-Reasoning-Plus, a small, powerful, open weights reasoning model!

venturebeat.com

The release demonstrates that with carefully curated data and training techniques, small models can deliver strong...
The release demonstrates that with carefully curated data and training techniques, small models can deliver strong reasoning performance.

ai ai ml and deep learning ai reasoning business conversational ai
1
Salesforce takes aim at ‘jagged intelligence’ in push for more reliable AI

venturebeat.com

Salesforce unveils groundbreaking AI research tackling "jagged intelligence," introducing new benchmarks, models, and guardrails to make...
Salesforce unveils groundbreaking AI research tackling "jagged intelligence," introducing new benchmarks, models, and guardrails to make enterprise AI agents more intelligent, trusted, and consistently reliable for business use.

ai automation data infrastructure enterprise analytics programming & development
1
UiPath’s new orchestrator guides AI agents to follow your enterprise’s rules

venturebeat.com

UiPath's agent orchestration layer Maestro moves prompts through three layers: the agent, a human and the...
UiPath's agent orchestration layer Maestro moves prompts through three layers: the agent, a human and the robotic process automation system.

agent2agent ai ai agent ai agents ai orchestration
1
The ‘era of experience’ will unleash self-learning AI agents across the web—here’s how to prepare

venturebeat.com

AI visionaries predict an 'Era of Experience' where AI learns autonomously, and it will have important...
AI visionaries predict an 'Era of Experience' where AI learns autonomously, and it will have important implications for application design.

agent2agent ai ai computer use ai ml and deep learning ai research
1
Qwen swings for a double with 2.5-Omni-3B model that runs on consumer PCs, laptops

venturebeat.com

The Qwen2.5-Omni-3B model is licensed for non-commercial use only under Alibaba Cloud’s Qwen Research License Agreement.

ai ai ml and deep learning alibaba business china
1
Breaking the ‘intellectual bottleneck’: How AI is computing the previously uncomputable in healthcare

venturebeat.com

How University of Texas Medical Branch is using AI to identify patients at high cardiovascular risk,...
How University of Texas Medical Branch is using AI to identify patients at high cardiovascular risk, flag for stroke and catch 'basic stuff.'

ai ai in healthcare ai ml and deep learning claude claude ai
1
OpenAI rolls back ChatGPT’s sycophancy and explains what went wrong

venturebeat.com

Many organizations may also begin shifting toward open-source alternatives that they can host and tune themselves.

ai ai ml and deep learning ai sycophancy chatbots chatgpt
1
Structify raises $4.1M seed to turn unstructured web data into enterprise-ready datasets

venturebeat.com

Brooklyn-based Structify emerges from stealth with $4.1 million in seed funding to transform how businesses prepare...
Brooklyn-based Structify emerges from stealth with $4.1 million in seed funding to transform how businesses prepare data for AI, promising to save data scientists from the task that consumes 80% of their time.

ai business data infrastructure enterprise analytics programming & development
1
No more window switching: Mastercard’s Agent Pay transforms how enterprises use AI search

venturebeat.com

Mastercard is working with AI companies and banks to allow AI platforms and agents to facilitate...
Mastercard is working with AI companies and banks to allow AI platforms and agents to facilitate transactions.

agent pay agentic search agentic workflows ai ai agent
1
Meta unleashes Llama API running 18x faster than OpenAI: Cerebras partnership delivers 2,600 tokens per second

venturebeat.com

Meta partners with Cerebras to launch its new Llama API, offering developers AI inference speeds up...
Meta partners with Cerebras to launch its new Llama API, offering developers AI inference speeds up to 18 times faster than traditional GPU solutions, challenging OpenAI and Google in the fast-growing AI services market.

ai ai apis data infrastructure enterprise analytics programming & development
1
Meta’s first dedicated AI app is here with Llama 4 — but it’s more consumer than productivity or business oriented

venturebeat.com

This mainstream exposure will likely accelerate a shift in what people expect not just from consumer...
This mainstream exposure will likely accelerate a shift in what people expect not just from consumer apps, but from workplaces...

ai ai apps ai image generation ai images ai ml and deep learning
1
Tripp launches Kōkua AI as mental wellness coach across multiple platforms

venturebeat.com

Tripp launched Kōkua AI, a mental wellness guide designed to deliver real-time, personalized emotional support across...
Tripp launched Kōkua AI, a mental wellness guide designed to deliver real-time, personalized emotional support across multiple platforms.

ai business gamesbeat gaming business kōkua ai
1
xMEMS extends micro cooling fan-on-a-chip tech to AI data centers

venturebeat.com

xMEMS Labs, a pioneer of MEMS-based chips, announced that its innovative µCooling fan-on-a-chip tech will be...
xMEMS Labs, a pioneer of MEMS-based chips, announced that its innovative µCooling fan-on-a-chip tech will be expanded to AI data centers.

ai data centers game development gamesbeat gaming business
1
Alibaba launches open source Qwen3 model that surpasses OpenAI o1 and DeepSeek R1

venturebeat.com

Qwen3’s open-weight release under an accessible license marks an important milestone, lowering barriers for developers and...
Qwen3’s open-weight release under an accessible license marks an important milestone, lowering barriers for developers and organizations.

ai ai ml and deep learning alibaba alibaba qwen business
1
Ex-OpenAI CEO and power users sound alarm over AI sycophancy and flattery of users

venturebeat.com

Crucially, the turbulence also nudges many organizations to explore open-source models they can host, monitor, and...
Crucially, the turbulence also nudges many organizations to explore open-source models they can host, monitor, and fine-tune themselves.

ai ai ml and deep learning business chatbots chatgpt
1
Beyond A2A and MCP: How LOKA’s Universal Agent Identity Layer changes the game

venturebeat.com

The LOKA protocol, a proposed standard for AI agents from Carnegie Mellon University researchers, will give...
The LOKA protocol, a proposed standard for AI agents from Carnegie Mellon University researchers, will give identities and intentions to agents.

agent2agent agentic protocols ai ai agent ai agents
1
30 seconds vs. 3: The d1 reasoning framework that’s slashing AI response times

venturebeat.com

d1 framework changes boosts diffusion LLMs with novel reinforcement learning, unlocking efficient, problem-solving AI possibilities.

ai ai ml and deep learning ai research belladati business
1
Writer releases Palmyra X5, delivers near GPT-4.1 performance at 75% lower cost

venturebeat.com

Writer unveils Palmyra X5: The enterprise AI model that processes 1,500 pages at once, costs 75%...
Writer unveils Palmyra X5: The enterprise AI model that processes 1,500 pages at once, costs 75% less than GPT-4, and enables affordable autonomous agents for businesses seeking automation ROI.

ai automation data infrastructure enterprise analytics programming & development
1
Does RAG make LLMs less safe? Bloomberg research reveals hidden dangers

venturebeat.com

RAG is supposed to make enterprise AI more accurate, but it could potentially also make it...
RAG is supposed to make enterprise AI more accurate, but it could potentially also make it less safe according to new research.

ai ai guardrails ai safety bloomberg business
1
Is your AI product actually working? How to develop the right metric system

venturebeat.com

Metrics are critical for determining AI product performance. But where to begin? Here's a framework to...
Metrics are critical for determining AI product performance. But where to begin? Here's a framework to apply across various use cases.

ai ai ml and deep learning ai research datadecisionmakers generative ai
1
DeepSeek’s success shows why motivation is key to AI innovation

venturebeat.com

How did DeepSeek attain such cost-savings while American companies could not? Let's dive into the technical...
How did DeepSeek attain such cost-savings while American companies could not? Let's dive into the technical details.

ai ai ml and deep learning ai research datadecisionmakers deepseek
1
Liquid AI is revolutionizing LLMs to work on edge devices like smartphones with new ‘Hyena Edge’ model

venturebeat.com

Hyena Edge’s success positions Liquid AI as one of the emerging players to watch in the...
Hyena Edge’s success positions Liquid AI as one of the emerging players to watch in the evolving AI model landscape.

ai ai ml and deep learning business conversational ai edge ai
1