AIs

How Sakana AI’s new evolutionary algorithm builds powerful AI models without expensive retraining

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now A new evolutionary technique from Japan-based AI lab Sakana AI enables developers to augment the capabilities of AI models without costly training and fine-tuning processes. The technique, called Model…

Moonshot AI’s Kimi K2 outperforms GPT-4 in key benchmarks — and it’s free

ellonjohns3 months ago015 mins

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Moonshot AI, the Chinese artificial intelligence startup behind the popular Kimi chatbot, released an open-source language model on Friday that directly challenges proprietary systems from OpenAI and Anthropic with…

Sakana AI’s TreeQuest: Deploy multi-model teams that outperform individual LLMs by 30%

ellonjohns3 months ago012 mins

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Japanese AI lab Sakana AI has introduced a new technique that allows multiple large language models (LLMs) to cooperate on a single task, effectively creating a “dream team” of…

Between utopia and collapse: Navigating AI’s murky middle future

ellonjohns3 months ago020 mins

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more In the blog post The Gentle Singularity, OpenAI CEO Sam Altman painted a vision of the near future where AI quietly and benevolently transforms human life. There will be no sharp break,…

AI’s Struggle to Read Analogue Clocks May Have Deeper Significance

ellonjohns4 months ago019 mins

A new paper from researchers in China and Spain finds that even advanced multimodal AI models such as GPT-4.1 struggle to tell the time from images of analog clocks. Small visual changes in the clocks can cause major interpretation errors, and fine-tuning only helps with familiar examples. The results raise concerns about the reliability of…

How Patronus AI’s Judge-Image is Shaping the Future of Multimodal AI Evaluation

ellonjohns5 months ago011 mins

Multimodal AI is transforming the field of artificial intelligence by combining different types of data, such as text, images, video, and audio, to provide a deeper understanding of information. This approach is similar to how humans process the world around them using multiple senses. For example, AI can examine medical images in healthcare while considering…

DeepSeek jolts AI industry: Why AI’s next leap may not come from more data, but more compute at inference

ellonjohns6 months ago013 mins

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More The AI landscape continues to evolve at a rapid pace, with recent developments challenging established paradigms. Early in 2025, Chinese AI lab DeepSeek unveiled a new model that sent shockwaves through the AI industry and resulted…

Emergence AI’s new system automatically creates AI agents rapidly in realtime based on the work at hand

ellonjohns6 months ago013 mins

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Another day, another announcement about AI agents. Hailed by various market research reports as the big tech trend in 2025 — especially in the enterprise — it seems we can’t go more than 12 hours or…

Contextual AI’s new AI model crushes GPT-4o in accuracy — here’s why it matters

ellonjohns7 months ago011 mins

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Contextual AI unveiled its grounded language model (GLM) today, claiming it delivers the highest factual accuracy in the industry by outperforming leading AI systems from Google, Anthropic and OpenAI on a key benchmark for truthfulness. The…

Anthropic’s Claude 3.7 Sonnet takes aim at OpenAI and DeepSeek in AI’s next big battle

ellonjohns7 months ago012 mins

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Anthropic just fired a warning shot at OpenAI, DeepSeek and the entire AI industry with the launch of Claude 3.7 Sonnet, a model that gives users unprecedented control over how much time an AI spends “thinking”…

Highlights

Power Tips #145: EIS applications for EV batteries

ExpressVPN review 2025: Fast speeds and a low learning curve

AI system learns from many types of scientific information and runs experiments to discover new materials

Track, Prioritize & Win: The Complete GEO Playbook For AI Search Visibility

Category Collection