
Modeling

SEA-LION v4: Multimodal Language Modeling for Southeast Asia
AI Singapore (AISG) has released SEA-LION v4, an open-source multimodal language model developed in collaboration with Google and based on the Gemma 3 (27B) architecture. The model is designed to support Southeast Asian languages, including those with limited digital resources, and provides both text and image understanding capabilities. SEA-LION v4 uses a commercially permissive license…

Mistral AI Releases Devstral 2507 for Code-Centric Language Modeling
Mistral AI, in collaboration with All Hands AI, has released updated versions of its developer-focused large language models under the Devstral 2507 label. The release includes two models—Devstral Small 1.1 and Devstral Medium 2507—designed to support agent-based code reasoning, program synthesis, and structured task execution across large software repositories. These models are optimized for performance…

This AI paper from DeepSeek-AI Explores How DeepSeek-V3 Delivers High-Performance Language Modeling by Minimizing Hardware Overhead and Maximizing Computational Efficiency
The growth in developing and deploying large language models (LLMs) is closely tied to architectural innovations, large-scale datasets, and hardware improvements. Models like DeepSeek-V3, GPT-4o, Claude 3.5 Sonnet, and LLaMA-3 have demonstrated how scaling enhances reasoning and dialogue capabilities. However, as their performance increases, so do computing, memory, and communication bandwidth demands, placing substantial strain…

3 Questions: Modeling adversarial intelligence to exploit AI’s security vulnerabilities
If you’ve watched cartoons like Tom and Jerry, you’ll recognize a common theme: An elusive target avoids his formidable adversary. This game of “cat-and-mouse” — whether literal or otherwise — involves pursuing something that ever-so-narrowly escapes you at each try. In a similar way, evading persistent hackers is a continuous challenge for cybersecurity teams. Keeping…

Modeling Extremely Large Images with xT
As computer vision researchers, we believe that every pixel can tell a story. However, there seems to be a writer’s block settling into the field when it comes to dealing with large images. Large images are no longer rare—the cameras we carry in our pockets and those orbiting our planet snap pictures so big and…