Revolutionize Your Tech Journey with AI — Explore Gadget Wave's Latest Innovations

DeepSeek Launches V3.2-Exp: Revolutionizing Long Text Processing with DSA

DeepSeek's new model, V3.2-Exp, makes handling long texts more efficient. With its innovative DeepSeek Sparse Attention, it scores higher on certain benchmarks while keeping output quality high. Open-source kernels are also released to boost this technology.

, and Administrator

2025 October 3 . 2:21 PM

1 min read

In this image, there is an article contains pictures and some text.

DeepSeek Launches V3.2-Exp: Revolutionizing Long Text Processing with DSA

Chinese AI startup DeepSeek has launched an experimental model, V3.2-Exp, featuring a novel sparse attention mechanism called DeepSeek Sparse Attention (DSA). The model, available on HuggingFace and supported by vLLM, promises significant improvements in training and inference for long contexts.

DSA, introduced in late September 2025, works by selecting relevant parts of long texts. This reduces the computing power required, making it more efficient. The model has shown promising results on benchmarks. On the Codeforces benchmark, V3.2-Exp scored 2121 points, slightly higher than the previous V3.1-Terminus model's 2046 points. However, on the MMLU-Pro benchmark, both models scored identically at 85.0 points, indicating that DSA maintains output quality while improving efficiency.

DeepSeek has also released open-source kernels like TileLang, DeepGEMM, and FlashMLA to help developers make the most of sparse attention technology. The inference code for local use is available, but it requires adjustments for GPU configuration and expert settings.

DeepSeek's V3.2-Exp model, with its innovative DeepSeek Sparse Attention, offers a significant leap in handling long texts efficiently. While maintaining the output quality of its predecessor, it scores slightly higher on certain benchmarks. The open-source kernels released by DeepSeek promise to further boost the use of this efficient technology.

Latest

Manufacturing

HMS Astute Returns for Major Overhaul After 15 Years of Global Service

HMS Astute, the first of its class to achieve numerous milestones, is back for a well-deserved refit. The multi-million-pound Mid-Life Revalidation Period will secure the submarine's future and reflect the Royal Navy's commitment to a strong underwater fleet.

, and Administrator

2025 October 9

In the center of the image we can see a man riding on the jet ski. At the bottom there is water. In...

Latest Tech Innovations

Salomon's Speedcross Peak Waterproof Sneaker: Fall 2025's Must-Have

Stay dry and stylish this fall with Salomon's latest. The Speedcross Peak Waterproof sneaker combines performance and fashion at a Prime Day discount.

, and Administrator

2025 October 9

In this picture there is a security person who is holding the papers. In front of him there is...

Fortify Your Gadget World

Rubrik Bolsters Leadership with Top Appointments, Surpasses $400M in ARR

Rubrik strengthens its leadership with high-profile appointments. With over $400M in ARR, it's poised to drive innovation in cybersecurity, especially in the APAC region.

, and Administrator

2025 October 9

This image consists of few persons. They are wearing the army dresses. At the bottom, there is...

Smart-home-devices

Wesel Police Offers Free E-bike & Pedelec Training & Coding This Fall

Boost your riding skills and security with free police-led training and coding for your E-bike or Pedelec. Sessions happening across Wesel this October.

, and Administrator

2025 October 9

DeepSeek Launches V3.2-Exp: Revolutionizing Long Text Processing with DSA

DeepSeek Launches V3.2-Exp: Revolutionizing Long Text Processing with DSA

Read also:

Related

Latest