DeepSeek V4 arrives with a 1 million token window, targets US rivals, and promises to shake up the global artificial intelligence race.

Written by Fabio Lucas Carvalho Published on 25/04/2026 at 00:02 Updated on 02/05/2026 at 13:40

DeepSeek V4 expands the global artificial intelligence dispute by combining a 1 million token context window, gains in coding, reasoning, and agent-oriented tasks, in addition to a strategy focused on lower costs, national hardware, and open-source models to pressure competitors from the United States.

DeepSeek unveiled preview versions of its V4 AI model and entered a new stage of the global artificial intelligence dispute, directly targeting American platforms at a time of accelerating industry growth. The Chinese company is betting on a 1 million token context window, gains in coding, reasoning, and agent-oriented tasks, in addition to a strategy focused on reducing operational costs.

The launch occurs in a week marked by the advancement of rivals from the United States and accusations from the White House against China regarding large-scale copying of American AI systems. In this scenario, DeepSeek seeks to expand its space with the V4 Flash and V4 Pro series, which arrive with architectural updates, optimization improvements, and a declared focus on efficiency.

DeepSeek bets on a 1 million token context window

One of the main elements of the V4 model is the so-called Hybrid Attention Architecture, presented as a way to improve context retention in long conversations. The technology also seeks to reduce memory loss in prolonged interactions, a relevant point for more complex uses of artificial intelligence.

ARTICLE CONTINUES BELOW

Efficiency becomes a central axis of the strategy

DeepSeek maintains efficiency as one of its main differentials in the dispute with rivals from the United States. The trillion-parameter system uses a Mixture of Experts approach, which activates only a fraction of the parameters for each task.

This operation reduces inference costs compared to traditional models, where all parameters are usually activated with each request. The technical choice reinforces the company’s attempt to compete in performance without relying on such an expensive operational structure.

The pressure for efficiency gains weight because AI systems are becoming increasingly expensive to operate. DeepSeek tries to position itself as an alternative in a market where performance, cost, and scale have become decisive factors for companies and developers.

Chinese chips enter the center of the dispute

DeepSeek’s models were also designed to run on national hardware, which increases the relevance of Chinese infrastructure in the company’s advancement. The expectation is for a further drop in costs when clusters equipped with Huawei Technologies Co.’s Ascend 950 chips come into operation later this year.

This change could reduce dependence on US chip manufacturers and strengthen China’s AI infrastructure. The strategy emerges amidst the dispute for computational capacity, considered essential for training, operating, and scaling more advanced models.

Market reaction was swift after the announcement. Shares of Semiconductor Manufacturing International Corp. and Hua Hong Semiconductor rose, while shares of competing AI companies declined.

The movement indicates an investor bet on the growth of demand for chips manufactured in China. DeepSeek, however, stated that the service capacity of the V4 Pro series remains limited by computational resource constraints.

Negotiations indicate expansion plans

The company is also in talks with Tencent Holdings Ltd. and Alibaba Group Holding Ltd. for its first round of funding. The negotiations signal infrastructure expansion plans, at a time when computational capacity limits part of the V4 Pro’s offering.

The launch of the V4 version follows the R1 model, which stirred the artificial intelligence market and prompted a re-evaluation of investments in cutting-edge systems. DeepSeek stated that R1 delivered competitive performance at a fraction of the cost of leading American models.

Since then, the debate over investments has regained momentum. Technology companies in the United States are expected to invest approximately US$650 billion in 2026 in AI infrastructure and data centers, attempting to balance performance gains and long-term costs.

DeepSeek increases pressure on closed models

DeepSeek states that version 4 deepens the strategy initiated with R1, with advancements in scalability and efficiency. The company also continues to position open-source models as alternatives to closed systems, attracting developers and companies seeking more control over their tools.

The launch, however, occurs under scrutiny. American authorities accused DeepSeek of using restricted chips, while Anthropic alleged misuse of its Claude system.

DeepSeek did not disclose training costs or hardware details for version 4. Nevertheless, V4 reinforces the dispute around lower costs, scalable performance, and hardware flexibility, points that place the Chinese company at the center of the next phase of global competition in artificial intelligence.

With information from Interesting Engineering

0 Comments

most recent

older Most voted

DeepSeek V4 arrives with a 1 million token window, targets US rivals, and promises to shake up the global artificial intelligence race.

DeepSeek bets on a 1 million token context window

Efficiency becomes a central axis of the strategy

Chinese chips enter the center of the dispute

Negotiations indicate expansion plans

DeepSeek increases pressure on closed models

1,700-Year-Old Roman Mosaic Discovered by Farmer Planting Cherry Trees in Turkey

Chinese Teen Wins Gold in Germany for Invention Turning Air Moisture into Underground Irrigation in Drought-Stricken Areas

Indian Teens Win National Geographic Award at Google Science Fair for Using Common Sour Fruit to Naturally Coagulate Rubber in 6 Hours, Reducing Production Time by 10 Hours

India Begins Drilling Massive Twin Tunnel Under National Park to Cut Mumbai Commute by 12 km and Save Up to 1 Hour

Lucy, the 3.2-million-year-old fossil that reshaped our understanding of human origins, makes a rare journey from Ethiopia to Abu Dhabi.

South Korea Tows 18 Concrete Blocks Weighing Up to 48,000 Tons Over 36 km to Construct a 3.2 km Underwater Tunnel Between Busan and Geoje, Installed Nearly 50 Meters Deep

At 28, a Female Shipbuilder Restores a 30-Meter Yacht from 1962, Highlighting Women’s Often Overlooked Contributions in Shipyards

Brazil’s BNDES Funded Metro and Steel Plant Projects in Venezuela with Brazilian Support

Thailand Converts 8 Tons of PET Bottles into 3,500 School Backpacks, Showcasing Circular Economy with Over 40 Factories Involved

Mysterious Bats Spotted in Paris’s Père Lachaise Cemetery, Drawing Tourists to the “Vampire Cemetery”

Spain Launches World’s Largest Floating Dry Dock: 56-Meter Giant Produces Massive Concrete Structures in A Coruña Using Technology Proven in Brazil

From Selling Street Food to Leading a $300 Million Construction Firm: The Journey of a Brazilian Entrepreneur

DeepSeek V4 arrives with a 1 million token window, targets US rivals, and promises to shake up the global artificial intelligence race.

DeepSeek bets on a 1 million token context window

Efficiency becomes a central axis of the strategy

Chinese chips enter the center of the dispute

Negotiations indicate expansion plans

DeepSeek increases pressure on closed models

1,700-Year-Old Roman Mosaic Discovered by Farmer Planting Cherry Trees in Turkey

Chinese Teen Wins Gold in Germany for Invention Turning Air Moisture into Underground Irrigation in Drought-Stricken Areas

Indian Teens Win National Geographic Award at Google Science Fair for Using Common Sour Fruit to Naturally Coagulate Rubber in 6 Hours, Reducing Production Time by 10 Hours

India Begins Drilling Massive Twin Tunnel Under National Park to Cut Mumbai Commute by 12 km and Save Up to 1 Hour

Lucy, the 3.2-million-year-old fossil that reshaped our understanding of human origins, makes a rare journey from Ethiopia to Abu Dhabi.

South Korea Tows 18 Concrete Blocks Weighing Up to 48,000 Tons Over 36 km to Construct a 3.2 km Underwater Tunnel Between Busan and Geoje, Installed Nearly 50 Meters Deep

At 28, a Female Shipbuilder Restores a 30-Meter Yacht from 1962, Highlighting Women’s Often Overlooked Contributions in Shipyards

Survey of 2,500 professionals reveals AI concerns: 39% feel it makes them less intelligent, 41% fear long-term career impact.

Panasonic Shifts Strategy in the U.S., Focuses on AI Data Center Batteries as Global Energy Demand Surges

Brazilian Entrepreneur Develops Device to Provide Companionship and Health Monitoring for Elderly, Alerting Families to Potential Health Issues

AI Customizes Burgers for Individuals and Impresses in Blind Test with Over 100 Participants, Matching or Surpassing Fast-Food Chains’ Approval Ratings

AI Revolutionizes Digital Entrepreneurship by Enabling Lean Startups and Rapid Scaling

Brazil’s BNDES Funded Metro and Steel Plant Projects in Venezuela with Brazilian Support

Thailand Converts 8 Tons of PET Bottles into 3,500 School Backpacks, Showcasing Circular Economy with Over 40 Factories Involved

Mysterious Bats Spotted in Paris’s Père Lachaise Cemetery, Drawing Tourists to the “Vampire Cemetery”

Spain Launches World’s Largest Floating Dry Dock: 56-Meter Giant Produces Massive Concrete Structures in A Coruña Using Technology Proven in Brazil

From Selling Street Food to Leading a $300 Million Construction Firm: The Journey of a Brazilian Entrepreneur