DeepSeek Gives 75% Low cost on its Reasoning Mannequin; New R2 Mannequin to Launch Earlier than Might

DeepSeek, the Chinese language AI startup, introduced on Wednesday that it’s going to supply reductions for its API platform throughout non-peak hours – from 16:30 – 00:30 daily.

Throughout this off-peak interval, the associated fee for DeepSeek-V3 is decreased by 50% to $0.035 per million tokens for enter (cache hit), $0.135 for enter (cache miss), and $0.550 for output.

In the meantime, DeepSeek-R1 sees a 75% discount, with costs dropping to $0.035 per million tokens for enter (cache hit), $0.135 for enter (cache miss), and $0.550 for output.

Compared, OpenAI’s o1 reasoning mannequin prices $60 for output and $15 for 1 million enter API tokens.

That stated, DeepSeek’s API just lately encountered a collection of issues. At one level, its standing indicated that it skilled downtime for 10 steady days. On February 6, it was additionally reported that DeepSeek quickly suspended API top-ups for builders.

DeepSeek to Launch R2 Mannequin Earlier than Might

On Tuesday, Reuters reported that DeepSeek plans to launch its subsequent reasoning mannequin, the DeepSeek R2, ‘as early as attainable’. The corporate initially deliberate to launch it in early Might however is now contemplating an earlier timeline.

The mannequin is claimed to supply ‘higher coding’ and motive in languages past English.

The competitors in China’s AI ecosystem is heating up, and just lately, Alibaba launched a preview of the Qwen QwQ-Max reasoning mannequin, and the corporate additionally dedicated to a $52 billion funding in AI infrastructure over the subsequent three years.

Moreover its plans to launch new fashions, DeepSeek is eager to open-sourcing its applied sciences. The startup introduced an open-source week, the place it’ll launch 5 new open supply repositories.

On Wednesday, the corporate introduced its third launch known as DeepGEMM, an FP8 GEMM library optimised for dense and Combination of Consultants (MoE) computations. The library is claimed to ship greater than 1350 FP8 TFLOPS on NVIDIA Hopper GPUs.

Just lately, the startup launched its DeepSeek-R1 and DeepSeek-V3 fashions, making a storm throughout the AI ecosystem.

These fashions provided state-of-the-art efficiency whereas being skilled and deployed at a fraction of the price of their opponents whereas additionally being accessible as open supply.

The put up DeepSeek Gives 75% Low cost on its Reasoning Mannequin; New R2 Mannequin to Launch Earlier than Might appeared first on Analytics India Journal.

DeepSeek Gives 75% Low cost on its Reasoning Mannequin; New R2 Mannequin to Launch Earlier than Might

DeepSeek to Launch R2 Mannequin Earlier than Might

Latest stories

CMS Uses Machine Learning to Fully Reconstruct LHC Collisions

LANL: AI Accelerates Elucidation of Nuclear Forces with Explosive Neutron...

PNNL: Integrating AI into Biological Research

Rick Stevens on the Genesis Mission and the Future of...

Inside the DOE’s 26 AI Challenges for Genesis Mission

You might also like...

CMS Uses Machine Learning to Fully Reconstruct LHC Collisions

LANL: AI Accelerates Elucidation of Nuclear Forces with Explosive Neutron Star Data

PNNL: Integrating AI into Biological Research