Skip to main content

DeepSeek to Launch V3 with Advanced Modelling

Munazza Shaheen
1 minute read
DeepSeek and Tsinghua University Partner to Advance Self-Improving AI Models
Image: DeepSeek and Tsinghua University Partner to Advance Self-Improving AI Models

In collaboration with Tsinghua University, DeepSeek is developing large language models (LLMs) based on new methods to enable better and faster results. According to the researchers from the university, Deepseek has developed a technique that combines generative reward modelling (GRM) and self-principled critique tuning. 

The company announced that the new DeepSeek-GRM model outperformed existing models, having  “achieved competitive performance” with strong public reward models. Reward modelling is a process that guides an LLM about human preferences. The company also intends to make GRM models open source. 

DeepSeek was reported by Reuters to be releasing its R2 model this month. However, there is no confirmation from the Chinese-based start-up. 

DeepSeek rocked the AI community with its cost-efficient R1 model. It was founded in 2023 by Liang Wenfeng. Last month, the AI startup upgraded its V3 model, named DeepSeek-V3-0324. The company claims that the model will be offering

“enhanced reasoning capabilities, optimised front-end web development and upgraded Chinese writing proficiency”.

In February, DeepSeek open-sourced five of its code repositories. This initiative allowed the developers and reviewers to contribute to its software development. The start-up envisions  “sincere progress with full transparency”.

As AI technology is continuously evolving and there is cut-throat competition between the emerging AI companies, including OpenAI and Anthropic, time will decide who will gain a bigger share of the market. It will depend on speed, accuracy, diversity, and real-time information. Most importantly, the performance and price of the models will determine the global trust and acceptance of the AI chatbots. 

Share

Pick your channel

Spotted an error?Report a correction →

About the Author

Munazza Shaheen
Munazza ShaheenReviewedScore 50
@munazzaWriter

Munazza Shaheen is an AI and technology researcher at TECHi with a deep interest in machine learning, automation, and emerging tech trends. Her work focuses on exploring the impact of artificial intelligence on industries, ethical AI development, and future innovations. She actively follows advancements in deep learning, robotics, and AI-driven solutions, contributing insights into how technology is shaping the world.

Comments