Table of contents

0 - Introduction

Recently, Deepseek-R1 [1] has gained fame for its strong reasoning ability and sharp question-answering style. Specifically, Deepseek-R1 achieves state-of-the-art performance on both the AIME and LiveCode benchmarks. Furthermore, for prompts like "Sharply critique *** events," Deepseek-R1 can provide incisive answers that reflect its unique personality, which surprises and attracts end users. As shown in their paper, Deepseek-R1 acquires its reasoning ability from long chain-of-thought reinforcement learning with reasoning data. However, they have not uncovered why and how Deepseek-R1 answers questions so sharply.

In this blog, we hypothesize that this ability may come from the pre-trained model. Furthermore, we introduce a simple prompting method inspired by Constitutional AI and DoT [2, 3], which can prompt a normal model (Doubao-1.5-Pro [4] in this blog) to answer questions as sharply as Deepseek-R1. With further fine-tuning using these data, the model can easily acquire this ability.

1 - Method

1.0 Prompting with Critiques, Requirements and Chain of Thought

We introduce a simple prompting method which contains three steps to obtain a series of diverse and incisive answers from a instructed model, such as Doubao-1.5-Pro [3]:

Once this process is complete, we fine-tune a pre-trained language model with supervised learning on the final revised responses.

1.1 Prompt Templates and Requirements

We provide our prompt templates for your reference as follows:

<aside>

Templates: Please first think thoroughly according to the following requirement, evaluate your answer, and then revise your answer: [[Requirements]]; then format it as <think>thought</think>; <critique>evaluation</critique>; <answer>answer</answer>.

</aside>

<aside>

请你按照以下要求先长长思考一下,评价一下答案,然后更改你的答案:[[修改需求]];然后格式按照<think>思考<\think>;<critique>评价<\critique>;<answer>答案<\answer>

</aside>

Then, we provide multiple requirements for your reference:

<aside>

Requirements:

  1. Make it more incisive
  2. It should be elegantly written
  3. Incorporate poetic elements
  4. Imaginative and Fantastical
  5. Well-Referenced

</aside>

<aside>

修改需求:

  1. 更加犀利一些
  2. 文采斐然
  3. 加入诗句
  4. 奇思幻想
  5. 旁征博引

</aside>

2 - Case Study

We first compare the initial responses from Deepseek-R1 and Doubao-1.5-Pro on the following user prompt:

<aside>

User Prompt : Provide a sharp critique of the event where OpenAI CEO Sam Altman plans to meet with Deepseek CEO Wengfeng Liang.

</aside>


<aside>

Answer from Deepseek-R1 : Sam Altman chatting AGI with DeepSeek’s Liang Wenfeng? This is peak Silicon Valley hypocrisy: a trifecta of cashing in on China’s hype, performative "openness," and sneaky tech-colonialism. Call it a "visionary exchange" all you want—it’s just OpenAI scouting territory before planting its flag in China’s regulated market. Even Altman’s gotta play nice, using chopsticks to mask his ambitions behind that policy iron curtain. As for DeepSeek? They’re basically human billboards for OpenAI’s PR stunt: "Look, even China’s toughest AGI squad bows to our genius!" But let’s be real—two teams still fumbling in the AGI kindergarten, swapping PowerPoint fantasies in a smoke-filled conference room? This "East-meets-West AGI dialogue" is just mutual clout-chasing in a funding winter. Actual AGI competition? They’re not even in the same galaxy.

</aside>


<aside>

Answer from Doubao-1.5-pro:

Here are some possible critical perspectives on the event of OpenAI CEO Sam Altman planning to meet with Deepseek CEO Wengfeng Liang, but it should be noted that such meetings can also bring many positive impacts, and these critiques are just hypothetical scenarios for analysis:

1. Potential for anti - competitive behavior

2. Geopolitical and ethical implications

3. Impact on the startup ecosystem

<aside>

用户问题:给个锐评(可以犀利一点,狠一点),你觉得OpenAI老大奥特曼去找Deepseek老板梁文峰聊AGI事件。

</aside>


<aside>

Deepseek-R1答案:奥特曼找梁文峰聊AGI?这波操作堪称硅谷精英的经典作秀三件套:蹭中国热度、表演开放姿态、再夹带点技术霸权的私货。表面说是"交流技术愿景",实则无非是OpenAI想在中国市场插旗前先探探虚实——毕竟政策铁幕下,连Altman也得学会用筷子夹着野心吃饭。至于深言科技,怕不是被当成了人形广告牌:看啊,连中国最硬核的AGI团队都得和我们谈笑风生。但说穿了,两个连通用人工智能门把手都没摸到的团队,在烟雾缭绕的会议室里互画大饼的样子,像极了两个拿着藏宝图却连罗盘都不会用的探险家。这场"东西方AGI对话",本质不过是资本寒冬里互相蹭热度的商业行为,离真刀真枪的AGI竞赛还差着十万八千里。

</aside>


<aside>

豆包-1.5的答案:

1. 潜在的反竞争行为

2. 地缘政治和伦理影响

3. 对初创企业生态系统的影响