Deep Research
-
Moonshot AI Unveils First Self-Reinforcement Learning Agent, Outperforming OpenAI and Gemini
Moonshot AI launched Kimi-Researcher, its first autonomous AI agent, currently in beta. Built on end-to-end agentic RL, Kimi-Researcher surpasses leading models like Claude 4 Opus and Gemini 2.5 Pro in internal tests, demonstrating strong autonomy and zero-structure adaptability. The agent independently manages research tasks, navigates conflicting information, and prioritizes accurate results. Moonshot AI plans to open-source key components to further accelerate advancements in agentic RL.
-
ChatGPT Unveils Agentic Features to Revolutionize Complex Research Execution
OpenAI launched Deep Research, an agentic AI feature enhancing ChatGPT’s multi-stage analytical workflows by autonomously synthesizing vetted online sources. Operational in complex domains like financial modeling and supply chain risk, it achieved 26.6% problem-solving accuracy across 3,000 cross-disciplinary questions (vs. 9.4% for competitors) and 72.57% on the GAIA benchmark. While outperforming prior models in rigor and documentation, limitations persist in resolving conflicting data and probabilistic reasoning. Initially available to Pro-tier users, deployment excludes EU jurisdictions, raising compliance concerns for sectors requiring calibrated confidence thresholds.