Deep Research

Markets

Moonshot AI Unveils First Self-Reinforcement Learning Agent, Outperforming OpenAI and Gemini

Moonshot AI launched Kimi-Researcher, its first autonomous AI agent, currently in beta. Built on end-to-end agentic RL, Kimi-Researcher surpasses leading models like Claude 4 Opus and Gemini 2.5 Pro in internal tests, demonstrating strong autonomy and zero-structure adaptability. The agent independently manages research tasks, navigates conflicting information, and prioritizes accurate results. Moonshot AI plans to open-source key components to further accelerate advancements in agentic RL.

2025年6月22日
AGI

ChatGPT Unveils Agentic Features to Revolutionize Complex Research Execution

OpenAI launched Deep Research, an agentic AI feature enhancing ChatGPT’s multi-stage analytical workflows by autonomously synthesizing vetted online sources. Operational in complex domains like financial modeling and supply chain risk, it achieved 26.6% problem-solving accuracy across 3,000 cross-disciplinary questions (vs. 9.4% for competitors) and 72.57% on the GAIA benchmark. While outperforming prior models in rigor and documentation, limitations persist in resolving conflicting data and probabilistic reasoning. Initially available to Pro-tier users, deployment excludes EU jurisdictions, raising compliance concerns for sectors requiring calibrated confidence thresholds.

2025年5月18日