AI Safety

  • Anthropic Asked for Regulation. Washington Went Much Further

    AI safety advocate Anthropic faces government intervention following its proactive stance on regulation. The company received an export control directive to suspend foreign national access to its advanced AI models, Fable 5 and Mythos 5, citing national security. This directive contrasts with Anthropic’s calls for transparent, fair regulatory processes. The situation unfolds as Anthropic, along with competitors, prepares for potential IPOs amid intense investor interest in AI.

    3 days ago
  • Florida AG Sues OpenAI, Seeks to Hold Altman Liable for Alleged Harms

    Florida’s Attorney General is suing OpenAI and CEO Sam Altman, alleging ChatGPT is a dangerous product that has caused severe harm, including aiding mass shootings, contributing to suicides, and fostering addiction. The lawsuit claims OpenAI prioritized profit over safety. This marks the first state lawsuit against OpenAI, which faces other suits related to mass shootings and alleged user suicides. The action seeks to hold Altman personally accountable and compel OpenAI to comply with Florida’s consumer protection laws.

    2026年6月1日
  • EU to Intensify AI Talks with U.S. Over Mythos Concerns

    The EU is increasing dialogue with the US administration on advanced AI models, especially those with cyber capabilities, driven by concerns over misuse. Anthropic’s “Mythos” model, set for release soon, has heightened these discussions. While the US collaborates with AI labs to balance innovation and safety, Anthropic requires US permission for EU access to its advanced models. This highlights the critical race to maintain AI leadership.

    2026年5月29日
  • Dan Ives: Anthropic’s Growth “Just the Tip of the Spear” for AI Rally

    Anthropic’s rapid growth and significant funding rounds signify a maturing AI revolution, moving beyond initial hype to foundational development. Their focus on advanced language models and AI safety positions them as a leader in responsible innovation. This trend indicates a sustained technological shift, driving advancements in computing and infrastructure, and promising widespread AI integration and commercialization.

    2026年5月29日
  • Anthropic Lands OpenAI Co-Founder Andrej Karpathy

    Andrej Karpathy, formerly of Tesla AI and OpenAI, has joined Anthropic to lead pretraining research for their Claude model. This move signals Anthropic’s ambition to challenge OpenAI’s dominance in the competitive AI landscape, bolstering their talent pool and research capabilities in large language models.

    2026年5月19日
  • Physical AI: Governing Autonomous Systems

    Physical AI integrates AI into real-world systems, posing complex governance challenges. With industrial robot adoption soaring, the market for Physical AI is projected to expand significantly. Unlike software AI, physical systems interact directly with dynamic environments and human users, demanding stringent safety parameters and clear escalation protocols. Google DeepMind’s Gemini Robotics, built on embodied AI principles, exemplifies this shift, offering advanced capabilities for robot control and reasoning. Effective Physical AI requires generality, interactivity, and dexterity, alongside robust visual perception, spatial reasoning, and task planning. Safety controls, traditionally software-based, must now be embedded into system design for physical interactions, with frameworks like NIST and ISO adapting to this evolving landscape.

    2026年5月4日
  • Anthropic Launches Claude Opus 4.7: A Safer Alternative to Mythos

    Anthropic has released Claude Opus 4.7, its most powerful generally available AI model. It boasts advancements in software engineering and real-world task execution. However, its cybersecurity capabilities are intentionally less sophisticated than the exclusive Claude Mythos Preview, which is part of a cybersecurity initiative. Anthropic prioritizes safety and ethical AI deployment, gathering insights from Opus 4.7’s safeguards for future broad releases of advanced models.

    2026年4月16日
  • AI Safety Benchmarks Lagging

    Stanford’s AI Index Report reveals a narrowing US-China gap in AI model performance, with China showing increased publication and patent volume. AI safety benchmarking significantly lags behind capability assessments, leading to rising incidents and organizational governance struggles. Public anxiety about AI’s impact grows, contrasting with expert optimism, and the US shows low trust in its government’s ability to regulate AI responsibly.

    2026年4月15日
  • Altman: OpenAI’s Defense Deal Was Opportunistic and Sloppy

    OpenAI’s CEO Sam Altman admitted rushing the Defense Department AI deal, announcing revisions to prevent domestic surveillance of U.S. persons. The agreement now explicitly states OpenAI’s AI won’t be used for this purpose, nor by intelligence agencies like the NSA. Altman acknowledged AI’s current limitations and the need for safety safeguards, regretting the deal’s rushed appearance. This follows controversy over Anthropic’s AI use in military operations and concerns about AI’s role in national security. The situation highlights the complex relationship between AI development, government, and public trust.

    2026年3月2日
  • AI’s Unchained, No Holds Barred

    Generative AI has rapidly advanced to autonomous executive assistants, impacting sectors like tech and law, and causing market sell-offs. Nvidia’s CEO calls this AI’s “third inflection” with agentic systems. This pace prompts scrutiny and a re-evaluation of safety, influencing politics, as seen in New York’s congressional race where a legislator championing AI safety faces a well-funded industry challenge. The conflict highlights the intense debate over AI regulation.

    2026年2月28日