AI infrastructure

  • Huawei Ascend 384 Supernode Debuts, Outperforming NVIDIA and AMD’s Previous Generation

    The 2025 WAIC features Huawei’s debut of the Atlas 900 A3 SuperPoD, built on the Ascend 384 Super Node. This super-node utilizes advanced bus technology for high-bandwidth, low-latency interconnection between 384 NPUs, addressing communication bottlenecks in large AI clusters. Huawei’s CloudMatrix 384 (CM384) AI cluster, built around Ascend chips, delivers 300 PFLOPs of dense BF16 compute power, reportedly surpassing NVIDIA’s GB200 NVL72. Analysts suggest Huawei’s scaled solution surpasses current market offerings from NVIDIA and AMD.

    2025年7月26日
  • OpenAI and Oracle Partner on Stargate AI Data Center

    OpenAI is significantly expanding its AI infrastructure through a massive deal with Oracle to build new data centers across the US as part of its Stargate initiative. These centers, requiring 4.5 gigawatts of power, will house over two million chips and support wider access to advanced AI. This supports OpenAI’s pledge to invest heavily in US AI infrastructure, potentially exceeding $500 billion, and is projected to create over 100,000 jobs. Collaborations also include SoftBank and Microsoft, highlighting the extensive industrial effort behind AI development.

    2025年7月22日
  • CoreWeave Announces Intention to Offer $1.5 Billion of Senior Notes

    CoreWeave (Nasdaq: CRWV) announced a private offering of $1.5 billion in senior notes due 2031, guaranteed by its subsidiaries. The AI hyperscaler intends to use the proceeds to repay debt and cover expenses, bolstering its position in the expanding AI infrastructure market. The offering targets qualified institutional buyers and non-U.S. persons, adhering to securities regulations. CoreWeave provides cloud solutions for accelerated computing and cautions investors about forward-looking statements, advising due diligence and consulting SEC filings.

    2025年7月21日
  • Nvidia’s Near-Death Experiences: Remembering the Two Close Calls

    Nvidia CEO Jensen Huang, despite the company’s $4 trillion market cap, feels constant pressure, stating Nvidia is “30 days from going out of business.” He attributes this to the rapid pace of technological obsolescence in the chip and AI industries. Nvidia faced near-bankruptcy twice: once due to the Dreamcast chipset failure and again from a graphics card design flaw leading to recalls. Huang’s commitment to innovation fuels his drive to guide Nvidia far into the future.

    2025年7月21日
  • Microsoft Announces Second Major Layoff This Year, Potentially Affecting 9,000 Roles

    Microsoft announced a second major round of job cuts this year, potentially impacting up to 9,000 roles across departments, geographies, and levels. This move aims to streamline operations and reduce costs amidst increasing AI infrastructure expenses and broader industry trends of workforce adjustments. These cuts represent less than 4% of Microsoft’s global workforce.

    2025年7月2日
  • IREN Names Anthony Lewis Chief Capital Officer, Taps Him for Capital Markets Strategy

    IREN Limited has appointed Anthony Lewis as its new Chief Capital Officer to bolster its capital markets strategy and fuel aggressive growth in the AI infrastructure sector. Lewis brings over two decades of financial markets experience, including a significant tenure at Macquarie Group, to shape IREN’s capital structure and financing initiatives.

    2025年7月1日
  • Nine-Chapter Cloud Releases Intelligent Computing Cloud 2.0 to Empower Diverse Industries

    DataCanvas launched Alaya NeW Cloud 2.0, a next-gen intelligent computing platform featuring serverless architecture and reinforcement learning. It offers on-demand AI infrastructure, targeting compute-intensive applications with a user-friendly toolchain. By offering a “pay-as-you-go” model and streamlining AI development, the platform aims to reduce costs, provide efficient performance, and make AI computing accessible to developers worldwide. The platform also introduces the world’s first reinforcement learning intelligent computing service, AgentiCTRL, for improved efficiency.

    2025年6月16日
  • Broadcom Unveils Tomahawk 6: First 102.4Tbps Superchip, Capable of Driving 100,000 GPUs

    Broadcom unveiled Tomahawk 6, the world’s first 102.4Tbps data center switch chip. Designed for AI, it doubles existing switch performance, supporting up to 100,000 GPUs and boosting GPU cluster utilization. The chip offers flexible architecture with features such as 100G/200G SerDes interfaces and CPO support. Tomahawk 6 promises to reduce AI training costs, with further energy efficiency improvements planned by the end of 2025.

    2025年6月3日
  • NVIDIA and WiMi HoloAccel Spearhead AI Robotics Industrialization Through GPU and Full-Stack AI Integration

    At Computex 2025, NVIDIA CEO Jensen Huang repositioned the company as an AI infrastructure architect, unveiling advancements in consumer GPUs, data center solutions, industrial metaverse tools, and robotics. Key highlights included the Blackwell architecture for AI training/inference and Isaac GR00T N1.5, a foundation model for humanoid robots. Huang emphasized physical AI as the next industrial revolution, citing partnerships with firms like Boston Dynamics and Morgan Stanley’s projection of a $60T humanoid market by 2050. NVIDIA aims to dominate through its full-stack ecosystem, bridging AI models, autonomous systems, and simulation tools to power smart factories and cities. (100 words)

    2025年5月26日
  • Kunpeng & Ascend Developer Conference 2025 Successfully Concludes in Beijing

    The Kunpeng & Ascend Developer Conference 2025 in Beijing showcased Huawei’s AI infrastructure innovations under the theme “Innovation Through Collaboration.” Key releases included Kunpeng’s modular AI+ Solution Suite, open-source platform openFuyao for hybrid computing, and Ascend’s SuperNode Architecture, boosting cluster training efficiency. Upgraded tools like the MindSpeed RL Suite and CATLASS Operator Library aim to accelerate AI development. Huawei reported 665,000 global developers and 8,800 partners, with real-world deployments like RAG Solution 1.0 yielding 40% latency reductions. Emphasizing ecosystem growth and open standards, Huawei positions itself as a foundational force in industrial AI transformation through scalable tooling and infrastructure. (99 words)

    2025年5月24日