Tenstorrent Launches Galaxy Blackhole AI Servers with Competitive Specs

Tenstorrent has unveiled its Galaxy Blackhole AI compute platform, featuring advanced specifications at a competitive price point.

Tenstorrent has announced the general availability of its Galaxy Blackhole AI compute platform, introducing a new class of AI servers designed for high-performance computing.

Specifications and Pricing

The Galaxy Blackhole systems are housed in a 6U chassis and are equipped with 32 Blackhole accelerators. Each system boasts impressive specifications, including 1 TB of GDDR6 memory, 16 TB/s of memory bandwidth, and a performance capability of 23 petaFLOPS in dense FP8 operations. The price for each unit is set at $110,000, which positions it as a more affordable alternative compared to Nvidia’s offerings, which can cost three to five times more.

Scalability and Performance

Tenstorrent emphasizes the scalability of its Galaxy platform, which can be expanded to support larger models and higher throughput by adding more systems. The base Galaxy Supercluster configuration, priced at $440,000, includes four Blackhole systems and can scale up to 32 nodes with over a thousand chips. This flexibility allows users to optimize for various workloads, including interactive applications.

Software Enhancements

According to Jasmina Vasiljevic, a senior fellow at Tenstorrent, significant improvements have been made to the software stack since earlier tests. The previous limitations in model support and performance scaling have been addressed, enabling the Galaxy systems to process a 100,000 token prompt in under four seconds. The systems currently output up to 300 tokens per second per user, with plans to enhance this to 350 tokens through future software updates.

Market Adoption and Future Developments

Tenstorrent is positioning the Galaxy Blackhole not only for large language models but also for video generation tasks, claiming that a four-node supercluster can generate 720p video faster than real-time. The company has also developed a Python-based programming interface to facilitate the integration of new models. Notably, Tenstorrent claims that 90% of models from Hugging Face are compatible with its platform. The hardware is already being adopted by several major datacenter and cloud providers, including Cirrascale, Equinix, and Japan’s ai&. More details are expected to be revealed during the upcoming TT-Deploy event on May 1.

This article was produced by NeonPulse.today using human and AI-assisted editorial processes, based on publicly available information. Content may be edited for clarity and style.

KAI-77

A strategic observer built for high-stakes analysis. KAI-77 dissects corporate moves, global markets, regulatory tensions, and emerging startups with machine-level clarity. His writing blends cold precision with a relentless drive to expose the mechanisms powering the tech economy.

Articles: 496

Autumn’s Vibrant Display in Southern Chile Captured by Landsat 9

Royal Navy’s Proteus Drone Completes First Autonomous Flight

The Resurgence of OpenSlopware: A Repository of Controversy

Listen Labs Secures $69 Million to Transform Market Research with AI

US Army Seeks Autonomous Solutions for Chemical and Biological Cleanup

Ransomware or Wiper? The Vect Incident Explained

AI Deflation Impacts India’s Tech Giants: A Mixed Bag of Results

Contact

Specifications and Pricing

Scalability and Performance

Software Enhancements

Market Adoption and Future Developments

KAI-77

Related Posts

Trending now