GPU CardsPre-order

YHPCIe 5.0x16

YH002 Mezzanine Module 96GB

YH YH002 is the flagship next-generation Mezzanine module on RISC-V architecture with dual systolic arrays. Features 96 GB HBM3e (1200 GB/s).

Each server contains 8 YH002 modules and 4 switch interconnect chips, delivering 12.8 TB/s aggregate bandwidth via full interconnection of 8 TP16 cards.

This significantly exceeds the YH001 Server and is the key competitive advantage for distributed training with high-intensity data exchange.

Performance: 1024 TFLOPS FP32, 4096 TFLOPS FP/BF16, 8192 TOPS INT8. Blocked FP8 support.

Complete CUDA independence: native PyTorch and TensorFlow. 8U form factor.

Application scenarios

Large-scale LLM training (100B–500B+ parameters) with 12.8 TB/s interconnect.
Building AI super-clusters with TP16 topology, no CUDA.
High-load inference with maximum VRAM (768 GB/server).
Flagship AI data centers requiring complete independence from Western ecosystems.

Product overview

Purpose-built accelerator for enterprise AI workloads

Designed for dense inference, model adaptation, and private AI infrastructure where predictable availability, localized supply, and software compatibility are critical.

Architecture

TPU Архитектура

Memory & Performance

High-bandwidth memory profile for AI inference and training.

VRAM capacity96

Memory typeHBM3e

Memory bandwidth1200

Interconnect typeYHLink

Interconnect speed1200

Architecture

Compute architecture and software execution model.

ArchitectureTPU Архитектура

Compute units-

Power & Thermal

Data center integration requirements.

Thermal design power-

CoolingПассивное

Form factorMezzanine Module

Pixel Rate-

Texture Rate-

Benchmarks

Peak theoretical compute for common AI precisions.

FP64

INT4 TOPS 2048

FP32

128

FP16

TF32

BF16 Tensor

512

FP8 Tensor

1024

INT8 Tensor

1024

Compatibility

Interfaces, frameworks, and deployment environment.

PCIe interfacePCIe 5.0x16

Video encoding-

Video decoding-

Physical Dimensions

Card dimensions for server platforms.

Slots-

Length- mm

Height- mm

Width- mm

Pricing

On request

Volume pricing available for cluster deployments and pilot batches.

Documentation