Add flux1-schnell-bf16-blackwell engines

Files changed (3) hide show

README.md ADDED Viewed

+# FLUX1-SCHNELL-BF16-BLACKWELL
+TensorRT-RTX engines for FLUX1-SCHNELL optimized for Blackwell architecture with BF16 quantization.
+## Specifications
+- **Model**: FLUX1-SCHNELL
+- **Base Model**: black-forest-labs/FLUX.1-schnell
+- **Architecture**: Blackwell
+- **Quantization**: BF16
+- **Batch Size**: 1 (optimized)
+- **Resolution**: 1024x1024 (optimized)
+- **Estimated Size**: 12.0 GB
+## Performance Estimates
+Based on architecture and quantization:
+- **Memory Usage**: ~12.0GB VRAM
+- **Speed**: ~2.5s (H200)
+## Files
+- `engines/`: TensorRT engine files (.plan)
+- `config.json`: Configuration metadata
+- `README.md`: This file
+## Usage
+```python
+from imageai_server.tensorrt.nvidia_sdxl_pipeline import NVIDIASDXLPipeline
+pipeline = NVIDIASDXLPipeline()
+pipeline.load_engines(
+    engine_dir="./engines",
+    framework_model_dir="./framework",
+    onnx_dir="./onnx"
+)
+pipeline.activate_engines()
+images, time_ms = pipeline.infer(
+    prompt="a beautiful landscape",
+    height=1024,
+    width=1024
+)
+```
+## Requirements
+- **GPU**: Blackwell architecture (Compute Capability 8.0+)
+- **VRAM**: 12.0GB+
+- **TensorRT-RTX**: 1.0.0.21+
+- **CUDA**: 12.0+

config.json ADDED Viewed

+{
+  "model_type": "flux1-schnell",
+  "architecture": "blackwell",
+  "quantization": "bf16",
+  "batch_size": 1,
+  "height": 1024,
+  "width": 1024,
+  "base_model": "black-forest-labs/FLUX.1-schnell",
+  "expected_engines": [
+    "clip.plan",
+    "t5.plan",
+    "transformer.plan",
+    "vae.plan"
+  ],
+  "created_at": 1754966251.0015864,
+  "tensorrt_version": "10.13.2.6",
+  "tensorrt_rtx_version": "1.0.0.21",
+  "size_estimate_gb": 12.0,
+  "build_system": "comprehensive_offline_builder"
+}

engines/ENGINES_NEEDED.txt ADDED Viewed

+Engines for flux1-schnell-bf16-blackwell need to be built.
+This is a placeholder structure. To complete:
+1. Build actual flux1-schnell engines with bf16 quantization
+2. Copy .plan files to this directory
+3. Remove this placeholder file
+Expected files:
+- clip.plan
+- t5.plan
+- transformer.plan
+- vae.plan
+Build command:
+python build_tensorrt_engines.py --single flux1-schnell blackwell bf16