julianzu9612
/

RFDETR-Soccernet

+# RF-DETR SoccerNet - Complete Inference Guide
+This guide provides comprehensive instructions for using the RF-DETR SoccerNet model for soccer video analysis.
+## 🚀 Quick Start
+### Installation
+```bash
+# Clone the repository
+git clone https://huggingface.co/julianzu9612/RFDETR-Soccernet
+cd RFDETR-Soccernet
+# Install dependencies
+pip install -r requirements.txt
+# Run the example
+python example.py
+```
+### Basic Usage
+```python
+from inference import RFDETRSoccerNet
+# Initialize model (auto-detects GPU/CPU)
+model = RFDETRSoccerNet()
+# Process a single image
+df = model.process_image("your_image.jpg", confidence_threshold=0.5)
+# Process a video
+df = model.process_video("your_video.mp4", confidence_threshold=0.5)
+```
+## 📊 Output Format
+All methods return a pandas DataFrame with these columns:
+| Column | Type | Description |
+|--------|------|-------------|
+| `frame` | int | Frame number (for videos) |
+| `timestamp` | float | Time in seconds (for videos) |
+| `class_name` | str | Object class (ball, player, referee, goalkeeper) |
+| `class_id` | int | Numeric class ID (0-3) |
+| `x1, y1, x2, y2` | float | Bounding box coordinates |
+| `width, height` | float | Bounding box dimensions |
+| `confidence` | float | Detection confidence (0.0-1.0) |
+| `center_x, center_y` | float | Bounding box center coordinates |
+| `area` | float | Bounding box area in pixels² |
+## 🎯 Detection Classes
+| Class ID | Class Name | Color (BGR) | Description |
+|----------|------------|-------------|-------------|
+| 0 | ball | (0, 0, 255) | Soccer ball |
+| 1 | player | (0, 255, 0) | Field players |
+| 2 | referee | (255, 255, 0) | Referees and officials |
+| 3 | goalkeeper | (0, 255, 255) | Goalkeepers |
+## ⚙️ Configuration Options
+### Model Initialization
+```python
+# Default initialization (auto-detects CUDA/CPU)
+model = RFDETRSoccerNet()
+# Specify device explicitly
+model = RFDETRSoccerNet(device="cuda")  # or "cpu"
+# Custom model path
+model = RFDETRSoccerNet(model_path="path/to/your/checkpoint.pth")
+```
+### Image Processing
+```python
+df = model.process_image(
+    image_path="image.jpg",
+    confidence_threshold=0.5  # Only detections above 50% confidence
+)
+```
+### Video Processing
+```python
+df = model.process_video(
+    video_path="video.mp4",
+    confidence_threshold=0.5,    # Confidence threshold
+    frame_skip=1,               # Process every N frames (1 = all frames)
+    max_frames=None,            # Limit frames (None = all frames)
+    save_results=False,         # Auto-save results to files
+    output_dir=None             # Directory for saved results
+)
+```
+## 📈 Performance Optimization
+### Hardware Recommendations
+| Hardware | Expected FPS | Memory Usage | Batch Size |
+|----------|--------------|--------------|------------|
+| RTX 4070 8GB | 10-15 FPS | 6GB VRAM | 1-2 |
+| RTX 3080 10GB | 12-18 FPS | 7GB VRAM | 2-4 |
+| A100 40GB | 25-35 FPS | 8GB VRAM | 4-8 |
+| CPU (16 cores) | 2-4 FPS | 4GB RAM | 1 |
+### Speed vs Accuracy Trade-offs
+```python
+# Maximum accuracy (slower)
+df = model.process_video("video.mp4", confidence_threshold=0.3, frame_skip=1)
+# Balanced (recommended)
+df = model.process_video("video.mp4", confidence_threshold=0.5, frame_skip=2)
+# Maximum speed (less accurate)
+df = model.process_video("video.mp4", confidence_threshold=0.7, frame_skip=5)
+```
+### Batch Processing
+```python
+import os
+from pathlib import Path
+# Process multiple images
+image_dir = Path("images/")
+results = []
+for image_path in image_dir.glob("*.jpg"):
+    df = model.process_image(str(image_path))
+    df['source_file'] = image_path.name
+    results.append(df)
+# Combine all results
+all_results = pd.concat(results, ignore_index=True)
+```
+## 🎨 Visualization
+### Basic Visualization
+```python
+import cv2
+import pandas as pd
+def draw_detections(image_path, df, output_path):
+    """Draw bounding boxes on image."""
+    # Load image
+    image = cv2.imread(image_path)
+    # Colors for each class (BGR)
+    colors = {
+        'ball': (0, 0, 255),        # Red
+        'player': (0, 255, 0),      # Green
+        'referee': (255, 255, 0),   # Yellow
+        'goalkeeper': (0, 255, 255) # Cyan
+    }
+    # Draw each detection
+    for _, det in df.iterrows():
+        x1, y1, x2, y2 = int(det['x1']), int(det['y1']), int(det['x2']), int(det['y2'])
+        class_name = det['class_name']
+        confidence = det['confidence']
+        color = colors.get(class_name, (255, 255, 255))
+        # Draw bounding box
+        cv2.rectangle(image, (x1, y1), (x2, y2), color, 2)
+        # Draw label
+        label = f"{class_name}: {confidence:.2f}"
+        cv2.putText(image, label, (x1, y1-10),
+                   cv2.FONT_HERSHEY_SIMPLEX, 0.7, color, 2)
+    # Save result
+    cv2.imwrite(output_path, image)
+# Usage
+df = model.process_image("examples/sample_soccer_frame.jpg")
+draw_detections("examples/sample_soccer_frame.jpg", df, "result.jpg")
+```
+### Advanced Visualization with Matplotlib
+```python
+import matplotlib.pyplot as plt
+import matplotlib.patches as patches
+from PIL import Image
+def plot_detections(image_path, df):
+    """Plot detections using matplotlib."""
+    # Load image
+    img = Image.open(image_path)
+    # Create plot
+    fig, ax = plt.subplots(1, figsize=(12, 8))
+    ax.imshow(img)
+    # Colors for classes
+    colors = {'ball': 'red', 'player': 'green', 'referee': 'yellow', 'goalkeeper': 'cyan'}
+    # Plot each detection
+    for _, det in df.iterrows():
+        x1, y1 = det['x1'], det['y1']
+        width, height = det['width'], det['height']
+        class_name = det['class_name']
+        confidence = det['confidence']
+        # Create rectangle
+        rect = patches.Rectangle((x1, y1), width, height,
+                               linewidth=2, edgecolor=colors.get(class_name, 'white'),
+                               facecolor='none')
+        ax.add_patch(rect)
+        # Add label
+        ax.text(x1, y1-5, f"{class_name}: {confidence:.2f}",
+               color='white', fontsize=10, weight='bold',
+               bbox=dict(boxstyle='round,pad=0.3', facecolor=colors.get(class_name, 'white')))
+    ax.set_title("RF-DETR SoccerNet Detections")
+    ax.axis('off')
+    plt.tight_layout()
+    plt.show()
+# Usage
+df = model.process_image("examples/sample_soccer_frame.jpg")
+plot_detections("examples/sample_soccer_frame.jpg", df)
+```
+## ⚽ Advanced Analytics
+### Ball Possession Analysis
+```python
+# Load video detections
+df = model.process_video("match.mp4")
+# Analyze ball possession (players within 100 pixels of ball)
+possession_df = model.analyze_ball_possession(df, distance_threshold=100)
+# Results include:
+# - frame: Frame number
+# - timestamp: Time in seconds
+# - player_x, player_y: Player center coordinates
+# - ball_x, ball_y: Ball center coordinates
+# - distance_to_ball: Distance in pixels
+# - ball_confidence, player_confidence: Detection confidences
+print(f"Possession events: {len(possession_df)}")
+print(f"Average distance to ball: {possession_df['distance_to_ball'].mean():.1f} px")
+```
+### Formation Analysis
+```python
+def analyze_formation(df, frame_number):
+    """Analyze player formation for a specific frame."""
+    frame_data = df[df['frame'] == frame_number]
+    players = frame_data[frame_data['class_name'] == 'player']
+    if len(players) < 5:
+        return "Insufficient players detected"
+    # Calculate formation metrics
+    positions = players[['center_x', 'center_y']].values
+    # Find centroid
+    centroid_x = positions[:, 0].mean()
+    centroid_y = positions[:, 1].mean()
+    # Calculate spread
+    spread = np.sqrt(((positions - [centroid_x, centroid_y])**2).sum(axis=1)).mean()
+    return {
+        'frame': frame_number,
+        'num_players': len(players),
+        'centroid': (centroid_x, centroid_y),
+        'formation_spread': spread,
+        'player_positions': positions.tolist()
+    }
+# Analyze formation for frame 100
+formation = analyze_formation(df, 100)
+print(f"Formation analysis: {formation}")
+```
+### Game Statistics
+```python
+def calculate_game_stats(df):
+    """Calculate comprehensive game statistics."""
+    stats = {
+        'total_frames': df['frame'].nunique(),
+        'duration_seconds': df['timestamp'].max() - df['timestamp'].min(),
+        'total_detections': len(df),
+        'detections_per_second': len(df) / (df['timestamp'].max() - df['timestamp'].min()),
+        'class_distribution': df['class_name'].value_counts().to_dict(),
+        'average_confidence': df['confidence'].mean(),
+        'ball_detection_rate': len(df[df['class_name'] == 'ball']) / df['frame'].nunique(),
+        'players_per_frame': df[df['class_name'] == 'player'].groupby('frame').size().mean()
+    }
+    return stats
+# Calculate stats
+stats = calculate_game_stats(df)
+for key, value in stats.items():
+    print(f"{key}: {value}")
+```
+## 💾 Export and Integration
+### Export Formats
+```python
+# CSV for spreadsheet analysis
+model.save_results(df, "detections.csv", format="csv")
+# JSON for web applications
+model.save_results(df, "detections.json", format="json", include_metadata=True)
+# Parquet for big data processing
+model.save_results(df, "detections.parquet", format="parquet")
+```
+### Integration with Other Tools
+#### Pandas Integration
+```python
+# Filter high-confidence ball detections
+balls = df[(df['class_name'] == 'ball') & (df['confidence'] > 0.7)]
+# Group by frame and calculate averages
+frame_stats = df.groupby('frame').agg({
+    'confidence': 'mean',
+    'class_name': 'count'
+}).rename(columns={'class_name': 'detection_count'})
+# Time-based analysis
+df['time_bin'] = (df['timestamp'] // 10) * 10  # 10-second bins
+time_analysis = df.groupby(['time_bin', 'class_name']).size().unstack(fill_value=0)
+```
+#### OpenCV Integration
+```python
+import cv2
+def create_annotated_video(video_path, df, output_path):
+    """Create video with detection annotations."""
+    cap = cv2.VideoCapture(video_path)
+    fps = int(cap.get(cv2.CAP_PROP_FPS))
+    width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
+    height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
+    # Create video writer
+    fourcc = cv2.VideoWriter_fourcc(*'mp4v')
+    out = cv2.VideoWriter(output_path, fourcc, fps, (width, height))
+    frame_count = 0
+    while cap.isOpened():
+        ret, frame = cap.read()
+        if not ret:
+            break
+        # Get detections for this frame
+        frame_detections = df[df['frame'] == frame_count]
+        # Draw detections (add your visualization code here)
+        # ... drawing code ...
+        out.write(frame)
+        frame_count += 1
+    cap.release()
+    out.release()
+```
+## 🔧 Troubleshooting
+### Common Issues
+#### 1. CUDA Out of Memory
+```python
+# Reduce batch size or use CPU
+model = RFDETRSoccerNet(device="cpu")
+# Or process smaller chunks
+df = model.process_video("video.mp4", max_frames=100)
+```
+#### 2. Low Detection Accuracy
+```python
+# Lower confidence threshold
+df = model.process_video("video.mp4", confidence_threshold=0.3)
+# Check image quality and lighting
+# Ensure video resolution is adequate (720p+ recommended)
+```
+#### 3. Memory Issues with Large Videos
+```python
+# Process in chunks
+chunk_size = 1000
+all_results = []
+for start_frame in range(0, total_frames, chunk_size):
+    chunk_df = model.process_video("video.mp4",
+                                  max_frames=chunk_size,
+                                  frame_skip=start_frame)
+    all_results.append(chunk_df)
+final_df = pd.concat(all_results, ignore_index=True)
+```
+### Performance Monitoring
+```python
+import time
+import psutil
+import torch
+def monitor_inference(model, video_path):
+    """Monitor inference performance."""
+    start_time = time.time()
+    start_memory = psutil.Process().memory_info().rss / 1024 / 1024  # MB
+    if torch.cuda.is_available():
+        start_gpu_memory = torch.cuda.memory_allocated() / 1024 / 1024  # MB
+    # Run inference
+    df = model.process_video(video_path, max_frames=100)
+    end_time = time.time()
+    end_memory = psutil.Process().memory_info().rss / 1024 / 1024
+    print(f"Processing time: {end_time - start_time:.2f}s")
+    print(f"Memory usage: {end_memory - start_memory:.2f} MB")
+    print(f"Frames processed: {df['frame'].nunique()}")
+    print(f"FPS: {df['frame'].nunique() / (end_time - start_time):.2f}")
+    if torch.cuda.is_available():
+        end_gpu_memory = torch.cuda.memory_allocated() / 1024 / 1024
+        print(f"GPU memory usage: {end_gpu_memory - start_gpu_memory:.2f} MB")
+# Usage
+monitor_inference(model, "test_video.mp4")
+```
+## 📚 Additional Resources
+- **Model Card**: Detailed performance metrics and training information
+- **Example Scripts**: Complete working examples in `example.py`
+- **Sample Data**: Test images and detection results in `examples/`
+- **GitHub Issues**: Report bugs and request features
+- **Documentation**: Full API reference and tutorials
+## 🏆 Best Practices
+1. **Data Quality**: Use high-quality video (720p+) with good lighting
+2. **Confidence Thresholds**: Start with 0.5, adjust based on results
+3. **Performance**: Use GPU for real-time processing, CPU for batch jobs
+4. **Memory**: Monitor memory usage for long videos
+5. **Validation**: Always validate results on sample data first
+6. **Export**: Save results in appropriate format for downstream analysis
+## 📞 Support
+For technical support, bug reports, or feature requests:
+- **Repository**: [Hugging Face Model Page](https://huggingface.co/julianzu9612/RFDETR-Soccernet)
+- **Issues**: Use the Issues tab for bug reports
+- **Community**: Join discussions in the Community tab