PixelProbe

Overview

PixelProbe is a comprehensive media file corruption detection tool with a modern web interface. It helps you identify and manage corrupted video, image, and audio files across your media libraries.

Why PixelProbe?

Protect Your Media: Automatically detect corrupted files before they cause playback issues
Save Time: Batch scan entire media libraries instead of checking files individually
Prevent Data Loss: Identify failing drives by detecting corruption patterns
Professional Grade: Uses industry-standard tools (FFmpeg, ImageMagick) for accurate detection
Set and Forget: Schedule automated scans to continuously monitor your media health

Features

Media Support

Comprehensive video format support (MP4, MKV, AVI, MOV, WebM, FLV, etc.)
Image format detection (JPEG, PNG, GIF, BMP, TIFF, WebP, etc.)
Audio file validation (MP3, FLAC, WAV, AAC, OGG, etc.)
Large file support (tested with 50GB+ Bluray remux files)

Detection Capabilities

FFmpeg-based deep video analysis
ImageMagick and PIL image validation
Smart warning system for minor issues vs critical corruption
Multi-stage detection with configurable thresholds
Automatic retry logic for transient failures

Scanning Features

Parallel multi-threaded scanning: Configurable worker threads (10-24 workers recommended) with thread-safe database access
Real-time progress: Live updates with ETA calculations and phase tracking
Multiple scan types: Full scan, orphan cleanup, file changes detection
Scheduled automated scans: Cron expressions or simple intervals for hands-free monitoring
Smart exclusions: Configure paths and file extensions to skip
Phase-based scanning: Discovery → Database → Validation workflow
Bulk operations: Rescan multiple files, deep analysis, batch actions

Web Interface

Modern responsive design with dark/light theme support
Real-time scan progress with WebSocket updates
Advanced filtering and search capabilities
Bulk file selection and management
Mobile-optimized touch interface
Detailed file corruption reports

System Features

PostgreSQL database: Reliable ACID-compliant data storage
Redis-backed task queue: Background processing with Celery workers
Docker deployment: Multi-container architecture (web, workers, database, queue)
REST API: Comprehensive OpenAPI/Swagger documentation
Monitoring & Reports: Real-time statistics, PDF/JSON exports, complete audit trail
Performance optimized: Production-tested with millions of files

Security & Authentication

Multi-user support: Role-based access control with admin privileges
Secure password storage: Bcrypt hashing with minimum 8 character passwords
API token authentication: Generate and manage tokens for programmatic access
Session management: Cookie-based sessions with CSRF protection, configurable timeout
First-run setup wizard: Secure admin account creation on initial deployment
Audit logging: Complete security event tracking

Screenshots

Authentication & User Management

Login Screen

Secure login interface with:

Username/password authentication
Remember me functionality
Dark mode support
First-run setup detection

User Management

Comprehensive user administration:

Create new users with email and admin privileges
View and manage existing users
Delete user accounts (admin only)
Role-based access control

API Token Management

Programmatic access management:

Generate API tokens with descriptions
Optional expiration dates
View and revoke existing tokens
Bearer token authentication

Password Management

Secure password changes:

Current password verification
New password confirmation
Minimum 8 character requirement
Bcrypt hashing for security

Desktop Interface

Light Mode

Features visible in the interface:

Statistics Dashboard: Real-time display of 1M+ scanned files with health status
Sidebar Navigation: Quick access to Dashboard, API Documentation, Tools (Start Scan, Cleanup, Check Changes, Schedules, Exclusions), System (Stats, Reports, Build Info), and Account (User Management, API Tokens, Change Password)
File Results Table: Sortable columns for status, file path, size, type, scan date with bulk actions
Filtering Options: All Files, Corrupted Only, Warnings Only, Healthy Only
Action Buttons: Mark as Good, Rescan, Download, Export functionality

Dark Mode

Dark mode features:

High Contrast Theme: Green accent colors for better visibility in low-light environments
Full Feature Parity: All light mode features available with optimized dark color scheme
Persistent Theme: Settings toggle remembers preference across sessions
Account Section: Shows logged-in user (admin) with logout option

Mobile Interface

The mobile interface is fully responsive and touch-optimized:

Adaptive layout that works on all screen sizes
Touch-friendly buttons and controls
Collapsible sidebar navigation
Card-based design for scan results on mobile

Advanced Features

Scan Reports

Comprehensive scan reporting with history and analytics:

View all past scan operations with detailed statistics
Filter by scan type (full scan, rescan, deep scan, cleanup, file changes)
Export reports as JSON for data analysis or PDF for documentation

Scheduled Scanning

Create and manage automated scan schedules:

Support for both cron expressions and simple intervals
Multiple scan types: Normal Scan, Orphan Cleanup, File Changes
View next run times and last execution status

Quick Start

Using Docker (Recommended)

Clone the repository:

git clone https://github.com/ttlequals0/PixelProbe.git
cd PixelProbe

Configure environment variables:

cp .env.example .env

Edit .env and set required variables:

# Generate a secure secret key
python -c "import secrets; print(secrets.token_hex(32))"

# Edit .env file with your values
SECRET_KEY=your-generated-secret-key-here
MEDIA_PATH=/path/to/your/actual/media/directory
SCAN_PATHS=/media

Start the application:
```
docker-compose up -d
```
Access the web interface: Open http://localhost:5001 in your browser
Initial Setup (IMPORTANT - First Run Only):

On first run, you must create the admin account via the setup endpoint:
```
# Create admin user with your chosen password
curl -X POST http://localhost:5001/api/auth/setup \
  -H "Content-Type: application/json" \
  -d '{"password":"YourSecurePassword123"}'
```
Or visit http://localhost:5001/login and follow the first-run setup wizard.

Security Note: No default admin account exists. You must explicitly create it on first run.
Start scanning:
- After login, click "Scan All Files" to begin analyzing your media library
- Configure exclusions and schedules as needed

Docker Image Versions

PixelProbe is available on Docker Hub as ttlequals0/pixelprobe:

ttlequals0/pixelprobe:latest - Latest stable release

Requirements

Important: PixelProbe requires PostgreSQL. SQLite is no longer supported.

Quick Migration from SQLite

Backup your data:

cp /path/to/instance/pixelprobe.db /path/to/instance/pixelprobe.db.backup

Update Docker Compose - Add PostgreSQL and Redis services:

services:
  postgres:
    image: postgres:15-alpine
    environment:
      POSTGRES_DB: pixelprobe
      POSTGRES_USER: pixelprobe
      POSTGRES_PASSWORD: ${POSTGRES_PASSWORD}
    volumes:
      - postgres_data:/var/lib/postgresql/data

  mediachecker:
    image: ttlequals0/pixelprobe:2.4.0
    environment:
      POSTGRES_HOST: postgres
      POSTGRES_PASSWORD: ${POSTGRES_PASSWORD}
      # ... other settings

Run migration:

docker-compose up -d postgres
sleep 15

# Migrate existing SQLite data
docker run --rm \
  --network pixelprobe_pixelprobe-network \
  -v "/path/to/instance/pixelprobe.db:/app/pixelprobe.db:ro" \
  -e POSTGRES_HOST=postgres \
  -e POSTGRES_PASSWORD=$POSTGRES_PASSWORD \
  ttlequals0/pixelprobe:2.4.0 \
  python migrate_to_postgres.py --sqlite-path /app/pixelprobe.db

For detailed migration instructions, see MIGRATION_v2.2.0.md.

Documentation

Quick Links

Docker Setup Guide - Complete Docker Compose setup with container explanations
System Architecture - Container architecture, Celery queues, and data flow
API Documentation - Complete REST API reference
Architecture Overview - Application layers and design
Performance Tuning - Optimization strategies
Developer Guide - Development setup and guidelines

API Client Examples

Python Client - Full-featured Python client with CLI
Node.js Client - JavaScript/Node.js client implementation
Bash Client - Shell script client using curl and jq

Configuration

Environment Variables

PixelProbe uses environment variables for all configuration. Copy .env.example to .env and customize:

Required Variables:

SECRET_KEY - Secure secret key for Flask sessions
MEDIA_PATH - Host path to your media files (for Docker volume mounting)

Optional Variables:

SCAN_PATHS - Comma-separated directories to monitor inside container (default: /media)
TZ - Timezone (default: UTC)
MAX_WORKERS - Parallel file scanning workers (default: 10, recommended: 10-24)
- Controls parallelism within each scan task
- Higher values = faster scans but more CPU/memory usage
- Each worker creates 1 database connection
- Total connections = 60 (main app) + MAX_WORKERS
BATCH_SIZE - Files per batch during discovery (default: 100)
CELERY_CONCURRENCY - Concurrent Celery tasks (default: 4)
- Controls how many scan tasks can run simultaneously
- Independent from MAX_WORKERS
PERIODIC_SCAN_SCHEDULE - Automated scanning schedule
CLEANUP_SCHEDULE - Automated cleanup schedule
EXCLUDED_PATHS - Paths to ignore during scanning
EXCLUDED_EXTENSIONS - File extensions to ignore

See .env.example for complete configuration options with examples.

Multiple Scan Paths

You can configure multiple directories to scan:

Method 1: Docker Compose with Multiple Volumes

environment:
  - SCAN_PATHS=/movies,/tv-shows,/backup
volumes:
  - /mnt/movies:/movies
  - /mnt/tv-shows:/tv-shows  
  - /mnt/backup:/backup

Method 2: Single Volume with Subdirectories

export MEDIA_PATH=/mnt/all-media  # Contains subdirs: movies/, tv/, backup/
# docker-compose.yml uses: SCAN_PATHS=/media/movies,/media/tv,/media/backup

Usage

Web Interface

Access the Dashboard: Navigate to http://localhost:5001
Start a Scan: Click "Scan All Files" to begin scanning your media directories
View Results: Results appear in the table below with corruption status
Filter Results: Use the filter buttons to show only corrupted or healthy files
File Actions:
- Rescan: Re-examine a specific file
- Download: Download the file to your local machine
Schedules: Manage automated scan schedules with multiple scan types
Exclusions: Interactive management of paths and extensions to exclude

API Documentation

PixelProbe provides a comprehensive REST API with OpenAPI/Swagger documentation.

Interactive API Documentation

Swagger UI: Available at /api/v1/docs when logged in
OpenAPI Spec: Full API specification with request/response schemas
Try it out: Test endpoints directly from the documentation

Authentication Endpoints

GET /api/auth/status - Check authentication status
POST /api/auth/login - User login
POST /api/auth/logout - User logout
POST /api/auth/setup - First-run admin setup
GET /api/auth/users - List all users (admin only)
POST /api/auth/users - Create new user (admin only)
DELETE /api/auth/users/{id} - Delete user (admin only)
PUT /api/auth/users/{id}/password - Change user password
GET /api/auth/tokens - List user's API tokens
POST /api/auth/tokens - Create new API token
DELETE /api/auth/tokens/{id} - Revoke API token

Scanning Endpoints

GET /api/stats - Get scanning statistics
GET /api/scan-results - Get paginated scan results with filtering
POST /api/scan - Start a directory scan
POST /api/scan/parallel - Start parallel scan with Celery
GET /api/scan-status - Get current scan progress
POST /api/cancel-scan - Cancel running scan
POST /api/rescan-file - Rescan specific file
POST /api/deep-scan - Perform deep analysis

Maintenance Endpoints

POST /api/cleanup - Remove orphaned database entries
GET /api/cleanup-status - Get cleanup operation status
POST /api/file-changes - Detect file system changes
GET /api/file-changes-status - Get file changes scan status
POST /api/reset-for-rescan - Reset files for rescanning
POST /api/reset-files-by-path - Reset specific files by path

Schedule Management

GET /api/schedules - List all scan schedules
POST /api/schedules - Create new schedule
PUT /api/schedules/{id} - Update schedule
DELETE /api/schedules/{id} - Delete schedule
GET /api/schedule-types - Get available schedule types

System Configuration

GET /api/exclusions - Get path and extension exclusions
PUT /api/exclusions - Update exclusions
GET /api/ignored-errors - Get ignored error patterns
POST /api/ignored-errors - Add ignored error pattern
DELETE /api/ignored-errors/{id} - Remove error pattern

Data Export

POST /api/export - Export scan results (CSV, JSON, Excel)
GET /api/reports - List generated reports
GET /api/reports/{filename} - Download specific report

API Authentication

Session Authentication (Web UI)

// Login via web form
fetch('/api/auth/login', {
    method: 'POST',
    headers: {'Content-Type': 'application/json'},
    body: JSON.stringify({
        username: 'admin',
        password: 'your_password'
    }),
    credentials: 'include'
});

Bearer Token Authentication (API)

import requests

# Create API token via web UI or API
headers = {
    'Authorization': 'Bearer your_api_token_here'
}

# Make authenticated API request
response = requests.get(
    'http://localhost:5000/api/stats',
    headers=headers
)

cURL Examples

# Login and get session cookie
curl -X POST http://localhost:5000/api/auth/login \
  -H "Content-Type: application/json" \
  -d '{"username":"admin","password":"password"}' \
  -c cookies.txt

# Use session cookie for requests
curl http://localhost:5000/api/stats -b cookies.txt

# Or use API token
curl http://localhost:5000/api/stats \
  -H "Authorization: Bearer your_api_token_here"

Command Line Usage

from media_checker import PixelProbe

checker = PixelProbe()

# Scan a single file
result = checker.scan_file('/path/to/media/file.mp4')
print(f"Corrupted: {result['is_corrupted']}")

# Scan multiple directories
results = checker.scan_directories(['/path/to/media1', '/path/to/media2'])
for result in results:
    if result['is_corrupted']:
        print(f"Corrupted file: {result['file_path']}")

Supported File Formats

Video Formats

Common: MP4, MKV, AVI, MOV, WMV, FLV, WebM, M4V
HEVC/H.265: HEVC, H265
Professional: ProRes, MXF, DNxHD, DNxHR
Broadcast: MTS, M2TS, AVCHD
Legacy: MPG, MPEG, VOB, RM, RMVB

Image Formats

Common: JPEG, PNG, GIF, BMP, TIFF, WebP
Apple: HEIC, HEIF
RAW Formats: CR2, CR3, NEF, NRW, ARW, DNG, ORF, RW2, PEF, RAF

Audio Formats

Lossy: MP3, AAC, M4A, WMA, OGG, OGA, Opus
Lossless: FLAC, WAV, AIFF, APE, WV
High-Resolution: DSF, DFF (DSD)
Dolby/DTS: AC3, DTS

Development

Development Setup

Clone the repository:

git clone https://github.com/ttlequals0/PixelProbe.git
cd PixelProbe

Use development compose file:

docker-compose -f docker-compose.dev.yml up -d

Testing

# Install test dependencies
pip install -r requirements-test.txt

# Run all tests
pytest

# Run with coverage report
pytest --cov=pixelprobe --cov-report=html

# Run specific test categories
pytest tests/unit/           # Unit tests only
pytest tests/integration/    # Integration tests only

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

Troubleshooting

Database Errors After Updates

If you encounter "no such table: scan_results" errors after upgrading:

# Quick fix
docker exec pixelprobe python tools/fix_database_schema.py

Common Issues

FFmpeg/ImageMagick not found:

Ensure FFmpeg and ImageMagick are installed and in PATH
On Ubuntu/Debian: sudo apt-get install ffmpeg imagemagick
On macOS: brew install ffmpeg imagemagick

Permission errors:

Ensure the application has read access to your media directories
Check file permissions and ownership

Performance issues with large libraries:

Increase MAX_WORKERS (default: 10, try 16-24 for powerful systems)
Monitor system resources during scanning
Use SSD storage for the database if possible
Adjust CELERY_CONCURRENCY if running multiple scans simultaneously

Getting Help

Check logs first: docker logs pixelprobe
Search existing issues: GitHub Issues
Create new issue: Include logs and system info

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

FFmpeg for video analysis
ImageMagick for image processing
PIL/Pillow for Python image handling
Inspired by media integrity checkers and corruption detectors

Support

For issues, questions, or contributions, please visit the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
.github/workflows		.github/workflows
docs		docs
migrations		migrations
pixelprobe		pixelprobe
scripts		scripts
static		static
templates		templates
tests		tests
tools		tools
.dockerignore		.dockerignore
.env.example		.env.example
.env.test		.env.test
.gitignore		.gitignore
CHANGELOG.MD		CHANGELOG.MD
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
P3_TODO.md		P3_TODO.md
README.md		README.md
app.py		app.py
auth.py		auth.py
celery_config.py		celery_config.py
celery_worker.py		celery_worker.py
config.py		config.py
cookies.txt		cookies.txt
database_migrations.py		database_migrations.py
docker-compose.simple.yml		docker-compose.simple.yml
docker-compose.test.yml		docker-compose.test.yml
docker-compose.yml		docker-compose.yml
init_db.py		init_db.py
media_checker.py		media_checker.py
models.py		models.py
operation_handlers.py		operation_handlers.py
pytest.ini		pytest.ini
requirements-test.txt		requirements-test.txt
requirements.txt		requirements.txt
scheduler.py		scheduler.py
utils.py		utils.py
version.py		version.py

Uh oh!

Uh oh!

ttlequals0/PixelProbe

Folders and files

Latest commit

History

Repository files navigation

PixelProbe

Overview

Why PixelProbe?

Features

Media Support

Detection Capabilities

Scanning Features

Web Interface

System Features

Security & Authentication

Screenshots

Authentication & User Management

Login Screen

User Management

API Token Management

Password Management

Desktop Interface

Light Mode

Dark Mode

Mobile Interface

Advanced Features

Scan Reports

Scheduled Scanning

Quick Start

Using Docker (Recommended)

Docker Image Versions

Requirements

Quick Migration from SQLite

Documentation

Quick Links

API Client Examples

Configuration

Environment Variables

Multiple Scan Paths

Usage

Web Interface

API Documentation

Interactive API Documentation

Authentication Endpoints

Scanning Endpoints

Maintenance Endpoints

Schedule Management

System Configuration

Data Export

API Authentication

Session Authentication (Web UI)

Bearer Token Authentication (API)

cURL Examples

Command Line Usage

Supported File Formats

Video Formats

Image Formats

Audio Formats

Development

Development Setup

Testing

Contributing

Troubleshooting

Database Errors After Updates

Common Issues

Getting Help

License

Acknowledgments

Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages