Scratch Pad

Thoughts on generative models, video quality, and the intersection of perception and computation.

HDR-Q: Quality-Aware HDR Video Assessment via Multimodal LLMs

Presentation slides for HDR-Q (CVPR 2026) — the first multimodal LLM for HDR video quality assessment, featuring HAPO (HDR-Aware Policy Optimization) with contrastive KL, dual-entropy regularization, and SigLIP-2 HDR-aware encoding.

View slides →
For best viewing, adjust browser zoom to fit your display. Slides auto-enter fullscreen.

Rectified-CFG++: Geometry-Aware Guidance for Rectified Flow Models

Presentation slides for Rectified-CFG++ (NeurIPS 2025) — a predictor-corrector guidance method that fixes CFG artifacts on flow models like Flux, SD3, and Lumina-Next with theoretical guarantees and zero extra training cost.

View slides →
For best viewing, adjust browser zoom to fit your display. Slides auto-enter fullscreen.