Scratch Pad

Thoughts on generative models, video quality, and the intersection of perception and computation.

HDR-Q: Quality-Aware HDR Video Assessment via Multimodal LLMs

Presentation slides for HDR-Q (CVPR 2026) — the first multimodal LLM for HDR video quality assessment, featuring HAPO (HDR-Aware Policy Optimization) with contrastive KL, dual-entropy regularization, and SigLIP-2 HDR-aware encoding.

View slides →
For best viewing, adjust browser zoom to fit your display. Slides auto-enter fullscreen.

Rectified-CFG++: Geometry-Aware Guidance for Rectified Flow Models

Presentation slides for Rectified-CFG++ (NeurIPS 2025) — a predictor-corrector guidance method that fixes CFG artifacts on flow models like Flux, SD3, and Lumina-Next with theoretical guarantees and zero extra training cost.

View slides →
For best viewing, adjust browser zoom to fit your display. Slides auto-enter fullscreen.

Prepping for a Research Scientist, GenAI Position — A Pointer Notebook

A long revision notebook for Research Scientist / GenAI loops focused on image generation, perceptual quality, and video processing: diffusion and flow models, transformer internals (attention variants, RoPE, KV cache), the text-to-image design space, evaluation metrics, color and HDR, classical CV, RL alignment, and the coding tier — all in one place.

Read more →