LuSIR

LuSIR: Latent Upscaling via Self-trained Image Restoration is a vision-only x4 super-resolution research project trained without a pretrained text-to-image diffusion model.

GitHub: https://github.com/BitIntx/LuSIR

The repository stores selected research checkpoints, configs, metrics, and sample grids. It does not redistribute training datasets.

Current Selected Detail Artifact

The latest public detail-branch research checkpoint is:

checkpoints/detail_branch_v1d_deep3m_photo130k_lsdir_best99500.pt

It is a deterministic 3.02M-parameter image-space detail branch on top of the frozen dual-context LSDIR Stage 2 step 98000 condition encoder and frozen Stage 1 decoder. The run completed 100086 micro-steps, exactly three epochs, and selected step 99500 by eval/detail_score.

Selected ordinary photo_detail_mix val100 result:

aggregate PSNR delta vs frozen base: +0.1646 dB
mean PSNR delta vs frozen base:      +0.1888 dB
SSIM delta vs frozen base:           +0.00647
PSNR wins:                           99/100
detail wins:                         100/100

Exploratory strict-bicubic DIV2K five-center-crop result:

mean RGB PSNR: 31.9513 dB
vs frozen base: +0.2102 dB
vs detail v1c:  +0.1358 dB
wins:           5/5

The strict-bicubic result is not a formal SOTA benchmark. It uses five 512x512 center crops, PIL bicubic x4 degradation, full-image RGB PSNR, and no border shave.

Formal full-image clean-bicubic benchmark, reported as Y PSNR / Y SSIM:

Dataset	Dual-context base	Detail v1d
DIV2K validation	29.9575 / 0.82887	30.1602 / 0.83421
Set5	31.6621 / 0.88952	31.8892 / 0.89440
Set14	28.2441 / 0.77340	28.4123 / 0.77998
Urban100	25.4816 / 0.76473	25.8755 / 0.77875

This uses public x4 LR pairs, MATLAB-compatible BT.601 Y, a four-pixel border shave, and MATLAB-style SSIM. V1d improves its frozen base on all four datasets. These clean-bicubic fidelity results are not a claim of classical-SR SOTA or a substitute for real-degradation and perceptual evaluation.

For scale, the official SwinIR classical x4 checkpoint reaches 31.0838 / 0.85228 on the same DIV2K evaluator, +0.9235 dB Y PSNR ahead of detail v1d. The next clean-fidelity priority is therefore the Stage 2/base reconstruction path rather than a larger detail branch.

Download

From a LuSIR GitHub clone:

python scripts/download_hf_checkpoints.py --preset detail_branch_v1d

Other useful presets include:

residual_refiner_v2
stage2_photo130k_lsdir_dual
detail_branch_v1b
photo100k_xl_stage4_edge

The public Colab default remains the conservative deterministic residual refiner v2 path. Detail v1d is preserved as the latest research detail candidate and is available in the Colab WebUI with single-image and tiled inference.

Runtime Paths

public deterministic default:
  LR -> Stage 2 XL -> residual refiner v2 -> Stage 1 decoder -> SR

selected detail research path:
  LR -> dual-context LSDIR Stage 2 -> Stage 1 decoder
     -> detail branch v1d -> SR

generative comparison:
  LR -> Stage 2 condition encoder -> Stage 3 OR Stage 4 diffusion U-Net
     -> Stage 1 decoder -> SR

Stage numbers describe training order. Stage 3 and Stage 4 are alternative diffusion checkpoints, not modules executed sequentially.

License

Checkpoints, generated samples, metrics, and other non-code artifacts: CC BY-NC 4.0.
Source code: PolyForm Noncommercial License 1.0.0.

Commercial use is not permitted without separate written permission.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support