JPG Artifacts & Noise Remover (V2)

This is a lightweight model designed to remove JPEG compression artifacts, digital noise, and slight blur from images. Ideally suited for anime/illustration art, but works reasonably well on photos.

Version 2 Update:

New Architecture: Switched to a NAFNet-based UNet (State-of-the-Art blocks for restoration).
Larger Dataset: Trained on ~40,000 high-quality images from Danbooru2024.
Improved Training: Added Perceptual Loss, HFEN, and blur degradation handling.

Which version to pick? V2 generally restores better, but it is also cautious and may decide an image is already clean. If V2 feels too conservative for a specific image, try the V1 weights (best_ema_15E.safetensors) and keep the result you like more.

Force processing (ComfyUI node): If V2 decides an image is already clean, enable force_process and raise force_noise_std (default 0.02; 0.05 for stubborn cases). This injects tiny noise so the model always runs, while the final output is still blended with the original.

Examples

How to use in ComfyUI

The model is designed to work with the JPG & Noise Remover ComfyUI Node.

Install the Node: Follow instructions in the GitHub Repository.
Download Weights: Download the .pt or .safetensors v2 file from this repository.
Place Weights: Put the file in ComfyUI/models/artifacts_remover/.
Select Model: Select the new weight file in the node settings. Ensure base_ch is set to 64.

Training Details (V2)

The goal was to create a restorer that not only removes noise but preserves fine structural details without "plastic" smoothing.

Dataset

Trained on 40,000 images from the Danbooru2024 dataset (anime/illustration style).

Architecture: NAFNet-based UNet

The model uses a U-Net structure but replaces standard convolutional blocks with NAFBlocks (Nonlinear Activation Free Blocks).

SimpleGate: Replaces complex activation functions with element-wise multiplication.
LayerNorm2d: Stabilizes training.
Simplified Channel Attention (SCA): captures global context efficiently.
Base Channels: 64

Degradation Pipeline

The model is trained on "on-the-fly" generated pairs. The degradation pipeline is more aggressive than V1:

Blur: Random downscale-upscale (probability 50%, scale down to 0.85x) to simulate soft blur.
JPEG Compression: Quality 10 - 85.
Gaussian Noise: Standard deviation 0 - 20.0.
Identity: 2% of images are left clean to teach the model not to alter good images.

Loss Function

A composite loss function was used to balance pixel accuracy and perceptual quality:

Pixel Loss: Charbonnier Loss (0.7) + MixL1SSIM (0.3).
Perceptual Loss (VGG19): Weight 0.05. Helps generating realistic textures.
HFEN (High-Frequency Error Norm): Weight 0.1. Enforces edge reconstruction.
Gradient/Edge Loss: Weight 0.2.
Identity Loss: Weight 0.02 (applied on clean images).

Training Config

Optimizer: AdamW (lr=2e-4, wd=1e-4).
Scheduler: Cosine Annealing (2000 warmup steps).
Precision: BFloat16 (AMP).
Patch Size: 288x288.
Batch Size: 4 (Accumulated to effective 8).

Limitations

Primarily trained on anime/2D art.
May struggle with extremely heavy motion blur (since it trained mostly on slight downscale blur).

Downloads last month: 65