raoumer/srrescgan – Run with an API on Replicate

raoumer / srrescgan

Intelligent image scaling to 4x resolution

Cold

Public
40.7K runs
CPU
GitHub
Paper
License

Run with an API

Playground API Examples README Versions

Input

Run this model in Node.js with one line of code:

npx create-replicate --model=raoumer/srrescgan

or set up a project from scratch

Install Replicate’s Node.js client library:

npm install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import and set up the client:

import Replicate from "replicate";

const replicate = new Replicate({
  auth: process.env.REPLICATE_API_TOKEN,
});

Run raoumer/srrescgan using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

const output = await replicate.run(
  "raoumer/srrescgan:2bc3f9d57d00e92d00974650f3f2404499f16f26ac8c3e7f4876aaa0fa5a0cc6",
  {
    input: {
      image: "https://replicate.delivery/mgxm/b187805d-4458-461e-b928-dc30cc90e93e/bird.png"
    }
  }
);

// To access the file URL:
console.log(output.url()); //=> "http://example.com"

// To write the file to disk:
fs.writeFile("my-image.png", output);

To learn more, take a look at the guide on getting started with Node.js.

Install Replicate’s Python client library:

pip install replicate

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Import the client:

import replicate

Run raoumer/srrescgan using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

output = replicate.run(
    "raoumer/srrescgan:2bc3f9d57d00e92d00974650f3f2404499f16f26ac8c3e7f4876aaa0fa5a0cc6",
    input={
        "image": "https://replicate.delivery/mgxm/b187805d-4458-461e-b928-dc30cc90e93e/bird.png"
    }
)
print(output)

To learn more, take a look at the guide on getting started with Python.

Set the REPLICATE_API_TOKEN environment variable:

export REPLICATE_API_TOKEN=<paste-your-token-here>

Find your API token in your account settings.

Run raoumer/srrescgan using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.

curl -s -X POST \
  -H "Authorization: Bearer $REPLICATE_API_TOKEN" \
  -H "Content-Type: application/json" \
  -H "Prefer: wait" \
  -d $'{
    "version": "2bc3f9d57d00e92d00974650f3f2404499f16f26ac8c3e7f4876aaa0fa5a0cc6",
    "input": {
      "image": "https://replicate.delivery/mgxm/b187805d-4458-461e-b928-dc30cc90e93e/bird.png"
    }
  }' \
  https://api.replicate.com/v1/predictions

To learn more, take a look at Replicate’s HTTP API reference docs.

You can run this model locally using Cog. First, install Cog:

brew install cog

If you don’t have Homebrew, there are other installation options available.

Run this to download the model and run it in your local environment:

cog predict r8.im/raoumer/srrescgan@sha256:2bc3f9d57d00e92d00974650f3f2404499f16f26ac8c3e7f4876aaa0fa5a0cc6 \
  -i 'image="https://replicate.delivery/mgxm/b187805d-4458-461e-b928-dc30cc90e93e/bird.png"'

To learn more, take a look at the Cog documentation.

Run this to download the model and run it in your local environment:

docker run -d -p 5000:5000 r8.im/raoumer/srrescgan@sha256:2bc3f9d57d00e92d00974650f3f2404499f16f26ac8c3e7f4876aaa0fa5a0cc6
curl -s -X POST \
  -H "Content-Type: application/json" \
  -d $'{
    "input": {
      "image": "https://replicate.delivery/mgxm/b187805d-4458-461e-b928-dc30cc90e93e/bird.png"
    }
  }' \
  http://localhost:5000/predictions

To learn more, take a look at the Cog documentation.

Output

Tweak itReport

This output was created using a different version of the model, raoumer/srrescgan:c13efe7c.

Examples

View more examples

Run time and cost

This model costs approximately $0.0063 to run on Replicate, or 158 runs per $1, but this varies depending on your inputs. It is also open source and you can run it on your own computer with Docker.

This model runs on CPU hardware. Predictions typically complete within 64 seconds. The predict time for this model varies significantly based on the inputs.

Readme

Intelligent image scaling to 4x resolution. An official PyTorch implementation of the SRResCGAN model as described in the paper Deep Generative Adversarial Residual Convolutional Networks for Real-World Super-Resolution. This work is participated in the CVPRW NTIRE 2020 RWSR challenges on the Real-World Super-Resolution.

Abstract

Most current deep learning based single image super-resolution (SISR) methods focus on designing deeper / wider models to learn the non-linear mapping between low-resolution (LR) inputs and the high-resolution (HR) outputs from a large number of paired (LR/HR) training data. They usually take as assumption that the LR image is a bicubic down-sampled version of the HR image. However, such degradation process is not available in real-world settings i.e. inherent sensor noise, stochastic noise, compression artifacts, possible mismatch between image degradation process and camera device. It reduces significantly the performance of current SISR methods due to real-world image corruptions. To address these problems, we propose a deep Super-Resolution Residual Convolutional Generative Adversarial Network (SRResCGAN) to follow the real-world degradation settings by adversarial training the model with pixel-wise supervision in the HR domain from its generated LR counterpart. The proposed network exploits the residual learning by minimizing the energy-based objective function with powerful image regularization and convex optimization techniques. We demonstrate our proposed approach in quantitative and qualitative experiments that generalize robustly to real input and it is easy to deploy for other down-scaling operators and mobile/embedded devices.

Video demo

BibTeX

@InProceedings{Umer_2020_CVPR_Workshops,
    author = {Muhammad Umer, Rao and Luca Foresti, Gian and Micheloni, Christian},
    title = {Deep Generative Adversarial Residual Convolutional Networks for Real-World Super-Resolution},
    booktitle = {The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month = {June},
    year = {2020}
    }

SRResCGAN Architecture

Overall Representative diagram

SR Generator Network

Quantitative Results

_{Dataset (HR/LR pairs)}	_{SR methods}	_#Params	_PSNR↑	_SSIM↑	_LPIPS↓	_Artifacts
_Bicubic	_EDSR	_43M	_24.48	_0.53	_0.6800	_{Sensor noise (σ = 8)}
_Bicubic	_EDSR	_43M	_23.75	_0.62	_0.5400	_{JPEG compression (quality=30)}
_Bicubic	_ESRGAN	_16.7M	_17.39	_0.19	_0.9400	_{Sensor noise (σ = 8)}
_Bicubic	_ESRGAN	_16.7M	_22.43	_0.58	_0.5300	_{JPEG compression (quality=30)}
_CycleGAN	_ESRGAN-FT	_16.7M	_22.42	_0.55	_0.3645	_{Sensor noise (σ = 8)}
_CycleGAN	_ESRGAN-FT	_16.7M	_22.80	_0.57	_0.3729	_{JPEG compression (quality=30)}
_DSGAN	_ESRGAN-FS	_16.7M	_22.52	_0.52	_0.3300	_{Sensor noise (σ = 8)}
_DSGAN	_ESRGAN-FS	_16.7M	_20.39	_0.50	_0.4200	_{JPEG compression (quality=30)}
_DSGAN	_{SRResCGAN (ours)}	_380K	_25.46	_0.67	_0.3604	_{Sensor noise (σ = 8)}
_DSGAN	_{SRResCGAN (ours)}	_380K	_23.34	_0.59	_0.4431	_{JPEG compression (quality=30)}
_DSGAN	_{SRResCGAN+ (ours)}	_380K	_26.01	_0.71	_0.3871	_{Sensor noise (σ = 8)}
_DSGAN	_{SRResCGAN+ (ours)}	_380K	_23.69	_0.62	_0.4663	_{JPEG compression (quality=30)}
_DSGAN	_{SRResCGAN (ours)}	_380K	_25.05	_0.67	_0.3357	_{unknown (validset)}
_DSGAN	_{SRResCGAN+ (ours)}	_380K	_25.96	_0.71	_0.3401	_{unknown (validset)}
_DSGAN	_ESRGAN-FS	_16.7M	_20.72	_0.52	_0.4000	_{unknown (testset)}
_DSGAN	_{SRResCGAN (ours)}	_380K	_24.87	_0.68	_0.3250	_{unknown (testset)}

The NTIRE2020 RWSR Challenge Results (Track-1)

_Team	_PSNR↑	_SSIM↑	_LPIPS↓	_MOS↓
_{Impressionism}	_{24.67 (16)}	_{0.683 (13)}	_{0.232 (1)}	_2.195
_{Samsung-SLSI-MSL}	_{25.59 (12)}	_{0.727 (9)}	_{0.252 (2)}	_2.425
_BOE-IOT-AIBD	_{26.71 (4)}	_{0.761 (4)}	_{0.280 (4)}	_2.495
_MSMers	_{23.20 (18)}	_{0.651 (17)}	_{0.272 (3)}	_2.530
_KU-ISPL	_{26.23 (6)}	_{0.747 (7)}	_{0.327 (8)}	_2.695
_InnoPeak-SR	_{26.54 (5)}	_{0.746 (8)}	_{0.302 (5)}	_2.740
_ITS425	_{27.08 (2)}	_{0.779 (1)}	_{0.325 (6)}	_2.770
_{MLP-SR (ours)}	_{24.87 (15)}	_{0.681 (14)}	_{0.325 (7)}	_2.905
_Webbzhou	_{26.10 (9)}	_{0.764 (3)}	_{0.341 (9)}	_-
_SR-DL	_{25.67 (11)}	_{0.718 (10)}	_{0.364 (10)}	_-
_TeamAY	_{27.09 (1)}	_{0.773 (2)}	_{0.369 (11)}	_-
_{BIGFEATURE-CAMERA}	_{26.18 (7)}	_{0.750 (6)}	_{0.372 (12)}	_-
_{BMIPL-UNIST-YH-1}	_{26.73 (3)}	_{0.752 (5)}	_{0.379 (13)}	_-
_SVNIT1-A	_{21.22 (19)}	_{0.576 (19)}	_{0.397 (14)}	_-
_KU-ISPL2	_{25.27 (14)}	_{0.680 (15)}	_{0.460 (15)}	_-
_SuperT	_{25.79 (10)}	_{0.699 (12)}	_{0.469 (16)}	_-
_GDUT-wp	_{26.11 (8)}	_{0.706 (11)}	_{0.496 (17)}	_-
_SVNIT1-B	_{24.21 (17)}	_{0.617 (18)}	_{0.562 (18)}	_-
_SVNIT2	_{25.39 (13)}	_{0.674 (16)}	_{0.615 (19)}	_-
_AITA-Noah-A	_{24.65 (-)}	_{0.699 (-)}	_{0.222 (-)}	_2.245
_AITA-Noah-B	_{25.72 (-)}	_{0.737 (-)}	_{0.223 (-)}	_2.285
_Bicubic	_{25.48 (-)}	_{0.680 (-)}	_{0.612 (-)}	_3.050
_{ESRGAN Supervised}	_{24.74 (-)}	_{0.695 (-)}	_{0.207 (-)}	_2.300

Visual Results

Validation-set (Track-1)

You can download all the SR resutls of our method on the validation-set from Google Drive: SRResCGAN, SRResCGAN+.

Test-set (Track-1)

You can download all the SR resutls of our method on the test-set from Google Drive: SRResCGAN, SRResCGAN+.

Real-World Smartphone images (Track-2)

You can download all the SR resutls of our method on the smartphone images from Google Drive: SRResCGAN, SRResCGAN+.

Run with Docker

Given that you have a folder of low-resolution images in the folder ./input, the following command saves high-resolution results to the folder ./output.

GPU

This model requires an NVIDIA GPU, compatible with CUDA 11.0.

docker run -it --rm --gpus all \
    -v $PWD/input:/code/LR \
    -v $PWD/output:/code/sr_results_x4 \
    us-docker.pkg.dev/replicate/raoumer/srrescgan:gpu

CPU

docker run -it --rm \
    -v $PWD/input:/code/LR \
    -v $PWD/output:/code/sr_results_x4 \
    us-docker.pkg.dev/replicate/raoumer/srrescgan:cpu

Code Acknowledgement

The training and testing codes are somewhat based on ESRGAN, DSGAN, and deep_demosaick.